UnclaimedDomain
Back to Blogs

NameBio Just Released a 100k+ Keyword Dataset (and an API to Query It)

A deep dive into NameBio's keyword dataset and how it can help you find valuable domain names.

Domain investing has always been about finding the right names at the right time. With NameBio's keyword dataset keyword dataset, that process just got a whole lot easier. We’ve been digging into NameBio’s latest release, and it’s one of the more interesting updates we’ve seen for domain research in a while. This isn’t just another API. It’s a dataset of 100,000+ keywords backed by real retail domain sales, along with an API that lets you query that data instantly. It feels like a meaningful step forward in how keyword value can be understood beyond just comps and intuition.

So what did NameBio actually release?

NameBio has been tracking domain sales data for over a decade. Their new keyword dataset takes this a step further by analyzing which keywords appear most frequently in high-value domain sales. They just released a new Retail Keyword Stats API, alongside a downloadable dataset of 100,000+ keywords derived from real domain sales. This is based on retail transactions, which makes the data far more relevant for anyone trying to understand end user demand rather than just wholesale activity.

The API allows you to search any keyword and see how it performs in actual retail sales. Instead of relying on scattered comps, you can pull structured data instantly for a single keyword and understand how it behaves across the market.

For each keyword, the data is broken down by position. This includes exact match where the keyword is the full domain, as well as when the keyword appears at the start, at the end, or somewhere in the middle. This positional breakdown is something that has always existed in theory, but has been difficult to measure at scale until now.

What data are they actually providing us?

For each keyword position, you can see the number of sales, total dollar volume, average sale price, highest sale price, and standard deviation. This gives a much clearer view of how a keyword performs depending on where it appears in a domain, something that’s historically been difficult to quantify in a consistent way.

The dataset is where this gets more interesting. (In my opinion)

On top of the API, NameBio also released a downloadable dataset of 100,000+ unique keywords. This includes all placements such as exact, start, end, and middle, along with full sales statistics for each keyword. Instead of querying one keyword at a time, you can work with the entire dataset and start spotting broader patterns across categories and niches.

Most domain decisions are still being made using intuition, pattern recognition, and comps. That works, but one thing that has always been harder to measure is how keyword placement impacts value. Instead of guessing which keywords are actually valuable or whether they work better at the start, end, or middle, you can now look at real retail sales data across thousands of keywords instantly and base decisions on what’s actually selling, not just what sounds good.

This feels like a step toward more data driven domain investing. Not everything can or should be reduced to numbers, but having access to structured keyword level data adds a level of clarity that has been missing. It is especially useful when compared to inflated automated valuations or guess based keyword trends that often do not reflect real buyer behaviour.

How are we using this new data?

We’ve been experimenting with this data internally to quickly check keyword performance, compare placement positioning, and get a sense of real demand before registering. It is not a replacement for judgement, but it adds another layer to the decision making process. We’ve also put together a simple interface for exploring this data if anyone wants to try it.

NameBio has always been one of the most valuable resources in the space. Making this level of keyword data accessible both as a dataset and via API opens the door for more informed decisions, better tools, and ultimately better domain registrations.

Curious to see how others here are planning to use it or if it's currently being used anywhere else and if so, how?