General questions

We’ve collected the most heard questions right here. Can’t find what you are looking for? Contact us and we’ll try to get back to you within 24 hours.

What is Dataprovider.com?

Dataprovider.com is a database in which we collect information extracted from more than 280 million domains in 50 countries. We structure and show over 125 variables per website including contact details, payments methods and technological aspects. Our database allows you to create your own data sets. This way the data matches your exact criteria.

Is Dataprovider.com a search engine?

We do share some similarities with a search engine. Like Google or Bing, we crawl the web and collect information. However, we go beyond what other search engines do. We structure the data, and make this data available to you in a searchable database. This way you can find and extract any data you are looking for in a quick and convenient manner by using filters.

For example: if you want to find a Danish website that sells toys, your regular search engine can help you out. But what if you want a list of all the toy retailers in Denmark? That’s where we come in. In our database you can search for this list quickly and even download the information you find. We also collect and provide information that is difficult to retrieve with a conventional search, such as technological aspects and hosting information.

What is a data set?

A data set is a collection of information based on criteria set by you. You can select the variables and filters you need, save the datasets and run it on a frequent basis. This way you can get regular up-to-date information without spending much time. For example: you can look for all websites in France that use Wordpress and have more than 500 visitors a month every two months.

Who is Dataprovider.com for?

Dataprovider.com is a powerful resource for businesses, developers, governments and universities. Our database can be used to generate sales leads, research competitors, look at technology trends and much more.

Is Dataprovider.com legal?

Yes, it is. We use similar technologies that search engines like Google and Bing use to collect information. With these technologies we follow the best practice conventions outlined in the robot exclusion protocol. In addition, we only crawl information that is publicly available.

Where can I opt out?

If you wish you don't want to be indexed by Dataprovider.com you can always opt out here.

How it works

We’ve collected the most heard questions right here. Can’t find what you are looking for? Contact us and we’ll try to get back to you within 24 hours.

How does Dataprovider.com retrieve its data?

We gather all our data using advanced proprietary software, also known as a spider or crawler. This software downloads up to 50 pages per website. We then analyze and structure the data we receive. To this we only add Alexa as an external source of data. On an average day we download, analyze and summarize over 50 million pages and make all this data accessible on our Dataprovider.com database.

Do you only download and analyze homepages?

No, we don’t. This is the key difference between Dataprovider.com and other services. We download up to 50 pages per website, which allows us to retrieve all sorts of data such as contact details and technical information. In addition, we’ve trained our spider to look at the type of page it indexes. This way we can provide more accurate data than other search engines. For example: when our spider finds contact details on both the contact page and a terms and conditions page, it knows that the information on the contact page is more likely to be correct.

Do you analyze every site within a country?

Like other search engines, we index virtually all websites in the countries we currently track. There are some exceptions, however, for example those websites that do not allow us to index them.

How current is your data?

We aim to re-index all websites we track each month. In our experience this is frequent enough to make sure that all data stays up-to-date, without putting an unnecessary burden on the websites we index.

How accurate is the data?

Most of the data we collect is very accurate. We can say this because technical or operational data we collect, such as shopping cart systems or CMS information, is very straightforward to find and has little possibility of error. Site content such as contact details can be more complex to collect, because of the many ways in which people choose to present it and the national conventions used. We therefore crawl multiple pages to verify this information, and offer an ‘information certainty index’ in our database to give you an indication of how accurate the data presented is.

Does Dataprovider.com observe the robot exclusion protocol?

Yes, we do. The robot exclusion protocol is a set of instructions that specifies which areas of a website a robot can and cannot process. We adhere to these instructions and exclude any directory or content with an indication that it should not be indexed.

What formats are data sets available in?

You can download the outcomes of your searches and datasets on .CSV format or alternatively use your API.

Does Dataprovider.com provide API access?

Yes, we do. Contact us for more information.

Pricing

We’ve collected the most heard questions right here. Can’t find what you are looking for? Contact us and we’ll try to get back to you within 24 hours.

How does Dataprovider.com pricing work?

Our pricing is based on a monthly subscription plan with a minimum of 1 or 2 years based on your contract. For more detailed information, contact us. Contracts start at 10,000 Euro / year.


Trusted by

SIDN Google
PayPal HomeAway
GoDaddy Homify
Catawiki Donuts
Symantec Creditsafe
Lexis Nexis Lloyds bank
Cendris 1&1
Dun & Bradstreet Government of The Netherlands
YellowPages.ca MultiSafePay