- Our Product
- The Science
- Count Engine
1. Who is HG Data?
Every day, HG Data indexes more than one billion unstructured documents across the open Internet, the archived Web and offline resources to produce a detailed, accurate census of B2B technology installations in use at companies globally. Included among the many resources indexed are B2B social media, case studies, press releases, blog postings, government documents, content libraries, technical support forums, website source code, job postings…and much more. HG Data offers its technology clients access to the world’s largest global B2B database of installed technologies. Our database is bigger, more accurate and loaded with greater detail than ever before thought possible.
In less than two years, HG Data has become the de facto standard for installed-technologies intelligence. We are the trusted advisor and must-have technology partner of the largest and most sophisticated technology companies in the world. Whether your need is market analysis, competitive displacement, predictive modeling, marketing campaigns, client retention initiatives or sales playbooks, HG Data enables its clients to target by installed technology like never before.
Welcome to the Holy Grail of Installed-Technologies Intelligence.
HG Data by the Numbers:
More than 5,000 technologies categorized
…manufactured by more than 1,700 technology vendors
…installed @ more than 1,200,000 locations globally
…with a tech catalogue growing at over 100% annually
…built through large-scale data science
…indexing more than 1 billion documents daily
…being used by more than 30% of the F500 Tech companies
2. How does HG Data build its database?
It’s not what you think. No telephone surveys. No web forms. No modeling. No crowdsourcing overseas. It’s over 2 billion documents in a proprietary repository comprised of active Internet, historic web and offline source files. But is isn’t just big. The real value of HG Data is our ability to separate the signals from the noise. It’s years’ worth of algorithms, natural language processing, supervised machine learning, language translation, OCR and proprietary ontologies that allows us to make sense every day out of a tsunami of unstructured data. Tools didn’t exist a decade ago that would have made this possible. But now it is. It’s real, it’s big, and it’s global. This is 21st century data science.
And it’s built on a triad of “ingredients” that represent the leading edge in database product development. Here are the basics
Part 1: Huge Volume of Unstructured Data
We have discovered new ways to efficiently collect and extract value from billions of unstructured documents. Until now, the cost of processing this much material made the approach impractical. Today, using state-of-the-art “big data” technology, our team can efficiently sift through seas of information to locate valuable data nuggets.
Part 2: Proprietary Data Service Access
The end product is enhanced through an optimum blend of multiple quality information streams, including firmagraphic data from Dun & Brad Street.
Part 3: Curation by Data Scientists
Based on the specific objective, our data scientists make adjustments to the resulting data set to assure a superior result for our clients. The culmination of this process is on-target relevancy and high accuracy.
We apply our secret sauce of proprietary algorithms to extract, parse, scrub, curate, match, mash and integrate the data into a detailed census of who is using what technology products at which companies and at which locations.
Target by Installed Technology
B2B technology sales and marketing organizations are then able to target by installed technology across more than 3,000 technology products being used at more than 500,000 companies with more than seven million contacts.
By knowing which prospects are running what technology products where, our customers are able to improve demand generation response rates, make marketing spend more cost-effective, increase lead-handling capacity, increase competitive win rates and shorten sales cycles.
3. What specific sources of unstructured data do you use?
We apply our Natural Language Processing to over a billion unstructured documents. Unlike many vendors focusing exclusively on social signals via APIs on the internet, HG Data has created a proprietary database which is the ‘raw material’ upon which its technology install database is built. Less than 10% is the active web, but mostly composed of archived web or never-on-web data. As a result, the foundation of HG Data’s build process is a vastly largely and more valuable warehouse of data on which to extract information.
4. How large is the installed-technology database?
It’s growing every day. Growing in new verified install dates, growing in new technologies, growing in finding existing technologies in new companies. Download our current Size of File document to see the current size of file and what we track.
5. How fast is the master database growing?
Very fast. At last count we were growing the file at about 10% per month, which is over 100% annually. The growth is coming from two sources. First, we are constantly adding to the source material, both current and historic, that we use to extract information. Second, we’re constantly specifying new and more granular categories of technologies to profile. Download our current Size of File document to see the current size of file and what we track.
6. What are the main ways your data are bundled for delivery?
We sell columns and rows. A flat file of the most accurate and largest dataset of installed technologies on the planet. Most folks are surprised that we ship a flat file to our clients. But when your data is being integrated into multiple systems of record, such as business analytics, marketing automation and CRM, a flat file is the most efficient. See a sample of our file format here.
Need something more? HG Data is fortunate to have leading vendors that have incorporated our data into their products and services. Whether its contacts, a UI, predictive analytics or using installed technologies as a filter to generate another list, changes are you can find an HG Data partner who can help you with your project. See our partner list here.
7. What technology products do you track?
If a software or hardware is worth tracking, chances are HG Data is already doing so. SaaS, Cloud, Security, Networking, Storage, Open Source, Applications, Mobile, Big Data, CRM, Virtualization, Visualization and Analytics are all major sectors of our country-specific datasets. Dozens of new technologies get added monthly to our market ready file that comprises the largest, most accurate and detailed database of installed technologies available today. HG Data by the Numbers: Verified to 90%+ accuracy by some of the largest B2B technology companies in the world 1.2 billion unstructured data objects indexed daily 1.3 million business locations profiled 400,000+ companies tech installed rolled up to a single entity 5,000+ technology products tracked 1,700+ technology vendors And more than 100% annual data growth
Go to our Count Engine to run some queries yourself.
8. What do your services cost?
Our datasets are expensive. Until you consider the alternatives. What is the cost of spray-and-pray marketing outreach? What is the productivity loss of reps groping around random internet pages running into dead-end after dead-end trying to figure out what technology you’re competing against? What are the financial implications of churn rates driven by not knowing when your competition has infiltrated your customers? No doubt that HG Data is expensive and a fraction of the cost of how you might be doing business today.
Go to our What it Costs page for more details.
9. How often is the master database updated?
The master database is updated daily.
10. Is the database telephone-verified?
As part of our QA process, we do telephone survey a statistically relevant portion of our database to confirm its accuracy. In this manner, telephone surveys are used to verify, not build, our dataset.
11. How accurate is the data set?
We believe our dataset is over 90% accurate and have had clients validate that claim time and time again. Because we track both historic and ‘current’ intallations, it is important for clients to understand how to filter our results to determine the best subset of our data for your particular use case. We have multiple fields, such as ‘intensity’ and ‘verfication date(s)’ that can help you ensure the accuracy of your purchase.
12. What do you guarantee?
We warrant that the database you order from us meets the exact specifications in the written order. If it doesn’t, for whatever reason, we guarantee we will make it right.
13. What tools do you offer to match data to my in-house database?
Clients will typically match on either URL or DUNS numbers. It is very common for clients to ask us to append an entire class of technologies, such as ‘all networking technologies’, to their house file…and often expand that house file with net new companies for which we have verified installations of certain technologies.
14. If I re-order at a later time can I avoid getting duplicates?
Yes, if you specify what records to suppress in the re-order.
15. How can I use the data?
Your organization is licensed to use the data for marketing, sales or research purposes for the term of the license. You may not allow other organizations to use the data for any purpose.
16. May we license your data for applications that go beyond internal use?
17. How often can I use the data?
The data are available for use throughout the term of the license.
18. Can I buy the database one time?
Both one-time and annual subscription agreements with monthly or quarterly updates are available.
19. Is there a minimum order size?
Our minimum order size is $5,000.
20. How quickly can I expect delivery of the data?
Standard files will usually be delivered within 24 hours. Custom files might take a little longer depending on complexity and work load. We pride ourselves on fast turnaround.
© Copyright 2014 - HG Data - All rights reserved