Everyone into the (data) Pool
There’s a quiet revolution going on in agriculture, much of it riding under the label of “precision agriculture.” What this means is that farms are finding they can use data both to increase their productivity and their crop yields.
To provide just one vivid example, unmanned tractors now routinely plow fields, guided by GPS and information on how deep to dig in which sections of the field for optimal results. Seeds are being planted variably as well. Instead of just dumping seeds in the earth and hoping for the best, precision machinery, guided by soil data, now determines what seeds are planted and where, almost on an inch-by-inch basis.
It’s a big opportunity, with big dollars attached to it, and everyone is jockeying to collect and own this data. The seed companies want to own it. The farm equipment companies want to own it. Even farm supply stores – the folks who sell farmers their fertilizer and other supplies want to own it. In fact, everyone is clamoring to own the data, except perhaps the farmer.
Why not? Because a farmer’s own soil data is effectively a sample size of one. Not too valuable. Value is added when it is aggregated to data from other farmers to find patterns and establish benchmarks. It’s a natural opportunity for someone to enable farmers to share their data to mutual benefit. This is a content model we call the “closed data pool,” where a carefully selected group agrees to contribute its data, and pay to receive back the insights gleaned from the aggregated dataset.
One great example of this model is Farmers Business Network. Farmers pool their data and pay $500 per year to access the benchmarks and insights it generates. Farmers Business Network is staffed with data scientists to make sense of the data. Very importantly, Farmers Business Network is a neutral player: it doesn’t sell seeds or tractors. Its business model is transparent, and farmers can get data insights without being tied to a particular vendor. Farmers Business Network makes its case brilliantly in its promotional video, which is well worth watching: https://www.youtube.com/watch?v=IS4KIrcRMMU
Market neutrality and a high level of trust are essential to building content using the closed data pool model. But it’s a powerful, sticky model that benefits every player involved. Many data publishers and other media companies are well positioned to create products using this model because they already have the neutral market position and market trust. Closed data pools are worth a closer look. Google certainly agrees: it just invested $15 million into Farmers Business Network.
Is Your Data "Datanyzed"?
A new product by a cool young company called Datanyze is capitalizing on some well-established infocommerce best practices. Here’s how they did it.
The core business of Datanyze is identifying what SaaS software companies are using (sometimes called a company’s “technology stack”). To do this, Datanyze interrogates millions of company websites on a daily basis, looking for telltale clues as to the specific software they are employing online, and apparently a lot of categories of software can be divined this way. Datanyze aggregates and normalizes these data, then overlays company firmographic data (Alexa website rank, contact information, revenue estimates) to create a complete company profile.
Datanyze links directly to the Salesforce accounts of its customers, so it can add and update prospects on a real-time basis. At a basic level, the use case for this product is straightforward: a marketing automation platform like Eloqua could use it to find companies using a competitor or no marketing automation at all. But wait, there’s more!
Datanyze’s new product essentially flips this service. Now, Datanyze clients can have Datanyze analyze their existing best customers, and Datanyze will build a profile of these customers that can be used to predictively rank all their prospects, current and future. Here are the best practices to note:
- The transition of Datanyze from a data provider to an analytics provider, something that’s happening industry-wide
- The shift from passive (we supply the data, you figure out what to do with it), to active (here are top-rated prospects we’ve identified for you), and the associated increase in value being delivered by the data provider
- The tight integration with Salesforce means that Datanyze customers just need to say “yes” and Datanyze can get to work – no IT involvement, no data manipulation, no delays
- Datanyze is pouring leads into critical, core systems of its customers, a strong example of workflow integration
- The use of inferential data. Boil down a lot of the analytical nuance, and Datanyze has discovered that companies that buy expensive SaaS software are better prospects for other kinds of expensive SaaS software. Datanyze doesn’t know these companies have big budgets; but it does know that these companies use software that implies they have big budgets
Datanyze offers a concrete example of how data companies are evolving from generating mountains of moderate value data to much more precise, filtered and valuable answers. Are you still selling data dumps or analytics and answers?
The Award for Outstanding Performance Goes to Internet Movie Database
We awarded the Internet Movie database a Model of Excellence in 2003, and it is still a standout in terms of innovation and best practices.
The Internet Movie Database (often called by its acronym IMDB) originally started in the UK as a non-profit undertaking, and it may well be the earliest and most successful example of crowdsourcing – well over a decade before the term was even coined. Very simply, the IMDB was a site for movie buffs worldwide to build an enormously detailed database of every movie ever made. And we are talking about a serious level of detail. Want to know who was the hairstylist for the co-star of an obscure French drama from the 1950s? Well, IMDB was the go-to source. What also made IMDB interesting was that from its inception it was a true database, and despite the inherently unruly nature of crowdsourcing, there were enough committed volunteers to take on the unsexy work of removing duplicate entries and normalizing the data.
In 1998, IMDB was quietly acquired by Amazon and turned into a for-profit company. There are some great best practices to be observed here. Taking over and commercializing a site built by tens of thousands of unpaid, die-hard movie fans was a risky proposition. The backlash could have killed the business in short order. But Amazon left IMDB alone, infusing it with editorial resources so the database got bigger and better every year. Better data, less work and all free. Not much here to get upset about!
But Amazon (surprise!) wasn’t in this to be charitable. First, it started marketing to the substantial audience of IMBD users with links to its site. Like the movie? Great. Amazon can sell you a copy.
Amazon’s next move was sell sponsorships to movie studios eager to promote upcoming releases. From there, Amazon launched a subscription-based Pro version of the database that offered enhanced searching and even deeper content to movie industry professionals for research purposes. The core site remained free, meaning Amazon was a pioneer with the freemium model, well before that term had become popular.
Is Amazon now resting on its laurels? Absolutely not. To support both its Kindle and Amazon Prime offerings, Amazon has launched a service called X-Ray, powered by IMDB. Amazon also selectively licenses this new data capability. What X-Ray does is link movies to the IMDB database, so users can visually identify actors in the film, find movie trivia, explore the movie soundtrack and much more, right while watching the movie. It’s not all software magic, by the way. Amazon is doing a lot of the necessary linkages manually, but it already has thousands of movies coded. Also of interest, it’s touting its “X-Ray Enabled” badge that if it plays its card right, could someday become a differentiator for new movie releases.
Endless innovation. Strong support of its core e-commerce platform. Deft handling of often prickly enthusiast community. Endless monetization. This is where data is going!
Upping the Data Ante
Step back a bit from the fray and you’ll see an interesting evolution in the world of data: from providing lists of people or entities that might be prospects, to lists of people or entities that should be prospects, based on something they have done (think sales triggers). Now we’re beginning to move squarely into what used to be the realm of science fiction: identifying prospects before they have done anything at all.
We’re blazing new trails here, and pre-prospecting (for lack of a better name) depends heavily on lots of input data and Big Data analytics. The 800-pound gorilla in this space right now is a company called InsideSales that calls its analytical secret sauce “Neuralytics.”
All hype, you say? Well some level of hype is a given these days, but the company has raised over $139 million to date, and Salesforce.com in particular has fallen hard for the company’s pitch, and actually led its most current funding round, that also included Microsoft.
I don’t have any inside knowledge of what InsideSales is up to, but from the tantalizing tidbits that have surfaced in the press, it seems to be a combination of obvious inputs such as social media feeds, plus less intuitive things such as weather patterns and sports team scores. I can only guess that you’re a somewhat better prospect if it’s sunny out and your team won last night, but perhaps these data are being used in a more subtle and sophisticated way.
The other hint I picked up is that InsideSales depends on “email and phone records” to perform its analytical alchemy. Needless to say, these tend not to be public records, so to deliver the holy grail of sales prospecting, InsideSales apparently depends on the holy grail of input data as well!
I’m not dismissing InsideSales, primarily because I am doing some big league speculating here. But I will say there are data sources available today that get us a long way towards the notion of pre-prospecting. What excites me the most is what is going on today with online ad re-targeting. Ad re-targeting is based on what might be described as networked cookies. Visit a site, and a common cookie is placed on your computer. As you move to other sites that are part of the network, ads can be displayed based on sites you’ve previously visited. More importantly, your travels around the Internet can be centrally stored, creating a wealth of information about you, your interests, your habits and much more. While not easy, it is a straightforward leap to start learning about not only what interests you but also what are the early signs that you are beginning to contemplate a purchase.
Privacy isn’t the issue in re-targeting (at least for now), because nobody needs to know who you are for re-targeting to work. But as your movements around the Internet are recorded and analyzed, it is entirely possible that we’ll someday know when you’re thinking about buying something, and perhaps even a little before.
The next generation of sales insights likely isn’t all that far away, so now is a good time to do some pre-pondering on what it might mean to you and your business.
How Many Ways Can You Monetize Data?
I watch the real estate sales vertical with great interest. There’s a lot of data, and money here, which in turn means a lot of innovation and competition. Companies like Trulia, Zillow (which are poised to merge shortly), Move (which operates the Realtor.com site) and a host of fascinating and scrappy regional players such as PropertyShark makes for endless creativity and impressive user experiences. The first thing you notice about all the online real estate information services is that none of them is trying to disintermediate real estate brokers. Indeed, these services typically have business models that depend on agents for revenue. Thus what has happened in this very unusual market is that customers have taken on the primary work of discovery (formerly a big part of the agent’s job), even though agents haven’t reduced their commissions to reflect this.
The second thing you notice is the wealth of structured data that is available for parametric searching. Search by zip code, price range, bedrooms, lot size, and much, much more. In fact, such powerful searching is table stakes now. Map integration? Done. Alerts? Done. Rich multimedia? Done. So what’s left to innovate?
Zillow burst onto the scene (beautifully timed to coincide with our late, great real estate boom a few years back) with its audacious system that put a price valuation on every home in the country. That brought it tremendous visibility, but also introduced consumers to the power of predictive analytics.
Trulia later upped the ante by overlaying neighborhood crime statistics on its database. Not to be outdone, its competitors overlaid school district boundaries to map the schools nearest to each home. Trulia then upped the ante again, licensing data from our Model of Excellence winner GreatSchools.org, that showed the relative quality of each school. And that’s where the market seems to be headed today – qualitative assessments of neighborhoods, along with more predictive analysis.
As you might expect, qualitative assessment starts with Census demographic overlays. Real estate site Movoto.com is already there, with zip-level income, education and ethnicity. Some other sites are hesitating because of the vagaries of real estate anti-discrimination laws. But that is not an impediment to third-party data providers such as Onboard Informatics, which provides a raft of local data, including an innovative “lifestyle search engine.” Other sites like neighborhoodscout.com provide sophisticated demographic views of local areas. And we’d be remiss not to acknowledge diedinhouse.com for those who need to know if former home occupants left on their own power or not.
But what’s most fascinating is that this lifestyle analysis of neighborhoods has even been elevated to a personalized, consultative model. The New York Times recently profiled a New York area firm called Suburban Jungle that helps homebuyers target areas based both on demographics and deep market knowledge. Suburban Jungle doesn’t sell real estate; it refers its clients to real estate agents in exchange for a fee-share, another great example of how many different ways data can be monetized.