The Mirage of Data Big Data and data-driven marketing are two of the most prominent business buzzwords. New technologies are giving businesses access to larger amounts of data and tools that help us analyze data faster, in larger volumes than ever before. It feels empowering in our quest for the answers. We want data to illuminate the way for us. Data enables us to make the right decisions and to bulletproof our plans – Or does it? “It is possible to be drowning in data and still none the wiser” – Paul Laughlin A couple years ago, one of the World’s most innovative companies was about to launch a massive project. They invested in a research project as large as they come, hiring three of the top consulting companies at the time, and spending 8 figures to interview 200,000 people in 54 cities and 22,000 individuals at 3000 corporations. All this research and data should have guided the project and ensured success, right? Recommended for You “A great part of information obtained in war is contradictory, a still greater part is false, and by far the largest part is of doubtful character” – Karl von Clausewitz We are about to drown in an ocean of data Back in early 2013, Big Data was already an overhyped term with a questionable path to business value. Unfortunately, the hype continues to grow and companies are focusing on collecting more and more data. One of the better-known big data technologies helps organizations collect IT systems-information well in excess of 100 GBs of data per day. To put this number into context, consider the entire Encyclopædia Britannica, which uses less than 1 GB of data. How are business people supposed to analyze all this data? “There is an unlimited amount of data in a digital world. Many times people try to do the most complex way of using the data.” – Rishi Dave, CMO, Dun & Bradstreet A recent survey found that only 6% of marketers have the talent to leverage marketing analytics. Most marketers are not taking advantage of the data we have today, web analytics being a prime example. Then, a recent ReadWrite article appropriately titled “Gartner on Big Data: Everyone’s Doing It, No One Knows Why”, states “according to a recent Gartner report, 64% of enterprises surveyed indicate that they’re deploying or planning Big Data projects. Yet even more acknowledge that they still don’t know what to do with Big Data. Have the inmates officially taken over the Big Data asylum?” More data is harder. The value is not really in the quantity of the data, but on knowing what to ask for and deriving the right insights. We are entering an age where organizations will find they are rich in data but poor in customer-insights. “But of course, the quantity of data is entirely beside the point if the data aren’t of a good quality.” – Phil Rosenzweig, the Halo Effect The danger is all this data can be creating a Mirage The mirage is that the answer is in the data, that more data is better, that if we collect enough data and analyze it we will arrive at the right answer. A couple years back, Mohan Sawhney, one of the most brilliant marketing academics from the Kellogg School of Management, gave us a problem to solve. The case study included a couple data tables and multiple data points: prices, costs, volume, market research, etc. Everyone in the class, myself included, fired up our spreadsheets and came up with a couple conclusions – all wrong. Our mistake was assuming that because we had a lot of data, the answer must be in data analysis. The answer was in front of us the entire time. There was no need to analyze the data to find the answer, because the answer was found in basic marketing strategy. The data was a trap. “A flood of data should never be allowed to was away your common sense” – Jack Trout The 7 Hazards That Create a Mirage of Data 1. Data only looks at the past. Despite conventional wisdom, data is limited as a predictor of the future. Otherwise, it would be easy to predict stock prices, sports scores, or even lottery numbers. Einstein knew the act of observation itself alters reality. By the time you collect and analyze data, the environment will have changed, limiting its usefulness. 2. Data does not tell you why and fails to reflect emotion. It is so easy to confuse information with evidence. Modern management practices focus on quantification of all the elements of a business. We are all human. We make emotional decisions and then justify them rationally. From your morning Starbucks, to the car you drive, to billion-dollar decisions in corporations, it is all driven by emotions. You cannot fully understand customers through a numerical lens. Data can show us what customers purchased, how they paid, and how often they visited. It cannot tell us why. As Harley Manning and Kerry Bodine tell in their book, Outside In, “We recently spoke with the head of consumer insights at a major cosmetics manufacturer who was building a repository of shopper purchasing patterns from retailers PoS systems. Although he insisted that he wanted to create a ‘true 360 degree view’ of his customers, he wouldn’t even consider our suggestion to study how women actually use his company’s products in the bathroom every morning. Women of the world should be outraged that a guy in a suit has the audacity to think he understands them based solely on when and where they shop for makeup.” (P.s. I have lived with my wife for 16 years, and still don’t understand her, or her cosmetics buying process). HBR notes “Human behavior is nuanced and complex, and no matter how robust it is, data can provide only part of the story. Desire and motivation are influenced by psychological, social, and cultural factors that require context and conversation in order to decode. Data …reveals what people do, but not why they do it.” In other words, trusting data alone assumes the data you have is accurate, complete, and reflects or captures all relevant aspects of consumer behavior. 3. Data is always biased: Every step in data collection and analysis has potential for introducing bias. Your data is most likely biased by the method you use to collect it, the customers you collect it from, the data available to correlate it with, and the way you interpret the data. Psychologists tell us we are all victims of confirmation bias, giving more value to the data that confirms our hypothesis and ignoring the data that contradicts it. Iridium failed to ask if people would pay for the service at a particular price. That was their bias. Surveys are biased by the subset of data you survey and the subset of those who respond. Your transaction data is biased based on your existing customers, which may not accurately represent the entirety of the market. You are blind as to why non-customers fail to buy. Collection can result in a self-fulfilling prophecy: display the most popular products on your website, make them even more popular, not because they are the best products, but because you chose them as such. 4. We often look at the data we can collect, not at the data we need. Not all data can be useful. Not all data is interesting and most is not insightful. We tend to assume the data we have (or what we can collect or observe) must have the answer to our questions. The weight of a television set has nothing at all to do with the clarity of its picture. Even if you measure to a tenth of a gram, this precise data is useless. The number of Twitter followers a person has is probably not a good indicator of actual influence, even if it easy to measure. It takes guts to stop measuring things that are measurable, and even more guts to create things that don’t measure well by conventional means. An online retailer can focus on conversion rates (percentage of visitors who buy), while being blind to the fact those larger number of buyers have a lower lifetime customer value (LTCV). Conversion is far more easy to measure than measuring and correlating with LTCV. 5. Data makes it easy to confuse correlation with causation. In search for an ROI story, social marketers are quick to point out the higher customer value of those who follow a company on social networks, implying they buy more because they follow them. However, it is more plausible that they follow the retailer on social networks because they are loyal customers. Without understanding the why and the emotions behind people’s decisions, it is hard to properly interpret data. Your data shows a customer purchased a large ticket item and a small ticket item, which one was the original reason why he showed up at the store? Which one was the impulse buy. 6. The delusion of a single explanation. It is common to look for the (singular) root-cause for a problem – to try to find the one thing that drives customer buying decisions. The reality is that the way people make decisions is far more complex. Dozens of factors have some influence on a single decision. It is like proposing for marriage: would you be able to tell why she said ‘yes’? Was it your looks, your money, your smile, or your sense of humor? As much as we would like a simple answer that clearly points to a single explanation, chances are events and decisions are results of much more complex, interdependent factors. 7. More data (or big data) is not better data. You can survey thousands of people and ask them who invented the light bulb, which will result in a false sense of security in perfectly inaccurate data. We tend to believe that more data will give us more precision, when it usually has the opposite effect. It is easy to draw a chart with two axis and it is possible to compare data using three dimensions. Adding a fourth, fifth, or sixth dimension makes data visualization an impossible problem in itself. With more information and more data points to correlate, we are forced to ignore or minimize large sets of data, increasing the chances of arriving at the wrong conclusion. Malcolm Gladwell’s Blink makes a pretty good case for the accuracy of gut-based, split-second decisions over those made after more careful and pseudo-scientific evaluation. What to do, if data can create reality distortion? The best insights come from customers, not spreadsheets or analytics Sometimes we look for technology, sophisticated data models, complex analytics and expert advice to tell us more about customer behavior, when it would be easier to simply get out of the office and listen to them. The best insights come from observing and listening to customers. If you make it a habit to speak with a customer or three, every day, you will gain insights that no computer technology can give you. Bernadette Jiwa captured it beautifully: “We assume that the most valuable data is static and lives on graphs and spreadsheets….The truth of what we need to know and some of the most valuable data live in plain sight. The wrinkled nose of the diner. The sigh of the shopper waiting in line. The posture of the customer as she walks out the door. What she packed in her bag before she left home this morning. Noticing what people do is often more valuable to us than listening to what they say they think” I am a believer in the power of Big Data technology and the business value of insights. I have seen the success stories first hand. What I am pointing to is Gartner Research calls (link) ‘a peak of inflated expectations‘ that is almost always followed by a ‘trough of disillusionment’, that there is a danger of following buzzwords and trends blindly, and the danger in relying purely in metrics and analysis for decision making. Understanding the limits of what data can tell us, identifying the challenges in interpreting data, and being deliberate at capturing simple but very useful data can lead to insights and then to action. Taking advantage of data, along with non-data derived insights, results in better decision making. If he had relied purely on data, Steve Jobs would have never considered building the iPhone (click on the link for the story). It would be a misunderstanding to conclude I am suggesting marketers ignore data. To the contrary, data is fundamental for every businessperson. Data can be especially useful for marketers. The point is to be aware of the limitations and traps of data, to better comprehend its usefulness and to extract real value from the data we have. |