Text size
An earthquake has struck a developing country, devastating a large city. Relief agencies and the local government have responded, but they’re struggling to get an accurate picture of the situation so they can rush food, water, and medical supplies to where they’re needed most. First-hand observations and aerial photos are not enough to capture a fluid situation in which tens of thousands of people are on the move, making it hard to deliver supplies effectively or to track the spread of disease as sanitation breaks down.
Fortunately, data saves the situation. Analysis of millions of anonymous call records from a mobile-phone provider reveals where the SIM cards (and therefore subscribers) are, allowing responding organizations to track the daily movement of the population in and out of the zone. When combined with census data from before the disaster, it becomes much easier to assess needs accurately and prepare for a possible outbreak of disease. Many lives are saved.
This dramatic solution is now a real possibility. Around the world, social innovators are starting to use “big data”—datasets whose size is beyond the ability of typical software programs to capture, manage, and analyze—to address critical problems. As the McKinsey Global Institute (MGI) has written elsewhere, big data can create significant value in the world economy, enhancing the productivity and competitiveness of companies and the public sector and creating economic surplus for consumers.1 But big data may be equally important in the social sector, where they can be used to identify and address important societal issues. The disaster response scenario outlined above has already been tested by researchers; others are proving the utility of big data in fields like microcredit, education, and public health.2
There’s plenty of data for the sector to work with. Many organizations with a social purpose have been collecting their own big data for years, and there is significant data in the hands of institutions such as developing country governments, various branches of the UN, and public health organizations such as the Global Fund to Fight AIDS, Tuberculosis and Malaria. Some organizations such as the World Bank are moving aggressively to open this data to the public. And new data collection initiatives will produce even more data—at 4 to 8 MB of data per person, India’s Unique Identification Number database could quickly grow to as much as 20 petabytes of data, more than 80 times the size of the entire Library of Congress.3 Finally, many “big” datasets in the private sector might be adapted to help produce social impact, even if they were not designed for that purpose. The group of researchers that tested the disaster response scenario above also used location data from 138,000 cell phones to understand the magnitude and trend of population movements over the course of a cholera outbreak in Haiti.
For social innovators and other organizations interested in producing social change, the question is where to start. When is big data valuable in the social sector? How much impact could it have? And what do we need to do to produce that impact? Some of our current research in McKinsey’s Social Sector Office aims to answer these questions.
MGI’s original report argued that big data could create value in five ways in the private sector. Our research suggests that these five sources of value, slightly modified, apply in the social sector as well.
The social sector has barely begun to tap the potential of big data. As these examples suggest, organizations that systematically integrate big data into their strategies can generate significant new impact. But in order to do so, they must overcome several barriers.
The biggest, by far, is simply gaining access to data. Large amounts of potentially important data are held by organizations that may not make full use of it themselves (such as governments) or that use it mostly for proprietary reasons (such as many private companies), and privacy concerns hold more back. The “open data” movement now in evidence at organizations such as the World Bank is one step forward, but much more thought must be given to policies and incentives that would put more data into circulation in a sustainable and responsible way.
There are also questions about data compatibility and quality. Much of the data stored is not easily useable. Legacy systems and inconsistent data formats make it more difficult to integrate datasets in the ways that generate value. In addition, many existing data sets often suffer from poor quality—incorrect or mismatched information, gaps, or extensive “noise.” Even if they overcome these hurdles, many interested organizations lack the computing power and software tools required to integrate, analyze, and visualize big data—although the advent of cloud computing could address some of the problem.
A major shortage of talent—deep analytical talent, data-savvy managers, and supporting technology personnel—presents an additional hurdle. This is already an issue in the private sector, where MGI estimates that there will be an annual shortage of graduates in deep analytical fields of 140,000 to 190,000 by 2018 in the United States alone. Despite the efforts of organizations such as Data Without Borders, which matches scientists and statisticians with nonprofits for pro bono data work, shortages in these roles may be even more challenging in the social sector.
Philanthropists and practitioners can take a first step toward removing these barriers by starting to think strategically about big data—where big data might help advance their goals, what datasets and analysis would be required to do so, who they could partner with to produce the analysis, and what the partner’s incentives might be. They should also consider how to build a stronger infrastructure around big data in the social sector: for example, how to move to common policies and priorities, share best practices, and develop talent. The findings from our ongoing research will highlight some of these steps, as well as those that can be taken by partner organizations in the public and private sectors.
Done well, big data has the potential to advance important social goals in areas such as disease surveillance, student curricula, and microcredit. Understanding big data’s potential for social impact and the barriers to capturing it are the first steps toward its effective use.
1See Big data: The next frontier for innovation, competition, and productivity.
2For the disaster relief example, see Linus Bengtsson et al., “Improved response to disasters and outbreaks by tracking population movements with mobile phone network data: A post-earthquake geospatial study in Haiti,” PLoS Medicine, 2011, Volume 8, Number 8, e1001083, doi:10.1371/journal.pmed.1001083.
2Vince Beiser, “Massive biometric project gives millions of Indians an ID,” Wired, August 19, 2011.
4Joanne Linnerooth-Bayer, Reinhard Mechlar, Pablo Suarez, and Marjorie Victor, “Drought insurance for subsistence farmers in Malawi,” Natural Hazards Observer, 2009, Volume 33, Number 5, pp. 1–8.
Text size
Commenting is closed for this article.
Send an e-mail to let us know how we can make our site better.
The Stanford Social Innovation Review is written for and by social change leaders in the nonprofit, business, and government sectors. Sample articles of particular interest to readers of What Matters are available below.
by Ben Hecht. Living Cities is working with five US municipalities to develop an ecosystem for solving urban problems.
by Clayton M. Christensen, Shuman Talukdar, Richard Alton, and Michael B. Horn. Unless clean tech follows well-established rules of innovation and commercialization, the industry’s promise to provide sustainable sources of energy will fail.
Agree. Data Analytics has been left as only a commercial corporate concept for a long time; with Governments only relying on a 5-yearly census to get key statistics.Social data as it is generated from social media is a powerful Risk Management and Social Planning tool. If Governments could setup teams to analyse social data; they could be in a better position to respond to disasters or innovate around the challenges of unemployment and poverty – they could even get to the tangible and fact-based root cause of social challenges in their states. It’s time Governments go quantitative.
Posted 23 December 2011, 03:48 by Mthandeni Mathabela
As the article mentions, the Open Data movement is definitely a step in the right direction. Data analytics needs a lot of ground work. In India, the common data definition across the various departments within the state governments and central government itself is a task at large. I hope the UID project of India has also taken the lead to set the data definitions, which is thought through from a national and cross-functional perspective. Otherwise, we will still talk about e-governance as transfer of certain scanned files.
Posted 16 November 2011, 02:48 by Jacob Varghese