Does Big Data have the flu?

by Angela Herring

March 13, 2014

These days, when people start feeling a fever and a sore throat coming on, often times their first move isn’t to the medicine cabinet. Instead, it’s to a computer or smartphone to Google their symptoms.

These queries, which make up only a tiny fraction of the more than 7 billion total queries the search engine handles each day, are all stored by Google. The company uses this data for a variety of reasons; it can help Google improve its search results for users—which also boosts the company’s bottom line—and can also benefit the population as a whole in other ways.

One example of the latter is Google Flu Trends, a statistical model developed by engineers at Google.org—the company’s foundational arm—in an effort to “now-cast” what’s happening with the flu on any given day.

But research has shown that GFT often misses its target. These results led Northeastern University network scientists and their colleagues to take a closer look at how Big Data should be used to advance scientific research. Their report was published online Thursday in the journal Science.

“Big Data have enormous scientific possibilities,” said Northeastern professor David Lazer. “But we have to be aware that most Big Data aren’t designed for scientific purposes.” Fully achieving Big Data’s enthusiastically lauded potential, he added, requires a synthesis of both computer science approaches to data as well as traditional approaches from the social sciences.

The paper was co-authored by Lazer, who holds joint appointments in the Department of Political Science and the College of Computer and Information Science; Alessandro Vespignani, the Sternberg Family Distinguished University Professor of Physics at Northeastern who has joint appointments in the College of Science, Bouvé College of Health Sciences, and the College of Computer and Information Science; Northeastern visiting research professor of political science Ryan Kennedy; and Gary King, a professor in the Harvard University Department of Government.

Northeastern network science researchers David Lazer (left) and Alessandro Vespignani (right) examine how Big Data can best be utilized for scientific gain in a report published online on Thursday in the journal Science. Photos by Brooks Canaday.

“In a sense, Google Flu Trends is not bad, but it’s no better than any basic approach to time series prediction,” Vespignani said. “So the issue is in the claims and the disregard of other techniques or data more than the actual result.”

In their paper, the researchers explain where Google Flu Trends went wrong and examine how the research community can best utilize the outputs of Big Data companies as well as how those companies should participate in the research effort.

By incorporating lagged data from the Centers for Disease Control and Prevention as well as making a few simple statistical tweaks to the model, Lazer said, the GFT engineers could have significantly improved their results. But in a companion report also released Thursday on the Social Science Research Network—an online repository of scholarly research and related materials—Lazer and his colleagues show that an updated version of GFT, which came about in response to a 2013 Nature article revealing GFT’s limitations, does little better than its predecessor.

While Big Data certainly holds great promise for research, Lazer said, it will only be successful if the methods and data are made—at least partially—accessible to the community. But that so far has not been the case with Google.

“Google wants to contribute to science but at the same time does not follow scientific praxis and the principles of reproducibility and data availability that are crucial for progress,” Vespignani said. “In other words they want to contribute to science with a black box, which we cannot fully scrutinize and understand.”

If scientists are to “stand on the shoulders of giants,” as the old adage requires for moving knowledge forward, they will need some help from the giants, Lazer said. Otherwise failures like that with Google Flu Trends will be rampant, with the potential to tarnish our understanding of anything from stock market trends to the spread of disease.

Editor's Picks

What time is it on the moon? We may soon know, thanks to NASA project

A Black soldier receives a full military funeral — 83 years after his death. Here is his story

Northeastern researcher exposes child labor trafficking as a hidden crime after investigating 132 victims

‘Right problem, wrong solution.’ TikTok raises legitimate privacy concerns, but ban may be the wrong geopolitical move, experts say

We’re addicted to ‘true crime’ stories. This class investigates why

Featured Stories

From Northeastern to Iraq, groundbreaking brigadier general now a leader of veterans services in Massachusetts

Northeastern dean and distinguished professor join latest cohort of American Academy of Arts and Sciences

These Northeastern graduates are improving our neighborhoods one tree at a time

The FTC has banned non-compete agreements. What does that mean for workers, the economy and your paycheck?

What time is it on the moon? We may soon know, thanks to NASA project

A Black soldier receives a full military funeral — 83 years after his death. Here is his story

Harvey Weinstein’s New York rape conviction was overturned. But is a retrial really a good idea?

The FTC has banned non-compete agreements. What does that mean for workers, the economy and your paycheck?

Northeastern’s Roux Institute announces 10 health care technology startups for second year-long mentorship program

From Northeastern to Iraq, groundbreaking brigadier general now a leader of veterans services in Massachusetts

Northeastern University announces speakers for global campus commencements, and college and school ceremonies

What time is it on the moon? We may soon know, thanks to NASA project

Will the US ban the use of single-use plastics like England, India, Hong Kong and other countries?

Why Microsoft is opening an AI office in London and what its challenges will be

What time is it on the moon? We may soon know, thanks to NASA project

‘Right problem, wrong solution.’ TikTok raises legitimate privacy concerns, but ban may be the wrong geopolitical move, experts say

Is AI revolutionizing rehabilitation care? This Northeastern expert is digging deep on the issue

Award-winning student film serves as ‘homage’ to friendship and Northeastern’s graduating students

Does Hollywood have a pain problem? Researchers study Netflix and find that depictions of adolescent pain in TV and movies could be reinforcing stereotypes

From right swipe to writing: How this Northeastern professor wrote a book with a fellow entrepreneur she met on a dating app

The FTC has banned non-compete agreements. What does that mean for workers, the economy and your paycheck?

She went from marketing exec and part-time singer to opening her own art studio — while leaning on her Northeastern MBA

Start Summit at Northeastern’s Portland campus focuses on inclusivity and welcoming new entrepreneurs to Maine

Drinking water in communities of color is more likely to be contaminated by ‘forever chemicals,’ research finds

At hospital co-op, this Northeastern student is helping bridge the gap between neonatal care and research

Malaria and maternity wards: This Northeastern student balances medical research and hospital work during Ghana co-op

Harvey Weinstein’s New York rape conviction was overturned. But is a retrial really a good idea?

The FTC has banned non-compete agreements. What does that mean for workers, the economy and your paycheck?

‘Right problem, wrong solution.’ TikTok raises legitimate privacy concerns, but ban may be the wrong geopolitical move, experts say

This co-op at a Napa Valley winery teaches students about wine ‘from grape to bottle’

Efforts to limit fast-food near homes need rethinking, Northeastern researcher says

Nike Dunks, Air Jordans, Yeezy slides: Huskick’s club is all about sneakers

Northeastern researcher exposes child labor trafficking as a hidden crime after investigating 132 victims

What is eldest daughter syndrome? Is it a real condition?

O.J. Simpson is dead. How the former NFL star’s double-murder trial captured the nation’s attention

Overheated or dehydrated after the Boston Marathon? These Northeastern physical therapy students will help you recover

March Madness is coming to a peak. Will collegiate basketball superstar Caitlin Clark maintain her momentum as she moves on to the WNBA?

Federal sports betting bill is introduced with assist from Northeastern’s Public Health Advocacy Institute

The thinking behind gender stereotypes

How to secure the cloud

What’s wiping out the Caribbean corals?

What time is it on the moon? We may soon know, thanks to NASA project

.ngn-magazine__shapes {fill: var(--wp--custom--color--emphasize, #000) } .ngn-magazine__arrow {fill: var(--wp--custom--color--accent, #cf2b28) } NGN Magazine A Black soldier receives a full military funeral — 83 years after his death. Here is his story

Northeastern researcher exposes child labor trafficking as a hidden crime after investigating 132 victims

‘Right problem, wrong solution.’ TikTok raises legitimate privacy concerns, but ban may be the wrong geopolitical move, experts say

.ngn-magazine__shapes {fill: var(--wp--custom--color--emphasize, #000) } .ngn-magazine__arrow {fill: var(--wp--custom--color--accent, #cf2b28) } NGN Magazine We’re addicted to ‘true crime’ stories. This class investigates why

From Northeastern to Iraq, groundbreaking brigadier general now a leader of veterans services in Massachusetts

Northeastern dean and distinguished professor join latest cohort of American Academy of Arts and Sciences

.ngn-magazine__shapes {fill: var(--wp--custom--color--emphasize, #000) } .ngn-magazine__arrow {fill: var(--wp--custom--color--accent, #cf2b28) } NGN Magazine These Northeastern graduates are improving our neighborhoods one tree at a time

The FTC has banned non-compete agreements. What does that mean for workers, the economy and your paycheck?

Science & Technology

Recent Stories

A Black soldier receives a full military funeral — 83 years after his death. Here is his story

We’re addicted to ‘true crime’ stories. This class investigates why

These Northeastern graduates are improving our neighborhoods one tree at a time