Millions of research papers are published in a year. How do scientists keep up?

by Eva Botkin-Kowacki

April 26, 2022

Northeastern doctoral student Alessia Iancarelli works on her recent research in ISEC. Iancarelli is part of the Affective and Brain Sciences lab. Photo by Alyssa Stone/Northeastern University

If you want to be a scientist, you’re going to have to do a lot of reading.

Science is an endeavor focused on building and sharing knowledge. Researchers publish papers detailing their discoveries, breakthroughs, and innovations in order to share those revelations with colleagues. And there are millions of scientific papers each year.

Keeping up with the latest developments in their field is a challenge for researchers at all points of their careers, but it especially affects early-career scientists, as they also have to read the many papers that represent the foundation of their field.

Ajay Satpute, assistant professor of psychology and director of the Affective and Brain Science Lab. Photo by Ruby Wallau/Northeastern University

“It’s impossible to read everything. Absolutely impossible,” Ajay Satpute, director of the Affective and Brain Science Lab and an assistant professor of psychology at Northeastern. “And if you don’t know everything that has happened in the field, there’s a real chance of reinventing the wheel over and over and over again.” The challenge, he says, is to figure out how to train the next generation of scientists economically, balancing the need to read all the seminal papers with training them as researchers in their own right.

That task is only getting more difficult, says Alessia Iancarelli, a Ph.D student studying affective and social psychology in Satpute’s lab. “The volume of published literature just keeps increasing,” she says. “How are scientists able to develop their scholarship in a field given this huge amount of literature?” They have to pick and choose what to read.

But common approaches to that prioritization, Iancarelli says, can incorporate biases and leave out crucial corners of the field. So Iancarelli, Satpute and colleagues developed a machine learning approach to find a better—and less biased—way to make a reading list. Their results, which were published last week in the journal PLOS One, also help reduce gender bias.

“There really is a problem about how we develop scholarship,” Satpute says. Right now, scientists will often use a search tool like Google Scholar on a topic and start from there, he says. “Or, if you’re lucky, you’ll get a wonderful instructor and have a great syllabus. But that’s going to be basically the field through that person’s eyes. And so I think that this really fills a niche that might help create balance and cross-disciplinary scholarship without necessarily having access to a wonderful instructor, because not everyone gets that.”

Alessia Iancarelli, a doctoral student in Northeastern's Affective and Brain Sciences lab. Photo by Alyssa Stone/Northeastern University

The problem with something like Google Scholar, Iancarelli explains, is that it will give you the most popular papers in a field, measured by how many other papers have cited them. If there are subsets of that field that aren’t as popular but are still relevant, the important papers on those topics might get missed with such a search.

Take, for example, the topic of aggression (which is the subject the researchers focused on to develop their algorithm). Media and video games are a particularly hot topic in aggression research, Iancarelli says, and therefore there are a lot more papers on that subset of the field than on other topics, such as the role of testosterone, and social aggression.

So Iancarelli decided to group papers on the topic of aggression into communities. Using citation network analysis, she identified 15 research communities on aggression. Rather than looking at the raw number of times a paper has been cited in another research paper, the algorithm determines a community of papers that tend to cite each other or the same core set of papers. The largest communities it revealed were media and video games, stress, traits and aggression, rumination and displaced aggression, the role of testosterone, and social aggression. But there were also some surprises, such as a smaller community of research papers focused on aggression and horses.

“If you use community detection, then you get this really rich, granular look at the aggression field,” Satpute says. “You have sort of a bird’s-eye-view of the entire field rather than [it appearing that] the field of aggression is basically media, video games, and violence.”

In addition to diversifying the topics featured by using this community approach, the researchers also found that the percentage of articles with women first authors dubbed influential by the algorithm doubled in comparison to when they focused only on total citation counts. (Iancarelli adds there might be some biases baked into that result, as the team couldn’t ask the authors directly about their gender identity and instead had to rely on assumptions based on the author’s name, picture, and any pronouns used to refer to them.)

The team has released the code behind this algorithm so that others can use it and replicate their citation network analysis approach in other fields of research.

For Iancarelli, there’s another motivation: “I would love to use this work to create a syllabus and teach my own course on human aggression. I would really love to base the syllabus on the most relevant papers from each different community to give a true general view of the human aggression field.”

For media inquiries, please contact Shannon Nargi at s.nargi@northeastern.edu or 617-373-5718.

by Eva Botkin-Kowacki

April 26, 2022

More by Eva Botkin-Kowacki

This Northeastern graduate hiked the entire Appalachian Trail—in the winter

Moderna has an Omicron-specific booster shot. Does it change anything?

These fish live in sub-freezing waters. Why are so many getting sick?

Editor's Picks

What do corporations need to ethically implement AI? Turns out, a philosopher

Business leaders should use human-centered approaches to AI adoption, Northeastern dean says

Expert advice: Coping strategies for navigating the 24-hour news cycle

Google’s brand ads are a “sham” but companies have to buy them anyway, new report finds

With the help of Northeastern, Tennessee Valley Authority experiments with a new forecast model to better predict extreme rainfalls

Featured Stories

They’re living boulders on the ocean floor. Northeastern research explains the mysterious corallith

Wendy Parmet became a public health giant. In true Northeastern fashion, it started with a co-op

With the help of Northeastern, Tennessee Valley Authority experiments with a new forecast model to better predict extreme rainfalls

Northeastern’s Summer Youth Employment Program expands in Oakland, empowering more high school students

What do corporations need to ethically implement AI? Turns out, a philosopher

Business leaders should use human-centered approaches to AI adoption, Northeastern dean says

Have MinuteClinics had their minute? Why retail health clinics are shutting their doors, and what’s next

Can you trust AI-powered search engines like OpenAI’s SearchGPT? Northeastern expert explains why she’s ‘extremely skeptical’

Shelley Stewart, a global supply chain leader, appointed to Northeastern University Board of Trustees

This Northeastern graduate is pioneering women’s leadership in Boston’s real estate development

What do corporations need to ethically implement AI? Turns out, a philosopher

Expert advice: Coping strategies for navigating the 24-hour news cycle

What can Kamala Harris learn from Donald Trump to win the 2024 presidential election?

How soon will pollsters have good data on a Harris-Trump matchup?

Can you trust AI-powered search engines like OpenAI’s SearchGPT? Northeastern expert explains why she’s ‘extremely skeptical’

Google’s brand ads are a “sham” but companies have to buy them anyway, new report finds

With the help of Northeastern, Tennessee Valley Authority experiments with a new forecast model to better predict extreme rainfalls

Legal scholar Patricia Williams explores race, bodily integrity and law in ‘The Miracle of the Black Leg’

10 books to add to your summer must-read list

Looking for cheese plate inspiration and recipes? This food stylist, connoisseur and influencer built a global community

Have MinuteClinics had their minute? Why retail health clinics are shutting their doors, and what’s next

Job applicants perceive AI-powered hiring process as more fair when it is blind to characteristics such as race or gender, new study finds

Why the Boston Celtics’ sale that could top $4.7 billion signals a booming market for sports franchises

Listeria outbreak linked to deli meats. Those who are pregnant are at severe risk, Northeastern expert warns

Northeastern cannabinoids researcher developing drugs to fight pain and inflammation

New treatments for Alzheimer’s cost tens of thousands of dollars a year. Here’s why

Is joking about Trump’s assassination attempt protected speech? You might not get charged, but you could lose your job, experts say

Can Donald Trump or Joe Biden play whatever music they want at a rally or convention? Legal expert says it’s more complicated

From factories to TikTok, how child labor laws are struggling to keep up with the digital revolution

Efforts to limit fast-food near homes need rethinking, Northeastern researcher says

Nike Dunks, Air Jordans, Yeezy slides: Huskick’s club is all about sneakers

Video: The story and science behind Rupee Beer, a lager designed to be paired with Indian food

From London to Paris: What the 2012 Olympics taught us about urban transformation

Falling out of a coconut tree into a ‘brat summer’ — why Kamala Harris is embracing meme culture

Donald Trump ‘has a new lease on life.’ Can a traumatic event like surviving a shooting change a person’s personality?

Northeastern graduate Fiona Howard named to 2024 U.S. Paralympic dressage team

Northeastern star Mike Sirota goes to the Cincinnati Reds in third round of Major League Baseball draft

Boston Unity Cup partners with Northeastern for international soccer celebration at Carter Playground

This Northeastern graduate hiked the entire Appalachian Trail—in the winter

Moderna has an Omicron-specific booster shot. Does it change anything?

These fish live in sub-freezing waters. Why are so many getting sick?

What do corporations need to ethically implement AI? Turns out, a philosopher

Business leaders should use human-centered approaches to AI adoption, Northeastern dean says

Expert advice: Coping strategies for navigating the 24-hour news cycle

Google’s brand ads are a “sham” but companies have to buy them anyway, new report finds

With the help of Northeastern, Tennessee Valley Authority experiments with a new forecast model to better predict extreme rainfalls

.ngn-magazine__shapes {fill: var(--wp--custom--color--emphasize, #000) } .ngn-magazine__arrow {fill: var(--wp--custom--color--accent, #cf2b28) } NGN Magazine They’re living boulders on the ocean floor. Northeastern research explains the mysterious corallith

.ngn-magazine__shapes {fill: var(--wp--custom--color--emphasize, #000) } .ngn-magazine__arrow {fill: var(--wp--custom--color--accent, #cf2b28) } NGN Magazine Wendy Parmet became a public health giant. In true Northeastern fashion, it started with a co-op

With the help of Northeastern, Tennessee Valley Authority experiments with a new forecast model to better predict extreme rainfalls

Northeastern’s Summer Youth Employment Program expands in Oakland, empowering more high school students

Science & Technology

Recent Stories

They’re living boulders on the ocean floor. Northeastern research explains the mysterious corallith

Wendy Parmet became a public health giant. In true Northeastern fashion, it started with a co-op