Your photos can hear you. AI and machine learning help researchers get audio from still images and silent videos

by Cody Mello-Klein

September 25, 2023

When you take a photo on your phone, the vibrations of your voice can create tiny bends in the light that are enough to extract audio, according to Kevin Fu, a professor of engineering and computer science at Northeastern University. Photo by Matthew Modoono/Northeastern University

With video calls becoming more common in the age of remote and hybrid workplaces, “mute yourself” and “I think you’re muted” have become part of our everyday vocabularies. But it turns out muting yourself might not be as safe as you think.

Kevin Fu, a professor of electrical and computer engineering and computer science at Northeastern University, has figured out a way to get audio from pictures and even muted videos. Using Side Eye, a machine learning assisted tool that Fu and his research team created, Fu can determine the gender of someone speaking in the room where a photo was taken –– and even the exact words they spoke.

“Imagine someone is doing a TikTok video and they mute it and dub music,” Fu says. “Have you ever been curious about what they’re really saying? Was it ‘Watermelon watermelon’ or ‘Here’s my password’? Was somebody speaking behind them? You can actually pick up what is being spoken off camera.”

Kevin Fu, professor of electrical and computer engineering and computer science at Northeastern. Photo by Matthew Modoono/Northeastern University

It sounds like the stuff of science fiction –– and it is. The idea for Side Eye was inspired by an episode of the sci-fi show “Fringe” that saw the main characters, a team of fringe science investigators working for the FBI, extracting audio from a melted pane of glass.

When the episode aired, one critic for Den of Geek called it a “ridiculous pseudo science technique.” Fu disagreed.

“I was like, ‘I bet we can do that,’” Fu says. “My lab specializes in the impossible. We usually expect the first reaction to anything we do to be ‘You can’t do that,’ and we say, ‘Well, we already did.’”

Side Eye takes advantage of the image stabilization technology that is now virtually standard across most phone cameras. To ensure a shaky hand doesn’t make for a blurry photo, cameras have small springs that hold the lens suspended in liquid. An electromagnet and sensors then push the lens in equal and opposite directions to reduce camera shake.

However, Fu says whenever someone speaks near a camera lens, it causes tiny vibrations in the springs and bends the light ever so slightly. The angle of the light changes almost imperceptibly –– “unless you’re looking for it,” Fu says.

Normally, it would be hard to extract sonic frequency from those microscopic vibrations. But Fu says rolling shutter, a method of photography most phone cameras use today, actually makes it easier to achieve the impossible.

“The way cameras work today to reduce cost basically is they don’t scan all pixels of an image simultaneously –– they do it one row at a time,” Fu says. “[That happens] hundreds of thousands of times in a single photo. What this basically means is you’re able to amplify by over a thousand times how much frequency information you can get, basically the granularity of the audio.”

As long as there is even a little bit of light, Side Eye will work, although the more imagery it has access to, the better. Fu says even a photo pointed at a ceiling would let Side Eye do its thing.

The end result of this process is audio that, even at its best, sounds more like the muffled sound of adults in the Peanuts cartoons. But by using machine learning and training Side Eye on certain words and audio, Fu is able to extract a lot of information.

“If you want to know if I said yes or no, you can train [Side Eye] on people saying yes and no and then look at the patterns and with high confidence when I get an image later know if someone said yes or no,” Fu says.

Side Eye can even identify the exact person who is speaking if it’s been trained on that person’s voice, although Fu says it’s not as accurate when it comes to that just yet.

From a cybersecurity perspective, Side Eye opens up an entirely new world of threats that people and cybersecurity experts should be aware of. However, Fu says the most interesting application for Side Eye could be as a new form of digital evidence for lawyers and others working in the criminal legal system.

“Maybe there’s an alibi and it’s being admitted to court and somebody wants to prove somebody was or wasn’t there,” Fu says. “You might be able to use this technique if you have an authenticated video with a known timestamp to confirm one way or the other. If you hear the person’s voice, they’re more than likely there.”

Cody Mello-Klein is a Northeastern Global News reporter. Email him at c.mello-klein@northeastern.edu. Follow him on Twitter @Proelectioneer.

by Cody Mello-Klein

September 25, 2023

More by Cody Mello-Klein

Shelley Stewart, a global supply chain leader, appointed to Northeastern University Board of Trustees

Google’s brand ads are a “sham” but companies have to buy them anyway, new report finds

Galaxy clusters could be used as natural dark matter colliders to understand nature of invisible particles

Editor's Picks

What do corporations need to ethically implement AI? Turns out, a philosopher

Business leaders should use human-centered approaches to AI adoption, Northeastern dean says

Expert advice: Coping strategies for navigating the 24-hour news cycle

Google’s brand ads are a “sham” but companies have to buy them anyway, new report finds

With the help of Northeastern, Tennessee Valley Authority experiments with a new forecast model to better predict extreme rainfalls

Featured Stories

They’re living boulders on the ocean floor. Northeastern research explains the mysterious corallith

Wendy Parmet became a public health giant. In true Northeastern fashion, it started with a co-op

With the help of Northeastern, Tennessee Valley Authority experiments with a new forecast model to better predict extreme rainfalls

Northeastern’s Summer Youth Employment Program expands in Oakland, empowering more high school students

What do corporations need to ethically implement AI? Turns out, a philosopher

Business leaders should use human-centered approaches to AI adoption, Northeastern dean says

Have MinuteClinics had their minute? Why retail health clinics are shutting their doors, and what’s next

Can you trust AI-powered search engines like OpenAI’s SearchGPT? Northeastern expert explains why she’s ‘extremely skeptical’

Shelley Stewart, a global supply chain leader, appointed to Northeastern University Board of Trustees

This Northeastern graduate is pioneering women’s leadership in Boston’s real estate development

What do corporations need to ethically implement AI? Turns out, a philosopher

Expert advice: Coping strategies for navigating the 24-hour news cycle

What can Kamala Harris learn from Donald Trump to win the 2024 presidential election?

How soon will pollsters have good data on a Harris-Trump matchup?

Can you trust AI-powered search engines like OpenAI’s SearchGPT? Northeastern expert explains why she’s ‘extremely skeptical’

Google’s brand ads are a “sham” but companies have to buy them anyway, new report finds

With the help of Northeastern, Tennessee Valley Authority experiments with a new forecast model to better predict extreme rainfalls

Legal scholar Patricia Williams explores race, bodily integrity and law in ‘The Miracle of the Black Leg’

10 books to add to your summer must-read list

Looking for cheese plate inspiration and recipes? This food stylist, connoisseur and influencer built a global community

Have MinuteClinics had their minute? Why retail health clinics are shutting their doors, and what’s next

Job applicants perceive AI-powered hiring process as more fair when it is blind to characteristics such as race or gender, new study finds

Why the Boston Celtics’ sale that could top $4.7 billion signals a booming market for sports franchises

Listeria outbreak linked to deli meats. Those who are pregnant are at severe risk, Northeastern expert warns

Northeastern cannabinoids researcher developing drugs to fight pain and inflammation

New treatments for Alzheimer’s cost tens of thousands of dollars a year. Here’s why

Is joking about Trump’s assassination attempt protected speech? You might not get charged, but you could lose your job, experts say

Can Donald Trump or Joe Biden play whatever music they want at a rally or convention? Legal expert says it’s more complicated

From factories to TikTok, how child labor laws are struggling to keep up with the digital revolution

Efforts to limit fast-food near homes need rethinking, Northeastern researcher says

Nike Dunks, Air Jordans, Yeezy slides: Huskick’s club is all about sneakers

Video: The story and science behind Rupee Beer, a lager designed to be paired with Indian food

From London to Paris: What the 2012 Olympics taught us about urban transformation

Falling out of a coconut tree into a ‘brat summer’ — why Kamala Harris is embracing meme culture

Donald Trump ‘has a new lease on life.’ Can a traumatic event like surviving a shooting change a person’s personality?

Northeastern graduate Fiona Howard named to 2024 U.S. Paralympic dressage team

Northeastern star Mike Sirota goes to the Cincinnati Reds in third round of Major League Baseball draft

Boston Unity Cup partners with Northeastern for international soccer celebration at Carter Playground

Shelley Stewart, a global supply chain leader, appointed to Northeastern University Board of Trustees

Google’s brand ads are a “sham” but companies have to buy them anyway, new report finds

Galaxy clusters could be used as natural dark matter colliders to understand nature of invisible particles

What do corporations need to ethically implement AI? Turns out, a philosopher

Business leaders should use human-centered approaches to AI adoption, Northeastern dean says

Expert advice: Coping strategies for navigating the 24-hour news cycle

Google’s brand ads are a “sham” but companies have to buy them anyway, new report finds

With the help of Northeastern, Tennessee Valley Authority experiments with a new forecast model to better predict extreme rainfalls

.ngn-magazine__shapes {fill: var(--wp--custom--color--emphasize, #000) } .ngn-magazine__arrow {fill: var(--wp--custom--color--accent, #cf2b28) } NGN Magazine They’re living boulders on the ocean floor. Northeastern research explains the mysterious corallith

.ngn-magazine__shapes {fill: var(--wp--custom--color--emphasize, #000) } .ngn-magazine__arrow {fill: var(--wp--custom--color--accent, #cf2b28) } NGN Magazine Wendy Parmet became a public health giant. In true Northeastern fashion, it started with a co-op

With the help of Northeastern, Tennessee Valley Authority experiments with a new forecast model to better predict extreme rainfalls

Northeastern’s Summer Youth Employment Program expands in Oakland, empowering more high school students

Science & Technology

Recent Stories

They’re living boulders on the ocean floor. Northeastern research explains the mysterious corallith

Wendy Parmet became a public health giant. In true Northeastern fashion, it started with a co-op