Close Menu
Tech Savvyed
  • Home
  • News
  • Artificial Intelligence
  • Gadgets
  • Apps
  • Mobile
  • Gaming
  • Accessories
  • More
    • Web Stories
    • Spotlight
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

Don’t let these 3 hidden streaming movie gems in May 2025 fly under your radar

10 May 2025

Key ChatGPT and Gemini features compared. Who did it better?

10 May 2025

Audio gear prices are climbing, but for how long?

10 May 2025
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Tech Savvyed
SUBSCRIBE
  • Home
  • News
  • Artificial Intelligence
  • Gadgets
  • Apps
  • Mobile
  • Gaming
  • Accessories
  • More
    • Web Stories
    • Spotlight
    • Press Release
Tech Savvyed
Home » AI headphones driven by Apple M2 can translate multiple speakers at once
News

AI headphones driven by Apple M2 can translate multiple speakers at once

News RoomBy News Room10 May 20254 Mins Read
Share
Facebook Twitter Reddit Telegram Pinterest Email

Google’s Pixel Buds wireless earbuds have offered a fantastic real-time translation facility for a while now. Over the past few years, brands such as Timkettle have offered similar earbuds for business customers. However, all these solutions can only handle one audio stream at once for translation. 

The folks over at the University of Washington (UW) have developed something truly remarkable in the form of AI-driven headphones that can translate the voice of multiple speakers at once. Think of it as a polyglot in a crowded bar, able to understand the speech of people around him, speaking in different languages, all at once. 

The team is referring to their innovation as a Spatial Speech Translation, and it comes to life courtesy of binaural headphones. For the unaware, binaural audio tries to simulate sound effects just the way human ears perceive them naturally. To record them, mics are placed on a dummy head, apart at the same distance as human ears on each side. 

The approach is crucial because our ears don’t only hear sound, but they also help us gauge the direction of its origin. The overarching goal is to produce a natural soundstage with a stereo effect that can provide a live concert-like feel. Or, in the modern context, spatial listening. 

The work comes courtesy of a team led by Professor Shyam Gollakota, whose prolific repertoire includes apps that can put underwater GPS on smartwatches, turning beetles into photographers, brain implants that can interact with electronics, a mobile app that can hear infection, and more. 

How does multi-speaker translation work?

“For the first time, we’ve preserved the sound of each person’s voice and the direction it’s coming from,” explains Gollakota, currently a professor at the institute’s Paul G. Allen School of Computer Science & Engineering.

The team likens their stack to a radar, as it kicks into action by identifying the number of speakers in the surroundings, and updating that number in real-time as people move in and out of the listening range. The whole approach works on-device and doesn’t involve sending user voice streams to a cloud server for translation. Yay, privacy!

In addition to speech translation, the kit also “maintains the expressive qualities and volume of each speaker’s voice.” Morever, directional and audio intensity adjustments are made as the speaker moves across the room. Interestingly, Apple is also said to be developing a system that allows the AirPods to translate audio in real-time.

How does it all come to life?

The UW team tested the AI headphones’ translation capabilities in nearly a dozen outdoor and indoor settings. As far as performance goes, the system can take, process, and produce translated audio within 2-4 seconds. Test participants appeared to prefer a delay worth 3-4 seconds, but the team is working to speed up the translation pipeline.

Phil Nickinson wearing the Sonos Ace headphones.

So far, the team has only tested Spanish, German, and French language translations, but they’re hopeful of adding more to the pool. Technically, they condensed blind source separation, localization, real-time expressive translation, and binaural rendering into a single flow, which is quite an impressive feat.

As far as the system goes, the team developed a speech translation model capable of running in real-time on an Apple M2 silicon, achieving real-time inference. Audio duties were handled by a pair of Sony’s noise-cancelling WH-1000XM4 headphones and a Sonic Presence SP15C binaural USB mic.

And here’s the best part. “The code for the proof-of-concept device is available for others to build on,” says the institution’s press release. That means the scientific and open-source tinkering community can learn and base more advanced projects on the foundations laid out by the UW team. 











Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleThe best video game trailers of all time
Next Article HP’s smallest gaming PC with RTX 3050 is on sale today at $150 off

Related Articles

Don’t let these 3 hidden streaming movie gems in May 2025 fly under your radar

10 May 2025

Key ChatGPT and Gemini features compared. Who did it better?

10 May 2025

Audio gear prices are climbing, but for how long?

10 May 2025

HP’s smallest gaming PC with RTX 3050 is on sale today at $150 off

10 May 2025

The best video game trailers of all time

10 May 2025

Charlize Theron says Uma Thurman should have won an Oscar for Kill Bill

10 May 2025
Demo
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo
Don't Miss

Key ChatGPT and Gemini features compared. Who did it better?

By News Room10 May 2025

Table of Contents Table of Contents Search  Shopping Deep research  The AI industry has blossomed…

Audio gear prices are climbing, but for how long?

10 May 2025

HP’s smallest gaming PC with RTX 3050 is on sale today at $150 off

10 May 2025

AI headphones driven by Apple M2 can translate multiple speakers at once

10 May 2025
Tech Savvyed
Facebook X (Twitter) Instagram Pinterest
  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact
© 2025 Tech Savvyed. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.