Skip to content

UW researchers designed a headphone system that translates several people speaking at once, following them as they move and preserving the direction and qualities of their voices. The team built the system, called Spatial Speech Translation, with off-the-shelf noise-cancelling headphones fitted with microphones.

A University of Washington team has developed an artificial intelligence system that lets someone wearing headphones look at a person speaking for three to five seconds to “enroll” them. The system then plays just the enrolled speaker’s voice in real time, even as the pair move around in noisy environments.