Orply.

ElevenLabs Shows Voice Isolator Cleaning Noisy iPhone Audio in Seconds

ElevenLabsFriday, May 8, 20264 min read

ElevenLabs presents Voice Isolator, a tool inside ElevenCreative, as a fast way to salvage noisy recordings that cannot be reshot. In the tutorial, the company demonstrates a single workflow on an iPhone recording made on a London street: upload or drag in the file, click send, and play back an isolated voice track. The presenter says the street noise is removed and the file is processed within nine seconds, while interviews, podcasts, social clips, audio files and video files are named as broader use cases.

One-click cleanup inside ElevenCreative

ElevenLabs presents Voice Isolator as a repair path for recordings whose content is worth keeping even when the capture environment was poor: an interview, podcast, phone clip, or social video with background noise. The tool sits inside ElevenCreative, where Voice Isolator appears in the workspace navigation alongside tools such as Instant speech and Audiobook.

The workflow shown is deliberately narrow: open Voice Isolator, provide media, send it for processing, and review the cleaned result. The upload page is labeled “Drop files here,” and the interface also shows options to “Upload media files” and “Record audio.” ElevenLabs says users can upload an audio file with bad background noise, upload a video file directly, record inside the tool, or drag and drop a file into the upload area.

That flexibility is part of the product pitch, but the on-screen demonstration uses one file. Video upload, podcast cleanup, multi-person interviews, and social clips are described as use cases. The worked example is a noisy iPhone audio recording made on a London street.

What the London street test actually shows

The file used for the cleanup is named “iphone Recording in the Street.m4a.” It is dragged into the Voice Isolator interface, then played before processing under the on-screen label “Original audio.”

The original sample begins: “So right now, I’m currently walking through the back streets of London.” After playback, the presenter describes the clip as having “a lot of background noise from the street.” The example is therefore not a studio take with a small flaw. It is a phone recording captured while walking outdoors, used to stand in for the kind of clip a creator may not be able to reshoot.

The processing step is a single action: click “send.” The interface then shows the uploaded file in a “Processing” state. A visible “28s” appears on the processing screen, while the presenter says ElevenLabs starts processing immediately and finishes “within nine seconds.”

9 seconds
processing time the presenter reports for the iPhone street recording

The on-screen sequence separates three versions of the same clip: the noisy source, the isolated voice, and the isolated voice with music added afterward.

State shownWhat it representsWhat the source claims
Original audioThe unprocessed iPhone street recordingThe clip contains a lot of street background noise
Voice Isolator audioThe processed recording after Voice IsolatorThe background noise is removed
Voice Isolator audio + ElevenMusicThe isolated voice paired with background musicThe result is presented as sounding like the intended original
The three audio states used in the before-and-after comparison

The repair is presented as automatic, not adjustable

After processing, the same recording plays again under the label “Voice Isolator audio.” The cleaned sample repeats the London street line and continues: “So right now, I’m currently walking through the back streets of London. And I had this idea,”

The presenter’s evaluation is direct: “there is now no background noise at all and the audio is fixed.” In this example, ElevenLabs is not merely saying the street noise is reduced. The claim made over the processed playback is that the background noise has been removed enough for the recording to be used.

The demonstration uses one iPhone street recording; the other use cases are described rather than shown. ElevenLabs says Voice Isolator works for interviews, podcasts, quick social clips, audio files, and video files. The walkthrough itself shows a single before-and-after pass on an outdoor phone recording.

The demo also does not show detailed controls, manual noise profiling, track separation settings, or comparisons across different kinds of noise. The operation is presented as a fast automated process: bring in the file, send it, and play back the isolated result.

Music is added after the voice has been isolated

The final version of the clip is labeled “Voice Isolator audio + ElevenMusic.” The same cleaned spoken line plays again, now paired with background music. The presenter says that once the isolated audio is combined with “some nice background music,” it becomes “impossible to notice that this wasn’t the original recording.”

That last step is a finishing layer rather than the repair mechanism. Voice Isolator is shown removing the background noise first. ElevenMusic is then added to make the recovered clip feel more complete. In practical terms, the source separates salvage from polish: isolate the voice to fix the noisy recording, then add music if the finished asset needs production texture.

The strongest use case ElevenLabs gives is an important clip that cannot be reshot. The closing examples are interviews and podcasts where “the audio messes up a little.” The tool is framed less as a general audio editor and more as a quick recovery path for captured material whose value is in the words, not the recording conditions.

The frontier, in your inbox tomorrow at 08:00.

Sign up free. Pick the industry Briefs you want. Tomorrow morning, they land. No credit card.

Sign up free