Audio Is Half the Edit: Fixing Bad Sound Without a Reshoot

Audio Is Half the Edit: Fixing Bad Sound Without a Reshoot

Most edits are let down by audio, not visuals. Here's our full process for producing clean, professional sound from any recording.

5 min read

a silhouette of a person
Imagine watching a video with technically perfect visuals but with muffled, echoey, inconsistent audio. Now imagine watching a video with average visuals but with crystal-clear, well-mixed sound. Almost everyone arrives at the same conclusion: the second video feels significantly more professional. And yet editors consistently spend 80 percent of their time on picture and only 20 percent on sound.
Bad audio doesn't just sound unpleasant. It makes the viewer work harder. The brain must spend continuous energy trying to decipher muffled words, filter out background noise, and compensate for inconsistent volume levels. Viewers rarely identify audio as the source of their discomfort — they simply know they want to stop watching, without understanding why.
The first step in any audio cleanup process is eliminating filler sounds and unnecessary silence from the dialogue track. Every um, every uh, every long breath, and every pause longer than a quarter of a second should be removed. A dialogue track cleaned of its verbal fillers feels dramatically tighter and more confident, and the whole edit inherits that energy.
Noise reduction addresses the background layer beneath the dialogue. Capturing a short sample of pure room tone from a quiet moment in the clip gives the algorithm a reference to work from. This approach handles air conditioning hum, camera noise, and ambient room buzz without introducing the unnatural artefacts that come from applying noise reduction blindly.
A high-pass filter at 80 to 100 hertz removes the low-frequency rumble that microphones pick up from vibration and movement. A slight boost in the presence range between 2 and 5 kilohertz brings forward the consonants and clarity frequencies that make speech intelligible. Gentle compression evens out the loudness differences between the loudest and softest passages.
Normalising the final mix to negative 14 LUFS is the standard for most streaming and social platforms. Background music should sit between negative 20 and negative 25 LUFS when dialogue is present — felt emotionally, but never competing with the spoken content for the listener's attention.

splicify

Address

234 Market Street



San Francisco, CA 94103 United States

234 Market Street



San Francisco, CA 94103 United States

Create a free website with Framer, the website builder loved by startups, designers and agencies.