Intermediate10 min

Edit Video by Editing Text in Descript

Descript flips editing on its head. It transcribes your video, and you edit the transcript like a document. Delete a sentence and the matching footage disappears. For talking-head and voiceover work this is dramatically faster than scrubbing a timeline.

Step 1: Import and transcribe

Drop your footage or voiceover into Descript and let it transcribe. The accuracy is high enough that you can edit directly from the text in most cases.

Step 2: Cut filler by deleting words

Highlight the rambling sentence, delete it, and the video tightens automatically. Run the Remove Filler Words command to strip out every um and uh in one pass.

Descript - transcript editor
You
So, um, today I want to, like, talk about three things that, you know, matter.
Agent
After Remove Filler Words: 'Today I want to talk about three things that matter.' Video cut to match.

Step 3: Fix mistakes with Overdub

Misspoke a word? Type the correction into the transcript and Descript's voice model can regenerate that snippet in your own trained voice. Use it for small fixes, not whole scripts, so the result stays natural.

Studio Sound is your audio rescue
Recorded in a noisy room? Studio Sound removes background hum and reverb and lifts the voice to sound close to a treated booth. Apply it before you export.

Step 4: Export to your finishing editor

Descript is excellent for the dialogue cut but lighter on motion graphics. Export the cleaned sequence and bring it into CapCut for captions, b-roll, and the final polish.

descript - export
$apply: studio sound + remove filler words
transcript edits applied to timeline
$export: 1080p mp4 + .srt captions
saved -> exports/interview-cut.mp4
$

Result: a tight, filler-free dialogue cut with clean audio, produced by editing text instead of frames. For interviews and voiceover, this is the single biggest speed gain in the whole pipeline.

Hands-on tasks