Voice Acting and ADR for Vertical Micro-Dramas: Complete Guide

The mix engineer flagged it mid-session. Three scenes from a rooftop shoot, wind noise sitting under the dialogue, just audible enough to flatten the emotional charge of every line. The production audio was technically usable. On a studio monitor it passed. On a phone speaker in a quiet room, the wind competed with the performance at exactly the frequency range where the actor's delivery carried its weight.

Those three scenes went to ADR. The re-recorded dialogue took four hours. The scenes that made it into the final cut held their emotional impact on device. The scenes that did not go to ADR did not.

That decision, when to go to ADR, how to run the session, what the performance needs to do differently in a studio than it did on set, is one of the least documented craft areas in vertical drama production. This guide covers it completely.

Why Audio Performance Is Different in This Format

Vertical drama is a close-up format. The 9:16 frame places the viewer's eye directly on the actor's face for the majority of every episode. What the frame does visually, the audio has to do sonically: create intimacy, immediacy, and emotional charge in 90 seconds on a phone speaker.

That combination, close-up frame, phone speaker, 90-second runtime, creates a specific performance and audio requirement that neither film nor television fully prepares actors or sound teams for.

In film, the audience is at distance. Theatrical projection and calibrated cinema sound systems carry emotional nuance across a large room. In television, broadcast loudness standards and home speaker systems provide more headroom than a phone speaker. In vertical drama, the viewer is holding the device. The speaker is six inches from their face. Every breath, every slight change in vocal texture, every moment of controlled restraint in a performance, all of it either lands on that speaker or it does not.

Productions that understand this calibrate their voice performance direction and their ADR workflow to the device. Productions that do not deliver audio that sounds correct in a studio and wrong where viewers actually watch.

Voice Performance Direction: What the Format Requires

Before ADR is ever needed, the on-set performance direction has to be right. The voice performance in vertical drama is not the same as film performance, and it is not the same as theatre.

Controlled Intensity, Not Projection

The most common performance error from actors new to the format is projection, the instinct to push volume and energy outward, which reads as theatrical on a close-up phone frame. Vertical drama performance is inward. The intensity lives in the face and in the voice's texture, not in its volume.

On set, this means directing actors to find the emotional charge of a scene at a volume level that would feel slightly under-energised for theatrical performance. The camera and the phone speaker together amplify what is there. The actor who projects into the scene pushes the emotional register past where the device can carry it cleanly.

Clarity on Every Syllable

The dialogue in vertical drama is direct, high-stakes, and fast. Every line carries story weight. A single syllable lost to poor articulation or a rushed delivery can lose a viewer in a format where there is no time to recover context. On set, sound recordists flag lines where articulation is compromised. In ADR, the priority is to land every word with full clarity, not theatrical over-enunciation, but clean, precise delivery that survives phone speaker playback in ambient noise.

Matching Performance Energy Across Takes

The biggest challenge in ADR for vertical drama is not technical sync. It is performance consistency. An actor who delivered a confrontation scene on set with a specific emotional temperature needs to return to that temperature in a studio, watching themselves on a monitor, without the physical presence of the scene, the other actors, or the production energy. ADR gives actors the chance to refine delivery, add nuance, or completely shift tone, but it can also lose the spontaneous emotional quality of the original if the session is not directed carefully.

The session direction job in vertical drama ADR is to get the actor back into the emotional state of the original performance, not to ask them to improve on it. A technically cleaner delivery that is emotionally flatter than the original is a worse result than a slightly noisy original that carries the scene's charge.

When to Schedule ADR: The Decision Framework

Not every scene needs ADR. Productions that send everything to ADR create unnecessary post-production cost. Productions that under-schedule ADR arrive at delivery with compromised dialogue that fails the platform acquisition audio review.

Always Schedule ADR For

Scenes shot in locations with unavoidable ambient noise: busy streets, outdoor locations with wind or traffic, locations with HVAC systems that could not be silenced. The test is simple. Play the production audio on a phone in a quiet room. If any background element competes with dialogue intelligibility, the scene goes to ADR.

Scenes where the on-set recording captured a technically correct but emotionally flat performance. ADR is a second performance opportunity. If the actor delivered a paywall episode cliffhanger scene that works structurally but does not hit the emotional register the scene requires, the ADR session is where it gets fixed.

Scenes with script changes after principal photography. Vertical drama's compressed production timeline sometimes surfaces dialogue issues, a line that reads fine on the script page but delivers wrong, or a plot point that requires clarification that was not in the original shooting script. ADR allows those lines to be fixed without a pickup shoot.

Do Not Schedule ADR For

Scenes where the production audio is clean and the performance is strong. Running every scene through ADR regardless of whether it needs it creates performance flatness across the series. Studio-recorded ADR has a different acoustic quality from location audio, and mixing ADR with clean production audio requires careful room tone management to avoid audible inconsistency between episodes.

Running the ADR Session: The Practical Process

A vertical drama ADR session has specific requirements that differ from film ADR in format and pace.

Prepare the Cue Sheet in Advance

The cue sheet lists every line going to ADR with its timecode, the original line, any revised line, and the reason it is being re-recorded. For a 70-episode series, the cue sheet organisation determines whether the session runs efficiently or wastes time locating material. Group lines by actor and by episode, not by chronological order. An actor re-recording scattered lines from 40 different episodes in random sequence loses performance consistency.

Play the Original Performance Before Re-Recording

The actor watches the scene on a monitor and hears their original performance before attempting the ADR line. This is not optional. The original performance establishes the emotional temperature and delivery rhythm the re-recording has to match. Asking an actor to cold-record a replacement line without hearing what they originally did produces a different performance, not a better one.

Three-Beep Sync Method

The standard ADR sync method: the actor hears three beeps before the line, watches the picture, and records the replacement dialogue to sync with their lip movement on screen. For vertical drama specifically, where close-up framing makes lip movement highly visible, sync accuracy is not optional. A re-recorded line that is even slightly off-sync breaks the close-up and reads as obviously post-produced on the viewer's phone.

Room Tone Matching

The ADR recording environment has to have its room tone matched to the production audio. A line re-recorded in a dead studio and cut against location audio with ambient room tone creates an audible discontinuity. Record 30 seconds of room tone in the ADR space and use it to fill the acoustic environment of the re-recorded lines in the final mix. This is a step that production teams on tight post schedules frequently skip. It shows in the final audio.

Localization Dubbing: The Second Audio Pipeline

DramaBox operates in 84 markets. Holywater's MyDrama platform uses AI for dubbing and subtitles as part of its international distribution pipeline. A vertical drama series that performs in the English-language market has significant revenue upside in Spanish, French, Portuguese, and other language versions, but only if the localization audio holds the same emotional quality as the original.

Localization dubbing for vertical drama is a distinct discipline from standard film dubbing. The format's compressed runtime and close-up framing creates specific requirements.

Lip-Sync Precision Matters More Than in Long-Form

In a 90-minute film, a line that is slightly off-sync reads as normal. In a 75-second episode where the entire runtime is close-up, an off-sync dubbed line reads as wrong immediately. Localization dubbing for vertical drama requires the same sync precision as original-language ADR: three-beep method, frame-accurate sync, and a post-sync quality pass on every line before the dubbed master is locked.

Dub Adaptation, Not Direct Translation

Direct word-for-word translation of vertical drama dialogue rarely lip-syncs correctly because different languages have different syllable structures and speech rhythms. A Spanish dub of a line that runs three syllables in English may need six syllables in the translation, which breaks the sync. The translation must become a dub adaptation, rewriting the line in the target language to carry the same emotional content while matching the timing and mouth movement of the original. This is specialist work, not standard translation.

Voice Casting for Localization

The voice actor in the dubbed version is the lead performance for every viewer in that language market. Casting the wrong voice, one that does not match the emotional register of the original performance, produces a dubbed version that underperforms relative to the original regardless of production quality. The CEO romance lead needs the same controlled intensity in Spanish that the English-language performance delivers. The localization voice cast should be cast from auditions, not selected from a roster.

AI Tools in the ADR and Dubbing Pipeline

Holywater's February 2026 acquisition of Jeynix, an AI-VFX studio specialising in facial animation, face replacement, and lip-sync, signals where the professional end of the vertical drama market is moving. AI lip-sync tools that match dubbed audio to original lip movement without requiring the dubbed actor to precisely match the original performance timing are compressing the localization pipeline significantly.

AI Audio Cleanup Before ADR Decisions

Tools like Adobe Podcast and iZotope RX can salvage production audio that would otherwise require full ADR re-recording. A scene with moderate background noise that Adobe Podcast's Enhance Speech processes to clean dialogue may not need ADR at all. Running all production audio through AI cleanup before making ADR decisions reduces the ADR session scope and the associated cost.

AI Dubbing for Secondary Language Markets

For language markets where full professional dubbing sessions are not budget-justified, AI dubbing tools, ElevenLabs, Resemble AI, and platform-specific tools, generate dubbed audio tracks that can serve secondary distribution windows. The quality ceiling of AI dubbing is currently below professional human dubbing for primary market releases, but for seventh or eighth language market versions of a series, AI dubbing delivers a viable localised product at a fraction of the human session cost.

The honest limitation: AI dubbing tools in 2026 produce results that read as AI-generated to viewers in markets with high dubbing sensitivity, France, Germany, Spain, Latin America. For primary market localization in these territories, human voice actors remain the correct choice. AI dubbing is most appropriate for markets where viewers are less sensitive to dubbing quality or where the content is a secondary distribution window.

Axis AI Studios Perspective

ADR is where audio quality problems that were created on set get fixed, or do not. Productions that budget for ADR as a planned post-production stage arrive at delivery with clean, emotionally consistent audio across every episode. Productions that treat ADR as an emergency measure only spend more time and money fixing problems under deadline pressure than a planned ADR session would have cost.

The vertical drama format amplifies this more than any other production type. The close-up frame and the phone speaker delivery environment mean that audio quality issues that would be minor in long-form content are significant in 90-second close-up episodes. A single scene with compromised dialogue in episode six can affect platform acquisition review of the entire series if the audio is a consistent signal of production quality.

The ADR decision framework is simple: test every scene on a phone in a quiet room before the edit is locked. Flag every scene where background noise competes with dialogue. Flag every scene where the performance is technically correct but emotionally flat relative to what the script requires. Schedule those scenes for ADR before the post-production clock runs down. The cost of a half-day ADR session is significantly lower than the cost of a rejected platform submission that requires a post-delivery fix.

Common ADR Mistakes in Vertical Drama Production

Scheduling ADR After the Picture Lock Deadline

ADR that runs against a delivery deadline produces rushed performance and inadequate sync passes. Build ADR into the post-production schedule at the rough cut stage, not after picture lock.

Not Directing the Actor in the ADR Session

A sound engineer running cues without directorial input produces technically synced ADR that is emotionally disconnected. Every ADR session for vertical drama needs a director present, or at minimum, a director's note on every line about the emotional temperature required.

Over-Recording ADR

Sending every scene to ADR regardless of production audio quality creates two problems: cost overrun and performance flatness. Selective ADR on the scenes that genuinely need it preserves the production energy of location recording where it is clean.

Ignoring Room Tone in the Dub

A dubbed audio track that sounds acoustically different from the production audio scenes creates jarring inconsistency across the episode. Match room tone in every ADR and dubbing session.

Using AI Dubbing for Primary Market Localization Without a Quality Review

AI dubbing tools generate audible artifacts that professional voice actors do not. A primary market dubbing release, Spanish, French, Portuguese, that uses AI dubbing without a human quality review pass risks audience rejection in markets that are sensitive to dubbing quality.


FAQ

How Many ADR Sessions Does a Typical Vertical Drama Series Require?

It varies by production environment and pre-production discipline. A well-planned production shot in controlled locations with proper sound coverage may require ADR on 10–20% of scenes. A production with significant outdoor shooting, crowd scenes, or locations with uncontrollable ambient noise may require ADR on 40–60% of scenes. The investment in location sound recording quality during principal photography directly determines how much of the budget gets redirected to ADR in post.

Can Actors Record ADR Remotely for Vertical Drama Productions?

Remote ADR is technically feasible using platforms like Source-Connect or Cleanfeed, which allow real-time studio-quality recording with the actor in a different location. The limitation for vertical drama specifically is sync precision. Remote ADR sessions are harder to sync to frame accuracy than in-person sessions because of latency and the difficulty of the actor precisely matching their lip movement on a remote monitor. For lines where sync precision matters, close-up dialogue scenes, in-person ADR produces more reliable results. For off-camera lines and non-lip-synced scenes, remote ADR is viable and significantly more cost and schedule-efficient.

How Does ADR Affect the Localization Pipeline for International Distribution?

ADR-recorded dialogue creates a cleaner audio asset for the localization pipeline than production audio. The localization team receives clean, noise-free dialogue tracks without background competing with the voice. This simplifies the dub adaptation process and reduces the localization post-production workload. Productions that plan for international distribution from the start separate their audio stems, dialogue, music, effects, at the ADR and mix stage, which gives localization teams full control over the dialogue layer without having to separate it from a combined mix.

Vertical drama audio quality lives or dies in the six inches between the phone speaker and the viewer's ear. ADR is not a fallback for productions that failed to capture clean audio on set. It is a planned post-production tool that closes the gap between what location recording can deliver and what phone playback requires.

Schedule it early. Direct it properly. Test the result on device before the delivery master is locked.


Further Reading

For the IP decisions that determine which language markets a localized audio track can actually be distributed in, the IP licensing guide for vertical drama adaptation covers territory rights and exclusivity structures.

For the production costs context, where ADR and dubbing sit as line items inside a full series budget, the vertical drama production costs breakdown covers every tier with real figures.

For the DramaBox acquisition criteria that make localization and multi-language delivery increasingly relevant for any series targeting their 84-market footprint, the DramaBox platform breakdown covers their content model in full.

Stay connected

For studios moving beyond traditional production.

Let's set
the new standard together.

If you're working on something, we'd like to hear about it.