DX ASSIST
Smart Tool for Preliminary Dialogue Editing

Version 1.1.0 is ready, featuring a fully functional Trial and a redesigned Vertical Processing feature
Key Features of DX Assist:


DX Assist is an advanced software solution designed for dialog editors and sound engineers, automating the initial dialogue editing process by removing non-dialogue elements from regions. With cutting-edge AI algorithms and support for the AAF format, DX Assist streamlines workflow, significantly reducing manual editing time.

  • Automatic Dialogue Separation – Cut regions with silence, noise, and unwanted sounds while preserving speech.

  • AAF Format Support – Processes AAF files while maintaining track organization for efficient editing.

  • Multi-Language Compatibility – Works with dialogue in multiple languages, making it ideal for global productions.

  • Adjustable Processing Parameters – Provides customizable settings to fine-tune the editing process to your needs.

  • Designed for Professionals – Optimized for film, TV, podcast, You Tube and game audio post-production.

Discover better way to start your Dialogue Edit

Set parameters to fit  quality of production sound

Set a few key parameters to customize the algorithm to your needs.

It is based on AI algorithms and detects human speech to cut out unwanted sounds - and you can adjust its level of precision.

You can change the way it works to obtain separate words or sentences.

You can add a few frames at the beginning and the end, which helps with applying fades.

With the VERTICAL PROCESSING parameter, DX Assist selects the best source by removing regions with crosstalk from other microphones.

Select tracks to process.

Select the tracks you want to process. Deselect tracks with music or sounds, focusing the algorithm only on dialogues reducing processing time.

You can use it on many platforms.

Pro Tools, Nuendo, Samplitude or Final Cut. Just import AAF from DX Assist.

In version 1.1.0 of the software, you can choose which algorithms to run by selecting the corresponding checkboxes.
Horizontal Processing scans the tracks from start to finish of the AAF file, non-destructively removing regions that do not contain human speech. This process uses an advanced speech recognition model.
Vertical Processing, on the other hand, analyzes tracks vertically — across layers — and keeps only the best available versions.
You can use each process independently or combine them for optimal results.

Probability determines how strict the speech recognition model is when evaluating audio.
Lower values (e.g. 10–20%) allow breaths and subtle vocal sounds to be kept, but may also leave in unwanted noise or fragments. Higher values (around 70–80%) retain only clearly recognized speech — ideal for clean and well-recorded dialogue.

Separation Level defines the threshold at which the Vertical Processing algorithm decides to keep multiple audio signals. If the value is set to 0, only the single best signal will be retained. If it's set to 5, the algorithm will keep additional sounds whose loudness difference falls within that range. This gives you control over how selective the process should be — whether you prefer to let the algorithm decide everything, or retain more options for manual decision-making later.

Minimum Strip Gap is a parameter that defines the minimum number of frames between regions required to keep them separate. Lower values will result in many short, individual regions — sometimes even single words.
Higher values will merge nearby regions, preserving more fluid, continuous sentences. This setting can be adjusted according to your personal preference and editing style.

Strip Start & End Pad defines how many frames are added to the beginning and end of each region that remains after Horizontal Processing.Higher values will extend the preserved regions, which may reduce the precision of the cuts made by the algorithm — especially in dense dialogue sequences.

Workflows


Here is an example workflow for a full dialogue editing process — from receiving materials from the picture editor to the initial dialogue edit using DXAssist.
The process includes assembling audio from the recorder and then processing it with DXAssist.


The video below demonstrates how the Vertical Processing feature works in DXAssist.


Below is a video that shows how the parameters in DXAssist work and how they affect the resulting material.