Representation Chizzler

Two-stage audio processing: VAD-based speech extraction followed by MP-SENet denoising. Use the Single File tab for ad-hoc processing or the Dataset tab to clean and publish a dataset to the Hugging Face Hub.

Upload Audio File

VAD Threshold (higher = stricter voice detection)

0.1 0.9

Max Silence Gap (seconds)

1 10

Normalize volume

Target loudness (dBFS)

-35 -10

Max boost (dB)

0 30

Max attenuation (dB)

0 20

Original Audio

VAD Processed (Speech Only)

Final Denoised

Processing Details