← Back to jobsApply for this position
Innodatainc
Audio Engineer
engineeringfull-timeRemote - United States
SALARY
$120k – $160k/yr
WORK TYPE
remote
JOB TYPE
full-time
INDUSTRY
ai
✦ AutoApply Let us apply to roles like this on your behalf.
Learn more
About the role
Scope of the Role
Innodata produces the voice and audio datasets behind the world's leading speech AI. We're hiring an Audio Engineer to own the technical heart of that work: the signal chain, the post-processing recipe, the technical specifications, and the consistent acoustic quality — the “sound signature” — that defines an Innodata dataset.
You'll ensure every hour of audio we deliver, across dozens of languages and recording conditions, meets a precise and consistent technical bar. You'll design the recipe, build the validation, and continuously push the quality and efficiency of how we capture and process audio.
What You’ll Own
- Own the end-to-end audio signal chain and post-processing pipeline for all collection programs.
- Define and document technical specifications: sample rates, bit depth, formats, loudness (LUFS) targets, noise floors, channel configurations.
- Design and maintain the “Innodata sound signature” — a consistent, spec-compliant acoustic profile across studio, remote, real-world, and telephonic captures
- Build technical QA: automated and manual checks that validate audio against spec before delivery.
- Specify and validate recording setups for vendors and remote contributors (signal-chain testing in a small in-house studio).
- Partner with the Solutions Architect to translate customer acoustic requirements into achievable technical recipes.
- Drive tooling: help select and configure recording/QA/processing tools; automate where possible.
- Troubleshoot acoustic and signal issues across diverse capture environments.
You’ll Thrive in This Role If You Have
- Strong audio engineering background: signal chain, recording, post-processing, mastering, and acoustic QA.
- Deep fluency in audio technical specs (sample rate, bit depth, LUFS, formats, codecs) and the ability to define and enforce them.
- Experience producing consistent audio quality across varied recording conditions and locations.
- Comfort with audio tooling and automation (scripting for batch processing/QA is a strong plus).
- Precision and process orientation — you care about consistency at scale, not just one great recording.
- Experience with speech/voice data for AI/ML (TTS or ASR datasets).
- Familiarity with multilingual recording and remote/distributed capture.
- Knowledge of speech quality metrics and how acoustic choices affect downstream model performance.
- Scripting (Python) for audio processing pipelines (e.g., ffmpeg, sox, pydub, librosa).
✦ Let us apply for you
We find roles like this and apply on your behalf. Cover letter written for each one. Plans from $14.99/mo. Cancel anytime.
Join waitlist