Text-to-Speech Engineer

Engineeringfull timeRemotemid level$65k-$95k

About This Role

We're seeking a Text-to-Speech Engineer to design and optimize TTS systems that power our AI-first recruiting platform. You'll work on end-to-end speech synthesis pipelines, from model selection and fine-tuning to deployment and inference optimization, enabling natural voice interactions for thousands of recruiters globally. This is a high-impact role where your work directly improves candidate experience and recruiter efficiency across our B2B SaaS platform.

Requirements

3+ years of professional experience in TTS, speech synthesis, or audio ML engineering
Proficiency in PyTorch or TensorFlow for implementing and fine-tuning speech models
Strong Python skills with demonstrated experience in production ML systems
Experience with TTS frameworks (Tacotron, FastSpeech, WaveNet, or similar architectures)
Understanding of audio processing, signal processing, and spectrograms
Ability to optimize models for latency and inference efficiency in production environments

Skills

Text-to-Speech (TTS)Speech SynthesisDeep LearningPyTorchAudio ProcessingNeural NetworksPythonModel Optimization

Text-to-Speech Engineer

About This Role

Requirements

Skills

Check your profile with AI