Text-to-Speech Engineer
Engineeringfull timeRemotemid level$65k-$95k
About This Role
We're seeking a Text-to-Speech Engineer to design and optimize TTS systems that power our AI-first recruiting platform. You'll work on end-to-end speech synthesis pipelines, from model selection and fine-tuning to deployment and inference optimization, enabling natural voice interactions for thousands of recruiters globally. This is a high-impact role where your work directly improves candidate experience and recruiter efficiency across our B2B SaaS platform.
Requirements
- 3+ years of professional experience in TTS, speech synthesis, or audio ML engineering
- Proficiency in PyTorch or TensorFlow for implementing and fine-tuning speech models
- Strong Python skills with demonstrated experience in production ML systems
- Experience with TTS frameworks (Tacotron, FastSpeech, WaveNet, or similar architectures)
- Understanding of audio processing, signal processing, and spectrograms
- Ability to optimize models for latency and inference efficiency in production environments
Skills
Text-to-Speech (TTS)Speech SynthesisDeep LearningPyTorchAudio ProcessingNeural NetworksPythonModel Optimization
