Multimodal AI Engineer
Engineeringfull timeRemotemid level$85k-$130k
About This Role
We're seeking a Multimodal AI Engineer to design and deploy machine learning systems that process text, image, and audio data for our AI-first recruiting platform. You'll own the full lifecycle of multimodal model development—from architecture selection and training to production optimization—working with large language models, vision transformers, and audio processing pipelines. This is a high-impact role where you'll directly improve candidate matching and recruiter intelligence using cutting-edge multimodal AI techniques.
Requirements
- 3+ years building and deploying machine learning models in production environments
- Strong proficiency in PyTorch or TensorFlow for multimodal architectures
- Hands-on experience with LLMs, vision transformers, or multimodal fusion techniques
- Solid understanding of model optimization, inference acceleration, and serving at scale
- Experience designing experiments, evaluating metrics, and iterating on model performance
- Ability to write clean, testable code and collaborate effectively with product and backend teams
Skills
Multimodal LearningLarge Language Models (LLMs)Computer VisionPyTorchModel Fine-tuningProduction MLAudio ProcessingML Ops
