Multimodal AI Engineer

Engineeringfull timeRemotemid level$85k-$130k

About This Role

We're seeking a Multimodal AI Engineer to design and deploy machine learning systems that process text, image, and audio data for our AI-first recruiting platform. You'll own the full lifecycle of multimodal model development—from architecture selection and training to production optimization—working with large language models, vision transformers, and audio processing pipelines. This is a high-impact role where you'll directly improve candidate matching and recruiter intelligence using cutting-edge multimodal AI techniques.

Requirements

3+ years building and deploying machine learning models in production environments
Strong proficiency in PyTorch or TensorFlow for multimodal architectures
Hands-on experience with LLMs, vision transformers, or multimodal fusion techniques
Solid understanding of model optimization, inference acceleration, and serving at scale
Experience designing experiments, evaluating metrics, and iterating on model performance
Ability to write clean, testable code and collaborate effectively with product and backend teams

Skills

Multimodal LearningLarge Language Models (LLMs)Computer VisionPyTorchModel Fine-tuningProduction MLAudio ProcessingML Ops

Multimodal AI Engineer

About This Role

Requirements

Skills

Check your profile with AI