Pivots Hiring

Multimodal AI Engineer

Engineeringfull timeRemotemid level$85k-$130k

About This Role

We're seeking a Multimodal AI Engineer to design and deploy machine learning systems that process text, image, and audio data for our AI-first recruiting platform. You'll own the full lifecycle of multimodal model development—from architecture selection and training to production optimization—working with large language models, vision transformers, and audio processing pipelines. This is a high-impact role where you'll directly improve candidate matching and recruiter intelligence using cutting-edge multimodal AI techniques.

Requirements

  • 3+ years building and deploying machine learning models in production environments
  • Strong proficiency in PyTorch or TensorFlow for multimodal architectures
  • Hands-on experience with LLMs, vision transformers, or multimodal fusion techniques
  • Solid understanding of model optimization, inference acceleration, and serving at scale
  • Experience designing experiments, evaluating metrics, and iterating on model performance
  • Ability to write clean, testable code and collaborate effectively with product and backend teams

Skills

Multimodal LearningLarge Language Models (LLMs)Computer VisionPyTorchModel Fine-tuningProduction MLAudio ProcessingML Ops

Check your profile with AI