ML Reliability Engineer
Engineeringfull timeRemotemid level$85k-$125k
About This Role
We're seeking an ML Reliability Engineer to build and maintain the infrastructure that powers our AI-first recruiting platform. You'll own the reliability, monitoring, and deployment pipelines for machine learning models in production, ensuring our B2B SaaS platform delivers consistent, high-quality predictions at scale. This is a critical role for a growth-stage startup where your work directly impacts recruiter productivity and hiring outcomes across Europe.
Requirements
- Design and implement monitoring systems for ML model performance, data drift, and production anomalies
- Build and maintain CI/CD pipelines for model training, validation, and deployment workflows
- Develop infrastructure-as-code solutions for reproducible ML environments using Docker and Kubernetes
- Collaborate with data scientists and backend engineers to operationalize machine learning models
- Troubleshoot production ML system failures and implement preventive reliability measures
- Write clean, well-tested Python code for ML infrastructure and automation tools
Skills
MLOpsModel MonitoringPythonKubernetesCI/CD PipelinesData Pipeline ArchitectureML Model DeploymentCloud Infrastructure
