Site Reliability Engineering Manager
EU engineers, ready to place with your US clients
Pre-screened on AI. Remote B2B contracts. View 5 full profiles free — AI score, skills report, interview questions included.
About This Role
Requirements
- 8+ years of professional experience in SRE, DevOps, or infrastructure engineering roles, with demonstrated impact on system reliability at scale
- 2+ years managing and developing technical teams — you've hired engineers, run 1:1s, and grown people's careers while maintaining your own technical credibility
- Deep hands-on expertise with Kubernetes in production environments — you've debugged pod scheduling issues at 3am, not just followed tutorials
- Strong infrastructure-as-code skills with Terraform or similar tooling, with a focus on building reusable, well-documented modules that teams actually want to use
- Production experience designing and operating monitoring and observability systems (Prometheus, Grafana, Datadog, or equivalent) — you know the difference between metrics, logs, and traces and when to use each
- Proven ability to lead incident response and build blameless post-mortem culture — calm under pressure, analytical when debugging cascading failures
- Track record of defining SLIs/SLOs/SLAs and using them to drive technical decisions and resource allocation, not just compliance checkboxes
- Effective communication skills that translate complex technical tradeoffs into clear recommendations for engineering leadership and product stakeholders
Required Skills
Similar Positions
Candidates may also fit these roles
MLOps Engineer
12 matchedBoston, MA
About the Role You will own the complete ML lifecycle infrastructure that bridges research and production at scale. This is a systems-ownership role where you'…
Platform Engineer
18 matchedAustin, TX
About the Role We're looking for an experienced platform engineer to architect and own the infrastructure foundations that power our engineering organization. …
Cloud Infrastructure Engineer
2 matchedDenver, CO
About the Role You'll design and maintain the production infrastructure that powers our platform at scale. As a core member of our infrastructure team, you'll …
Senior ML Engineer
148 matchedAustin, TX
About the Role We're building ML systems that serve millions of users with sub-100ms latency requirements. As a Senior ML Engineer, you'll own the full lifecyc…
