Remote

LLM Evaluation Engineer

Engineeringfull timemid level$65k-$95k

AI ScreenedRemote B2BEU Talent Pool1 applicants

For hiring agencies & HR teams

EU engineers, ready to place with your US clients

Pre-screened on AI. Remote B2B contracts. View 5 full profiles free — AI score, skills report, interview questions included.

About This Role

We're seeking an LLM Evaluation Engineer to join our AI-first recruiting platform. In this role, you'll design and execute rigorous evaluation frameworks for large language models, develop benchmark datasets, and analyze model performance across recruitment-specific tasks. You'll work directly with our product and ML teams to ensure our AI systems deliver accurate, fair, and reliable candidate assessments at scale.

Requirements

Develop and maintain LLM evaluation metrics and benchmark datasets
Design A/B testing frameworks for model performance comparison
Write clean, production-grade Python code for evaluation pipelines
Analyze model outputs for bias, hallucination, and accuracy across recruitment use cases
Collaborate with ML engineers to implement evaluation findings into model improvements
Document evaluation methodologies and create clear performance reports

Required Skills

LLM evaluation frameworksPrompt engineeringPythonData annotation & labelingStatistical analysisModel benchmarkingRAG systemsNLP

Pre-screened Candidates

No candidates available for review yet.

All profiles are anonymized for fair evaluation

Similar Positions

Candidates may also fit these roles

Chatbot Developer

2 matched

Remote

We're hiring a Chatbot Developer to build conversational AI systems that power our AI-first recruiting platform. You'll design, develop, and optimize intelligen…

AI Consultant

2 matched

Remote

Join our AI-first recruiting platform as an AI Consultant and drive intelligent automation across our B2B SaaS product. You'll architect and implement machine l…

AI Application Developer

1 matched

Remote

We're seeking an AI Application Developer to build intelligent, production-ready AI features for our recruiting platform. You'll work on integrating large langu…

AI Workflow Automation Engineer

1 matched

Remote

We're seeking an AI Workflow Automation Engineer to design and implement intelligent automation solutions for our AI-first recruiting platform. You'll build sca…