Pick a job to read the details

Tap any role on the left — its description and apply link will open here.

Research Engineer – Benchmarking, Evals & Failure Analysis

Mercor · San Francisco

Apply now

Engineering San Francisco Posted Apr 8, 2026 130,000 – 500,000 USD

About Mercor

Mercor's mission is to organize human intelligence to power the AI economy. We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development. Our vast talent network trains frontier AI models in the same way teachers teach students: by sharing knowledge, experience, and context that can't be captured in code alone. Today, more than 30,000 experts in our network collectively earn over $2 million a day.

Mercor is creating a new category of work where expertise powers AI advancement. Achieving this requires an ambitious, fast-paced and deeply committed team. You’ll work alongside researchers, operators, and AI companies at the forefront of shaping the systems that are redefining society. Mercor is a profitable Series C company valued at $10 billion. We work in-person five days a week in our San Francisco, NYC, or London offices.

About the Role

As a Research Engineer at Mercor, you’ll work at the intersection of engineering and applied AI research. You’ll own benchmarking pipelines, evaluation systems, and failure analysis workflows that directly inform how we train and improve frontier language models.
Your work will define how we measure tool use, agentic behavior, and real-world reasoning. You’ll design and run evals, build rubrics and scorers, and turn failure analysis into actionable improvements for post-training, RLVR, and data pipelines.

What You’ll Do

Benchmarking: Design, implement, and maintain benchmarks and metrics for tool use, agentic behavior, and real-world reasoning; ensure benchmarks scale with training and stay aligned with product and research goals.
Evaluation systems: Build and operate LLM evaluation systems end-to-end runs, scoring, dashboards, and reporting, so researchers and applied AI teams can track model performance and compare runs at scale.
Failure analysis: Run systematic failure analysis on model outputs (e.g., wrong tool use, reasoning errors, safety/alignment issues); categorize failure modes, quantify prevalence, and feed findings into reward design, data curation, and benchmark design.
Rubrics and evaluators: Create and refine rubrics, automated evaluators, and scoring frameworks that drive training and evaluation decisions; balance rigor with scalability (human vs. model-as-judge, calibration, agreement).
Data quality and usability: Quantify data usability, quality, and impact on key benchmarks; use evals and failure analysis to guide data generation, augmentation, and curation.
Cross-team collaboration: Work with AI researchers, applied AI teams, and data producers to align evals with training objectives and to prioritize benchmarks and failure analyses that matter most.
Ownership in a fast-paced environment: Operate in a high-iteration research setting with strong ownership of benchmarks, evals, and failure-analysis workflows.

What We’re Looking For

Strong applied research background, with focus on model evaluation, benchmarking, and/or failure analysis.
Strong coding skills and hands-on experience with ML models and evaluation code.
Solid grasp of data structures, algorithms, and backend systems.
Comfort with APIs, SQL/NoSQL, and cloud platforms for running and storing eval results.
Ability to reason about model behavior, experimental results, and data quality from evals and failure analyses.
Excitement to work in person in San Francisco five days a week in a high-intensity, high-ownership environment.

Nice To Have

Industry experience on a post-training or evaluation/benchmarking team (highest priority).
Publications at top-tier venues (NeurIPS, ICML, ACL), especially in evaluation or benchmarking.
Experience building or running LLM evaluations, benchmarks, or failure-analysis pipelines.
Experience with synthetic data generation, rubric design, or RL-style workflows that use evals for reward shaping.
Work samples or code (e.g., eval frameworks, benchmark suites, failure-analysis reports or tooling) that demonstrate relevant skills.

Benefits

Bi-annual performance bonus structure
Generous equity grant vested over 4 years
Up to $15k Relocation bonus
$10K housing bonus (if you live within 0.5 miles of our office)
$1.5K monthly stipend for meals
Free Equinox membership
$200 monthly laundry reimbursement
$200 monthly personal wellness reimbursement
Health, Dental, Vision insurance

Ready to apply?

Apply to Mercor

Mercor

View all jobs →

Program Manager (Compliance/Audits)

TechOp Solutions International · Arlington, Virginia, United States

Apply now

Project / Program Mgmt Arlington, Virginia, United States Posted Jun 10, 2026

Ready to apply?

Apply to TechOp Solutions International

TechOp Solutions International

View all jobs →

Tool Coordination Lead

Valard Construction · Headingley, Manitoba, Canada

Apply now

Uncategorized Headingley, Manitoba, Canada Posted Jun 10, 2026

Ready to apply?

Apply to Valard Construction

Valard Construction

View all jobs →

Professional Land Surveyor / City Surveyor

BKF · Bakersfield, California, United States

Apply now

Construction San Francisco, California, United States Los Angeles, California, United States Riverside, California, United States Sacramento, California, United States San Diego, California, United States Bakersfield, California, United States Posted Jun 10, 2026

Ready to apply?

Apply to BKF

BKF

View all jobs →

Hingham, EEC Certified Pre-K Teacher

JCC Greater Boston · Hingham, Massachusetts, United States

Apply now

Education Hingham, Massachusetts, United States Posted Jun 10, 2026

Ready to apply?

Apply to JCC Greater Boston

JCC Greater Boston

View all jobs →

Senior Project/Product Manager

pubGENIUS · Brazil

Apply now

Product Portugal Poland Romania Brazil Spain Posted Jun 10, 2026

Ready to apply?

Apply to pubGENIUS

pubGENIUS

View all jobs →

Director, Engineering Project Management

RAVE Aerospace LLC · Brea, California, United States

Apply now

Engineering Mgmt Brea, California, United States Posted Jun 10, 2026

Ready to apply?

Apply to RAVE Aerospace LLC

RAVE Aerospace LLC

View all jobs →

Certified Personal Coach - Northwest Las Vegas

GOLFTEC · Las Vegas, Nevada, United States

Apply now

Uncategorized Las Vegas, Nevada, United States Posted Jun 10, 2026

Ready to apply?

Apply to GOLFTEC

GOLFTEC

View all jobs →

Infant Teacher

Action Day Schools · San Jose, California, United States

Apply now

Education San Jose, California, United States Mountain View, California, United States Posted Jun 10, 2026

Ready to apply?

Apply to Action Day Schools

Action Day Schools

View all jobs →

Resolutions Analyst

Concord Servicing · Chandler, Arizona, United States

Apply now

Data / Analytics Chandler, Arizona, United States Posted Jun 10, 2026

Ready to apply?

Apply to Concord Servicing

Concord Servicing

View all jobs →

Paraprofessional

CCRES, Educational & Behavioral Health Services · Hummelstown, Dauphin County, United States

Apply now

Uncategorized Hummelstown, Dauphin County, United States Posted Jun 10, 2026

Ready to apply?

Apply to CCRES, Educational & Behavioral Health Services

CCRES, Educational & Behavioral Health Services

View all jobs →

Paraprofessional

CCRES, Educational & Behavioral Health Services · Kennett Square, Chester County, United States

Apply now

Uncategorized Kennett Square, Chester County, United States Posted Jun 10, 2026

Ready to apply?

Apply to CCRES, Educational & Behavioral Health Services

CCRES, Educational & Behavioral Health Services

View all jobs →

Revenue Operation - Strategy & Planning

WATI.io · Hong Kong, Hong Kong, Hong Kong

Apply now

Uncategorized Hong Kong, Hong Kong, Hong Kong Posted Jun 10, 2026

Ready to apply?

Apply to WATI.io

WATI.io

View all jobs →

Accountant - Remote

Sleek · Philippines

Apply now

Finance / Accounting Philippines Posted Jun 10, 2026

Ready to apply?

Apply to Sleek

Sleek

View all jobs →

Customer Success - Singapore Clients

Sleek · Singapore

Apply now

Customer Support / Success Singapore Posted Jun 10, 2026

Ready to apply?

Apply to Sleek

Sleek

View all jobs →

Accountant - Fully Remote (Must be UK-Based)

Sleek · United Kingdom

Apply now

Finance / Accounting United Kingdom Posted Jun 10, 2026

Ready to apply?

Apply to Sleek

Sleek

View all jobs →

LinkedIn Sales & Business Development Representative | Remote SA

HireHawk · Cape Town, Western Cape, South Africa

Apply now

Sales / BD Cape Town, Western Cape, South Africa Pretoria, Gauteng, South Africa Johannesburg, Gauteng, South Africa Durban, KwaZulu-Natal, South Africa Gqeberha, Eastern Cape, South Africa Bloemfontein, Free State, South Africa Posted Jun 10, 2026

Ready to apply?

Apply to HireHawk

HireHawk

View all jobs →

LinkedIn Sales & Business Development Representative | Remote PH

HireHawk · Cagayan Valley, Philippines

Apply now

Sales / BD Cagayan Valley, Philippines Central Visayas, Philippines Davao Region, Philippines Metro Manila, Philippines Calabarzon, Philippines Posted Jun 10, 2026

Ready to apply?

Apply to HireHawk

HireHawk

View all jobs →

Lead Generation Specialist & Outreach Specialist (Remote) | LATAM

HireHawk · Bogotá, Bogota, Colombia

Apply now

Social Services Bogotá, Bogota, Colombia Mexico City, Mexico City, Mexico Buenos Aires, Buenos Aires, Argentina Managua, Managua, Nicaragua Lima, Callao Region, Peru Mexicali, Baja California, Mexico Posted Jun 10, 2026

Ready to apply?

Apply to HireHawk

HireHawk

View all jobs →

Residential Security Officer- Dodington Park Estate, Gloucestershire

Weybourne · Dodington, England, United Kingdom

Apply now

Security Dodington, England, United Kingdom Posted Jun 10, 2026

Ready to apply?

Apply to Weybourne

Weybourne

View all jobs →

Residential Security Officer- Ballynatray, County Waterford

Weybourne · Waterford, Munster, Ireland

Apply now

Security Waterford, Munster, Ireland Posted Jun 10, 2026

Ready to apply?

Apply to Weybourne

Weybourne

View all jobs →

Business Analyst - Integration Platform

Vista Group · Auckland, Auckland, New Zealand

Apply now

Data / Analytics Auckland, Auckland, New Zealand Posted Jun 10, 2026

Ready to apply?

Apply to Vista Group

Vista Group

View all jobs →

Senior Software Engineer

Vista Group · Auckland, Auckland, New Zealand

Apply now

Engineering Auckland, Auckland, New Zealand Posted Jun 10, 2026

Ready to apply?

Apply to Vista Group

Vista Group

View all jobs →

Test Engineer

Vista Group · Auckland CBD, Auckland, New Zealand

Apply now

QA / Test Auckland CBD, Auckland, New Zealand Posted Jun 10, 2026

Ready to apply?

Apply to Vista Group

Vista Group

View all jobs →

Country Sales Manager ( Digital )

YMT · Manila, Metro Manila, Philippines

Apply now

Sales / BD Manila, Metro Manila, Philippines Posted Jun 10, 2026

Ready to apply?

Apply to YMT

YMT

View all jobs →

Motion Graphics Editor (Contract)

Jump 450 Media · United States

Apply now

Content / Writing United States Posted Jun 10, 2026

Ready to apply?

Apply to Jump 450 Media

Jump 450 Media

View all jobs →

Night Manager

City Wide Facility Solutions · Kansas City, Missouri, United States

Apply now

Uncategorized Kansas City, Missouri, United States Posted Jun 10, 2026

Ready to apply?

Apply to City Wide Facility Solutions

City Wide Facility Solutions

View all jobs →

CNA or HHA- Full-time, Philadelphia/ Montgomery county

KeystoneCare · Philadelphia, Pennsylvania, United States

Apply now

Healthcare / Medical Philadelphia, Pennsylvania, United States Posted Jun 10, 2026

Ready to apply?

Apply to KeystoneCare

KeystoneCare

View all jobs →

Math Department Chair

BelovED Community & Empowerment Academy Charter Schools · Jersey City, New Jersey, United States

Apply now

Uncategorized Jersey City, New Jersey, United States Posted Jun 10, 2026

Ready to apply?

Apply to BelovED Community & Empowerment Academy Charter Schools

BelovED Community & Empowerment Academy Charter Schools

View all jobs →

Associate Recruiter

Lago · Colombia

Apply now

People / HR / Talent Colombia Brazil Argentina Nicaragua Honduras El Salvador Posted Jun 10, 2026

Ready to apply?

Apply to Lago

Lago

View all jobs →

Account Manager

Envalior · Shanghai, Shanghai, China

Apply now

Sales / BD Shanghai, Shanghai, China Shenzhen, Guangdong Province, China Guangzhou, Guangdong Province, China Posted Jun 10, 2026

Ready to apply?

Apply to Envalior

Envalior

View all jobs →

Interpreter - Legal/Courts (Oregon)

Prisma International, Inc. · Portland, Oregon, United States

Apply now

Uncategorized Portland, Oregon, United States Posted Jun 10, 2026

Ready to apply?

Apply to Prisma International, Inc.

Prisma International, Inc.

View all jobs →

Khmer Interpreter (Onsite)

Prisma International, Inc. · Seattle, Washington, United States

Apply now

Uncategorized Seattle, Washington, United States Posted Jun 10, 2026

Ready to apply?

Apply to Prisma International, Inc.

Prisma International, Inc.

View all jobs →

Spanish Interpreter (Simultaneous) - Court-Certified

Prisma International, Inc. · Sacramento, California, United States

Apply now

Uncategorized Sacramento, California, United States Posted Jun 10, 2026

Ready to apply?

Apply to Prisma International, Inc.

Prisma International, Inc.

View all jobs →

Translator - Certified

Prisma International, Inc. · Sacramento, California, United States

Apply now

Uncategorized Sacramento, California, United States Posted Jun 10, 2026

Ready to apply?

Apply to Prisma International, Inc.

Prisma International, Inc.

View all jobs →

Translator - Pacific Island Languages

Prisma International, Inc. · Honolulu, Hawaii, United States

Apply now

Uncategorized Vancouver, Washington, United States Honolulu, Hawaii, United States San Diego, California, United States Salt Lake City, Utah, United States Las Vegas, Nevada, United States Posted Jun 10, 2026

Ready to apply?

Apply to Prisma International, Inc.

Prisma International, Inc.

View all jobs →

Interpreter (OPI & VRI - Legal)

Prisma International, Inc. · Pierre, South Dakota, United States

Apply now

Uncategorized Minneapolis, Minnesota, United States Milwaukee, Wisconsin, United States Pierre, South Dakota, United States Bismarck, North Dakota, United States Posted Jun 10, 2026

Ready to apply?

Apply to Prisma International, Inc.

Prisma International, Inc.

View all jobs →

Interpreter (OPI) - Government Services

Prisma International, Inc. · New York, New York, United States

Apply now

Uncategorized New York, New York, United States Florida, United States Chicago, Illinois, United States Minneapolis, Minnesota, United States Los Angeles, California, United States Houston, Texas, United States Posted Jun 10, 2026

Ready to apply?

Apply to Prisma International, Inc.

Prisma International, Inc.

View all jobs →

Interpreter (Simultaneous) - Court-Certified

Prisma International, Inc. · Sacramento, California, United States

Apply now

Uncategorized Sacramento, California, United States Posted Jun 10, 2026

Ready to apply?

Apply to Prisma International, Inc.

Prisma International, Inc.

View all jobs →

Court Interpreter (New Hampshire)

Prisma International, Inc. · Concord, New Hampshire, United States

Apply now

Uncategorized Concord, New Hampshire, United States Posted Jun 10, 2026

Ready to apply?

Apply to Prisma International, Inc.

Prisma International, Inc.

View all jobs →

Dental Hygienist | Atlanta Area

United Dental Corporation · Stockbridge, Georgia, United States

Apply now

Healthcare / Medical Stockbridge, Georgia, United States Posted Jun 10, 2026

Ready to apply?

Apply to United Dental Corporation

United Dental Corporation

View all jobs →

Hygienist | Phoenix East Valley

United Dental Corporation · Phoenix, Arizona, United States

Apply now

Uncategorized Phoenix, Arizona, United States Posted Jun 10, 2026

Ready to apply?

Apply to United Dental Corporation

United Dental Corporation

View all jobs →

Hygienist | Phoenix Metro West

United Dental Corporation · Phoenix, Arizona, United States

Apply now

Uncategorized Phoenix, Arizona, United States Posted Jun 10, 2026

Ready to apply?

Apply to United Dental Corporation

United Dental Corporation

View all jobs →

Hygienist | Phoenix North Central

United Dental Corporation · Phoenix, Arizona, United States

Apply now

Uncategorized Phoenix, Arizona, United States Posted Jun 10, 2026

Ready to apply?

Apply to United Dental Corporation

United Dental Corporation

View all jobs →

Head of Operations: Production

ARVO · Klapmuts, Western Cape, South Africa

Apply now

Operations / Strategy Klapmuts, Western Cape, South Africa Posted Jun 10, 2026

Ready to apply?

Apply to ARVO

ARVO

View all jobs →

Special Projects Engineer

Andromeda Robotics · South Melbourne, Victoria, Australia

Apply now

Engineering South Melbourne, Victoria, Australia Posted Jun 10, 2026

Ready to apply?

Apply to Andromeda Robotics

Andromeda Robotics

View all jobs →

Director of US Insights and Commercial Analytics (ICA) - Job ID: 1927, 1975

Ascendis Pharma · Princeton, New Jersey, United States

Apply now

Uncategorized Princeton, New Jersey, United States Posted Jun 10, 2026

Ready to apply?

Apply to Ascendis Pharma

Ascendis Pharma

View all jobs →

Senior Fund Accountant

GXE · Melbourne, Victoria, Australia

Apply now

Finance / Accounting Melbourne, Victoria, Australia Posted Jun 10, 2026

Ready to apply?

Apply to GXE

GXE

View all jobs →

Remote Hospice Triage RN PT Monday-Friday 1:30p-7p (No Wknds)

IntellaTriage · Phoenix, Arizona, United States

Apply now

Healthcare / Medical Phoenix, Arizona, United States Posted Jun 10, 2026

Ready to apply?

Apply to IntellaTriage

IntellaTriage

View all jobs →

Registered Behavior Technician - RBT/BT - Full-Time

ICBD · Bedford, New Hampshire, United States

Apply now

Healthcare / Medical Bedford, New Hampshire, United States Manchester, New Hampshire, United States Merrimack, New Hampshire, United States Concord, New Hampshire, United States Posted Jun 10, 2026

Ready to apply?

Apply to ICBD

ICBD

View all jobs →

Registered Behavior Technician - RBT/BT - Full-Time

ICBD · Nashua, New Hampshire, United States

Apply now

Healthcare / Medical Nashua, New Hampshire, United States Derry, New Hampshire, United States Hudson, New Hampshire, United States Milford, New Hampshire, United States Pelham, New Hampshire, United States Amherst, New Hampshire, United States Posted Jun 10, 2026

Ready to apply?

Apply to ICBD

ICBD

View all jobs →

Find roles at companies that ship.

Research Engineer – Benchmarking, Evals & Failure Analysis

About Mercor

About the Role

What You’ll Do

What We’re Looking For

Nice To Have

Benefits

Program Manager (Compliance/Audits)

Tool Coordination Lead

Professional Land Surveyor / City Surveyor

Hingham, EEC Certified Pre-K Teacher

Senior Project/Product Manager

Director, Engineering Project Management

Certified Personal Coach - Northwest Las Vegas

Infant Teacher

Resolutions Analyst

Paraprofessional

Paraprofessional

Revenue Operation - Strategy & Planning

Accountant - Remote

Customer Success - Singapore Clients

Accountant - Fully Remote (Must be UK-Based)

LinkedIn Sales & Business Development Representative | Remote SA

LinkedIn Sales & Business Development Representative | Remote PH

Lead Generation Specialist & Outreach Specialist (Remote) | LATAM

Residential Security Officer- Dodington Park Estate, Gloucestershire

Residential Security Officer- Ballynatray, County Waterford

Business Analyst - Integration Platform

Senior Software Engineer

Test Engineer

Country Sales Manager ( Digital )

Motion Graphics Editor (Contract)

Night Manager

CNA or HHA- Full-time, Philadelphia/ Montgomery county

Math Department Chair

Associate Recruiter

Account Manager

Interpreter - Legal/Courts (Oregon)

Khmer Interpreter (Onsite)

Spanish Interpreter (Simultaneous) - Court-Certified

Translator - Certified

Translator - Pacific Island Languages

Interpreter (OPI & VRI - Legal)

Interpreter (OPI) - Government Services

Interpreter (Simultaneous) - Court-Certified

Court Interpreter (New Hampshire)

Dental Hygienist | Atlanta Area

Hygienist | Phoenix East Valley

Hygienist | Phoenix Metro West

Hygienist | Phoenix North Central

Head of Operations: Production

Special Projects Engineer

Director of US Insights and Commercial Analytics (ICA) - Job ID: 1927, 1975

Senior Fund Accountant

Remote Hospice Triage RN PT Monday-Friday 1:30p-7p (No Wknds)

Registered Behavior Technician - RBT/BT - Full-Time

Registered Behavior Technician - RBT/BT - Full-Time

Whoa — hold up

Catch your next role the second it’s posted.

Don't be the 201st applicant.

Find roles at
companies that ship.