Manufacturing Expert - Quality Evaluator

Remote, USA Full-time Posted 2026-06-13

• *About The Job

*Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

*Benchmark**

,

*General Catalyst**

,

*Peter Thiel**

,

*Adam D'Angelo**

,

*Larry Summers**

, and

*Jack Dorsey**

.

*Position:**

AI Model Evaluation Specialist

*Type:
*Contract
Compensation:
$25–$35/hour
*Commitment:
*20 hours/week
*Role Responsibilities
Write realistic prompts reflecting professional and consumer domain-specific guidance.
Evaluate AI-generated responses for factual accuracy and practical usefulness.
Identify fabricated claims and misleading reasoning in model outputs.
Score and rank model responses using structured rubrics.
Provide written justifications with specific evidence for evaluations.
*Qualifications
*Must-Have
Professional experience applying domain expertise in a practitioner or advisory capacity.
Familiarity with industry-specific standards, regulations, or clinical guidelines.
Strong written communication and critical reasoning skills.
*Application Process (Takes 20–30 mins to complete)
Submit your resume to begin.
Complete the Model Response Evaluation assessment.
*Resources & Support**

• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: [email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

, Apply tot his job Apply To this Job

Similar Jobs

Senior Product Owner, IaaS (Remote)

Remote, USA Full-time

Staff Product Owner (Oracle Retail)

Remote, USA Full-time

Educational Technology AI Rater & Evaluator

Remote, USA Full-time

Vocational Evaluator

Remote, USA Full-time

AI Decision & Response Analyst

Remote, USA Full-time

NURSE EVALUATOR III, HEALTH SERVICES

Remote, USA Full-time

Finance Model Prompt Evaluator

Remote, USA Full-time

AI Quality Evaluator (Polish)

Remote, USA Full-time

Healthcare Research Evaluator (STEM) | $30/hr Remote

Remote, USA Full-time

Generative AI Evaluator (Russian) | $15/hr Remote

Remote, USA Full-time

Experienced Live Chat Remote Data Entry Specialist – Delivering Precision and Excellence in Customer Support

Remote, USA Full-time

Experienced Remote Data Entry Analyst – Driving Efficiency and Accuracy in arenaflex's Dynamic Operations

Remote, USA Full-time

Billing Manager & Credentialing Administrator

Remote, USA Full-time

Principal Software Engineer

Remote, USA Full-time

Experienced Work from Home Customer Support Representative – Remote Chat & Digital Customer Service Specialist (Entry Level)

Remote, USA Full-time

Remote Data Entry Specialist - Work From Home | Flexible Hours | Immediate Hiring

Remote, USA Full-time

Experienced Bilingual Customer Service Representative - Remote Opportunity in arenaflex

Remote, USA Full-time

Spanish Speaking Remote patient monitoring (RPM) Care Coordinator

Remote, USA Full-time

Senior Legal Executive/General Counsel Coach - EMEA

Remote, USA Full-time

Experienced Part-Time Remote Customer Service Representative – Flexible Work-from-Home Opportunities with arenaflex

Remote, USA Full-time