Information Systems Expert - AI Evaluator

Remote, USA Full-time Posted 2026-06-13

• *About The Job

*Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

*Benchmark**

,

*General Catalyst**

,

*Peter Thiel**

,

*Adam D'Angelo**

,

*Larry Summers**

, and

*Jack Dorsey**

.

*Position:**

AI Model Evaluation Specialist

*Type:
*Contract
Compensation:
$40–$60/hour
*Commitment:
*20 hours/week
*Role Responsibilities
Write realistic prompts reflecting how professionals and consumers seek domain-specific guidance.
Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
Score and rank multiple model responses using structured rubrics across dimensions.
Provide written justifications with specific evidence for each evaluation.
*Qualifications
*Must-Have
Master’s degree or higher in Computer Science, Information Systems, or a relevant professional field.
Professional experience applying domain expertise in a practitioner or advisory capacity.
Familiarity with industry-specific standards, regulations, or clinical guidelines.
Strong written communication and critical reasoning skills.
*Application Process (Takes 20–30 mins to complete)
Submit your resume to begin.
Complete the Model Response Evaluation assessment.
*Resources & Support**

• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

For any help or support, reach out to: [email protected]
PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

, Apply tot his job Apply To this Job

Similar Jobs

BDI Evaluator

Remote, USA Full-time

AI Writing Evaluators (Domain Experts) - English Expertise

Remote, USA Full-time

Business Research Evaluator | $30/hr Remote

Remote, USA Full-time

Social Media Evaluator (Ukrainian-United States)

Remote, USA Full-time

Qualified Medical Evaluator (QME) - Pain Medicine Physician - Part Time

Remote, USA Full-time

Regional Vocational Evaluation Specialist

Remote, USA Full-time

Lead Program Evaluator – Title III / Federal Education Grants

Remote, USA Full-time

Spanish Speaking CFTSS OLP Supervisor/Evaluator (Remote)

Remote, USA Full-time

Manufacturing Expert - Quality Evaluator

Remote, USA Full-time

Senior Product Owner, IaaS (Remote)

Remote, USA Full-time

DBA - PHP Backend Developer (Remote)

Remote, USA Full-time

Remote Data Entry Specialist – Precise Data Management & Administrative Support for Kolkata Operations (Work‑From‑Home)

Remote, USA Full-time

Experienced Lead Supervisor - Customer Care: Deliver Exceptional Service and Drive Team Success at arenaflex

Remote, USA Full-time

Experienced Part-Time Remote Data Entry Specialist – Supporting arenaflex's Operations with Accuracy and Efficiency

Remote, USA Full-time

Experienced Customer Service Representative – Live Chat Support for arenaflex

Remote, USA Full-time

Quality and Compliance Manager

Remote, USA Full-time

Experienced Technical Program Manager – Cloud Infrastructure and Data Analytics

Remote, USA Full-time

Mental Health Therapist - Tazewell - LCSW

Remote, USA Full-time

Remote Data Entry Operator – Precision Data Management & Reporting Specialist at arenaflex

Remote, USA Full-time

Sales Manager - Dealer Development - Rolltec Shading

Remote, USA Full-time