Job Description
- *About The Job**
- *Mercor**
- connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
- *Benchmark**
- ,
- *General Catalyst**
- ,
- *Peter Thiel**
- ,
- *Adam D'Angelo**
- ,
- *Larry Summers**
- , and
- *Jack Dorsey**
- .
- *Position:**
- AI Model Evaluation Specialist
- *Type:**
- *Contract**
- *Compensation:**
- *$50–$75/hour**
- *Commitment:**
- *~20 hours/week**
- *Role Responsibilities**
- Write realistic prompts that reflect how professionals and consumers seek domain-specific guidance.
- Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
- Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
- Score and rank multiple model responses using structured rubrics across dimensions.
- Provide written justifications with specific evidence for each evaluation.
- *Qualifications**
- *Must-Have**
- Master’s degree or higher in Legal or a relevant professional field.
- Professional experience applying domain expertise in a practitioner or advisory capacity.
- Familiarity with industry-specific standards, regulations, or clinical guidelines.
- Strong written communication and critical reasoning skills.
- *Application Process (Takes 20–30 mins to complete)**
- Submit your resume to begin.
- Complete the Model Response Evaluation assessment.
- *Resources & Support**
- For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
- For any help or support, reach out to: [email protected]
- PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*
,
Apply tot his job
Apply To this Job