Generalist Evaluator Expert

Remote Full-time
Mercor is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. * * * ### **Job Details:** - **Design and Optimize Prompts**: Create detailed prompts with multiple constraints and instructions. - **Define and Document Evaluation Standards**: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubric. - **Conduct Model Testing and Grading**: Run prompts through models and assess preliminary outputs against expectations. - **Support Benchmarking and Quality Assurance**: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks. ### **Minimum Qualifications:** - BS or BA from a reputable institution completed or in progress - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making or hobbies / general interests. - US or Canada based. ### **Preferred Qualifications:** - Experience in teaching or research. ### **Application & Onboarding Process:** - Complete an AI-led interview, this should take around 15 minutes. - Complete a 45-minute written assessment that will guide you through writing rubrics. - If selected, you will be invited to work on the project. ### **More Details About This Role:** - This is a **remote and asynchronous** role — work on your own schedule. - Expect to contribute at least **20 hours per week**. - Expect a commitment of around 1 month. - You’ll be working in a structured project environment with clear goals and tools. * * * ### **About** [**Mercor**]( - Our team is based in San Francisco, CA - We [specialize]( in recruiting experts for top AI labs - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey
Apply Now

Similar Opportunities

AI Product Engineer

Remote

Key Account Manager

Remote

Program Associate

Remote

Remote Amazon Marketplace Content & Keyword Optimization Specialist – SEO‑Driven Product Listing Expert for High‑Volume E‑Commerce

Remote

Remote Luxury Fashion Customer Experience Specialist – Amazon Shopbop Full‑Time Work‑From‑Home Role

Remote

Remote Amazon Customer Experience Specialist – Work‑From‑Home Customer Care Center Representative (Full‑Time, Flexible Shifts)

Remote

Remote Amazon Customer Service Representative – Entry‑Level Full‑Time Role with Comprehensive Training, Competitive Pay, Flexible Hours, and Clear Career Advancement Path

Remote

Remote Amazon Virtual Customer Care Advisor – Full‑Time Work‑From‑Home Role Supporting Billing, Insurance, and Pharmacy Services (Arizona Residents)

Remote

Remote Amazon Customer Service Representative – Fully Remote Flexible Schedule, Immediate Openings, Competitive Pay & Comprehensive Benefits

Remote

Part-Time Remote Amazon Customer Experience Specialist – Flexible Home‑Based Chat Support Role (20‑30 hrs/week)

Remote

Remote Product Tester - Flexible, High-Paying Opportunity to Shape Product Development from the Comfort of Your Home

Remote

**Experienced Data Entry Specialist – Remote Work Opportunity with blithequark**

Remote

Amazon.com Manager (USA eCommerce) (JB5634)

Remote

Management Consultant (Part-Time | Remote US | $80 –$120/hr)

Remote

[Remote] Telehealth Therapist: Associate Clinical Social Worker (ACSW)

Remote

Customer Support Manager (SaaS/Software)

Remote

Entry Level Remote Data Entry Specialist - Flexible Hours, Competitive Pay, and Career Growth Opportunities at blithequark

Remote

**Experienced Data Entry Specialist – Part-Time Evening Role in Data Management**

Remote

Patent Attorney / Computer Software IP CS / Remote 92130 3627-02 [FULLY REMOTE]

Remote

Customer Experience and Strategy Lead for a High-Growth AI Career Network and Human Data Labeling Business

Remote
← Back to Home