Hand-picked remote opportunities to evaluate AI — from home, on your schedule, with weekly pay.
Mercor — AI Research Lab Partner
Evaluate AI-generated responses and write structured, honest feedback for leading AI labs. You'll assess reasoning quality, accuracy, and nuance — helping shape the next generation of intelligent systems. Strong critical reading and writing skills required. No AI writing tools: your genuine human judgment is what they're paying for.
3-question quiz
Answer 3 quick questions — we'll show you the roles you're most likely to get hired for.
What best describes your background?
How many hours per week can you commit?
What matters most to you?
Your best matches:
Mercor
Emergency Medicine Expert
Design clinical EM scenarios, write attending-level reference responses, and grade AI outputs. For EM attendings, dual-boarded physicians, medical directors, and final-year residents. US & Canada only.
Mercor
Superstar Reviewer
High-paying AI review role for top performers. Evaluate AI outputs for leading labs. Flexible schedule, fully remote. Competitive rate reserved for standout reviewers.
Mercor
Cyber Benchmark — Blue Team Engineer
Design benchmark tasks grounded in real SOC and detection engineering work. Build realistic evaluation environments (multi-host networks, Active Directory, cloud). Hands-on blue-team experience required.
Mercor
Venture Capital Expert
Research project with a leading AI lab. Must have 2+ years in VC with experience in board materials, market sizing, investment memos, or deal sourcing. Min 10 hrs/week.
Mercor
Corporate Finance Expert
2+ years in corporate finance with experience in financial modeling, reporting, or analysis. Min 10 hrs/week, 4-week minimum engagement with leading AI lab.
Mercor
Accounting Expert — Document Understanding
Train AI on financial statement analysis and audit documentation. Evaluate balance sheets, income statements, audit workpapers, and compliance reports. CPA with Big 4 background ideal.
Mercor
STEM Python Expert (Math, Physics, Chemistry, Biology)
Translate research papers into decomposed subproblems, write gold-standard Python solutions, and author unit tests. 15–25 hrs/week, 1–2 month project. NumPy, SciPy, JAX experience helpful.
Mercor
STEM PhD AI Contributor
Craft difficult domain-specific STEM problems for a top AI lab's frontier model. Math, Biology, Chemistry, Physics, CS, Stats, Econ. 15–20 hrs/week with potential to scale to 40.
Mercor
AI Training Scenario Designer
Build detailed personas and simulated digital environments (Gmail, Slack, Calendar, Drive) to challenge AI agents. Write tasks, evaluate performance, document results. Great fit for writers, PMs, game designers.
Mercor
Economics & Finance Assessment Specialist
Author and verify multiple-choice questions across micro/macroeconomics, financial markets, econometrics, and personal finance. Write chain-of-thought solutions and academic references. 10+ hrs/week.
Mercor
Business & Commerce Assessment Specialist
Author and verify multiple-choice questions across business intelligence, marketing, ethics, and e-commerce. PhD/DBA/MBA preferred. Write chain-of-thought solutions and academic references. 10+ hrs/week.
Mercor
Enterprise Sales / Account Executive Expert
Recreate your real AE digital workspace and design multi-step tasks that challenge AI on enterprise sales workflows — pipeline management, account plans, proposals, contract redlines. 5+ years AE experience.
Mercor
Mathematics Assessment Specialist
Author and verify rigorous math questions across algebra, number theory, topology, and optimization. Write chain-of-thought solutions, rate difficulty, source academic references. PhD or master's preferred. 10+ hrs/week.
Mercor
Bilingual French STEM Expert
Create and evaluate French/English STEM prompts in biology, physics, or chemistry. Native French speaker from France, Canada, Belgium, or French-speaking Switzerland required. 20+ hrs/week, 3–6 months.
Mercor
Commerce Specialist — Consumer AI Agents
Train consumer-facing AI shopping agents. Evaluate AI on product categorization, catalog management, and merchandising. Ideal for retail merchandising managers or e-commerce catalog specialists.
Mercor
Bilingual German STEM Expert
Create and evaluate German/English STEM prompts in biology, physics, or chemistry. Native German speaker from Germany, Austria, or German-speaking Switzerland required. 20+ hrs/week, 3–6 month engagement.
Mercor
Generalist — Real World Understanding
Help train AI on real-world reasoning and visual understanding tasks. Evaluate AI on ambiguous, multi-modal challenges requiring common sense and spatial reasoning. Ideal for recent grads from selective universities.
Mercor
Bilingual Italian Generalist Evaluator
Author Italian/English prompt–answer pairs to train and evaluate AI models. Must be native to Switzerland (Italian-speaking) or Italy. Deep cultural and linguistic familiarity required. 20+ hrs/week.
Mercor
Voice Actor — CX Agent Voice Cloning (USA)
Record high-quality voice samples to train next-gen TTS AI. Native American English speaker based in the US required. Professional recording setup needed. 5–10 hrs/week. Note: your voice may be cloned for AI use.
The process
No lengthy process. No office. Apply, pass one interview, start earning.
Apply above
Click any Apply button — goes directly to the role page. About 10 minutes to complete.
One interview
20–30 minute screening. No take-home tests, no multi-round process.
Get matched
Mercor matches you to projects based on your background and availability.
Earn weekly
Set your own hours. Paid every Wednesday and Friday via Stripe or Wise.