Remote AI Jobs 2026 — Earn $25–$300/hr Evaluating AI From Home

🔥 Top pay

M

Mercor

Emergency Medicine Expert

Design clinical EM scenarios, write attending-level reference responses, and grade AI outputs. For EM attendings, dual-boarded physicians, medical directors, and final-year residents. US & Canada only.

US / CA onlyBoard certified20 hrs/wk

M

Mercor

Superstar Reviewer

High-paying AI review role for top performers. Evaluate AI outputs for leading labs. Flexible schedule, fully remote. Competitive rate reserved for standout reviewers.

RemoteTop performerWeekly pay

New

M

Mercor

Cyber Benchmark — Blue Team Engineer

Design benchmark tasks grounded in real SOC and detection engineering work. Build realistic evaluation environments (multi-host networks, Active Directory, cloud). Hands-on blue-team experience required.

RemoteSOC / Detection / IRScripting required

M

Mercor

Venture Capital Expert

Research project with a leading AI lab. Must have 2+ years in VC with experience in board materials, market sizing, investment memos, or deal sourcing. Min 10 hrs/week.

US/UK/CA/EU/AU2+ yrs VC10+ hrs/wk

M

Mercor

Corporate Finance Expert

2+ years in corporate finance with experience in financial modeling, reporting, or analysis. Min 10 hrs/week, 4-week minimum engagement with leading AI lab.

US/UK/CA/EU/AU2+ yrs corp finance10+ hrs/wk

M

Mercor

Accounting Expert — Document Understanding

Train AI on financial statement analysis and audit documentation. Evaluate balance sheets, income statements, audit workpapers, and compliance reports. CPA with Big 4 background ideal.

RemoteCPA preferredBig 4 ideal

M

Mercor

STEM Python Expert (Math, Physics, Chemistry, Biology)

Translate research papers into decomposed subproblems, write gold-standard Python solutions, and author unit tests. 15–25 hrs/week, 1–2 month project. NumPy, SciPy, JAX experience helpful.

BS/MS/PhD STEMPython required15–25 hrs/wk

M

Mercor

STEM PhD AI Contributor

Craft difficult domain-specific STEM problems for a top AI lab's frontier model. Math, Biology, Chemistry, Physics, CS, Stats, Econ. 15–20 hrs/week with potential to scale to 40.

STEM PhD requiredUS/UK/CA/EU university15–40 hrs/wk

M

Mercor

AI Training Scenario Designer

Build detailed personas and simulated digital environments (Gmail, Slack, Calendar, Drive) to challenge AI agents. Write tasks, evaluate performance, document results. Great fit for writers, PMs, game designers.

RemoteUndergrad degree2+ yrs experience

M

Mercor

Economics & Finance Assessment Specialist

Author and verify multiple-choice questions across micro/macroeconomics, financial markets, econometrics, and personal finance. Write chain-of-thought solutions and academic references. 10+ hrs/week.

PhD / Master's Econ or Finance10+ hrs/wkAsync

M

Mercor

Business & Commerce Assessment Specialist

Author and verify multiple-choice questions across business intelligence, marketing, ethics, and e-commerce. PhD/DBA/MBA preferred. Write chain-of-thought solutions and academic references. 10+ hrs/week.

PhD / MBA preferred10+ hrs/wkAsync

M

Mercor

Enterprise Sales / Account Executive Expert

Recreate your real AE digital workspace and design multi-step tasks that challenge AI on enterprise sales workflows — pipeline management, account plans, proposals, contract redlines. 5+ years AE experience.

Remote5+ yrs AE / B2BAsync

M

Mercor

Mathematics Assessment Specialist

Author and verify rigorous math questions across algebra, number theory, topology, and optimization. Write chain-of-thought solutions, rate difficulty, source academic references. PhD or master's preferred. 10+ hrs/week.

PhD / Master's in Math10+ hrs/wkAsync

M

Mercor

Bilingual French STEM Expert

Create and evaluate French/English STEM prompts in biology, physics, or chemistry. Native French speaker from France, Canada, Belgium, or French-speaking Switzerland required. 20+ hrs/week, 3–6 months.

FR / CA / BE / CHNative FrenchBS in STEM20+ hrs/wk

M

Mercor

Commerce Specialist — Consumer AI Agents

Train consumer-facing AI shopping agents. Evaluate AI on product categorization, catalog management, and merchandising. Ideal for retail merchandising managers or e-commerce catalog specialists.

RemoteRetail / e-commerce opsFlexible hours

M

Mercor

Bilingual German STEM Expert

Create and evaluate German/English STEM prompts in biology, physics, or chemistry. Native German speaker from Germany, Austria, or German-speaking Switzerland required. 20+ hrs/week, 3–6 month engagement.

DE / AT / CH / USNative GermanBS in STEM20+ hrs/wk

M

Mercor

Generalist — Real World Understanding

Help train AI on real-world reasoning and visual understanding tasks. Evaluate AI on ambiguous, multi-modal challenges requiring common sense and spatial reasoning. Ideal for recent grads from selective universities.

RemoteRecent gradFlexible hours

M

Mercor

Bilingual Italian Generalist Evaluator

Author Italian/English prompt–answer pairs to train and evaluate AI models. Must be native to Switzerland (Italian-speaking) or Italy. Deep cultural and linguistic familiarity required. 20+ hrs/week.

Italy / SwitzerlandNative ItalianBS / BA required20+ hrs/wk

New

M

Mercor

Voice Actor — CX Agent Voice Cloning (USA)

Record high-quality voice samples to train next-gen TTS AI. Native American English speaker based in the US required. Professional recording setup needed. 5–10 hrs/week. Note: your voice may be cloned for AI use.

US onlyNative American English5–10 hrs/wk

Remote work that pays you
to think, not automate

Generalist AI Evaluator

Find your best-fit role

Remote work that pays youto think, not automate

Generalist AI Evaluator

Find your best-fit role

Remote work that pays you
to think, not automate