Data Scientist Job at Mercor, New York, NY

eHVQN21ITmtPSWRqcisxaElsRmhSNzJnOHc9PQ==
  • Mercor
  • New York, NY

Job Description

Job Description

Job Description

Job Description: AI Task Evaluation & Statistical Analysis Specialist

Role Overview

We're seeking a data-driven analyst to conduct comprehensive failure analysis on AI agent performance across finance-sector tasks. You'll identify patterns, root causes, and systemic issues in our evaluation framework by analyzing task performance across multiple dimensions (task types, file types, criteria, etc.).

Key Responsibilities

  • Statistical Failure Analysis : Identify patterns in AI agent failures across task components (prompts, rubrics, templates, file types, tags)

  • Root Cause Analysis : Determine whether failures stem from task design, rubric clarity, file complexity, or agent limitations

  • Dimension Analysis : Analyze performance variations across finance sub-domains, file types, and task categories

  • Reporting & Visualization : Create dashboards and reports highlighting failure clusters, edge cases, and improvement opportunities

  • Quality Framework : Recommend improvements to task design, rubric structure, and evaluation criteria based on statistical findings

  • Stakeholder Communication : Present insights to data labeling experts and technical teams

Required Qualifications

  • Statistical Expertise : Strong foundation in statistical analysis, hypothesis testing, and pattern recognition

  • Programming : Proficiency in Python (pandas, scipy, matplotlib/seaborn) or R for data analysis

  • Data Analysis : Experience with exploratory data analysis and creating actionable insights from complex datasets

  • AI/ML Familiarity : Understanding of LLM evaluation methods and quality metrics

  • Tools : Comfortable working with Excel, data visualization tools (Tableau/Looker), and SQL

Preferred Qualifications

  • Experience with AI/ML model evaluation or quality assurance

  • Background in finance or willingness to learn finance domain concepts

  • Experience with multi-dimensional failure analysis

  • Familiarity with benchmark datasets and evaluation frameworks

  • 2-4 years of relevant experience

Job Tags

Similar Jobs

Mercy Rehabilitation Hospital (Oklahoma City)

Clinical Liaison- PRN Job at Mercy Rehabilitation Hospital (Oklahoma City)

 ...Job Description Clinic Liaison- PRN Your experience matters At Mercy Rehab Hospital OK City North, we are committed to empowering and supporting a diverse and determined workforce who can drive quality, scalability, and significant impact across our hospitals... 

CDM Smith

Construction Representative Intern (Summer 2026!) Job at CDM Smith

 ...characteristic protected by applicable law. Why CDM Smith?: Check out this video and find out why our team loves to work here! (...  ...: 20% Assignment Category: Fulltime-Temporary Background Check and Drug Testing Information: CDM Smith Inc. and its divisions... 

Kelsey-Seybold Clinic - Optum

Sports Medicine - Physician Job at Kelsey-Seybold Clinic - Optum

 ...Sports Medicine - Physician at Kelsey-Seybold Clinic - Optum summary: A Sports Medicine Physician at Kelsey-Seybold Clinic provides outpatient care focusing on sports medicine, musculoskeletal medicine, and non-operative orthopedics for patients of all ages including... 

MetroDerm PC

Mohs Surgeon Job at MetroDerm PC

 ...MetroDerm P.C. is seeking a Mohs Surgeon with knowledge in all areas of general dermatology to see patients in an established and expanding practice in Atlanta, GA. Significant income potential with first year guarantee, plus benefits Opportunity for Mohs... 

gpac

Remote Recruiter - Entry Level with High Earning Potential Job at gpac

A leading executive search firm is seeking an Entry Level Remote Recruiter to work from home. This commission-driven position offers unlimited earning potential by assisting clients in filling urgent positions. Ideal candidates should have strong communication skills and...