-
Senior Business Process Owner Data Sharing (m/w/d)
Kaufland Dienstleistung GmbH & Co. KG Premium- 74072 Heilbronn
kein Anschreiben nötig -
JUNIOR DATA ENGINEER
Blue Reply- Düsseldorf
-
Junior Data Scientist
Wik Far East Limited, Hongkong, Zweigniederlassung Luzern- Essen
-
Fullstack Cloud Engineer / Data Scientist (AWS, React, Python) – Viel Gestaltungsspielraum & echte W
Optimus Search- Monheim am Rhein
-
Data Scientist (Python / Azure ML) – Real-Time AI Decisioning - Frankfurt - €85–100K
Findr- Frankfurt
-
Research Scientist - mRNA Process Development
Cipla- Reutlingen
-
Scientist Protein Chemistry (fmd)
Ascendis Pharma- Heidelberg
Neu -
Research Scientist 3D Deep Learning Senior (mfd)
RIIICO- Düsseldorf
Neu -
Senior Data Scientist (mfd)
Redcare Pharmacy- Cologne
Neu Homeoffice möglich -
Data Scientist (mwd) – AI Drug Discovery
Cube4 Consulting GmbH- Heidelberg
Neu -
Senior Data Scientist Germany (mwd)
CUJU- Frankenthal
Neu -
Data Scientist Robotik
SEARCH4 Global - POOL YOUR TALENT- Aachen
Neu Homeoffice möglich -
Senior Data Scientist GPTN Region
Canva- Düsseldorf
Neu Homeoffice möglich -
Career opportunities Senior Data Scientist (mwd)
SSI SCHAEFER- Dortmund
Neu -
Fullstack Cloud Engineer / Data Scientist (AWS, React, Python) – Viel Gestaltungsspielraum & echte W...
Optimus Search- Monheim am Rhein
Neu
Data Scientist
- Neu
- Veröffentlicht am 22.12.2025
- Festanstellung
Job Description: AI Task Evaluation & Statistical Analysis Specialist
Role Overview
Were seeking a data-driven analyst to conduct comprehensive failure analysis on AI agent performance across finance-sector tasks. Youll identify patterns root causes and systemic issues in our evaluation framework by analyzing task performance across multiple dimensions (task types file types criteria etc.).
Key Responsibilities
-
Statistical Failure Analysis: Identify patterns in AI agent failures across task components (prompts rubrics templates file types tags)
-
Root Cause Analysis: Determine whether failures stem from task design rubric clarity file complexity or agent limitations
-
Dimension Analysis: Analyze performance variations across finance sub-domains file types and task categories
-
Reporting & Visualization: Create dashboards and reports highlighting failure clusters edge cases and improvement opportunities
-
Quality Framework: Recommend improvements to task design rubric structure and evaluation criteria based on statistical findings
-
Stakeholder Communication: Present insights to data labeling experts and technical teams
Required Qualifications
-
Statistical Expertise: Strong foundation in statistical analysis hypothesis testing and pattern recognition
-
Programming: Proficiency in Python (pandas scipy matplotlib/seaborn) or R for data analysis
-
Data Analysis: Experience with exploratory data analysis and creating actionable insights from complex datasets
-
AI/ML Familiarity: Understanding of LLM evaluation methods and quality metrics
-
Tools: Comfortable working with Excel data visualization tools (Tableau/Looker) and SQL
Preferred Qualifications
-
Experience with AI/ML model evaluation or quality assurance
-
Background in finance or willingness to learn finance domain concepts
-
Experience with multi-dimensional failure analysis
-
Familiarity with benchmark datasets and evaluation frameworks
-
2-4 years of relevant experience
Key Skills
Laboratory Experience,Immunoassays,Machine Learning,Biochemistry,Assays,Research Experience,Spectroscopy,Research & Development,cGMP,Cell Culture,Molecular Biology,Data Analysis Skills
Employment Type : Full Time
Experience: years
Vacancy: 1