Back

AI Decision & Response Analyst

Worldwide Salaried Open

Responsibilities

  • Evaluate AI model responses for personalization quality, including grounding, integration, and helpfulness.
  • Design and execute multi-turn prompts based on personal context to test AI capabilities.
  • Analyze responses for hallucinations, incorrect personalization, and poor inferences.
  • Perform side-by-side comparison of model outputs to determine quality and effectiveness.
  • Write clear and structured rationales for response evaluations and rankings.
  • Extract and verify debug information to ensure proper use of data sources.
  • Maintain strict data hygiene and ensure accurate documentation of evaluations.
  • Collaborate with cross-functional teams to improve AI model performance.

Requirements

  • Strong proficiency in Polish with excellent reading and writing skills.
  • Experience in data annotation, AI evaluation, content moderation, or a related role.
  • Strong analytical thinking and ability to assess nuanced AI responses.
  • Ability to design creative, multi-turn prompts based on personal context.
  • Understanding of personalization concepts, including identifying incorrect or forced personalization.
  • High attention to detail in evaluating subtle differences in model outputs.
  • Excellent written communication and structured reasoning skills.
  • Ability to work independently in a remote environment.
  • Willingness to use a personal Google account for evaluation purposes.
  • Full-time availability with at least 4 hours overlap with PST.
  • Bachelor’s degree or equivalent experience in a relevant analytical field.

Apply tot his job Apply To this Job

More jobs

NURSE EVALUATOR III, HEALTH SERVICES

Worldwide Salaried

Finance Model Prompt Evaluator

Worldwide Salaried

AI Quality Evaluator (Polish)

Worldwide Salaried

Healthcare Research Evaluator (STEM) | $30/hr Remote

Worldwide Salaried

Generative AI Evaluator (Russian) | $15/hr Remote

Worldwide Salaried

Product Manager - Healthcare (Remote)

Worldwide Salaried

Product Owner (Specialty Lines Insurance)

Worldwide Salaried

Product Owner – Digital Enablement

Worldwide Salaried

Product Owner (Data Center) || W.2 only, No C.2.C & No H.1s, E.A. Ds

Worldwide Salaried

AI Product Owner- Quote & Order Management

Worldwide Salaried

Sales Representative – Infectious Disease I Region Niederbayern & Oberpfalz (m/w/d)

Worldwide Salaried

Experienced Remote Customer Service Representative – Thrive in a Dynamic Environment at arenaflex

Worldwide Salaried

Senior Enablement Manager

Worldwide Salaried

Business Analyst III - CRM

Worldwide Salaried

Behavioral Health Care Advocate - Utilization Mgmt. - Remote

Worldwide Salaried

Import Specialist I

Worldwide Salaried

Experienced Data Entry Specialist – Workforce Management (WFM) Team at arenaflex – Flexible Part-Time Opportunity with Competitive Hourly Wage

Worldwide Salaried

Experienced Data Entry Specialist – Remote Work Opportunity at arenaflex

Worldwide Salaried

Experienced Full Stack Customer Support Associate – Bilingual French (arenaflex Starlink)

Worldwide Salaried

Experienced Data Entry Specialist – Flexible Remote Work Opportunity at arenaflex

Worldwide Salaried