Key Responsibilities● Evaluate AI/LLM-generated responses for correctness, relevance, and groundedness that aligns with guidelines
● Compare multiple model outputs and select or rank the best responses
● Identify errors, biases, or gaps in model outputs and provide structured feedback
● Follow detailed evaluation rubrics and project-specific instructions
● Maintain high quality and consistency while meeting productivity targets