Blueprint-Technologies
Labeler / Annotator – AI Response Evaluation (Italian)
Job Description
Remote
About the Role
We are seeking a detail‑oriented Labeler / Annotator to evaluate responses generated by AI systems in Italian . This role focuses on side‑by‑side (SBS) evaluation of outputs from different AI models across real‑world scenarios. You will play a key role in improving how AI systems understand and communicate in Italian. This is not a translation role; it is an evaluation and analysis role requiring strong judgment and attention to detail.
What You'll Work On
You will evaluate AI responses across scenarios such as:
- Web search results
- File‑based and image‑based responses
- Image and file generation tasks
- Single‑turn and multi‑turn conversations
Responsibilities
- Perform side‑by‑side (SBS) comparisons of AI‑generated responses
- Evaluate outputs based on:
- Accuracy
- Relevance
- Clarity
- Instruction‑f...