Data Analyst
Remoteok
We are sourcing independent Search Engine Evaluation Specialists to provide their expertise for an AI benchmark evaluation project. As AI models increasingly interpret search intent, analyze indexing protocols, and evaluate search rank responses, their accuracy relies entirely on robust, expert-crafted training data. The objective of this project is to autonomously produce high-quality evaluation tasks, strong prompts, and clear, well-structured rubrics that generate clean, reliable data for model training.
Operate autonomously to design complex evaluation frameworks and provide structured training data. Expected deliverables include: