Semesteria Logo
Mindrift

Evaluation Engineer

Mindrift

Posted

3 weeks ago

Vietnam

Remote

USD 6K

Mid Level

Part Time

Match

Skills

Experience

Industry

Sema

Sema Summary

Mindrift is seeking an Evaluation Engineer to design evaluation scenarios for AI agents. The role involves creating test cases and analyzing agent performance to enhance AI decision-making.

About Company

Mindrift is a pioneering platform dedicated to advancing artificial intelligence through collaborative online projects. They focus on creating data for generative AI, allowing freelancers to contribute from anywhere.

Core Requirements

  • Bachelor's or Master's Degree in relevant fields
  • Background in QA or software testing
  • Strong written communication skills in English
  • Experience with JSON/YAML formats
  • Basic knowledge of Python and JavaScript

Responsibilities

  • Design realistic evaluation scenarios for AI agents.
  • Create structured test cases simulating human workflows.
  • Define gold-standard behavior for agent actions.
  • Analyze agent logs and decision paths.
  • Iterate on test cases to improve clarity.
  • Ensure scenarios are reusable and easy to execute.
  • Collaborate with teams to validate scenarios.

Benifits

  • Flexible schedule
  • Remote work
  • Valuable experience
  • Influence AI development

Must Have skills

Analytical mindsetAttention to detailStrong communication skillsExperience with test design principlesFamiliarity with AI and NLP

Job Keywords

Evaluation EngineerAITestingFreelanceRemote

Similar Jobs