About the role
This role is for one of our clients
Compensation: $90-$110 per hour
We are building a benchmark dataset to evaluate AI models on professional document understanding and instruction following within the Engineering & Built Environment domain.
Tasks consist of complex, multi-step requests grounded in real-world workspace files (technical drawings, project specifications, engineering reports), web search, and code execution — each paired with a clearly defined ground truth output and an objective evaluation rubric. You will be responsible for authoring tasks that test an AI's ability to interpret engineering documentation, follow multi-step instructions, and produce precise, well-structured outputs.
Requirements
We expect a minimum commitment of 15–20 hours per week.
Ideal candidates have 3+ years of hands-on experience in one or more of the following sub-domains:
- Mechanical engineering
- Civil engineering
- Industrial engineering
- Architecture
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Contract and Payment Terms
- You will be engaged as an independent contractor.
- This is a fully remote role that can be completed on your own schedule.
- Projects can be extended, shortened, or concluded early depending on needs and performance.
- Your work will not involve access to confidential or proprietary information from any employer, client, or institution.
- Payments are weekly on Stripe or Wise based on services rendered.
- Please note: We are unable to support H1-B or STEM OPT candidates at this time.