Software Engineering, Data Science, and Systems Design Experts, Powershell
Posted 1 week ago by Great Value Hiring
Negotiable
Undetermined
Undetermined
United Kingdom
Summary: The role involves evaluating LLM-generated responses to software engineering queries, ensuring accuracy and clarity through fact-checking and code validation. Candidates will assess code quality and model responses while adhering to established evaluation standards. A strong background in software engineering, particularly with Powershell, is essential for success in this position.
Key Responsibilities:
- Evaluate LLM-generated responses for accuracy, reasoning, clarity, and completeness.
- Conduct fact-checking using trusted public sources and authoritative references.
- Perform accuracy testing by executing code and validating outputs.
- Annotate model responses by identifying strengths and areas for improvement.
- Assess code quality, readability, algorithmic soundness, and explanation quality.
- Ensure model responses align with expected conversational behavior and system guidelines.
- Apply consistent evaluation standards by following clear taxonomies and guidelines.
Key Skills:
- BS, MS, or PhD in Computer Science or a closely related field.
- Significant (5+ years) real-world experience in software engineering or related technical roles.
- Expert in Powershell.
- Able to solve HackerRank or LeetCode Medium and Hard-level problems independently.
- Experience contributing to well-known open-source projects.
- Significant experience using LLMs while coding.
- Strong attention to detail and comfort evaluating complex technical reasoning.
Salary (Rate): £100.00/hr
City: undetermined
Country: United Kingdom
Working Arrangements: undetermined
IR35 Status: undetermined
Seniority Level: undetermined
Industry: IT
Software Engineering, Data Science, and Systems Design Experts, Powershell [$60-$100/hr]
Role Responsibilites
- Evaluate LLM-generated responses to coding and software engineering queries for accuracy, reasoning, clarity, and completeness
- Conduct fact-checking using trusted public sources and authoritative references
- Conduct accuracy testing by executing code and validating outputs using appropriate tools
- Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies
- Assess code quality, readability, algorithmic soundness, and explanation quality
- Ensure model responses align with expected conversational behavior and system guidelines
- Apply consistent evaluation standards by following clear taxonomies, benchmarks, and detailed evaluation guidelines
Good Candidature
- BS, MS, or PhD in Computer Science or a closely related field
- Significant (5+ years) real-world experience in software engineering or related technical roles
- Expert in Powershell
- Able to solve HackerRank or LeetCode Medium and Hard–level problems independently
- Experience contributing to well-known open-source projects, including merged pull requests
- Significant experience using LLMs while coding and understand their strengths and failure modes
- Strong attention to detail and are comfortable evaluating complex technical reasoning, identifying subtle bugs or logical flaws
Nice to Have
- Prior experience with RLHF, model evaluation, or data annotation work
- Track record in competitive programming
- Experience reviewing code in production environments
- Familiarity with multiple programming paradigms or ecosystems
- Experience explaining complex technical concepts to non-expert audiences