Software Engineering, Data Science, and Systems Design Experts, Bash

Posted 1 week ago by Great Value Hiring

Apply

Negotiable

Undetermined

United Kingdom

Apply

Bash (Scripting Language) Computer Science Data Science Generic Programming Nice (Unix Utility) Open Source Development Readability Software Coding Software Development Software Engineering Systems Design

Summary: The role involves evaluating LLM-generated responses to software engineering queries, ensuring accuracy and clarity through fact-checking and code execution. Candidates will annotate model responses and assess code quality while adhering to established evaluation standards. A strong background in software engineering and expertise in Bash is essential, along with experience in using LLMs effectively. The position requires attention to detail and the ability to evaluate complex technical reasoning.

Key Responsibilities:

Evaluate LLM-generated responses to coding and software engineering queries for accuracy, reasoning, clarity, and completeness
Conduct fact-checking using trusted public sources and authoritative references
Conduct accuracy testing by executing code and validating outputs using appropriate tools
Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies
Assess code quality, readability, algorithmic soundness, and explanation quality
Ensure model responses align with expected conversational behavior and system guidelines
Apply consistent evaluation standards by following clear taxonomies, benchmarks, and detailed evaluation guidelines

Key Skills:

BS, MS, or PhD in Computer Science or a closely related field
Significant (5+ years) real-world experience in software engineering or related technical roles
Expert in Bash
Able to solve HackerRank or LeetCode Medium and Hard-level problems independently
Experience contributing to well-known open-source projects, including merged pull requests
Significant experience using LLMs while coding and understanding their strengths and failure modes
Strong attention to detail and comfortable evaluating complex technical reasoning, identifying subtle bugs or logical flaws
Prior experience with RLHF, model evaluation, or data annotation work (nice to have)
Track record in competitive programming (nice to have)
Experience reviewing code in production environments (nice to have)
Familiarity with multiple programming paradigms or ecosystems (nice to have)
Experience explaining complex technical concepts to non-expert audiences (nice to have)

Salary (Rate): £100.00/hr

City: undetermined

Country: United Kingdom

Working Arrangements: undetermined

IR35 Status: undetermined

Seniority Level: undetermined

Industry: IT

Detailed Description From Employer:

Software Engineering, Data Science, and Systems Design Experts, Bash [$60-$100/hr]

Role Responsibilities

Evaluate LLM-generated responses to coding and software engineering queries for accuracy, reasoning, clarity, and completeness
Conduct fact-checking using trusted public sources and authoritative references
Conduct accuracy testing by executing code and validating outputs using appropriate tools
Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies
Assess code quality, readability, algorithmic soundness, and explanation quality
Ensure model responses align with expected conversational behavior and system guidelines
Apply consistent evaluation standards by following clear taxonomies, benchmarks, and detailed evaluation guidelines

Good Candidature

BS, MS, or PhD in Computer Science or a closely related field
Significant (5+ years) real-world experience in software engineering or related technical roles
Expert in Bash
Able to solve HackerRank or LeetCode Medium and Hard–level problems independently
Experience contributing to well-known open-source projects, including merged pull requests
Significant experience using LLMs while coding and understand their strengths and failure modes
Strong attention to detail and are comfortable evaluating complex technical reasoning, identifying subtle bugs or logical flaws

Nice to Have

Prior experience with RLHF, model evaluation, or data annotation work
Track record in competitive programming
Experience reviewing code in production environments
Familiarity with multiple programming paradigms or ecosystems
Experience explaining complex technical concepts to non-expert audiences

Apply

Inside IR35

Outside IR35

Permanent Employee

IR35

Umbrella Companies

Limited Companies

First Time Contractors

What Is IR35?

InsideIR35

Outside IR35

The Cost of IR35

IR35 Assessments

IR35 Rules

IR35 Compliance

Expenses

Foreign Companies

Overseas Contractors

Limited Companies

Sole Traders

What Is An Umbrella Company?

Choosing an Umbrella Company

Tax and Pay

Tax Avoidance

Fees (Margin)

National Insurance

Holiday Pay

Expenses

Pensions

Maternity Pay

Sick Pay

What Is A Limited Company?

Limited Company vs Sole Trader

Incorporation

Taxes

Filing Responsibilities

Bookkeeping

Insurance

Expenses

Buying a Car or Van

Capital Allowances

Benefits In Kind

Pensions

Employing A Spouse

Managing Excess Money

Dormant Companies

Closing Your Company

Withdrawing Money

Business Asset Disposal Relief

How To Become A Contractor

Inside IR35 Checklist

Outside IR35 Checklist

Self-Assessment Tax Returns

Mortgages

Pensions

Working Multiple Contracts

What is the £100k Abatement?

Inside IR35

Outside IR35

Permanent Employee

IR35

Umbrella Companies

Limited Companies

First Time Contractors

What Is IR35?

InsideIR35

Outside IR35

The Cost of IR35

IR35 Assessments

IR35 Rules

IR35 Compliance

Expenses

Foreign Companies

Overseas Contractors

Limited Companies

Sole Traders

What Is An Umbrella Company?

Choosing an Umbrella Company

Tax and Pay

Tax Avoidance

Fees (Margin)