Remote

24-MAG

New York, New York(remote)

JOB DETAILS
SALARY
$10–$20 Per Hour
SKILLS
Academic Background, Analysis Skills, Artificial Intelligence (AI), Artificial Intelligence (AI) Programming Languages, Communication Skills, Consulting, Data Modeling, Detail Oriented, Editing, English Language, Fact Checking, Identify Issues, Linguistics, Modeling Languages, Multilingual, Performance Analysis, Performance Modeling, Project Evaluation, Project Execution, Quality Management, Technical Consulting, Training Data Sets, Urdu Language, Writing Skills
LOCATION
New York, New York
POSTED
4 days ago

We are sharing a specialised part-time consulting opportunity for Urdu-English bilingual professionals experienced in language evaluation, LLM response review, fact-checking, structured feedback, and high-quality written analysis in English.

This role supports current and upcoming remote consulting opportunities focused on Urdu-language AI response evaluation, bilingual quality review, factual accuracy assessment, reasoning analysis, rubric-based scoring, and high-quality project execution. Selected professionals will assess Urdu AI-generated responses, identify strengths and areas for improvement, fact-check outputs using trusted sources, and write clear English-language feedback based on structured evaluation criteria.

Key Responsibilities

Professionals in this role may contribute to:

Urdu Response Evaluation

  • Review Urdu AI-generated responses for accuracy, clarity, reasoning quality, tone, and completeness
  • Identify response strengths, improvement areas, factual inaccuracies, and communication gaps
  • Evaluate whether responses align with expected conversational behavior and project-specific guidelines
  • Apply native Urdu fluency and English writing ability to produce clear evaluation notes

Fact-Checking & Quality Review

  • Conduct fact-checking using trusted public sources and approved external tools
  • Assess whether responses are well-reasoned, complete, contextually appropriate, and useful
  • Identify subtle language issues, factual errors, unclear reasoning, or gaps in response quality
  • Generate high-quality human evaluation data through careful review and structured judgment

Rubric-Based Feedback & Evaluation Artifacts

  • Apply structured rubrics and quality criteria to assess model response performance
  • Write clear, consistent, and reproducible feedback in English
  • Compare outputs and make fine-grained qualitative judgments when required
  • Maintain accuracy, consistency, and strong attention to detail across submitted evaluations

Ideal Profile

Strong candidates may have:

  • Native fluency in Urdu
  • Strong English proficiency and excellent written communication skills
  • A bachelor's degree or equivalent academic background
  • Significant experience using large language models and understanding how people use AI tools
  • Strong ability to explain what makes an AI response accurate, incomplete, unclear, unrealistic, or poorly reasoned
  • Excellent attention to detail and ability to notice subtle issues in language, reasoning, and factual accuracy
  • Background or experience in structured analytical thinking, such as research, policy, analytics, linguistics, engineering, writing, or evaluation work

Educational Background

  • A bachelor's degree is preferred
  • Academic or professional experience in linguistics, translation, research, policy, analytics, engineering, writing, education, or related fields is highly relevant
  • Practical experience evaluating written content, bilingual language quality, or AI-generated outputs may also be valuable

Nice to Have

  • Prior experience with RLHF, model evaluation, data annotation, or AI response assessment
  • Experience writing, editing, or reviewing high-quality written content
  • Experience comparing multiple outputs and making fine-grained qualitative judgments
  • Familiarity with rubric-based evaluation, quality scoring, or structured feedback workflows
  • Ability to clearly explain factual inaccuracies, reasoning errors, and communication gaps

Why This Opportunity

  • Apply Urdu-English bilingual expertise to structured remote evaluation work
  • Contribute to high-quality AI response review, fact-checking, and language quality assessment
  • Work on flexible assignments aligned with language skills, analytical judgment, and LLM experience
  • Help improve response quality by identifying factual issues, reasoning gaps, and communication strengths
  • Remote structure with competitive hourly compensation

Contract Details

  • Independent contractor role
  • Fully remote with flexible scheduling
  • Eligible professionals may be based globally depending on project needs
  • Urdu native fluency and strong English proficiency are required for project work
  • Applicants may be asked to complete a Bilingual Competency interview in Urdu as part of project screening
  • Part-time commitment depending on project availability
  • Competitive rates between $10–$20 per hour depending on expertise and project scope
  • Weekly payments via Stripe or Wise
  • Projects may be extended, shortened, or adjusted depending on scope and performance
  • Work will not involve access to confidential or proprietary information from any employer, client, or institution

About the Platform

This opportunity is available through 24-MAG LLC. We connect experienced professionals with remote consulting opportunities across technical, evaluation, and project-based workstreams.

By submitting this application, you acknowledge that your information may be processed by 24-MAG LLC for recruitment and opportunity matching in accordance with our Privacy Policy: https://www.24-mag.com/privacy-policy.

About the Company

2

24-MAG