Analysis Skills, Artificial Intelligence (AI), Benchmarking, Communication Skills, Computer Engineering, Computer Programming, Computer Science, Cross-Functional, Data Science, Detail Oriented, Documentation Models, Electrical Engineering, Hardware Evaluation, Information Technology & Information Systems, Python Programming/Scripting Language, Record Keeping, Scientific Research, Stress Testing, Surface Modeling, Team Player, Technical Analysis, Technical Drawing, Technical Research, Workflow Analysis, Writing Skills
Job Title: Applied AI Research Scientist – Computer Science / Computer Engineering (Remote)
Exp - 5+ Yrs
> Job Overview
We are seeking skilled and motivated Applied AI Research Scientists in Computer Science and Computer Engineering with an MS or Ph.D. in a relevant technical field. In this role, you will contribute to the design, validation, and execution of expert-level evaluation tasks that probe the limits of state-of-the-art AI systems. Your work will focus on creating headroom-level, rigorously verifiable evaluation questions across hardware, systems, and computing domains to assess and stress-test advanced multimodal and reasoning-capable AI models. This position requires deep domain expertise, strong analytical rigor, and the ability to translate complex technical concepts into precise, evaluable challenges that expose model limitations beyond surface-level reasoning. You will work closely with a collaborative, cross-functional team and are expected to be a reliable team player who is highly detail-oriented and committed to accuracy and quality.
= Key Responsibilities
> Design headroom-level evaluation questions requiring advanced reasoning and graduate-level domain expertise in CS/CE
Ensure all tasks are objectively verifiable with clear, definitive ground-truth answers
= Develop high-quality multimodal prompts, including accurate technical diagrams or visuals when appropriate
=
Identify and document model headroom by conducting structured side-by-side evaluations of SOTA models such as Gemini and ChatGPT
= Document model failures and reasoning gaps, provide correct solutions, and maintain accurate records of prompts, answers, and evaluation results in shared tracking systems
< Qualifications
< MS or Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, Information Technology, Data Science, or a closely related field
> Strong expertise in two or more CS/CE domains such as hardware, computer architecture, systems, VLSI design, embedded systems and IoT, operating systems, compilers, systems security, or AI/ML
=, Proven experience with applied AI research, technical evaluation, or research-driven problem formulation in real-world or production-oriented settings
= Strong programming proficiency, with experience in Python for analysis, verification, and evaluation workflows
Strong written communication skills and the ability to collaborate effectively as a detail-oriented team player
> Familiarity with modern AI model capabilities, limitations, and benchmarking practices is a strong plus,
Please let me know if you have any questions.