Job Description
5+ years of building models for business application experience
PhD, or Master's degree and 6+ years of CS, CE, ML or related field experience
Experience programming in Java, C++, Python or related language
Practical experience with LLM pre-training or post-training, and experience working with organic, synthetic, agentic, or reasoning data for LLMs
Published research in top conferences (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL) and/or demonstrated significant industry influence in the field of AI
Experience using Unix/Linux
Experience in professional software development
Experience working on SOTA LLMs
Multiple first-author LLM-related publications in top-tier conferences (e.g., NeurIPS, ICLR, ACL, EMNLP, NAACL)
Preferred Qualifications:
5+ years of experience building and deploying machine learning models or developing algorithms for business applications
A strong track record of publications at top-tier, peer-reviewed conferences or journals (e.g., NeurIPS, ICML, ACL, KDD, CVPR)
Proven ability to transition research to production-grade, scalable systems
Experience with Generative AI tools, Large Language Models (LLMs), Transformers, Reinforcement Learning, or advanced Computer Vision/NLP techniques
Expertise in Python, Java, or C++, as well as experience with distributed machine learning frameworks (e.g., Spark, Hadoop)