Drive research in areas such as LLM post-training (RLHF, GRPO, instruction tuning), data efficiency, and the design of benchmarks to evaluate LLM capabilities across safety, reasoning, and domain-specific performance.
Design and run experiments to validate hypotheses and iterate on research ideas.
Collaborate with research scientists and engineers to prototype and evaluate novel approaches.
Produce publication-ready research targeting top-tier AI/ML conferences.
Qualification & Experience
A Ph.D. or Masters in Computer Science, Machine Learning, or a related field.
2+ years of post-PhD or 5+ years of post-Masters research experience in an academic or industrial setting.
A strong publication record in top-tier AI/ML conferences and journals, demonstrating a history of impactful research.
Deep expertise in at least one of the following areas: Computer Vision (CV), Natural Language Processing (NLP), or Deep Learning, with a strong grasp of modern Generative AI techniques (e.g., Transformers, Diffusion Models, LLMs).
5+ years of industry or academic research experience in ML/AI.
Proven track record of training large models on distributed infrastructure for research purposes.
Demonstrated ability to rapidly prototype and iterate on research ideas, signaling curiosity and an experimental approach.
Experience collaborating effectively across an organization, not limited to just within teams, to drive research adoption.