Foundation Model Research Scientist

Responsibilities:

Advance AI innovation: Explore long-term memory, decision-making models, and autonomous agents to drive the evolution of large language models (LLMs), large multimodal models (LMMs), and next-generation artificial intelligence systems.
Produce high-impact research outcomes: Publish research at top-tier conferences, file patents, and contribute to the broader AI community through open-source datasets, models, and code.
Drive original research breakthroughs: Investigate frontier AI research directions and emerging industry trends, delivering original, high-impact research that shapes future AI capabilities.

Qualifications:

Bachelor’s degree or higher (or equivalent practical experience) in Computer Science, Software Engineering, or a related field.
Strong theoretical foundation: Deep expertise in machine learning, deep learning, natural language processing (NLP), computer vision (CV), and reinforcement learning (RL).
Excellent programming skills: Proficient in Python and C/C++ in Linux environments; able to independently implement complex deep learning models and system components, with strong debugging and performance optimization capabilities.
In-depth understanding of modern architectures: Familiarity with language models (Transformers and variants, Linear Attention), multimodal models (LLaVA-style models, native MLLMs), generative models (autoregressive models, DiT), and reasoning or decision-making models (e.g., PPO, o1-style reasoning approaches).
Strong analytical and problem-solving skills, with a collaborative mindset and effective communication abilities.

Careers