Foundation Model Research Scientist
Responsibilities:
- Advance AI innovation: Explore long-term memory, decision-making models, and autonomous agents to drive the evolution of large language models (LLMs), large multimodal models (LMMs), and next-generation artificial intelligence systems.
- Produce high-impact research outcomes: Publish research at top-tier conferences, file patents, and contribute to the broader AI community through open-source datasets, models, and code.
- Drive original research breakthroughs: Investigate frontier AI research directions and emerging industry trends, delivering original, high-impact research that shapes future AI capabilities.
Qualifications:
- Bachelor’s degree or higher (or equivalent practical experience) in Computer Science, Software Engineering, or a related field.
- Strong theoretical foundation: Deep expertise in machine learning, deep learning, natural language processing (NLP), computer vision (CV), and reinforcement learning (RL).
- Excellent programming skills: Proficient in Python and C/C++ in Linux environments; able to independently implement complex deep learning models and system components, with strong debugging and performance optimization capabilities.
- In-depth understanding of modern architectures: Familiarity with language models (Transformers and variants, Linear Attention), multimodal models (LLaVA-style models, native MLLMs), generative models (autoregressive models, DiT), and reasoning or decision-making models (e.g., PPO, o1-style reasoning approaches).
- Strong analytical and problem-solving skills, with a collaborative mindset and effective communication abilities.