Foundational Model Researcher

Responsibilities:

AI Innovation: Drive advancements in LLMs, LMMs, and next-generation AI by exploring long-term memory, decision-making models, and autonomous agents.
High-Impact Publications: Publish papers in top-tier conferences and journals, apply for patents, and contribute to the open-source AI community through the release of datasets, models, and code.
Original Research: Explore the frontiers of AI research and industry trends to lead influential, cutting-edge, and original research initiatives.

Requirements:

Strong theoretical foundation in machine learning, deep learning, natural language processing (NLP), computer vision (CV), reinforcement learning (RL), or related fields.
Solid programming skills; proficient in Python and C/C++ in Linux environments; capable of independently implementing complex deep learning models and system components with strong debugging and performance optimization capabilities.
Familiarity with core architectures such as language models (Transformers and variants, Linear Attention), multimodal models (LLaVA-like, native MLLMs), generative models (Autoregressive, DiT), and reasoning models (O1 / PPO).
Excellent analytical skills, with a strong sense of collaboration and effective communication.

Preferred Qualifications：

Ph.D. in Computer Science, Artificial Intelligence, or a related field from a top-tier university.
Publications in leading conferences/journals such as NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, or ICCV/ECCV.
Outstanding performance in academic competitions (e.g., ACM/ICPC, NOI/IOI, CMO/IMO, CPhO/IPhO).
Experience contributing to prominent open-source large model projects or winning awards in related competitions.

If you are interested in these job openings, please submit your resume and cover letter to shandahr@shanda.com. We also welcome assistance from recruitment agencies.