Responsibilities:

  • AI Innovation: Drive advancements in LLMs, LMMs, and next-generation AI by exploring long-term memory, decision-making models, and autonomous agents.
  • High-Impact Publications: Publish papers in top-tier conferences and journals, apply for patents, and contribute to the open-source AI community through the release of datasets, models, and code.
  • Original Research: Explore the frontiers of AI research and industry trends to lead influential, cutting-edge, and original research initiatives.

Requirements:

  • Strong theoretical foundation in machine learning, deep learning, natural language processing (NLP), computer vision (CV), reinforcement learning (RL), or related fields.
  • Solid programming skills; proficient in Python and C/C++ in Linux environments; capable of independently implementing complex deep learning models and system components with strong debugging and performance optimization capabilities.
  • Familiarity with core architectures such as language models (Transformers and variants, Linear Attention), multimodal models (LLaVA-like, native MLLMs), generative models (Autoregressive, DiT), and reasoning models (O1 / PPO).
  • Excellent analytical skills, with a strong sense of collaboration and effective communication.

Preferred Qualifications:

  • Ph.D. in Computer Science, Artificial Intelligence, or a related field from a top-tier university.
  • Publications in leading conferences/journals such as NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR, or ICCV/ECCV.
  • Outstanding performance in academic competitions (e.g., ACM/ICPC, NOI/IOI, CMO/IMO, CPhO/IPhO).
  • Experience contributing to prominent open-source large model projects or winning awards in related competitions.
If you are interested in these job openings, please submit your resume and cover letter to shandahr@shanda.com. We also welcome assistance from recruitment agencies.