岗位概述
负责多模态大模型前沿技术研究与工程落地,将多模态能力应用于内容理解、生成等场景,推动模型效果与产品体验持续提升。
本岗位为 2026 年暑期实习,全职实习时长 5 个月以上。实习期间表现优秀者,将有机会获得留用。
职责描述
- 跟踪并探索多模态大模型方向的前沿技术,参与将相关技术应用于多模态内容理解任务
- 参与多模态大模型的结构设计、训练、微调以及下游功能与应用的开发工作
- 协助团队进行技术能力建设,持续提升模型效果
任职资格
- 计算机科学、人工智能等相关专业,硕士或博士在读
- 具备多模态/强化学习/NLP/CV 相关的研究或项目经验
- 良好的数学与编程基础,熟悉 PyTorch 深度学习框架
- 对大模型及新兴技术领域有浓厚兴趣,具备较强的学习能力和研究潜力
- 具备良好的沟通能力和团队协作精神
加分项
- 在 NLP、CV、ML 相关顶级会议或期刊发表论文
▸ Overview
You'll work at the frontier of multimodal large language models, applying cutting-edge capabilities to content understanding and generation tasks to continuously raise the bar on model performance.
This is a full-time summer 2026 internship (5+ months). Strong performers will be considered for a return offer.
▸ Responsibilities
- Track and explore cutting-edge multimodal LLM research; apply findings to multimodal content understanding tasks
- Contribute to model architecture design, pre-training, fine-tuning, and downstream application development
- Support team capability building through ongoing experimentation and model optimization
▸ Qualifications
- Master's or PhD student in CS, AI, or a related field
- Research or project experience in multimodal learning, RL, NLP, or CV
- Solid math and programming foundations; proficiency in PyTorch
- Strong interest in large models and emerging AI; self-motivated with genuine research potential
- Team player with good communication skills
▸ Bonus Points
- Publications at top NLP, CV, or ML conferences or journals