职位描述

About the Team

We are a visual R&D team focused on human aesthetic modelling and advanced 3D vision research. Our work spans human image understanding, 3D reconstruction, and intelligent aesthetic enhancement. By combining academic research methodologies with real-world product deployment, we continuously explore new frontiers in AI-driven image/video generation, editing, spatiotemporal consistency, and 3D structural understanding.

 

Location & Duration

Sydney Central; 6-12 months

 

Role Overview

You will participate in the research and development of human aesthetic enhancement and spatiotemporally consistent editing technologies at Meitu. You will work directly with real, product-scale datasets and state-of-the-art algorithms.

Depending on the internship track, your work may include (but is not limited to):

·      Fine-grained and controllable image / video aesthetic enhancement

·      2D / 3D human tracking and 3D reconstruction

·      Regression, reconstruction, and structural constraints of digital human models (e.g., SMPL)

 

This role offers the opportunity to produce both production-ready technical outcomes and high-quality academic research results. It is a research-and-engineering-oriented internship, ideal for candidates with strong interest and capability in 3D vision fundamentals, human visual quality enhancement, video generation models, and 3D human modelling.


 

Key Responsibilities

·      Research and implement algorithms related to depth estimation, multi-view generation, and 2D / 3D tracking with spatiotemporal reconstruction

·      Follow state-of-the-art 3D vision papers and open-source projects; reproduce experiments and adapt methods to practical applications

·      Collaborate with data teams to refine the 3D aesthetic development pipeline, improve data collection and quality evaluation, and establish foundations for high-quality scaling

·      Explore the integration of human structure priors (Skeleton / SMPL / Mesh) with multi-modal cues such as depth, normals, and optical flow in reconstruction and generative models

·      Assist in building data processing, evaluation, and visualization tools (e.g., immersive video aesthetic editing) to support rapid iteration

·      Enable high-quality projection of 3D features into 2D visual outputs, with the goal of producing A-level or above academic publications

 

Qualifications

Experience

·      PhD candidate in Computer Science, Artificial Intelligence, Intelligent Arts, or related fields

·      Proficient in Python, with experience using common libraries (e.g., NumPy, Transformers, Diffusers, Open3D), and strong hands-on experience with deep learning frameworks such as PyTorch

·      Strong, sustained interest in image/video aesthetics or 3D art; resilient mindset and openness to change

 

Skills & Abilities

·      Solid engineering implementation skills and rigorous experimental practices; able to independently complete module-level tasks

·      Proficient with AI-assisted programming tools; maintains clean, well-structured code

·      Strong ability to read, analyse, summarize, and reproduce academic papers

·      Clear logical thinking, strong problem-decomposition skills, and curiosity about underlying principles

 

Motivation & Values

·      Long-term interest in AI + imaging / 3D / aesthetics

·      Proactive, self-driven, and willing to explore new approaches and challenges

·      Alignment with a product culture centred on user experience and technical value

 

Bonus Points

·      Prior experience with SMPL, human pose estimation, 3D reconstruction, or tracking

·      Familiarity with video understanding, diffusion models, autoregressive models, or flow-matching methods

·      Research projects, competitions, or open-source contributions

·      Strong mathematical or geometric intuition; excellent aesthetic sensibility and passion for high-quality design



About Meitu

Founded in 2008, Meitu is an AI-driven technology company with “beauty” at its core. Guided by its mission to “Unite Art and Technology,” Meitu is committed to building world-class imaging products that make the creation of images, videos, and visual designs simple and efficient.

Meitu was listed on the Main Board of the Hong Kong Stock Exchange in 2016 (HKEX: 1357). As of June 30, 2025, Meitu serves over 280 million global monthly active users, including 98 million users outside Mainland China.

公司信息
美图公司成立于2008年,是一家以美为内核,以人工智能为驱动的科技公司。秉承着“让艺术与科技美好交汇”的使命,美图公司致力于打造优秀的影像与设计产品,让图像、视频、设计的制作变得更简单,并通过美业解决方案助力产业数字化升级。美图公司于2016年12月在香港联合交易所主板挂牌上市,股票代码:1357.HK。
部门信息
美图影像研究院(MT Lab)专注于计算机视觉、深度学习与计算机图形学等前沿算法的研究与应用。我们为美图产品提供核心技术支持,团队汇聚顶尖人才,致力于推动影像技术的突破,让科技与艺术美好交汇。