Computer Vision/Machine Learning Engineer (Photography Intelligence)
Lex
Software Engineering, Data Science
Beijing, China
Posted on Mar 25, 2026
Summary
If you are the kind of people who are passionate on pursuing excellence, embracing challenges, enjoying work with others, learning new things along the way, Apple is the right place for you.
Description
The photography intelligence algorithm engineer will work in China Vision Lab as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apple’s products. The role is responsible for designing and implementing machine learning systems that understand the scene as well as user intent before and during capturing photos or videos. It bridges visual perception, semantic understanding, and decision intelligence, enabling smart photography and videography experience. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers hands.
Responsibilities
If you are the kind of people who are passionate on pursuing excellence, embracing challenges, enjoying work with others, learning new things along the way, Apple is the right place for you.
Description
The photography intelligence algorithm engineer will work in China Vision Lab as part of the Video Engineering org which develops on-device computer vision and machine perception technologies across Apple’s products. The role is responsible for designing and implementing machine learning systems that understand the scene as well as user intent before and during capturing photos or videos. It bridges visual perception, semantic understanding, and decision intelligence, enabling smart photography and videography experience. We balance research and product to deliver the highest quality, state-of-the-art experiences, innovating through the full stack, and partnering with cross-functional teams to influence what brings our vision to life and into customers hands.
Responsibilities
- Build SOTA capture intelligence models in Visual Reasoning, Computational Photography, Camera Control, VLA, MLLM, etc
- Optimize models for real-time on-device video processing
- Collaborate with hardware team to integrate ML models into Apple devices
- File patents and papers in related area
- M.S. or PhD in Electrical Engineering/Computer Science or a related field (mathematics, physics or computer engineering), with a focus on computer vision and/or machine learning
- Rich experiences in video machine learning covering one of the topics: Computational Photography / Visual Reasoning Algorithms / VLM or MLLM / Camera Control
- Proven prototyping skills and proficient in coding (C, C++, Python)
- Excellent written and verbal communications skills, be comfortable presenting research to large audiences, and have the ability to work hands-on in multi-functional teams
- Publications in top-tier conferences (e.g. NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, SIGGRAPH)
- Solid understanding and industry experiences on computational photography, visual perception or reasoning algorithms, MLLM, camera control pipeline, etc
- Familiar with the challenges of developing algorithms that run efficiently on resource constrained platforms
- Team oriented, result oriented, and self motivated