Research Scientist
Google DeepMind
Six Pancras Square, Kings Cross, London N1C 4AG
Email: yiya@google.com
I am currently a research scientist at Google DeepMind. My research interests are self-supervised geometric visual representation learning, i.e. motion, depth and segmentation for objects.
Previously, I was a research scientist at Baidu Research from 2013 to 2018. I obtained my Ph.D. degree in Computer Science at UC Irvine in 2013. I had summer internships at Google and Microsoft Research. I obtained my master degree in Industrial Engineering at Hong Kong University of Science and Technology in 2008, and bachelor degree in Automation at Tsinghua University in 2006.
TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement
Carl Doersch, Yi Yang, Mel Vecerik, Dilara Gokay, Ankush Gupta, Yusuf Aytar, Joao Carreira, Andrew Zisserman
ICCV 2023
Paper • Project Page • Code
Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira
NeurIPS 2023
TAP-Vid: A Benchmark for Tracking Any Point in a Video
Carl Doersch, Ankush Gupta, Larisa Markeeva, Adrià Recasens, Lucas Smaira, Yusuf Aytar, João Carreira, Andrew Zisserman, Yi Yang
NeurIPS 2022
Large-Scale Multilingual Audio Video Dubbing
Yi Yang, Brendan Shillingford, Yannis Assael, Miaosen Wang, Wendi Liu, Yutian Chen, Yu Zhang, Eren Sezener, Luis C. Cobo, Misha Denil, Yusuf Aytar, Nando de Freitas
Arxiv 2020
A Refined 3D Pose Dataset of Fine-Grained Object Categories
Yaming Wang, Xiao Tan, Yi Yang, Ziyu Li, Xiao Liu, Feng Zhou, Larry S. Davis
ICCV 2019 Workshop
Recognizing Part Attributes with Insufficient Data
Xiangyun Zhao, Yi Yang, Feng Zhou, Xiao Tan, Yuchen Yuan, Yingze Bao, Ying Wu
ICCV 2019
UnOS: Unified Unsupervised Optical-flow and Stereo-depth Estimation by Watching Videos
Yang Wang, Peng Wang, Zhenheng Yang, Chenxu Luo, Yi Yang, Wei Xu
CVPR 2019
3D Pose Estimation for Fine-Grained Object Categories
Yaming Wang, Xiao Tan, Yi Yang, Xiao Liu, Errui Ding, Feng Zhou, Larry S. Davis
ECCV 2018 Workshop
Occlusion Aware Unsupervised Learning of Optical Flow
Yang Wang, Yi Yang, Zhenheng Yang, Liang Zhao, Peng Wang, Wei Xu
CVPR 2018
Feedback Convolutional Neural Network for Visual Localization and Segmentation
Chunshui Cao, Xianming Liu, Yi Yang, et al.
PAMI 2018
Depth-based Hand Pose Estimation: Data, Methods, and Challenges
James Supancic, Gregory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan
IJCV 2018
Dynamic Computational Time for Visual Attention
Zhichao Li, Yi Yang, Xiao Liu, Feng Zhou, Shilei Wen, Wei Xu
ICCV 2017 Workshop
Attention to Scale: Scale-aware Semantic Image Segmentation
Liang-Chieh Chen, Yi Yang, Jiang Wang, Wei Xu, and Alan L. Yuille
CVPR 2016
Paper • Project Page • Slides
CNN-RNN: A Unified Framework for Multi-label Image Classification
Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, Wei Xu
CVPR 2016
Paper • Slides • Video Talk
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
Haonan Yu, Jiang Wang, Zhiheng Huang, Yi Yang, Wei Xu
CVPR 2016
Paper • Slides • Poster • Video Talk
DenseBox: Unifying Landmark Localization with End to End Object Detection
Lichao Huang, Yi Yang, Yafeng Deng, Yinan Yu
Arxiv 2015
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images
Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan Yuille
ICCV 2015
Paper • Project Page • Dataset
Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks
Chunshui Cao, Xianming Liu, Yi Yang, et al.
ICCV 2015
Depth-based Hand Pose Estimation: Data, Methods, and Challenges
James Supancic, Gregory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan
ICCV 2015
Learning from Massive Noisy Labeled Data for Image Classification
Tong Xiao, Tian Xia, Yi Yang, Chang Huang, Xiaogang Wang
CVPR 2015
Deep Captioning with Multimodal Recurrent Neural Networks
Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Alan Yuille
ICLR 2015
Paper • Project Page • Code • Slides • Video Talk
Explain Images with Multimodal Recurrent Neural Networks
Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Alan Yuille
NIPS 2014 Workshop
Paper • Project Page • Code • Slides • Video Talk
AutoCaption: Automatic Caption Generation for Personal Photos
Krishnan Ramnath, Simon Baker, et al.
WACV 2014
Parsing Occluded People
Golnaz Ghiasi, Yi Yang, Deva Ramanan, Charless Fowlkes
CVPR 2014
Articulated Human Detection with Flexible Mixtures of Parts
Yi Yang, Deva Ramanan
PAMI 2013
Paper • Project Page • Code • Slides • Poster • Video Talk • News
Recognizing Proxemics in Personal Photos
Yi Yang, Simon Baker, Anitha Kannan, Deva Ramanan
CVPR 2012
Paper • Project Page • Code • Slides • Poster
Layered Object Models for Image Segmentation
Yi Yang, Sam Hallman, Deva Ramanan, Charless Fowlkes
PAMI 2012
Paper • Project Page • Code • Slides • Poster • Video Talk
Articulated Pose Estimation with Flexible Mixtures of Parts
Yi Yang, Deva Ramanan
CVPR 2011
Paper • Project Page • Code • Slides • Poster • Video Talk • News
Sequential Convex Approximations to Joint Chance Constrained Programs
L. Jeff Hong, Yi Yang, Liwei Zhang
OR 2011
Layered Object Detection for Multi-Class Segmentation
Yi Yang, Sam Hallman, Deva Ramanan, Charless Fowlkes
CVPR 2010
Paper • Project Page • Code • Slides • Poster • Video Talk