Yi Yang (杨亿)

Research Scientist
Google DeepMind
Six Pancras Square, Kings Cross, London N1C 4AG
Email: yiya@google.com

About Me

I am currently a research scientist at Google DeepMind. My research interests are self-supervised geometric visual representation learning, i.e. motion, depth and segmentation for objects.

Previously, I was a research scientist at Baidu Research from 2013 to 2018. I obtained my Ph.D. degree in Computer Science at UC Irvine in 2013. I had summer internships at Google and Microsoft Research. I obtained my master degree in Industrial Engineering at Hong Kong University of Science and Technology in 2008, and bachelor degree in Automation at Tsinghua University in 2006.

Publications

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement
Carl Doersch, Yi Yang, Mel Vecerik, Dilara Gokay, Ankush Gupta, Yusuf Aytar, Joao Carreira, Andrew Zisserman
ICCV 2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models
Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira
NeurIPS 2023

TAP-Vid: A Benchmark for Tracking Any Point in a Video
Carl Doersch, Ankush Gupta, Larisa Markeeva, Adrià Recasens, Lucas Smaira, Yusuf Aytar, João Carreira, Andrew Zisserman, Yi Yang
NeurIPS 2022

Large-Scale Multilingual Audio Video Dubbing
Yi Yang, Brendan Shillingford, Yannis Assael, Miaosen Wang, Wendi Liu, Yutian Chen, Yu Zhang, Eren Sezener, Luis C. Cobo, Misha Denil, Yusuf Aytar, Nando de Freitas
Arxiv 2020

A Refined 3D Pose Dataset of Fine-Grained Object Categories
Yaming Wang, Xiao Tan, Yi Yang, Ziyu Li, Xiao Liu, Feng Zhou, Larry S. Davis
ICCV 2019 Workshop

Recognizing Part Attributes with Insufficient Data
Xiangyun Zhao, Yi Yang, Feng Zhou, Xiao Tan, Yuchen Yuan, Yingze Bao, Ying Wu
ICCV 2019

UnOS: Unified Unsupervised Optical-flow and Stereo-depth Estimation by Watching Videos
Yang Wang, Peng Wang, Zhenheng Yang, Chenxu Luo, Yi Yang, Wei Xu
CVPR 2019

3D Pose Estimation for Fine-Grained Object Categories
Yaming Wang, Xiao Tan, Yi Yang, Xiao Liu, Errui Ding, Feng Zhou, Larry S. Davis
ECCV 2018 Workshop

Occlusion Aware Unsupervised Learning of Optical Flow
Yang Wang, Yi Yang, Zhenheng Yang, Liang Zhao, Peng Wang, Wei Xu
CVPR 2018

Feedback Convolutional Neural Network for Visual Localization and Segmentation
Chunshui Cao, Xianming Liu, Yi Yang, et al.
PAMI 2018

Depth-based Hand Pose Estimation: Data, Methods, and Challenges
James Supancic, Gregory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan
IJCV 2018

Dynamic Computational Time for Visual Attention
Zhichao Li, Yi Yang, Xiao Liu, Feng Zhou, Shilei Wen, Wei Xu
ICCV 2017 Workshop

Attention to Scale: Scale-aware Semantic Image Segmentation
Liang-Chieh Chen, Yi Yang, Jiang Wang, Wei Xu, and Alan L. Yuille
CVPR 2016

CNN-RNN: A Unified Framework for Multi-label Image Classification
Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, Wei Xu
CVPR 2016

Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks
Haonan Yu, Jiang Wang, Zhiheng Huang, Yi Yang, Wei Xu
CVPR 2016

DenseBox: Unifying Landmark Localization with End to End Object Detection
Lichao Huang, Yi Yang, Yafeng Deng, Yinan Yu
Arxiv 2015

Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images
Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang, Alan Yuille
ICCV 2015

Look and Think Twice: Capturing Top-Down Visual Attention with Feedback Convolutional Neural Networks
Chunshui Cao, Xianming Liu, Yi Yang, et al.
ICCV 2015

Depth-based Hand Pose Estimation: Data, Methods, and Challenges
James Supancic, Gregory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan
ICCV 2015

Learning from Massive Noisy Labeled Data for Image Classification
Tong Xiao, Tian Xia, Yi Yang, Chang Huang, Xiaogang Wang
CVPR 2015

Deep Captioning with Multimodal Recurrent Neural Networks
Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Alan Yuille
ICLR 2015

Explain Images with Multimodal Recurrent Neural Networks
Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Alan Yuille
NIPS 2014 Workshop

AutoCaption: Automatic Caption Generation for Personal Photos
Krishnan Ramnath, Simon Baker, et al.
WACV 2014

Parsing Occluded People
Golnaz Ghiasi, Yi Yang, Deva Ramanan, Charless Fowlkes
CVPR 2014

Articulated Human Detection with Flexible Mixtures of Parts
Yi Yang, Deva Ramanan
PAMI 2013

Recognizing Proxemics in Personal Photos
Yi Yang, Simon Baker, Anitha Kannan, Deva Ramanan
CVPR 2012

Layered Object Models for Image Segmentation
Yi Yang, Sam Hallman, Deva Ramanan, Charless Fowlkes
PAMI 2012

Articulated Pose Estimation with Flexible Mixtures of Parts
Yi Yang, Deva Ramanan
CVPR 2011

Sequential Convex Approximations to Joint Chance Constrained Programs
L. Jeff Hong, Yi Yang, Liwei Zhang
OR 2011

Layered Object Detection for Multi-Class Segmentation
Yi Yang, Sam Hallman, Deva Ramanan, Charless Fowlkes
CVPR 2010