About Me
I’m a full-time senoir algorithm engineer and researcher. My current work mainly includes on-device LLM & VLM development and accelerating. I am also keen on creating interesting applications using AIGC. Before I joined the industry, I worked on computer vision research during my PhD (I was selected as one of the world’s top 2% research scientists.) and mobile game development during internship. My lifelong dream is to witness and participate in advancing more intelligent gaming and companionship experiences.
I received my Ph.D. degree from the School of Computer Science and Engineering at Nanyang Technological University in 2021, dual-master degrees in Computer Science from the joint programme of Shanghai Jiao Tong University and Waseda University in 2018, and B.E. degree in Computer Science from the IEEE Pilot Class at Shanghai Jiao Tong University (SJTU) in 2015.
News
-
ECCV
Kaihua Tang, Mingyuan Tao, Jiaxin Qi, Zhenguang Liu, Hanwang Zhang
European Conference on Computer Vision (ECCV), 2022.
-
ECCV
Xuanyu Yi, Kaihua Tang, Xian-Sheng Hua, Joo-Hwee Lim, Hanwang Zhang
European Conference on Computer Vision (ECCV), 2022.
-
ECCV
Jiaxin Qi, Kaihua Tang, Qianru Sun, Xian-Sheng Hua, Hanwang Zhang
European Conference on Computer Vision (ECCV), 2022.
-
CVPR
Xinting Hu, Kaihua Tang, Chunyan Miao, Xian-Sheng Hua, Hanwang Zhang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
-
CVPR
Yulei Niu, Kaihua Tang, Hanwang Zhang, Zhiwu Lu, Xian-Sheng Hua, Ji-Rong Wen
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
-
NeurIPS
Kaihua Tang, Jianqiang Huang, Hanwang Zhang
Conference on Neural Information Processing Systems (NeurIPS), 2020.
-
CVPR
Kaihua Tang, Yulei Niu, Jianqiang Huang, Jiaxin Shi, Hanwang Zhang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
PDF
Code
BibTex
Oral Presentation • 836 Github Stars • 257 Citations
-
CVPR
Xinting Hu, Yi Jiang, Kaihua Tang, Hanwang Zhang, Chunyan Miao, Jingyuan Chen
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
-
CVPR
Kaihua Tang, Hanwang Zhang, Baoyuan Wu, Wenhan Luo, Wei Liu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
PDF
Code
BibTex
Best Paper Finalists [45/5160] • Oral Presentation • 280 Citations
-
CVPR
Xu Yang, Kaihua Tang, Hanwang Zhang, Jianfei Cai
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
Awards
- Outstanding Reviewer (Top 10%), ICML, 2022
- 2021 Alibaba Outstanding Interns in Academic Cooperation, Alibaba Group, 2021
- 2021 & 2019 PREMIA Best Student Paper Award, 2nd Place, PREMIA, 2021 & 2019
- CVPR 2019 Best Paper Finalists, 2019
- Honorable Judge Award, The 5th Cloud Programming World Cup, FORUM8 Tokyo, 2017
- Waseda Partial Tuition-Waiver Scholarship for International Students (10/300), Waseda University, 2015
- IPS special scholarship for international students, Waseda University, 2014
- Monbukagakusho Honors Scholarship for International Students, JASSO, 2014
- Emerging Talent Award, The 1st Cloud Programming World Cup, FORUM8 Tokyo, 2013
Academic Services
Organizing Committees
Talks and Blogs
Invited Talk : To
TechBeat, Hosted by TechBeat AI Community, Online Sharing, 2022.10
Invited Talk : To Alibaba Group, Hosted by Tianchi Team from Alibaba Cloud, Hangzhou, China, 2020.11
Blogs : Sharing Research Experiences at
Zhihu, Language: Chinese
Paper Review
CVPR, ECCV, ICCV, WACV, NeurIPS, ICLR, ICML, AAAI, TPAMI
Projects
-
This project aims to provide a new codebase for Scene Graph Generation (SGG). It is built on top of the well-known maskrcnn-benchmark. Moreover, I included all the exsiting metrics: R@K, mR@K, ngR@K, zR@K, to benchmark the SGG.
-
This project provides a strong single-stage baseline for Long-Tailed Classification, Detection, and Instance Segmentation (under LVIS dataset). This project can be easily generalized to other tasks with unbalanced datasets.
-
An open-source visual question answering (VQA) codebase built on top of the bottom-up-attention-vqa. It integrates several popular VQA papers published in 2018.
-
Indie Game Development
Out of interest, I independently developed several mobile games on Iphone. They have been downloaded over 10k times on Apple store in half a year.
Experience
Alibaba, DAMO Academy, Research Intern (2019.7 - 2021.11)
Major topic: Robust Machine Learning
Mentor: Mingyuan Tao, Chang Zhou, Jianqiang Huang
Tencent, AI Lab, Research Intern (2018.3 - 2018.6)
Major topic: Scene Graph Generation
Mentor: Wenhan Luo, Baoyuan Wu, Wei Liu
Mihoyo, Software Engineer Intern (2017.4 - 2017.12)
Mobile Game Development Using Unity 3D.
Toshiba, Research & Development Intern (2015.8 - 2015.9)
Major Project: Scenery Image Stitching and Inpainting.
Mentor: Kaoru Matsuoka
Speech Lab Intern, SJTU (2014.3 - 2014.9)
Major Project: Leading a team to develop an Android App for unlocking the screen by Voice Recognition.
Mentor: Kai Yu
28th ACM-MM Volunteer, Seattle, USA (2020.10)
Received Volunteer Appreciation Certification in the 2020 ACM Multimedia for joining the organization of online presentation.
YAPM Summer Volunteer, Yunnan Province, China (2014.7 - 2014.8)
Youth Ambassador Program for Minorities (TECC Organization) is determined to help the youth generation of minorities in remote area of China to inherit and protect their cultures.
Powered by Jekyll and Minimal Light theme.