Xuran Pan

Ph.D. Student

Tsinghua University

Biography

I am a fifth year Ph.D. student at Tsinghua University, advised by Prof. Gao Huang and Prof. Cheng Wu. My research interests lie in the model architecuture design, graph neural network and 3D computer vision.
I’m on job market now! If you are interested in me, contact me via Email.

Download my resumé (CN/EN).

Interests

Machine Learning
Computer Vision

Education

Ph.D. in Automation, 2018-Present

Tsinghua University
BSc in Automation, 2014-2018

Tsinghua University

Recent News

2023-03 One paper accepted by CVPR 2023 !

Slide-Transformer： Hierarchical Vision Transformer with Local Self-Attention Mar 2023

2023-01 One paper accepted by ICLR 2023 !

Budgeted Training for Vision Transformer Jan 2023

2022-09 Invited talk on ActiveNeRF at Zhiyi Technology (智一科技) !

Ai New Youth Lectures on Neural Radiance Field Sep 2022

2022-09 One paper accepted by NeurIPS 2022 !

Contrastive Language-Image Pre-Training with Knowledge Graphs Sep 2022

2022-07 One paper accepted by ECCV 2022 !

ActiveNeRF Learning where to See with Uncertainty Estimation Jul 2022

2022-06 One paper annominated in CVPR2022 best paper finalist !

Vision Transformer with Deformable Attention Jun 2022

2022-03 Two papers accepted by CVPR 2022 !

On the Integration of Self-Attention and Convolution / Vision Transformer with Deformable Attention Mar 2022

Selected Publications

Zhuofan Xia, Xuran Pan, Xuan Jin, Yuan He, Shiji Song, Gao Huang

January, 2023 In International Conference on Learning Representation (ICLR) 2023

Budgeted Training for Vision Transformer

In this paper, we address the high training cost problem of Vision Transformers by proposing a framework that enables the training process under any training budget from the perspective of model structure, while achieving competitive model performances.

Xuran Pan, Tianzhu Ye, Dongchen Han, Shiji Song, Gao Huang

September, 2022 In Neural Information Processing Systems (NeurIPS) 2022

Contrastive Language-Image Pre-Training with Knowledge Graphs

In this paper, we propose a knowledge-based pre-training framework, dubbed Knowledge-CLIP, that injects semantic information into the widely used CLIP model.

Xuran Pan, Zihang Lai, Shiji Song, Gao Huang

July, 2022 In European Conference on Computer Vision (ECCV) 2022

ActiveNeRF: Learning where to See with Uncertainty Estimation

We present a novel learning framework, ActiveNeRF, aiming to model a 3D scene with a constrained input budget. We first incorporate uncertainty estimation into a NeRF model, which ensures robustness under few observations and provides an interpretation of how NeRF understands the scene. On this basis, we propose to supplement the existing training set with newly captured samples based on an active learning scheme. By evaluating the reduction of uncertainty given new inputs, we select the samples that bring the most information gain. In this way, the quality of novel view synthesis can be improved with minimal additional resources.

Xuran Pan, Chunjiang Ge, Rui Lu, Shiji Song, Guanfu Chen, Zeyi Huang, Gao Huang

March, 2022 In Computer Vision and Pattern Recognition (CVPR) 2022

On the Integration of Self-Attention and Convolution

In this paper, we show that there exists a strong underlying relation between them, in the sense that the bulk of computations of these two paradigms are in fact done with the same operation. This observation naturally leads to an elegant integration of these two seemingly distinct paradigms, i.e., a mixed model that enjoys the benefit of both self-Attention and Convolution (ACmix), while having minimum computational overhead compared to the pure convolution or self-attention counterpart.

Zhuofan Xia, Xuran Pan, Shiji Song, Li Erran Li, Gao Huang

March, 2022 In Computer Vision and Pattern Recognition (CVPR) 2022

Vision Transformer with Deformable Attention

In this paper, we present Deformable Attention Transformer, a general backbone model with deformable attention for both image classification and dense prediction tasks.

Xuran Pan, Zhuofan Xia, Shiji Song, Li Erran Li, Gao Huang

May, 2021 In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021

3D Object Detection with Pointformer

In this paper, we propose Pointformer, a Transformer backbone designed for 3D point clouds to learn features effectively.

Yulin Wang, Xuran Pan, Shiji Song, Hong Zhang, Cheng Wu, Gao Huang

August, 2019 In Neural Information Processing Systems (NeurIPS) 2019

Implicit Semantic Data Augmentation for Deep Networks

In this paper, we propose a novel implicit semantic data augmentation (ISDA) approach to complement traditional augmentation techniques like flipping, translation or rotation.

Publications

Quickly discover relevant content by filtering publications.

Xuran Pan, Tianzhu Ye, Zhuofan Xia, Shiji Song, Gao Huang (2023). Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention. In Computer Vision and Pattern Recognition (CVPR) 2023.

Zhuofan Xia, Xuran Pan, Xuan Jin, Yuan He, Shiji Song, Gao Huang (2023). Budgeted Training for Vision Transformer. In International Conference on Learning Representation (ICLR) 2023.

PDF Cite

Xuran Pan, Tianzhu Ye, Dongchen Han, Shiji Song, Gao Huang (2022). Contrastive Language-Image Pre-Training with Knowledge Graphs. In Neural Information Processing Systems (NeurIPS) 2022.

PDF Cite Slides

Xuran Pan, Zihang Lai, Shiji Song, Gao Huang (2022). ActiveNeRF: Learning where to See with Uncertainty Estimation. In European Conference on Computer Vision (ECCV) 2022.

PDF Cite Code Talk

Xuran Pan, Chunjiang Ge, Rui Lu, Shiji Song, Guanfu Chen, Zeyi Huang, Gao Huang (2022). On the Integration of Self-Attention and Convolution. In Computer Vision and Pattern Recognition (CVPR) 2022.

PDF Cite Code Talk

Zhuofan Xia, Xuran Pan, Shiji Song, Li Erran Li, Gao Huang (2022). Vision Transformer with Deformable Attention. In Computer Vision and Pattern Recognition (CVPR) 2022.

PDF Cite Code Talk

Xuran Pan, Shiji Song, Yiming Chen, Liejun Wang, Gao Huang (2022). PLAM: A Plug-in Module for Flexible Graph Attention Learning. In Neurocomputing 2022.

PDF Cite

Xuran Pan, Zhuofan Xia, Shiji Song, Li Erran Li, Gao Huang (2021). 3D Object Detection with Pointformer. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021.

PDF Cite Code

Yulin Wang, Gao Huang, Shiji Song, Xuran Pan, Yitong Xia, Cheng Wu (2021). Regularizing Deep Networks with Semantic Data Augmentation. In IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI) 2021.

PDF Cite Code Slides

Yulin Wang, Xuran Pan, Shiji Song, Hong Zhang, Cheng Wu, Gao Huang (2019). Implicit Semantic Data Augmentation for Deep Networks. In Neural Information Processing Systems (NeurIPS) 2019.

PDF Cite Code Poster

Activities

Program Committee (PC) member of CICAI 2021

CAAI International Conference on Artificial Intelligence

May 2021 – Jun 2021

Program Committee (PC) member of CICAI 2022

CAAI International Conference on Artificial Intelligence

May 2022 – Jul 2022

Reviewer for CVPR, ICML, NeurIPS, ICLR, ICCV, ECCV, ICIG, IJRA, RiCO, Information Fusion

IEEE Computer Society / Elsevier

Dec 2019 – Present

Contact

pxr18@mails.tsinghua.edu.cn
Room 616, Central Main building, Tsinghua University, Beijing 100084