Xiaobin Hu (胡晓彬)

I am a research scientist at Tencent working closely with Dr. Ying Tai and Dr. Chengjie Wang through ‘腾讯技术大咖’ program. I receive Shanghai Overseas Talents Award (Baiyulan Young Talent Program) in 2023, Before that, I obtained my Ph.D. degree at the School of Computer Science and Engineering, Technische Universität München, Germany, under the joint supervision of Prof. Bjoern Menze and Prof. Kuangyu Shi. During Phd studies, I also worked as a long-term intern in Chinese Academy of Sciences with Prof. Wenqi Ren, and a half-year internship with Prof. Dengping Fan and Hang Dai in IIAI and Mbzuai of United Arab Emirates.

As a first or corresponding author, I have published over 19 papers on top-tier conferences and journals, such as TPAMI, ICCV, ECCV, AAAI, ACM MM, TCSVT, PR, MICCAI, JBHI, EJNM etc. (Total Top/Q1 Journal in Chinese Academy of Sciences ranking: 7; Top conference: 7;)

Email: xbhunanu [at] gmail.com

Avatar
My research interests are primarily around cutting-edge Generative AI research and its applications, with a particular focus on leveraging advanced large-scale vision and language models:

  • ID consistency image/video generation: AnyMaker (arxiv24), VTON-HandFit (arxiv24), DiffuMatting (ECCV24)

  • Image/video perception and understanding: JIF-MMFA (PR24), RLR (ECCV24), Manipvqa (IROS24), HitNet (AAAI23), M-RCNN (Friction22), DIS5K(ECCV22)

  • High-fidelity image/video restoration: MS-SVAN (TCSVT24), AutoGAN-Synthesizer (MICCAI22), MBL(CVPR2021), PyNAS (ICCV 2021)

  • Human-centric image/video editing and generation: TAFB (MM24), RealTalk (arxiv24), Plug-and-Play 3D (TPAMI22), FSR-3D (ECCV2020Spotlight)

  Email  /    Google Scholar  /    Github  / 

News

  • 07/2024 – 1 paper accepted by MM 2024

  • 07/2024 – 2 paper accepted by ECCV 2024

  • 07/2024 – 1 paper accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT).

  • 05/2024 – 1 paper accepted by Pattern Recognition (PR).

Selected Publications



pri3d DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
Xiaobin Hu, Xu Peng, Donghao Luo, Xiaozhong Ji, Jinlong Peng, Zhengkai Jiang, Jiangning Zhang, Taisong Jin, Chengjie Wang, Rongrong Ji
European Conference on Computer Vision (ECCV), 2024
paper / video / bibtex / code

Our DiffuMatting shows several potential applications (e.g., matting-data generator, community-friendly art design and controllable generation).

pri3d Efficiently Exploiting Spatially Variant Knowledge for Video Deblurring
Qian Xu*, Xiaobin Hu *, Donghao Luo, Ying Tai, Chengjie Wang, Yuntao Qian, (*equal contribution)
IEEE Transactions on Circuits and Systems for Video Technology, 2024
paper / video / bibtex / code

Video deblurring is a challenging task as the blur is often spatially variant. Existing methods mainly engage in building the spatial-temporal correspondence among the frames

pri3d 3D Priors-Guided Diffusion for Blind Face Restoration
Xiaobin Lu*, Xiaobin Hu *, Jun Luo, zhuben, paulruan, Wenqi Ren, (*equal contribution)
ACM Multimedia (MM), 2024
paper / video / bibtex / code

A customized multi-level feature extraction method is employed to exploit both structural and identity information of 3D facial images, which are then mapped into the noise estimation process.

pri3d Joint-Individual Fusion Structure with Fusion Attention Module for Multi-Modal Skin Cancer Classification
Peng Tang, Xintong Yan, Yang Nan, Xiaobin Hu #, Bjoern H Menze, Sebastian Krammer, Tobias Lasser. (corresponding author: #)
Pattern Recognition, 2024
paper / video / bibtex / code

Thus, this paper introduces a novel fusion method that integrates dermatological images (dermoscopy images or clinical images) with patient metadata for skin cancer classification, focusing on enhancing FS and FM components.

pri3d Automated segmentation of the human supraclavicular fat depot via deep neural network in water-fat separated magnetic resonance images
Yu Zhao, Chunmeng Tang, Bihao Cui, Arun Somasundaram, Johannes Raspe, Xiaobin Hu #, Christina Holzapfel, Daniela Junker, Hans Hauner, Bjoern Menze, Mingming Wu, Dimitrios Karampinos. (corresponding author # )
Quantitative Imaging in Medicine and Surgery, 2023
paper / video / bibtex / code

Human brown adipose tissue (BAT), mostly located in the cervical/supraclavicular region, is a promising target in obesity treatment. Magnetic resonance imaging (MRI) allows for mapping the fat content quantitatively.

pri3d High-resolution Iterative Feedback Network for Camouflaged Object Detection
Xiaobin Hu , Shuo Wang, Xuebin Qin, Hang Dai, Wenqi Ren, Donghao Luo, Ying Tai, Ling Shao
Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 23)
paper / video / bibtex / code

To tackle this challenge, we aim to extract the high-resolution texture details to avoid the detail degradation that causes blurred vision in edges and boundaries.

pri3d AutoGAN-Synthesizer: Neural Architecture Search for Cross-Modality MRI Synthesis
Xiaobin Hu , Ruolin Shen, Donghao Luo, Ying Tai, Chengjie Wang, Bjoern Menze
MICCAI 2022
paper / video / bibtex / code

In this study, we present a novel MRI synthesizer, called AutoGAN-Synthesizer, which automatically discovers generative networks for cross-modality MRI synthesis.

pri3d Face Restoration via Plug-and-Play 3D Facial Priors
Xiaobin Hu, Wenqi Ren, Jiaolong Yang, Xiaochun Cao, David Wipf, Bjoern Menze, Xin Tong, Hongbin Zha,
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
paper / video / bibtex / code

Existing face restoration algorithms only employ 2D priors without considering high dimensional information (3D). The 3D morphable facial priors are the main novelty of this work and are completely different from recently related 2D prior works

pri3d Face Super-Resolution Guided by 3D Facial Priors
Xiaobin Hu, Wenqi Ren, John LaMaster, Xiaochun Cao, Xiaoming Li, Zechao Li, Bjoern Menze, Wei Liu,
European Conference on Computer Vision (ECCV), 2020,
(Spotlight Presentation)
paper / video / bibtex / code

In this paper, we propose a novel face super resolution method that explicitly incorporates 3D facial priors which grasp the sharp facial structures. Our work is the first to explore 3D morphable knowledge based on the fusion of parametric descriptions of face attributes (e.g., identity, facial expression, texture, illumination, and face pose)

pri3d Pyramid Architecture Search for Real-Time Image Deblurring
Xiaobin Hu, Wenqi Ren, Kaicheng Yu, Kaihao Zhang, Xiaochun Cao, Wei Liu, Bjoern Menze,
International Conference on Computer Vision (ICCV), 2021, Montreal, Canada
paper / video / bibtex / code

we propose a novel deblurring method, dubbed PyNAS, towards automatically designing hyper-parameters including the scales, patches, and standard cell operators. Our primary contribution is a real-time deblurring algorithm (around 58 fps) for 720p images while achieves state-of-the-art deblurring performance on the GoPro and Video Deblurring datasets.

pri3d SRGAT: Single Image Super-Resolution With Graph Attention Network
Yanyang Yan, Wenqi Ren, Xiaobin Hu, Kun Li, Haifeng Shen, Xiaochun Cao,
IEEE Transactions on Image Processing (TIP), 2021
paper / video / bibtex / code
pri3d Ultra-High-Definition Image Dehazing via Multi-Guided Bilateral Learning
Zhuoran Zheng, Wenqi Ren, Xiaochun Cao, Xiaobin Hu, Tao Wang, Fenglong Song, Xiuyi Jia,
Computer Vision and Pattern Recognition (CVPR), 2021
paper / video / bibtex / code
pri3d Weakly supervised deep learning for determining the prognostic value of 18 F-FDG PET/CT in extranodal natural killer/T cell lymphoma, nasal type
Rui Guo*, Xiaobin Hu *, Haoming Song, Pengpeng Xu, Haoping Xu, Axel Rominger, Xiaozhu Lin, Bjoern Menze, Biao Li, Kuangyu Shi, (*equal contribution)
European Journal of Nuclear Medicine and Molecular Imaging, 2021, Top journal
paper / video / bibtex / code
pri3d Morphological Residual Convolutional Neural Network (MRCNN) for Intelligent Recognition of Wear Particles From Artificial Joints
Xiaobin Hu, Jian Song, Zhenhua Liao, Yuhong Liu, Jian Gao, Bjoern Menze, Weiqiang Liu,
Friction, 2021, Top journal
paper / video / bibtex / code
pri3d Feedback Graph Attention Convolutional Network for MR Images Enhancement by Exploring Self-Similarity Features
Xiaobin Hu, Yanyang Yan, Wenqi Ren, Hongwei Li, Amirhossein Bayat, Yu Zhao, Bjoern Menze,
Medical Imaging with Deep Learning (MIDL), 2021
paper / video / bibtex / code
pri3d Coarse-to-Fine Adversarial Networks and Zone-Based Uncertainty Analysis for NK/T-Cell Lymphoma Segmentation in CT/PET Images
Xiaobin Hu , Rui Guo, Jieneng Chen, Hongwei Li, Diana Waldmannstetter, Yu Zhao, Biao Li, Kuangyu Shi, Bjoern Menze,
Journal of Biomedical and Health Informatics, 2020, Top journal
paper / video / bibtex / code
pri3d Spatial-Frequency Non-local Convolutional LSTM Network for pRCC Classification
Yu Zhao, et al., Xiaobin Hu # , Bjoern Menze; corresponding author #.
International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2019)
paper / video / bibtex / code
pri3d Toward a Brain-Inspired System: Deep Recurrent Reinforcement Learning for a Simulated Self-Driving Agent
Jieneng Chen, Jingye Chen, Ruiming Zhang, Xiaobin Hu # ; corresponding author #.
Frontiers in neurorobotics, 2019
paper / video / bibtex / code
pri3d Hierarchical Multi-class Segmentation of Glioma Images Using Networks with Multi-level Activation Function
Xiaobin Hu, Hongwei Li, Yu Zhao, Chao Dong, Bjoern H. Menze, and Marie Piraud
MICCAI BrainLes 2018
paper / video / bibtex / code

For many segmentation tasks, especially for the biomedical image, the topological prior is vital information which is useful to exploit.

pri3d The performance prediction and optimization of the fiber-reinforced composite structure with uncertain parameters
Xiaobin Hu, XY Cui, ZM Liang, GY Li
Composite Structures 2017
paper / video / bibtex / code

The fiber-reinforced composites display the random fiber orientations and uncertain material parameters because of the manufacturing error and scattering feature.

pri3d Stochastic analysis using the generalized perturbation stable node-based smoothed finite element method
Xiaobin Hu, XY Cui, H Feng, GY Li
Engineering Analysis with Boundary Elements 2016
paper / video / bibtex / code

The traditional stochastic finite element method based on the finite element method fails to give the fine solution in precise determination of reliable problems when the computer power consumption is limited.

Awards / Honors Challenge

  • China National Scholarship
  • Excellent Graduate of Hunan Province
  • Excellent Master Thesis of Hunan Province
  • The Winner of the National Postgraduate Mathematical Modeling Competition
  • MICCAI Multimodal Brain Tumor Segmentation Challenge 2018
  • LiTS - Liver Tumor Segmentation Challenge 2017
  • Quantification of Uncertainties in Biomedical Image Quantification Challenge 2020