Tsinghua清华大学 Ph.D.博士 UCLA加州大学洛杉矶分校 Berkeley伯克利 Kandao (Start-up)看到科技 (创业) SenseTime商汤科技 IDEAIDEA研究院 CV Researcher视觉研究员 Senior Engineer高级工程师
Python / C++ / GoPython / C++ / Go PyTorchPyTorch 3D Vision3D视觉 Lie Algebra李代数 Bundle Adjustment光束法平差 Optimization优化理论 Vibe Coding氛围编程 Linux / GitLinux / Git Imitation Learning模仿学习 RL强化学习 Badminton羽毛球 Dev Psychology发展心理学 VLA Robot 6D Pose 3D Detection 3DGS Object Reconstruction Hand Reconstruction Keypoint Detection Human Motion Defect Detection Person ReID NDN/Networking VR Streaming
avatar

Shock (Xiaoke) Jiang 蒋小可 (Shock Jiang)

Senior Researcher @ IDEA IDEA研究院 高级研究员

Explore the world as well as our inner heart 探索世界,也探索我们的内心

A gentle black sheep. I hold warmth toward others while remaining inwardly independent. I'm drawn to clear facts, vivid details, well-reasoned arguments, measured expression paired with a kind heart. I hold no claim to absolute truth, and have little patience for context-free moralizing.

I believe cognition is the act of imposing order — structuring data, layering knowledge, and finding the hierarchy within chaos.
一只羊,保持对他人的尊重和善意,内心独立有态度。喜欢清晰的事实,生动的细节,有条理的逻辑,妥善的表达和善良的心灵;知自己没有掌握真理,也厌烦被脱离场景的真理训诫。

我喜欢秩序,知识要有层次,数据需要结构化,认知的过程就是不断把数据和知识进行秩序化的过程。

33
Papers
10+
First Author
10+
Corresponding
2026-03 SpatialPoint released on arXiv, covered by QbitAI (量子位)! SpatialPoint 在 arXiv 发布,被量子位报道!
2025-11 DINO-XGrasp showcased at IDEA Day (demo video). DINO-XGrasp 在 IDEA Day 上亮相(演示视频)。
2025-07 One paper accepted to ICCV 2025: UniG! 一篇论文被ICCV 2025录用:UniG!
2025-06 One paper accepted to IJCAI 2025: SeqPose! 一篇论文被IJCAI 2025录用:SeqPose!
2025-02 Three papers accepted to CVPR 2025: LeanGaussian, HandOS, HumanMM! 三篇论文被CVPR 2025录用:LeanGaussian、HandOS、HumanMM!
2024-09 Geo6D accepted to IEEE Transactions on Multimedia (TMM). Geo6D 被 IEEE Transactions on Multimedia (TMM) 录用。
2023-01 Uni6Dv2 accepted to AISTATS 2023. Uni6Dv2 被 AISTATS 2023 录用。
2022-06 Shared Uni6D at SenseTime CVPR 2022 Paper Session. 商汤 CVPR 2022 论文分享会上分享 Uni6D。
2022-03 Uni6D accepted as Oral presentation at CVPR 2022! Uni6D 被 CVPR 2022 录用为 Oral 报告!

2026

VLARobot Qiming Zhu, Zhirui Fang, Tianming Zhang, Chuanxiu Liu, Xiaoke Jiang*, Lei Zhang, SpatialPoint: Spatial-aware Point Prediction for Embodied Localization, arXiv 2026. [arXiv]

2025

6D Pose3D Detection Shitian Yang, Deyu Li, Xiaoke Jiang*, Lei Zhang, 3DRot: Rediscovering the Missing Primitive for RGB-Based 3D Augmentation, arXiv 2025. [arXiv]

3DGSObject Reconstruction Jiamin Wu, Hongyang Li, Xiaoke Jiang*, Yuan Yao, Lei Zhang, Coca-Splat: Collaborative Optimization for Camera Parameters and 3D Gaussians, arXiv 2025. [arXiv]

3DGSObject Reconstruction Jiamin Wu, Kenkun Liu, Yukai Shi, Xiaoke Jiang*, Yuan Yao, Lei Zhang, UniG: Modelling Unitary 3D Gaussians for View-Consistent 3D Reconstruction, ICCV 2025.TOP [PDF]

3DGSObject Reconstruction Jiamin Wu, Kenkun Liu, Han Gao, Xiaoke Jiang*, Yuan Yao, Lei Zhang, LeanGaussian: Breaking Pixel or Point Cloud Correspondence in Modeling 3D Gaussians, CVPR 2025.TOP [PDF]

Hand ReconstructionKeypoint Detection Xingyu Chen, Zhuheng Song, Xiaoke Jiang, Yaoqing Hu, Junzhi Yu, Lei Zhang, HandOS: 3D Hand Reconstruction in One Stage, CVPR 2025.TOP [PDF]

Human MotionObject Reconstruction Yuhong Zhang, Guanlin Wu, Ling-Hao Chen, Zhuokai Zhao, Jing Lin, Xiaoke Jiang, Jiamin Wu, Zhuoheng Li, Hao Frank Yang, Haoqian Wang, Lei Zhang, HumanMM: Global Human Motion Recovery from Multi-shot Videos, CVPR 2025.TOP [PDF]

6D Pose Yuzhu Ji, Mingshan Sun, Jianyang Shi, Xiaoke Jiang, Yiqun Zhang, Haijun Zhang, SeqPose: An End-to-End Framework to Unify Single-frame and Video-based RGB Category-Level Pose Estimation, IJCAI 2025.TOP [PDF]

6D Pose Jianqiu Chen, Mingshan Sun, Ye Zheng, Tianpeng Bao, Zhenyu He, Donghai Li, Guoqiang Jin, Zhao Rui, Liwei Wu, Xiaoke Jiang, Geo6D: Geometric-Constraints-Guided Direct Object 6D Pose Estimation Network, IEEE Transactions on Multimedia (TMM) 2025. [PDF]

6D PoseRobot Mingshan Sun, Ye Zheng, Tianpeng Bao, Jianqiu Chen, Guoqiang Jin, Liwei Wu, Rui Zhao, Xiaoke Jiang, Uni6Dv2: Noise Elimination for 6D Pose Estimation, AISTATS 2025. [arXiv] [PDF]

2023

Defect Detection Jingtian Guan, Jingjing Fei, Wei Li, Xiaoke Jiang, Liwei Wu, Yakun Liu, Juntong Xi, Defect classification for specular surfaces based on deflectometry and multi-modal fusion network, Optics and Lasers in Engineering 163 (2023). [PDF]

2022

6D PoseRobot Xiaoke Jiang, Donghai Li, Hao Chen, Ye Zheng, Rui Zhao, Liwei Wu, Uni6D: A Unified CNN Framework without Projection Breakdown for 6D Pose Estimation, CVPR 2022 (Oral).ORAL [arXiv] [PDF]

2021

Person ReID3D Detection Xiaoke Jiang, Yu Qiao, Junjie Yan, Qichen Li, Wanrong Zheng, Dapeng Chen, SSN3D: Self-Separated Network to Align Parts for 3D Convolution in Video Person Re-Identification, AAAI 2021.TOP [PDF]

2019

NDN/Networking Shanshan Shi, Jun Li, Haibo Wu, Yongmao Ren, Xiaoke Jiang, ATSRA: An Accelerated Transmission Strategy Based on Request Aggregation in NDN, INFOCOM 2019 (poster).

2018

VR Streaming Hongwei Ma, Xiaoke Jiang, Rui Ma, Zhiyou Ma, Yizhen Cai, Dah Ming Chiu, Smart Streaming of Panoramic Video, ACM SIGCOMM 2018 VR Workshop.TOP

VR StreamingNDN/Networking Yi Zhang, Xiaoke Jiang, Yi Wang, Kai Lei, Cache and delivery of VR video over named data networking, IEEE INFOCOM Workshops 2018.

2017

NDN/Networking Alexander Afanasyev, Xiaoke Jiang, Yingdi Yu, Jiewen Tan, Yumin Xia, Allison Mankin, Lixia Zhang, NDNS: A DNS-Like Name Service for NDN, ICCCN 2017.

2016

NDN/Networking Xiaoke Jiang, Jun Bi, IS: Interest Set to Enhance Flow Transmission in Named-Data Networking, China Communications (IEEE), Vol.13, 2016.

2015

NDN/Networking Xiaoke Jiang, Jun Bi, Guoshun Nan, Zhaogeng Li, A Survey on Information-Centric Networking: Rationales, Designs and Debates, China Communications (IEEE), Vol.12, No.7, 2015.

2014

NDN/Networking Xiaoke Jiang, Jun Bi, nCDN: CDN Enhanced with NDN, IEEE INFOCOM 2014, NOM Workshop.

NDN/Networking Xiaoke Jiang, Jun Bi, You Wang, What Benefits Does NDN Have in Supporting Mobility, IEEE ISCC 2014.

NDN/Networking Xiaoke Jiang, Jun Bi, You Wang, MCBS: Matrix Computation Based Simulator of NDN, Journal of Computers, Vol.9, No.9, 2014.

2013

NDN/Networking Xiaoke Jiang, Jun Bi, Interest Set Mechanism to Improve the Transport of Named Data Networking, ACM SIGCOMM 2013 (poster).TOP

NDN/Networking Hongcheng Tian, Jun Bi, Xiaoke Jiang, An Adaptive Probabilistic Marking Scheme for Fast and Secure Traceback, Networking Science (Springer), Vol.2, No.1-2, 2013.

NDN/Networking Pingping Lin, Jun Bi, Hongyu Hu, Xiaoke Jiang, MSDN: A Mechanism for Scalable Intradomain Control Plane in SDN, Journal of Chinese Computer Systems, Vol.34, No.9, 2013.

2012

NDN/Networking Xiaoke Jiang, Jun Bi, You Wang, Pingping Lin, Zhaogeng Li, A Content Provider Mobility Solution of Named Data Networking, IEEE ICNP 2012.TOP

NDN/Networking Xiaoke Jiang, Jun Bi, You Wang, Pingping Lin, Zhaogeng Li, An Easy Matrix Computation based Simulator of NDN, IEEE ICNDC 2012.

NDN/Networking You Wang, Jun Bi, Xiaoke Jiang, Mobility Support in the Internet Using Identifiers, ACM CFI 2012.

NDN/Networking Zhaogeng Li, Jun Bi, Sen Wang, Xiaoke Jiang, The Compression of Pending Interest Table with Bloom Filter in Content Centric Network, ACM CFI 2012.

2011

NDN/Networking Xiaoke Jiang, Jun Bi, Yangyang Wang, Zhijie He, Wei Zhang, Hongchen Tian, IPv6 Evolution, Stability and Deployment, IEEE ICNP 2011.TOP

NDN/Networking Hongcheng Tian, Jun Bi, Xiaoke Jiang, Dekai Wang, Wei Zhang, Fast and Secure Probabilistic Marking Technology for IP Traceback, Journal of Tsinghua University, Vol.50, No.4, 2011.

NDN/Networking Hongcheng Tian, Jun Bi, Wei Zhang, Xiaoke Jiang, EasyTrace: Easily-Deployable Light-Weight IP Traceback on an AS-Level Overlay Network, IEEE ICNP 2011.TOP

NDN/Networking Pingping Lin, Jun Bi, Hongyu Hu, Tao Feng, Xiaoke Jiang, A Quick Survey on Selected Approaches for Preparing Programmable Networks, ACM AINTEC 2011.

DINO-XGrasp

Open-Set Robotic Grasping powered by DINO-X
VLARobotOpen-Set DetectionPick&Place

Enabling robots to grasp any object accurately using consumer-level hardware (camera, GPU, robot arm), powered by open-set detection/segmentation, real-time tracking, and 3D ReID.

DINO-XGrasp at IDEA Day
Robot's working day
Voice-command grasping
Open-set grasping for bottles

oVP: Optimized Visual Prompt

Supervised Prompt Tuning for Customized Detection
Open-Set Detection

oVP uses Supervised Prompt Tuning (SPT) to generate optimized visual prompts that customize open-set detection for specialized scenarios. Unlike text prompts that rely on category names, SPT learns domain-specific prompt embeddings from as few as 7 labeled images. Only prompt embeddings are updated during training — no model fine-tuning needed. SPT+Grounding DINO 1.6 consistently outperforms YOLOv8 across 12 industry domains (agriculture, construction, manufacturing, retail, etc.), with advantages intensifying as training data decreases.

Education

Aug 2010 - Jun 2016
Tsinghua University — Ph.D., Computer Science & Technology
Supervisor: Prof. Jun Bi
Jul 2014 - Jul 2015
University of California Los Angeles (UCLA) — Joint PhD Program
Supervisor: Prof. Lixia Zhang
Aug 2011 - Jul 2012
Tsinghua-Berkeley Global Technology Entrepreneurship Program
  • Served as tech leader of 2nd Award Team
Sep 2006 - Jun 2010
Tsinghua University — B.S., School of Software
Supervisor: Prof. Jun Bi and Prof. Fei He

Work Experience

2022 - now
IDEA — Senior CV Researcher
Open-set Grasping & VLA: Developing systems to enable robots to grasp any object using consumer-level hardware, powered by open-set detection/segmentation, real-time tracking, and 3D ReID
Open-set 3D Detection: Monocular 3D Detection and RGB-D based 3D Detection
Open-Set Detection & Prompt Tuning: Customize detection targets given labeled dataset and pretrained model
Keypoint Detection: Keypoints detection in the wild
3D Reconstruction & Novel View Synthesis: LeanGaussian (CVPR'25), UniG (ICCV'25), Coca-Splat (arXiv'25)
Inertial Navigation: IMU + GPS + wheel odometry fusion
2018 - 2022
SenseTime — Senior Researcher
Defect Detection: Structured light imaging, defect detection in automobile smart manufacturing
6D Pose Estimation: Uni6D (CVPR'22 Oral), Uni6Dv2 (AISTATS'23)
Person Re-identification: SSN3D (AAAI'21)
AI Automation in Transportation
2016 - 2018
Kandao Technology — Senior Engineer
VR Video Streaming & CDN: Smart Streaming of Panoramic Video (SIGCOMM'18 VR Workshop)

Skills

Mathematics: Linear Algebra / Lie Algebra / Rotation Group / Probability Theory / Taylor Expansion / Optimization Theory
Programming: Languages: Python, GoLang, C++ / Tools: Linux, git, make, gcc, ffmpeg / Libraries: PyTorch, mmengine, PyTorch Lightning, OpenCV, scikit-learn, matplotlib / Vibe Coding: Claude Code, Cursor

Hobbies

Badminton • Hiking • Programming to solve real-world problems • Reading novels • Philosophy

教育经历

2010.08 - 2016.06
清华大学 — 博士,计算机科学与技术系
导师:毕军教授
2014.07 - 2015.07
加州大学洛杉矶分校 (UCLA) — 联合培养博士
2011.08 - 2012.07
清华-伯克利全球技术创业项目
  • 担任二等奖团队技术负责人
2006.09 - 2010.06
清华大学 — 学士,软件学院

工作经历

2022 - 至今
IDEA 研究院 — 高级计算机视觉研究员
开放集抓取与 VLA:开发使机器人使用消费级硬件(相机、GPU、机械臂)精准抓取任意物体的系统
开放集 3D 检测:单目 3D 检测和 RGB-D 3D 检测
开放集检测与提示调优:基于标注数据集和预训练模型自定义检测目标
关键点检测:野外场景下的关键点检测
3D 重建与新视角合成:LeanGaussian (CVPR'25)、UniG (ICCV'25)、Coca-Splat (arXiv'25)
惯性导航:IMU + GPS + 轮式里程计融合
2018 - 2022
商汤科技 — 高级研究员
缺陷检测:结构光成像、汽车智能制造缺陷检测
6D 位姿估计:Uni6D (CVPR'22 Oral)、Uni6Dv2 (AISTATS'23)
行人重识别:SSN3D (AAAI'21)
交通系统 AI 自动化
2016 - 2018
看到科技 — 高级工程师
VR 视频流优化与 CDN:全景视频智能传输 (SIGCOMM'18 VR Workshop)

技能

数学: 线性代数 / 李代数/旋转群 / 概率论 / 泰勒展开 / 优化理论
编程: 语言:Python, GoLang, C++ / 工具:Linux, git, make, gcc, ffmpeg / 库:PyTorch, mmengine, PyTorch Lightning, OpenCV, scikit-learn, matplotlib / Vibe Coding:Claude Code, Cursor

爱好

羽毛球 • 徒步 • 编程解决实际问题 • 小说阅读 • 哲学思考

Visitors
-- Visitors
-- Page Views