Hi, my name is

Zherui Qiu.

I Empower Robots to Perceive, Imagine, and Execute

A passionate and innovative AI researcher. I design agents that perceive the world, imagine outcomes, and execute them with reliable control. I also harness cutting-edge techniques to craft virtual worlds that are visually stunning and immersive.

"To boldly go where no one has gone before." β€” Star Trek

About Me

I am a Ph.D. student in Data Science at the University of Science and Technology of China advised by Prof. Juyong Zhang. My research includes publications on embodied intelligence, reinforcement learning, and Neural Radiance Fields (NeRF). Currently, I am focused on applying these techniques to embodied intelligence, aiming to enhance spatial understanding and interaction within complex environments.

Beyond my academic pursuits, I actively participate in collaborative projects and workshops, seeking to stay at the forefront of technological advancements and foster interdisciplinary research.

Here are a few technologies I've been working with recently:
  • VLA Model
  • World Model
  • Diffusion Policy
  • CUDA C++
  • NeRF/3D-GS
  • ComfyUI

Experience

Research Intern (Full-Time) - Shanghai AI Lab
June 2025 - Present
I am currently working as a research intern at Shanghai AI Lab in Shanghai, China.
On the research side, as a core contributor to InternVLA-A1, I have learned how to design robust VLA architectures, build scalable training pipelines, and run principled evaluations, while sharpening my intuition for data curation and ablation analysis.
On the engineering side, building ROSGUI-AGX, a modern GUI for orchestrating ROS commands and teleoperation-based data collection with live process monitoring, has taught me how to turn collection protocols into operator-friendly workflows, provide real-time feedback and logging, and keep long-running robotics processes stable and reproducible.
Research Intern (Full-Time) - OpenDriveLab
Nov 2024 - May 2025
I worked as a research intern at OpenDriveLab in Shanghai, China. I researched embodied intelligence by integrating advanced algorithms with robotic systems and refining manipulation strategies through hands-on experimentation. I also assisted in comprehensive data collection, analysis, and documentation to ensure accurate reporting and clear presentation of experimental findings.

Education

2020.09 - Present
Ph.D. of Data Science
School of Artificial Intelligence and Data Science, University of Science and Technology of China
GPA: 3.71 out of 4.3

I have authored and co-authored 6 top‑tier papers in embodied AI and computer graphics.

  • πŸ… InternVLA-A1: Unifying Understanding, Generation and Action for Robotic Manipulation. Preprint 2026.
  • πŸ… Deformable NeRF using Recursively Subdivided Tetrahedra. ACM Multimedia 2024.
  • 🀝 InternData-A1: Pioneering High-Fidelity Synthetic Data for Pre-training Generalist Policy. Preprint 2025.
  • 🀝 F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions. Preprint 2025.
  • 🀝 Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method. NeurIPS 2020.
  • 🀝 PaletteGaussian: 3D Photorealistic Color Editing with Gaussian Splatting. ISMAR 2024.
2016.09 - 2020.06
B.Eng. of Electrical Engineering and Automation
School of Electrical Engineering, Southwest Jiaotong University
GPA: 3.88 out of 4.0

Honors & Awards

  • National Scholarship 2016, awarded by the Chinese Ministry of Education
  • National Scholarship 2017, awarded by the Chinese Ministry of Education
  • 1st Prize in Contemporary Undergraduate Mathematical Contest in Modeling
  • Outstading Graduate, awarded by Sichuan Provincial Department of Education
2013.09 - 2016.06
High School
Hangzhou Xuejun High School

Publications

InternVLA-A1
Embodied AI VLA World Model
InternVLA-A1
A Mixture-of-Transformers VLA model that unifies scene understanding, visual foresight, and action execution.
DeformRF
NeRF Physical Simulation Tetrahedral Mesh
DeformRF
A method that integrates the manipulability of tetrahedral meshes with the high-quality rendering capabilities of feature grid representations.
F1
Embodied AI VLA World Model
F1
A novel paradigm by integrating visual foresight generation into the decision-making pipeline.
InternData-A1
Embodied AI Synthetic Data VLA
InternData-A1
A large-scale synthetic dataset and training pipeline showing simulation-only data can match strong real-robot pretraining for VLA models.
Actor-Critic with Generalized Energy Distance
Reinforcement Learning Regularization
Actor-Critic with Generalized Energy Distance
An off-policy regularized reinforcement learning algorithm.
PaletteGaussian
3D Gaussian Splatting Color Editing
PaletteGaussian
A framework that combines a palette with 3D-GS to enable interactive color editing and real-time rendering.
StructuredField
3D Gaussian Splatting Structured Geometry Tetrahedral Mesh
StructuredField
A novel structured representation that unifies radiance field and structured geometry.

Get in Touch

My inbox is always open. Whether you have a question or just want to say hi, I’ll try my best to get back to you!