About

Hi! I’m Leo Sin (施延庆).

I am a visiting student at Westlake University, Intelligent Spatial Medicine Lab, supervised by Prof. Haoyang Li. I received my M.Sc. in Computer Engineering from The University of Hong Kong (supervised by Prof. Grantham Pang) and my B.S. in Electrical Engineering from Soochow University (supervised by Prof. Lu Bo).

My research focuses on 3D perception for surgical and biomedical scenes, spatial omics, and multimodal AI. I am particularly interested in applying generative AI and 3D reconstruction (NeRF/3DGS) to medical imaging and spatial transcriptomics.

Research Interests: Spatial Omics, 3D Reconstruction (NeRF/3DGS), Surgical Scene Analysis, Multimodal Medical AI, Embodied Intelligence, LLM Agent Systems

News

  • 2026.02 Joined Westlake University as visiting student.
  • 2026.01 Built Maze Mask at Global Game Jam 2026 — mask.colanns.me
  • 2025.12 Started research internship at Tongji University CDI on robot questioning strategies.
  • 2025.10 Working on autonomous driving digital twin with 3DGS + CARLA/Autoware.
  • 2025.06 Joined Tab Next (HK) as algorithm engineer intern, developing medical AI chatbot.

Selected Projects

Spatial Omics 3D Atlas

Spatial Omics — 3D Cell Atlas Reconstruction

Westlake University · 2025.02 - present · Visiting Student

Core research on 3D virtual slice interpolation for spatial transcriptomics. Synthesize virtual intermediate tissue slices between real 2D spatial transcriptomics sections to reconstruct 3D cell atlases. Working with Xenium sequencing data, H&E staining image to spatial omics mapping.

Spatial Omics 3D Reconstruction Optimal Transport AnnData
Medical Chatbot Architecture

Medical RAG Chatbot — AI Agent for Imaging Center

Tab Next (HK) · 2025.06 - 2025.09 · Algorithm Engineer Intern

Built ReAct agent-based medical chatbot for TST Imaging Center. Designed 7-tool architecture (business query, appointment, form analysis), CoT + FAISS retrieval, LoRA fine-tuning (SFT+DPO), vLLM acceleration (+70% performance), and Qwen2.5VL multimodal WhatsApp processing. CSAT on par with human agents, saving 20% labor.

LLM Agent RAG LoRA vLLM Qwen WhatsApp
NeRF Surgical Scene

AskThenRearrange — Robot Questioning Strategy via KCQ Encoding

Tongji University CDI · 2025.12 - 2026.03 · Research Intern · Supervisor: Prof. Guo Weiwei

Designed robot questioning strategy planning based on KCQ encoding from real human dialogues. Conducted 24-group 1v1 user experiments comparing result-oriented vs. process-oriented tasks. Found process-oriented tasks better suit user-preference strategies. Deployed on Unitree humanoid robot.

Paper: Task Matters: Investigating Human Questioning Behavior in Different Household Service for Learning by Asking Robots (in progress)

GitHub → Figma Slides →
Embodied AI LLM HRI Unitree
3DGS Digital Twin

Autonomous Driving Digital Twin — 3DGS + CARLA/Autoware

2025.10 - present

Built Image → COLMAP → 3D Gaussian Splatting reconstruction pipeline with multi-view preprocessing and camera pose recovery. gsplat-based training with PSNR/SSIM/LPIPS evaluation. CARLA simulation with vehicle, pedestrian, camera, LiDAR, GNSS, IMU deployment. Exploring digital twin for Autoware co-simulation.

3DGS COLMAP CARLA Autoware Digital Twin
Continuum Robot Platform

Continuum Robot for Intestinal Surgery + SLAM Navigation

Soochow University · 2021.09 - 2022.06 · Supervisor: Prof. Lu Bo

Built 6-stepper-motor continuum robot for intestinal tract with FTL precision testing via Raspberry Pi + STM32. A* path planning algorithm burned into STM32. Extended with LiDAR SLAM + ROS for autonomous scene reconstruction and obstacle avoidance. CAD-designed chassis platform.

Surgical Robot STM32 ROS SLAM Raspberry Pi

Maze Mask — Global Game Jam 2026

GGJ2026 · 2026.01.29 - 2026.01.31

Multiplayer online pixel-art game: multiple thieves navigate pathways, collect coins, find exits and escape the maze. Masks grant different abilities with roguelike power-up elements. Responsible for map design, planning, and mask feature implementation.

Play Online → Bilibili →
React TypeScript Node.js Game Dev
UAT-NET Blood Pressure Model

UAT-NET — Multimodal Blood Pressure Estimation

HKU · 2024.08 - 2025.06 · Supervisor: Prof. Grantham Pang

Attention-based mechanism for blood pressure estimation from multimodal PPG and ECG signals. Reproduced and improved upon existing methods for reconstructing BP signals from physiological waveforms. Evaluated with MAE/RMSE on PulseDB dataset using 3× NVIDIA RTX 4090.

Deep Learning PPG ECG TensorFlow