
Yutian Chen
MS. Robotics
BSc. Computer Science, Minor in Mathematical Science
Also find me at ...
About Me
I am dedicated to improve the spatial understanding ability of autonomous systems, which requires a combination of geometry and semantic information about the surroundings. My research experience on visual-inertial SLAM at the AirLAB focuses on improving the recognition of scene geometry in adversarial conditions, while my research experience in Natural Language Processing (NLP) and semantic segmentation focuses on grasping and interpreting the semantic of scenes.
Recent Research
Below is a highlight list of my recent works. For a full list of works, please see Here.
Projects

PyPose
PyPose is a Library for Robot Learning with Physics-based Optimization. It supports efficient automatic-differentiation on Lie Group and Lie Algebra. I'm an active contributor of the PyPose project.

C0 Visualizer
I designed and implemented the C0 program visualizer in 2022 Summer. It is a virtual machine that executes C0 language (a safe subset of C) and provide visualization and debugging tools for education purpose.
Skills
Robotics
Visual-Inertial SLAM, Computer Vision, Geometric Vision, ROS2, C++
Deep learning & Artificial Intelligence
Natural Language Processing, Semantic Segmentation, PyTorch, CUDA, TensorRT, Python, Triton
Miscellaneous
React, HTML, CSS, TypeScript, LaTeX, Blender
Open Source Notes
I believe knowledge is most impactful when shared freely. By open-sourcing my notes from high school AP courses to advanced university topics, I aim to improve the accessibility of knowledge for everyone.
Experience
- Sep 2022-Now
Learning-based Visual-Inertial SLAM
The AirLab, Robotics Institue, Carnegie Mellon University- AirIO: Learning Inertial Odometry with Enhanced IMU Feature Observability
- MAC-VO: Metric-Aware Covariance for Learning-based Stereo Visual Odometry
- AirIMU: Learning Uncertainty Propagation for Inertial Odometry
- PyPose v0.6: The Imperative Programming Interface for Robotics
Working with Professor Sebastian Scherer, I aimed to construct robust and accurate visual-inertial SLAM system using data-driven approach. I Developed the MAC-VO, a visual odometry that outperforms the state-of-the-art visual odomtries like DPVO by 30% on relative translation error (RTE) and relative rotation error (ROE) in multiple public datasets. I also Deployed the MAC-VO as ROS2 node on Orin-AGX on real drone and speedup the system by 4 times with TensorRT.
- Work In Progress, Currently under double-blind review
Working with Professor Chuang Gan, I developed a data pipeline for City-scale 3D scene reconstruction based on real world satellite/street-view image for multi-agent simulator.
- Token Prediction as Implicit Classification to Identify LLM-Generated Text
Working with Professor Rita Singh and Bhiksha Raj, built a LLM-generated content detector called "LLM-Sentinel". Reaches 98% accuracy on test dataset and outperform existing content detector by OpenAI and ZeroGPT. Collected the OpenLLMText dataset, a dataset contains 30k human written text from OpenWebText and its corresponding rephrased version by various LLMs such as GPT3.5, LLaMA, PaLM, etc.
- Myocardial Segmentation of Cardiac MRI Sequences With Temporal Consistency for Coronary Artery Disease Diagnosis
Mentored by Professor Yiyu Shi and Xiaowei Xu, I proposed an encoder-decoder architecture to perform semantic segmentation on cardiac MRI sequence. By introducing Temporal constraint on segmentation result, the model improved the accuracy by 2% on ACDC Dataset comparing to the baseline model.
Courses
- 16-833 Localization and Mapping
- 16-385 Computer Vision
- 15-451 Algorithm Design & Analysis
- 15-418 Parallel Computer Architecture and Programming
- 11-777 Multi-Modal Machine Learning
- 10-708 Probablistic Graphical Model
- 11-785 Intro to Deep Learning