Yuxiang Guo

Johns Hopkins University.

Yuxiang.png

307B Clark Hall

3400 N Charles St Baltimore, MD 21218

yguo87 at jhu dot edu

Hi! I’m Yuxiang! I’m currently a final year PhD student at AIEM advised by Prof. Rama Chellappa. I earned MS degree from Mccormick School of Engineering, Northwestern University in 2021 and BS degree from joint program hosted by University of Electronic Science and Technology of China and University of Glasgow.

My research lies in Machine Learning and Computer Vision, with a particular emphasis on Human-centered AI, Video Understanding, Multi-modal Large Language Models, and 3D Reconstruction. I develop machines to perceive the world from observations, analyze and reason about these perceptions through the knowledge encoded in MLLMs, and generate informed, context-aware responses. My recent work further considers temporal dynamics and focuses on building explainable models that generalize effectively to real-world scenarios.

I had a wonderful experience as a Research Intern at HRI (Spring 2024) mentored by Dr. Shao-Yuan Lo; a Research Intern at AMD mentored by Dr. Jiang Liu (Summer 2025) and a student researcher at Google (Fall 2025) hosted by Cheng Zhong.

I am on the job market and actively seeking full-time research scientist/engineer opportunities starting in 2026!

selected publications

  1. IJCB2024
    IJCB2024.jpg
    Distillation-guided Representation Learning for Unconstrained Gait Recognition
    Yuxiang Guo, Siyuan Huang, Ram Prabhakar, and 3 more authors
    In 2024 IEEE International Joint Conference on Biometrics (IJCB), 2024
  2. WACV2025
    GaitContour.png
    GaitContour: Efficient Gait Recognition based on a Contour-Pose Representation
    Yuxiang Guo, Anshul Shah, Jiang Liu, and 3 more authors
    2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025
  3. IJCV
    Stimuvar.png
    Stimuvar: Spatiotemporal stimuli-aware video affective reasoning with multimodal large language models
    Yuxiang Guo, Faizan Siddiqui, Yang Zhao, and 2 more authors
    Posted by JHU Whiting School
    International Journal of Computer Vision, 2024
  4. CVPR2025
    spars3r.png
    SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction
    Yutao Tang*, Yuxiang Guo*, Deming Li, and 1 more author
    2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
  5. teaser_cropped.jpg
    ImageDoctor: Diagnosing Text-to-Image Generation via Grounded Image Reasoning
    Yuxiang Guo*, Jiang Liu*, Ze Wang, and 7 more authors
    2025