I’m a University student of SIST in ShanghaiTech University. My research interests include Deep Learning, Computer Vision and Computer Graphics. I’m currently working for MARS, a branch of ShanghaiTech Visual and Data Intelligence Center.
Undergrad in Computer Science
ShanghaiTech University
UCB COE Exchange program in Computer Science
University of California, Berkeley
We introduce OthelloGPT, a GPT model trained on Othello, to understand how LLMs learn internal representations. Despite simple training, it develops structured gameplay understanding—early layers detect board patterns, while deeper layers track dynamic moves. Using Sparse Autoencoders, we decode strategic features like tile stability, offering insights into LLM learning. This framework helps analyze representations in transformers and LLMs.
Jan 13, 2025
We introduce TransGS, a diffusion transformer that instantly translates physically-based facial assets into the corresponding GauFace representations.
Sep 26, 2024