Ruidi Chang

Ruidi Chang

 Ruidi Chang

  Department of Computer Science

  Rice University

  Address: Duncan Hall, 6100 Main St, Houston, TX 77005

  Email: rc151@rice.edu

           

About Me

I am a first year CS Ph.D. student in the Computer Science Department @ Rice University, advised by Prof. Hanjie Chen. My research interests lie in Interpretable Machine Learning, with a focus on the interpretability and understanding of language models. Prior to joining Rice, I was a master student at Carnegie Mellon University.

News

💬 Excited to present our work SAFR at NAACL2025 Findings. Superposition is powerful — but it buries interpretability. We control that!

🧠 Neurons often mix too many features (superposition) — making models a black box.
🎯 SAFR strategically redistributes neurons:

  • 🧩 Monosemantic for important tokens
  • 🔗 Polysemantic where interactions matter
Workflow of SAFR
Figure: SAFR improves interpretability by redistributing neurons.

Research Experience

Services

  • Reviewer: EMNLP BlackboxNLP Workshop 2024, COLING 2025
  • Volunteer: EMNLP BlackboxNLP Workshop 2024

Last update: 11/2024