Date & Time:
February 25, 2025 2:00 pm – 3:00 pm
Location:
JCL 390
02/25/2025 02:00 PM 02/25/2025 03:00 PM America/Chicago Jiachen Wang (Princeton)- Fueling Responsible AI with Data Attribution JCL 390

Abstract: Understanding how training data shapes model behavior is fundamental to building trustworthy AI systems. Data attribution techniques quantify the influence of individual training examples on machine learning models, providing key insights for developing data-centric algorithms (e.g., data curation) as well as addressing data-related challenges (e.g., privacy, safety, and copyright protection).

In this talk, I will present our recent advances in the foundations and practical frameworks of data attribution. First, I will introduce a general, game-theoretic data attribution framework that optimizes for stochastic learning algorithms. I will then discuss how we can efficiently conduct data attribution in the challenging setting of large-scale deep learning models (e.g., large language models). These techniques guide data quality management, explain model predictions, and boost trustworthy AI development from a data-centric perspective.

Speakers

Jiachen Wang

PhD Candidate, Princeton University

Jiachen (“Tianhao”) is a Ph.D. student at Princeton University, advised by Prof. Prateek Mittal. His research focuses on developing theoretical foundations and practical tools for trustworthy machine learning from a data-centric perspective. Most recently, he has been developing scalable, theoretically grounded data attribution and curation techniques for foundation models. His contributions have been recognized through multiple fellowships and oral/spotlight presentations at top AI/ML venues. He was selected as a Rising Star in Data Science in 2024.

Related News & Events

data points
UChicago CS News

Finding the “Goldilocks” Solution to a Classic Math Problem: A Breakthrough in Numerical Integration

Jul 29, 2025
UChicago CS News

Ten Years of MSCAPP: Where Public Policy Meets Coding

Jul 25, 2025
content warning label
UChicago CS News

Moderation at the Crossroads: How Generative AI Platforms Manage Creativity and Content Safety

Jul 21, 2025
UChicago CS News

Can a Doctor’s Notes Reveal When They’re Tired? New Research Illuminates the Hidden Signals of Physician Fatigue—And Raises Questions About AI in Healthcare

Jul 17, 2025
students looking at poster
UChicago CS News

2025 Midwest Machine Learning Symposium Demonstrates Regional Excellence

Jul 16, 2025
UChicago CS News

PhD Candidate Bogdan Stoica Receives Distinguished Artifact Evaluator Award for Championing Reproducibility in Computer Science

Jul 14, 2025
UChicago CS News

Report from GlobusWorld 2025: Going Beyond Data

Jul 10, 2025
headshots
UChicago CS News

University of Chicago PhD Graduates Secure Tenure-Track Faculty Positions Amid a Competitive Job Market

Jun 25, 2025
text to 3d example
UChicago CS News

Democratizing Digital Graphics: An Undergrad’s Unlikely Path To Putting Agency of 3D-Generation in Users’ Hands

Jun 17, 2025
headshot
UChicago CS News

Faculty Spotlight: Get to Know Kexin Pei

Jun 03, 2025
David Cash
UChicago CS News

David Cash Receives 2025 Quantrell Award for Undergraduate Teaching

Jun 02, 2025
future of AI panelists
Video

The Future of AI Panel: Alumni Weekend

May 30, 2025
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube