Peter West (U of B.C.)- Can Helpful Assistants be Unpredictable? Limits of Aligned LLMs

Date & Time:

March 6, 2025 1:00 pm – 2:00 pm

Location:

Crerar 346, 5730 S. Ellis Ave., Chicago, IL,

03/06/2025 01:00 PM 03/06/2025 02:00 PM America/Chicago Peter West (U of B.C.)- Can Helpful Assistants be Unpredictable? Limits of Aligned LLMs Crerar 346, 5730 S. Ellis Ave., Chicago, IL,

Abstract: The majority of public-facing language models have undergone some form of alignment–a family of techniques (e.g. reinforcement learning from human feedback) which aim to make models safer, more honest, and better at following instructions. In this talk, I will investigate the downsides of aligning LLMs. While the process improves model performance across a broad range of benchmark tasks, particularly those for which a “correct” answer is clear, it seems to mitigate some of the most interesting aspects of LLMs, including unpredictability and generation of text that humans find creative.

Speakers

Peter West

Assistant Professor, University of British Columbia

Peter is an Assistant Professor at UBC and a recent postdoc at the Stanford Institute for Human-Centered AI working in Natural Language Processing. His research broadly studies the capabilities and limits of large language models (and other generative AI systems). His work has been recognized with multiple awards, including best method paper at NAACL 2022, outstanding paper at ACL 2023, and outstanding paper at EMNLP 2023

Resources

Community

Finding the “Goldilocks” Solution to a Classic Math Problem: A Breakthrough in Numerical Integration

Ten Years of MSCAPP: Where Public Policy Meets Coding

Moderation at the Crossroads: How Generative AI Platforms Manage Creativity and Content Safety

The Future of AI Panel: Alumni Weekend

Can we authenticate human creativity?

AI and the Future of Work Panel: Featuring Nick Feamster

Speakers

Peter West

Finding the “Goldilocks” Solution to a Classic Math Problem: A Breakthrough in Numerical Integration

Ten Years of MSCAPP: Where Public Policy Meets Coding

Moderation at the Crossroads: How Generative AI Platforms Manage Creativity and Content Safety

Can a Doctor’s Notes Reveal When They’re Tired? New Research Illuminates the Hidden Signals of Physician Fatigue—And Raises Questions About AI in Healthcare

2025 Midwest Machine Learning Symposium Demonstrates Regional Excellence

PhD Candidate Bogdan Stoica Receives Distinguished Artifact Evaluator Award for Championing Reproducibility in Computer Science

Report from GlobusWorld 2025: Going Beyond Data

University of Chicago PhD Graduates Secure Tenure-Track Faculty Positions Amid a Competitive Job Market

Democratizing Digital Graphics: An Undergrad’s Unlikely Path To Putting Agency of 3D-Generation in Users’ Hands

Faculty Spotlight: Get to Know Kexin Pei

David Cash Receives 2025 Quantrell Award for Undergraduate Teaching

The Future of AI Panel: Alumni Weekend