Date & Time:
February 18, 2026 2:30 pm – 3:30 pm
Location:
DSI 105, 5460 S University, Chicago, IL,
02/18/2026 02:30 PM 02/18/2026 03:30 PM America/Chicago Shreya Shankar (UC Berkeley)- Building Effective Unstructured Data Systems DSI 105, 5460 S University, Chicago, IL,

Abstract: Databases and other data systems have successfully democratized data-oriented computation across domains, thanks to decades of research in system internals and end-user interfaces. However, such systems center on structured (i.e., tabular) data; unstructured data—the vast majority of data—has largely been ignored. Large language models (LLMs) now give us a building block for unstructured data analysis, and we face the same questions as in the early days of data systems—e.g., how should users author queries? How do we efficiently execute queries at scale?—but many well-established tenets from traditional data systems no longer hold. In my talk, I will present DocETL, a system I developed for unstructured data analysis. I will discuss how we had to rethink query optimization under these new assumptions, optimizing user-written pipelines for both accuracy and efficiency—as well as end-user interfaces for authoring, iterating on, and debugging pipelines. DocETL is open-source with 3.5k+ GitHub stars; our hosted interface has supported 4.1k+ pipelines across 30+ S&P-500 industries. Query optimization ideas from our work have been adopted in databases such as Snowflake and BigQuery, and our interface design principles have been adopted by companies like LangChain and OpenAI.

Speakers

headshot

Shreya Shankar

PhD student, UC Berkeley

Shreya Shankar is a fifth and final-year PhD student in the Data Systems and Foundations group at UC Berkeley, advised by Dr. Aditya Parameswaran. She is broadly interested in data systems, large language models, and human-computer interaction. Her PhD has been supported by an NDSEG Fellowship and a Bridgewater Research Fellowship, and her work has been recognized with EECS Rising Stars (2025) and a best paper honorable mention award at UIST. Beyond her research, Shreya authored the curriculum and companion book for AI Evals for Engineers and PMs, an industry course on evaluating AI applications taken by 4,000+ professionals from 500+ companies, including 50+ students each from Google, Microsoft, OpenAI, Meta, Amazon, Intuit, and First American. Before her PhD, Shreya worked as the first data/ML engineer at a startup after her undergraduate degree in CS at Stanford.

Related News & Events

chart
UChicago CS News

Who Gets Hired, Paid, and Liked? Who Gets Credit? New Research Examines AI’s Role in Writing and the Workplace

Apr 22, 2026
Jiayin presenting her work at CHI
UChicago CS News

The Time Constraints of AI Access Could Change How We Think

Apr 21, 2026
headshots
UChicago CS News

University of Chicago Wins Distinguished Laude Institute Moonshots Seed Grant

Apr 15, 2026
collage
UChicago CS News

Incredible Showing of UChicago CS Researchers to CHI 2026

Apr 10, 2026
ai cartoon
UChicago CS News

What If AI Scientists Could Talk to Each Other?

Apr 06, 2026
person using embodied AI to open a window
UChicago CS News

When AI Meets Muscle: Context-Aware Electrical Stimulation Promises a New Way to Guide Human Movements

Apr 03, 2026
graphic
UChicago CS News

UChicago Researchers Build a Tool to Help Fix Peer Review

Apr 02, 2026
iccc team photo
UChicago CS News

UChicago CS Team Qualified for 2026 ICPC World Final Championships in Dubai

Apr 01, 2026
AI wedding photos
UChicago CS News

Mapping the New Rules of “AI Slop”: How Social Media Platforms are Managing AI-Generated Content

Mar 23, 2026
robot
UChicago CS News

How Chicago Robot Tutors Are Teaching SEL Effectively–Without Pretending to Be Human

Mar 19, 2026
screen grab
UChicago CS News

Could AI Help Us Be More Thoughtful Voters?

Mar 17, 2026
nano carbons
In the News

Nanodiamonds and Beyond: Designing Carbon Materials with Artificial Intelligence at Exascale

Mar 16, 2026
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube