Structured Event Reasoning with Large Language Models

Zhang, Li

Structured Event Reasoning with Large Language Models

Files

Zhang_upenngdas_0175C_16511.pdf (9.83 MB)

Degree type

Doctor of Philosophy (PhD)

Graduate group

Computer and Information Science

Discipline

Computer Sciences

Subject

artificial intelligence
events and entities
large language models
machine learning
natural language processing
reasoning

Copyright date

01/01/2024

Permalink

https://repository.upenn.edu/handle/20.500.14332/60359

View all metadata

Author

Zhang, Li

Abstract

Reasoning about real-life events is a unifying challenge in AI and NLP that has profound utility in a variety of domains, while fallacy in high-stake applications could be catastrophic. Able to work with diverse text in these domains, large language models (LLMs) have proven capable of answering questions and solving problems. However, I show that end-to-end LLMs still systematically fail to reason about complex events, and they lack interpretability due to their black-box nature. To address these issues, I propose three general approaches to use LLMs in conjunction with a structured representation of events. The first is a language-based representation involving relations of sub-events that can be learned by LLMs via fine-tuning. The second is a semi-symbolic representation involving states of entities that can be predicted and leveraged by LLMs via few-shot prompting. The third is a fully symbolic representation that can be predicted by LLMs trained with structured data and be executed by symbolic solvers. On a suite of event reasoning tasks spanning common-sense inference and planning, I show that each approach greatly outperforms end-to-end LLMs with more interpretability. These results suggest manners of synergy between LLMs and structured representations for event reasoning and beyond.

Advisor

Callison-Burch, Chris

Date of degree

2024

Collection

Dissertations and Theses