Incidental Decomposition for Complex Reasoning

Zhou, Xuanyu

Incidental Decomposition for Complex Reasoning

Files

Zhou_upenngdas_0175C_16628.pdf (3.65 MB)

Degree type

Doctor of Philosophy (PhD)

Graduate group

Computer and Information Science

Discipline

Computer Sciences

Subject

analogy
decomposition
incidental supervision
language model
reasoning

Copyright date

01/01/2024

Permalink

https://repository.upenn.edu/handle/20.500.14332/60466

View all metadata

Author

Zhou, Xuanyu

Abstract

The primary impetus for developing Natural Language Processing (NLP) systems is facilitating effective human-computer interaction via natural language. This necessitates emulating human cognitive processes that involve complex reasoning with decomposed understanding. Compared with humans, who continually learn from daily experiences, decompose decision-making processes, and form new associations, machine learning systems often have difficulty doing the same because of the lack of supervision signals that provide the same level of granularity in reasoning processes and explanations that humans naturally form.One straightforward solution would be to ask humans to annotate decomposed reasoning processes, which would be extremely cost-ineffective and non-comprehensive. As an alternative, I propose using incidental decomposition, which refers to decomposed signals that can be automatically acquired from existing resources and contain finer and more structural details than end-task labels. Incidental decomposition can more effectively emulate human cognition and improve model performances on complex reasoning, as I will demonstrate through a series of works in this thesis. I first introduce an exploratory work of applying incidental decomposition at inference time for fine-grained entity-typing. Following inference-time methods, I show two use cases of incidental decomposition as supervision signals for temporal reasoning and argue its effectiveness for in-domain applications. I will then detail methods that make incidental decomposition generalizable and domain-agnostic. Finally, I will discuss the long-term significance of incidental decomposition in the era of large language models.

Advisor

Roth, Dan

Date of degree

2024

Collection

Dissertations and Theses