Making the Implicit Explicit: Implicit Content as a First Class Citizen in NLP

Alexander Hoyle,Rupak Sarkar,Pranav Goel,Philip Resnik
DOI: https://doi.org/10.48550/arXiv.2305.14583
2023-05-23
Computation and Language
Abstract:Language is multifaceted. A given utterance can be re-expressed in equivalent forms, and its implicit and explicit content support various logical and pragmatic inferences. When processing an utterance, we consider these different aspects, as mediated by our interpretive goals -- understanding that "it's dark in here" may be a veiled direction to turn on a light. Nonetheless, NLP methods typically operate over the surface form alone, eliding this nuance. In this work, we represent language with language, and direct an LLM to decompose utterances into logical and plausible inferences. The reduced complexity of the decompositions makes them easier to embed, opening up novel applications. Variations on our technique lead to state-of-the-art improvements on sentence embedding benchmarks, a substantive application in computational political science, and to a novel construct-discovery process, which we validate with human annotations.
What problem does this paper attempt to address?