Aggregation in Natural Language Generation |
| |
Authors: | Hercules Dalianis |
| |
Affiliation: | Department of Computer and Systems Sciences, The Royal Institute of Technology and Stockholm University |
| |
Abstract: | The content of real‐world databases, knowledge bases, database models, and formal specifications is often highly redundant and needs to be aggregated before these representations can be successfully paraphrased into natural language. To generate natural language from these representations, a number of processes must be carried out, one of which is sentence planning where the task of aggregation is carried out. Aggregation, which has been called ellipsis or coordination in Linguistics, is the process that removes redundancies during generation of a natural language discourse, without losing any information. The article describes a set of corpus studies that focus on aggregation, provides a set of aggregation rules, and finally, shows how these rules are implemented in a couple of prototype systems. We develop further the concept of aggregation and discuss it in connection with the growing literature on the subject. This work offers a new tool for the sentence planning phase of natural language generation systems. |
| |
Keywords: | natural language generation sentence planning aggregation |
|
|