A Symbolic Approach for Automatic Detection of Nuclearity and Rhetorical Relations among Intra-sentence Discourse Segments in Spanish

Nowadays automatic discourse analysis is a very prominent research topic, since it is useful to develop several applications, as automatic summarization, automatic translation, information extraction, etc. Rhetorical Structure Theory(RST) is the most employed theory. Nevertheless, there are not many...

Full description

Saved in:
Bibliographic Details
Published inComputational Linguistics and Intelligent Text Processing pp. 462 - 474
Main Authors da Cunha, Iria, SanJuan, Eric, Torres-Moreno, Juan-Manuel, Cabré, M. Teresa, Sierra, Gerardo
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg 2012
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3642286038
9783642286032
ISSN0302-9743
1611-3349
DOI10.1007/978-3-642-28604-9_38

Cover

More Information
Summary:Nowadays automatic discourse analysis is a very prominent research topic, since it is useful to develop several applications, as automatic summarization, automatic translation, information extraction, etc. Rhetorical Structure Theory(RST) is the most employed theory. Nevertheless, there are not many studies about this subject in Spanish. In this paper we present the first system assigning nuclearity and rhetorical relations to intra-sentence discourse segments in Spanish texts. To carry out the research, we analyze the learning corpus of the RST Spanish Treebank, a corpus of manually-annotated specialized texts, in order to build a list of lexical and syntactic patterns marking rhetorical relations. To implement the system, this patterns’ list and a discourse segmenter called DiSeg are used. To evaluate the system, it is applied over the test corpus of the RST Spanish Treebank. Automatic and manual rhetorical analyses of each sentence are compared, by means of recall and precision, obtaining positive results.
ISBN:3642286038
9783642286032
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-642-28604-9_38