Aproximación a la categorización textual en español basada en la semántica de marcos
ISSN: 1135-5948
Year of publication: 2008
Issue: 41
Pages: 65-71
Type: Article
More publications in: Procesamiento del lenguaje natural
Abstract
FrameNet is a resource based on Frame Semantics that comprises how languages account for daily situations linguistically. Frames represent information packets about how to convey information about a certain situation. This paper presents an approach to categorize texts by analysing the range of FrameNet situations that co-occur in a particular text. The set of FrameNet situations is used as a feature vector where the presence or absence of certain frames in a text is used to determine its category. Results show how our system was able to categorize texts in Spanish with high accuracy.