Aproximación a la categorización textual en español basada en la semántica de marcos

  1. Crespo Miguel, Mario
  2. Frías Delgado, Antonio
Revista:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Ano de publicación: 2008

Número: 41

Páxinas: 65-71

Tipo: Artigo

Outras publicacións en: Procesamiento del lenguaje natural

Resumo

FrameNet is a resource based on Frame Semantics that comprises how languages account for daily situations linguistically. Frames represent information packets about how to convey information about a certain situation. This paper presents an approach to categorize texts by analysing the range of FrameNet situations that co-occur in a particular text. The set of FrameNet situations is used as a feature vector where the presence or absence of certain frames in a text is used to determine its category. Results show how our system was able to categorize texts in Spanish with high accuracy.