Corpus ForenUCAdiseño, objetivos y estado actual en el marco del instituto de investigación en lingüística aplicada
-
1
Universidad de Cádiz
info
ISSN: 2444-197X
Year of publication: 2018
Issue: 4
Pages: 363-374
Type: Article
More publications in: E-Aesla
Abstract
One of the most recent areas of interest in Spanish studies is Forensic Linguistics, distinguished by the linguistic analysis to investigate crime. Among the main points of interest are the authorship attribution of electronic texts such as emails, social networks or mobile messaging. The study of dialectal and sociolinguistic parameters of a text is essential when characterizing the gender, age or educational level of a certain text sender. There is a lack of corpus of Spanish electronic texts linked to different sociolinguistic variables able to provide scientific support to Forensic Linguistics. This works presents the ForenUCA Corpus, under development at the Applied Linguistics Research Institute of the University of Cádiz, aiming at collecting texts from new social media — short mobile messaging, email and social networks —. This paper presents the guidelines, design and objectives of this corpus that currently contains more than 200 thousand words