Corpus ForenUCA: diseño, objetivos y estado actual en el marco del instituto de investigación en lingüística aplicada

Mario Crespo Miguel

Corpus ForenUCAdiseño, objetivos y estado actual en el marco del instituto de investigación en lingüística aplicada

Mario Crespo Miguel ¹

1 Universidad de Cádiz

Universidad de Cádiz

Cádiz, España

ROR https://ror.org/04mxxkb11

Journal:

E-Aesla

ISSN: 2444-197X

Year of publication: 2018

Issue: 4

Pages: 363-374

Type: Article

DIALNET GOOGLE SCHOLAR Open access editor

More publications in: E-Aesla

Abstract

One of the most recent areas of interest in Spanish studies is Forensic Linguistics, distinguished by the linguistic analysis to investigate crime. Among the main points of interest are the authorship attribution of electronic texts such as emails, social networks or mobile messaging. The study of dialectal and sociolinguistic parameters of a text is essential when characterizing the gender, age or educational level of a certain text sender. There is a lack of corpus of Spanish electronic texts linked to different sociolinguistic variables able to provide scientific support to Forensic Linguistics. This works presents the ForenUCA Corpus, under development at the Applied Linguistics Research Institute of the University of Cádiz, aiming at collecting texts from new social media — short mobile messaging, email and social networks —. This paper presents the guidelines, design and objectives of this corpus that currently contains more than 200 thousand words

Data source: Dialnet

Corpus ForenUCAdiseño, objetivos y estado actual en el marco del instituto de investigación en lingüística aplicada

Universidad de Cádiz

Abstract