®®®® SIIA Público

Título del libro: Gebnlp 2022 - 4th Workshop On Gender Bias In Natural Language Processing, Proceedings Of The Workshop
Título del capítulo: HeteroCorpus: A Corpus for Heteronormative Language Detection

Autores UNAM:
GEMMA BEL ENGUIX;
Autores externos:

Idioma:

Año de publicación:
2022
Palabras clave:

Classification (of information); Computational linguistics; Classification tasks; Gender bias; Language detection; Language technology; NLP systems; Performance; Natural language processing systems


Resumen:

In recent years, plenty of work has been done by the NLP community regarding gender bias detection and mitigation in language systems. Yet, to our knowledge, no one has focused on the difficult task of heteronormative language detection and mitigation. We consider this an urgent issue, since language technologies are growing increasingly present in the world and, as it has been proven by various studies, NLP systems with biases can create real-life adverse consequences for women, gender minorities and racial minorities and queer people. For these reasons, we propose and evaluate HeteroCorpus; a corpus created specifically for studying heterononormative language in English. Additionally, we propose a baseline set of classification experiments on our corpus, in order to show the performance of our corpus in classification tasks. © 2022 Association for Computational Linguistics.


Entidades citadas de la UNAM: