Filipa Calado

Gender Bias in NLP

This interdisciplinary project applies Queer Studies to Natural Language Processing (NLP) techniques for measuring and mitigating gender bias. It argues that these techniques assume a kind of "binary thinking" that limits their effectiveness, so that they cannot distinguish between different types of bias and their effects in language. The project also explores new methods to resist binary thinking, such as those that diversify and amplify gender expressions.

Read the most recent paper, "Some Myths About Bias: A Queer Studies Reading Of Gender Bias In NLP", published in the ACL Anthology.

Large Language Models for Studying Transphobia

This project uses machine learning technology to examine gender bias in language. It trains text generation models from custom datasets of anti-trans discourse in the USA.

Read more about the project on Github and see the models and datasests on Huggingface.

Python for Working with Text

This online workshop series, originally developed for the Princeton University Library, teaches Python programming from an ethical and feminist approach to working with text data. It begins with foundations of programming and proceeds to data gathering (with web scraping), and text analysis. Future units are currently being developed on text generation with machine learning.

See the online tutorial, python-for-text, published using Jupyter Book.

"Queer Tools For Studying Literature"

This dissertation, 'Since No Expressions Do': Queer Tools for Studying Literature explores how digital methods and tools for studying text engage with queer literature via Queer Studies frameworks. I critique digital methods and tools by exploring how computation, which disambiguates and fixes data for electronic processing, might be used to analyze the complexity of queerness expressed in textual style, form, and voice. Download the dissertation here.

Below are two small digital projects that demonstrate in practice how one might digital tools to work within Queer Studies frameworks.

Queer Distant Reading (qdr)

This project draws connections between programming logics and gender theory to propose a text analysis methodology that iterates through distant and close reading. Using Virginia Woolf's novel, Orlando: A Biography (1928), as a test case, I demonstrate how this method of text analysis leads from a binary understanding of gender into a plurality of gender significations in the novel, suggesting how language and gender are mutually constructed.

See the digital component of this project on the "Queer Distant Reading" Github repository.

Queer Text Encoding (qte)

This project uses the Text Encoding Initiative (TEI) standard, an electronic editing tool, to encode the homoerotic elements that Oscar Wilde edited while composing his novel, The Picture of Dorian Gray. (1890). It explores the mutually reinforcing nature of TEI's hierarchical structure and of dominance structures in archival data and practices. See a customized rendering of the manuscript's first chapter and access the XML/TEI files .