magistrate: The arc of the Earth in dark space. (Default)
Because definitely what I need is more webapp ideas that I don't have time to develop.



Anyway, following on from a Twitter conversation, I'm wondering how it would work to make a writing program which could track the genders of a number of characters and then arbitrarily shuffle them. What I'm picturing is, simplified, something like this:

• At the top of the document are a number of fields which ask for a character name (or a list of character references, such as name and nickname and other variations) and pairs the name with a gender (and its associated set of pronouns).

• Each character you add is arbitrarily assigned a color (or icon or other distinguishing visual marker).

• As you type, a parser will keep track of which name (or referent) has been typed last for each of the original genders. When you type a pronoun, it will look at the last character reference matching that pronoun's set, and highlight the pronoun (or assign it the correct icon) to associate it with the specific character. It'll also have some kind of (mouseover?) menu to allow users to correct its assumption about which character it refers to.

• When you finish writing, each pronoun will be associated with a character. So you can hit a shuffle button, and then the characters' genders will be shuffled, and each pronoun can be brought back into compliance with the character's gender.

Needless to say, this would fail in a lot of situations. Take, for example:

• Dialogue. "He's not coming today," he said. (I mean, I guess I could set up a sub-parser which kept track of the last character reference inside a set of quotes?)

• Ambiguiety. We'll just call this the Randall Munroe exploit. I guess people would just have to make close, personal friends with the drop-down menus?

• Gay porn. I am reliably informed by people who have tried to write gay porn that pronouns are a nightmare anyway. And humans are better at parsing language than computers are.

• Unexpected cases. Language is complicated, yo!

I feel like there should be a way to handle this, and that it probably involves algorithms. I'm a bit worried that trying to write a general-purpose pronoun shuffler would actually require re-inventing Google Translate. Any computational linguists out there who want to point out things I'm missing?

Date: 2015-01-21 06:39 pm (UTC)From: (Anonymous)
As an author, I absolutely totally love this idea. And I am a computational linguist (who coincidentally just happened across your site via the QDSF email that just went out). I haven't looked at coreference resolution for a couple of years, but basically, the pronoun labelling stage works *reasonably* for English and it's probably "good enough" if you're happy that people can correct it's labelling as they write. As you identify, it's going to need extra training for speech. That doesn't begin to cover the gendered nouns (she's X's daughter or an actress; he's Z's husband or headmaster of the school... etc), but again, it's a closed-class problem so probably tractable. If you want to open source this I'd contribute... drop me an email if you want to discuss it more?

Rachel
http://rachelcotterill.com (having problems with openid for every site I own!)

Profile

magistrate: The arc of the Earth in dark space. (Default)
magistrate

March 2024

S M T W T F S
     12
3456789
10111213141516
171819 20212223
24252627282930
31      

Page Summary

Style Credit

Expand Cut Tags

No cut tags
Page generated Jul. 31st, 2025 08:52 pm
Powered by Dreamwidth Studios