this post was submitted on 09 Jun 2024
292 points (90.1% liked)

Technology

59323 readers
5426 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] andrewrgross@slrpnk.net 13 points 5 months ago (1 children)

Why do you guarantee that? It seems obviously wrong, on a technical level.

The point I'm making is that even if we take it as a given that a shrewd enough AI could correctly distinguish sex at birth -- which I think is obviously impossible based on the appearances of many ciswomen and the nature of statistical prediction -- you'd still need a training data set.

If the dataset has any erroneous input, that corrupts its ability, and the whole point of this exercise is trying to find passing transwomen. Why would anyone expect that training set of hundreds of thousands of supposed cis women wouldn't have a few transwomen in it?

[–] AlligatorBlizzard@sh.itjust.works -4 points 5 months ago (1 children)

Because Facebook's data practices, and how much was volunteered by users on there, means that for some percentage of trans users Facebook knows that they're trans. And you also have a percentage of pregnancy photos uploaded, if someone identifies as a woman on Facebook, and has uploaded photos with a baby bump, she's cis (or at least a pre-hatching trans person). And at one point in time, a lot of people just volunteered that info to Facebook.

[–] andrewrgross@slrpnk.net 2 points 5 months ago

Yeah, but the training set is nowhere near clean. That's my point. "Close" is no where near good enough within this context,