• SavvyWolf@pawb.social
    link
    fedilink
    English
    arrow-up
    47
    ·
    8 months ago

    Gender PHP extension is a port of the gender.c program originally written by Joerg Michael. The main purpose is to find out the gender of firstnames. The current database contains >40000 firstnames from 54 countries.

    For anyone curious but not curious enough to go digging around.

  • schnurrito@discuss.tchncs.de
    link
    fedilink
    arrow-up
    39
    arrow-down
    1
    ·
    edit-2
    8 months ago

    If you click around a bit more in that documentation, you can see that that isn’t an enumeration of genders, it’s an interface for answering the question which gender any given name belongs to. (For example, “Andrea” is understood as exclusively feminine for German speakers, but it’s a common male first name in Italy.)

    • Aatube@kbin.melroy.org
      link
      fedilink
      arrow-up
      6
      ·
      8 months ago

      i’ve been trying to figure that out… why does each gender instance have a country getter? what’s a “DSN to open”?

    • T156@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      8 months ago

      Only slightly though. It hardly seems practical to try and infer gender from names, in a way where it can’t be obtained through historical records, or the user.

      • skisnow@lemmy.ca
        link
        fedilink
        English
        arrow-up
        7
        ·
        8 months ago

        For a given individual, sure. If you’re trying to do some statistics over a whole group that you have no other record for, it could be useful.

        • bss03@infosec.pub
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          4
          ·
          8 months ago

          Sounds like those statistics output would the heavily biased by whatever process you were using to turn names into genders. In short, a bad idea.

          • TangledHyphae@lemmy.world
            link
            fedilink
            arrow-up
            4
            arrow-down
            2
            ·
            8 months ago

            “Since the dataset isn’t 100% perfectly annotated for analysis, we should give up the whole project entirely.”

            • Shanmugha@lemmy.world
              link
              fedilink
              arrow-up
              2
              ·
              8 months ago

              No, since the dataset is bound to give nonsensical results, we search for sources that are more precise. Hint: “Andrea” already mentioned and Japanese names

  • jia_tan@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    19
    ·
    8 months ago

    Genuine question. Why does a program need to know the user’s gender? (I’m asking in general, not in this particular case). Just use gender neutral pronouns to refer to the user, or, better yet, don’t talk to me at all!

    Aside from niche things like targeted ads and gendered health tracking and stuff.

    • Xanza@lemm.ee
      link
      fedilink
      English
      arrow-up
      17
      arrow-down
      1
      ·
      edit-2
      8 months ago

      For some languages gender-neutral pronouns aren’t possible or aren’t appropriate. It’s really only in English and maybe five other languages that gender-neutral pronouns are a real thing and even in these languages if you’re not used to using neutral pronouns or reading them in common writing sounds like a mistake to begin with. It’s generally just easier to automate the task based on names, and the library itself comes from a time when that wasn’t a controversial thought. Lol

    • verstra@programming.dev
      link
      fedilink
      arrow-up
      6
      ·
      8 months ago

      Gender PHP extension is a port of the gender.c program … The main purpose is to find out the gender of firstnames.

      As of why, you don’t need a why in open source. Some people treat gender as a function of their firstname, apparently, and need that information somewhere - maybe for localization, maybe for personalization, maybe for form-filling auto-suggestion purposes.

    • lime!@feddit.nu
      link
      fedilink
      English
      arrow-up
      5
      ·
      8 months ago

      i learned from a friend that used to work in banking that some countries have laws that basically make contracts and invoices void if they are not correctly gendered.

    • cjk@discuss.tchncs.de
      link
      fedilink
      arrow-up
      4
      ·
      8 months ago

      HR software in Germany needs to know because we have to send this information to the government. Along with a lot of other information.