• brucethemoose@lemmy.world
    link
    fedilink
    arrow-up
    12
    ·
    edit-2
    2 months ago

    Would lemmy instances do this?

    I know they can’t afford to now, but hypothetically? A lot of people here don’t seem to like data scraping for AI.

    • Dave@lemmy.nz
      link
      fedilink
      arrow-up
      19
      ·
      edit-2
      2 months ago

      You don’t need to scrape. If you want to get all the content on Lemmy, just set up an instance and subscribe to all the top communities, and the instances will just send you all the content.

      So there isn’t really a way to monetise or block it. I guess you could only federate to a whitelist, but the biggest instances will federate by default with any new instances until they are given a reason to defederate.

    • cmnybo@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      9
      ·
      2 months ago

      Some Lemmy instances disallow indexing in robots.txt, however indexers can choose to ignore that and actually blocking them takes a lot more effort.