• bluueberry@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    1 year ago

    You make a great point about Wikipedia - it’s laughable to me that scraping is actually why Twitter is doing this. They’re just trying to find a convenient reason for why they’re failing that doesn’t stem from their own incompetence.

    • fubo@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      The idea that “AI scraping” is any more expensive than search engine indexing is flatly nonsense, only credible to people who have never run any network service at scale.

      Folks need to learn about Common Crawl. https://commoncrawl.org/