• UraniumBlazer@lemm.ee
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    3
    ·
    9 months ago

    Sorry, I’m ignorant in this matter. Why exactly would you want to scrape websites aside from collecting data for ML? What kind of irreplaceable API are you using? Someone please educate me here.

    • coltorl@programming.dev
      link
      fedilink
      arrow-up
      22
      ·
      9 months ago

      API might cost a lot of money for the amount of requests you want to send. API may not include some fields in the data you want. API is rate limited, scraping might not be. API requires agreement to usage terms, scraping does not (though the recent LinkedIn scraping case might weaken that argument.)

      • olympicyes@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        9 months ago

        My understanding is that the result of the LinkedIn case is that you can scrape data that you have permission to view but not to access data that you were not intended to. The end result that ClickWrap agreements are unenforceable.