My Lemmy Box
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
@[email protected] to [email protected]English • 11 months ago

Reddit blocking all major search engines, except Google

readwrite.com

external-link
message-square
183
fedilink
676
external-link

Reddit blocking all major search engines, except Google

readwrite.com

@[email protected] to [email protected]English • 11 months ago
message-square
183
fedilink
%%excerpt%% Reddit has commenced its assault on search engines, blocking those that don’t have a commercial relationship with the company, like Google.
  • @[email protected]
    link
    fedilink
    English
    29•11 months ago

    this is just going to cause indexers to ignore robots.txt

    • @[email protected]OP
      link
      fedilink
      English
      21•11 months ago

      “We always obey the robots.txt”

      • A bunch of corporations that have no accountability and plenty of incentive to just ignore it and have all been caught training AI on off-limits data.
    • Kairos
      link
      fedilink
      English
      6•11 months ago

      They’re likely blocking user agents too, which I think also doesn’t have legal enforcement (as in DuckDuckGo can just use “Google” unless they said otherwise.

      • Natanael
        link
        fedilink
        English
        8•
        edit-2
        11 months ago

        LinkedIn tried blocking scraping that way but as long as the scraping isn’t burdensome it’s basically legal but you can still be bound by TOS and civil claims

        https://natlawreview.com/article/hiq-and-linkedin-reach-proposed-settlement-landmark-scraping-case

    • capital
      link
      fedilink
      English
      6•11 months ago

      Rate limiting could “fix” that unfortunately.

[email protected]

[email protected]
Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @[email protected]
  • @[email protected]
  • @[email protected]
  • @[email protected]
  • 1 user / day
  • 1 user / week
  • 1 user / month
  • 25 users / 6 months
  • 0 subscribers
  • 8.98K Posts
  • 363K Comments
  • Modlog
  • mods:
  • @[email protected]
  • enu
  • L4sBot
  • Technopagan
  • BE: 0.18.4
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org