Andy Reid to [email protected]English • 1 year agoAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comexternal-linkmessage-square197fedilinkarrow-up11.09K
arrow-up11.09Kexternal-linkAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comAndy Reid to [email protected]English • 1 year agomessage-square197fedilink
minus-square@[email protected]linkfedilinkEnglish2•1 year agoJust thought of a nasty hack the browser makers (or hackers) could use to scrape unlisted sites - by surreptitiously logging user browser history for a crawl list
minus-square@[email protected]linkfedilinkEnglish3•1 year agoWhile there are some extensions that do this, last I saw Google didn’t use Chrome for populating Search: https://blogs.perficient.com/2017/03/15/does-google-use-chrome-to-discover-new-urls-for-crawling/
minus-square@[email protected]linkfedilinkEnglish3•1 year agoPerhaps some web extensions already do this and phone home about it.
Just thought of a nasty hack the browser makers (or hackers) could use to scrape unlisted sites - by surreptitiously logging user browser history for a crawl list
While there are some extensions that do this, last I saw Google didn’t use Chrome for populating Search:
https://blogs.perficient.com/2017/03/15/does-google-use-chrome-to-discover-new-urls-for-crawling/
Perhaps some web extensions already do this and phone home about it.