A purported leak of 2,500 pages of internal documentation from Google sheds light on how Search, the most powerful arbiter of the internet, operates.
The leaked documents touch on topics like what kind of data Google collects and uses, which sites Google elevates for sensitive topics like elections, how Google handles small websites, and more. Some information in the documents appears to be in conflict with public statements by Google representatives, according to Fishkin and King.
Some information in the documents appears to be in conflict with public statements by Google representatives
I would have never guessed that.
Can’t wait for selfhosted web search to become better.
You mean hosting your own crawler/indexer? That doesn’t really sound like a thing you could do cost-effectively.
Federated bookmarks?
Federated directories. We’re going back to Yahoo like it’s 1995