There’s not much room to squeeze in when your competitors hold the keys to 15 million top websites.
I find it wild that "at scale" we can bypass anti-bot measures, but just "normal" internet use (i.e Non-Google Browser or VPN) will throw a million captchas at you.
cgnat is pretty bad too.
So if another search engine does arise, it won't find anything useful, because the useful content on the web has been buried under slop, and largely removed. Your best bet today is a curated directory, sorta like the original Yahoo, where you allowlist the web to only real sites, download them, and make them searchable. I think this is actually Kagi's approach. But the open web as we knew and loved it is dead.
https://blogs.microsoft.com/blog/2023/02/07/reinventing-sear...
[1]: https://alternativeto.net/software/google-search/?license=co...
Very few of the smaller search engines actually do their own indexing for exactly this reason.
When I use google, usually from my phone, I am reminded of why I don't use google on desktop.
With the announcement of this move by them, I just manually removed google as an address bar search engine option in all my browsers on desktop and mobile.
Human produced content should be separated from sites primarily hosting slop. That seems solvable?