how to search the internet, set for 26/01 (published at 27)

This commit is contained in:
Wouter Groeneveld 2024-01-24 11:25:42 +01:00
parent 66aa816d0a
commit 2a6ed514bc
4 changed files with 42 additions and 9 deletions

View File

@ -9,6 +9,8 @@ tags:
- screenshots
- archiving
date: 2020-10-04
aliases:
- /museum
---
While browsing through archives of _very_ old files, I rediscovered backups of websites I once made. It felt a bit like scrolling [thehistoryofweb.design](https://thehistoryofweb.design/), an interactive journey through the history of webdesign. Thanks to the Internet Archive project, revisiting these now-offline websites was not only a very personal and nostalgic ride for me, but also an educational one.

View File

@ -0,0 +1,39 @@
---
title: How To Search The Internet
date: 2024-01-26T13:00:00+01:00
categories:
- webdesign
tags:
- indieweb
- search engines
---
Thanks to the multi billion dollar [advertisement industry](/post/2023/02/aggressiveness-of-modern-web-advertising/), searching for something on the internet has devolved from a joyous Altavista guess-the-keywords activity to a tiring chore where one has to wade through endless pools of generated [SEO-optimized crap](https://rubenerd.com/my-line-about-seo-being-a-red-herring/), hollow company blogs with more social media link embeds than actual content, and Reddit flame wars than ever before. In short: great stuff.
Suppose you're looking for a review of a video game. The first 100 hits will return the expected results: articles from huge journalism companies with a big enough budget to bribe any search engine to stay on top. And while these IGN et al. reviews _are_ interesting to a certain degree, I want to read about the honest opinion of another _person_, another human being---whether or not journalists are human beings is still up for debate.
I'm sure you know what I mean: I'm looking for small, independent websites carefully curated by people who care. I want to discover personal blogs, not professional cookie-laden ad-riddled heavyweight junk that has my router choke on `50 MB` of cruft instead of just loading one document and a few pieces of metadata. I do not want to click like and subscribe. I am not interested in Facebook embeds. I am not willing to consume words that have nothing to say besides _click here_. I am not bait.
How should you search the internet, while avoiding that cruft? I think by now it's clear that a simple Google search isn't the answer, and neither is migrating to a privacy-friendly DuckDuckGo search engine that fetches results from Bing: same shit different engine (and progressively worse results, I might add).
Instead, I have been relying on [Search My Site](https://searchmysite.net): an open source search engine specifically geared towards personal and independent websites (like this one). Unfortunately, but perhaps not unexpectedly so, Search My Site is not very good at finding things: if your website happens to be in their index, you're good, but if not... Is that different compared to the big ones? Not really. What do you do when you can't find something in one search engine? You revert to another strategy. [Marginalia Search](https://search.marginalia.nu/) is another great little gem. It clearly states its purpose:
> This search engine isn't particularly well equipped to answering queries posed like questions, instead try to imagine some text that might appear in the website you are looking for, and search for that. <br/>Where this search engine really shines is finding small, old and obscure websites about some given topic, perhaps old video games, a mystery, theology, the occult, knitting, computer science, or art.
My experiments recently made me switch from SearchMySite to Marginalia as my go-to small engine.
Both [seirdy.one](https://seirdy.one/posts/2021/03/10/search-engines-with-own-indexes/) and [Dan Luu](https://danluu.com/seo-spam/) provided outstanding overviews on alternative search engines optimized for the small and independent web, where Search My Site and Marginalia.nu happen to be just two of the many ways to tap into personal websites and blogs. Not every engine has their own index database, but those that do can curate entries more rigorously. It's now easier than ever to trust and rely on smaller engines, says Dan:
> If you want to make a useful search engine for a small number of users, that seems easier than ever because Google returns worse results than it used to for many queries. In our test queries, we saw a number of queries where many or most top results were filled with SEO garbage, a problem that was significantly worse than it was a decade ago, even before the rise of LLMs and that continues to get worse.
Another strategy is to try and skip the first thousand results of conventional search engines and focus on what lies below and forgotten: that's exactly what [Million Short](https://millionshort.com/) set out to do (based on the Bing index). As stated on their site, Million Short "Remove over SEOd sites that show up over and over again with ease.". It's just sad strategies like that exist and are needed to scour around the web nowadays. Million Short's about page does smell an awful lot like Silicon Valley VC-like brands though.
Then there are web directories such as [Indieseek.xyz](https://indieseek.xyz/2021/04/19/search-engines-for-the-indie-web-and-indieweb/) and [Blogroll.org](https://blogroll.org/) that simply list blogs as entry points, leaving the exciting spelunking up to you. There, you first search by general topic before diving deep.
Lastly, premium search engines like [Kagi](https://kagi.com/) started popping up that claim to deliver fast and personal results, free of ads and tracking---provided that you pay `$5` a month for 300 searchers or `$10` for unlimited queries. Kagi aims to replace your general Google-esque search and does not focus on small sites like SearchMySite or Marginalia does. With Kagi, you can block and/or filter domains you do (not) like, which to a certain degree is also possible in Million Short. I haven't tried it myself, but have read impressions from [Dave Heinemann](https://dheinemann.com/kagi-search-first-impressions/), [Horst Gutmann](https://zerokspot.com/weblog/2023/10/21/trying-kagi-search-for-real/), and [Kev Quirk](https://kevquirk.com/my-thoughts-on-kagi-search).
---
Google et al. are great "answer engines" when technical questions arise, but their cool new website discovery levels leave much to be desired. If you are going to rely on them, do make sure to bring protection such as a [Pi-Hole](/post/2022/08/six-months-with-pi-hole/) and [a good content blocker](https://ublockorigin.com/), as these big boys made ads and scam look like real results. Fortunately, alternative smaller engines focusing on personal sites do exist. They profoundly changed the way I use the internet---for the better---and for me made fooling around fun again, not unlike the good old StumbleUpon days.
Most browsers (and Alfred!) support custom search shortcuts that allow for quick searchers in many different engines, big and small. You owe it to yourself to check out at least a few of the ones mentioned here.

View File

@ -12,7 +12,7 @@ My mother-in-law bought a new laptop that came pre-installed with Windows 11. I
I wonder when Microsoft started missing the mark? Probably after Windows XP? Layer upon layer upon layer of unnecessary crap eventually became the achievement called Windows 11, where the AI-assisted Bing, the ridiculous Microsoft Store, the security "enhancements", and the unwanted integration of Xbox entertainment had me curse for almost two hours straight until I finally managed to successfully install a local government extension to get an eID card reader working.
As soon as you boot up Windows and hit the start button, you'll notice lots of moving components that weren't there last time I frequently used windows. Okay fine, that was 17 years ago, but still. Why should users have to put up with trending news articles right inside that start menu? Or with Xbox account activation questions? Or with Bing? Or with what kind of weather it is today? The flashing things on the screen, the more confused my mother-in-law is, and rightly so. So I tried to hide, disable, uninstall, revert, delete, bin, trash, kill, and burn everything I deemed unnecessary.
As soon as you boot up Windows and hit the start button, I noticed lots of moving components that weren't there last time I frequently used windows. Okay fine, that was 17 years ago, but still. Why should users have to put up with trending news articles right inside that start menu? Or with Xbox account activation questions? Or with Bing? Or with what kind of weather it is today? The flashing things on the screen, the more confused my mother-in-law is, and rightly so. So I tried to hide, disable, uninstall, revert, delete, bin, trash, kill, and burn everything I deemed unnecessary.
But I couldn't: Windows simply wouldn't let me. Either I couldn't immediately find where to configure the thing---made worse by a Dutch installation---or it simply was part of the "core Windows" experience and impossible to alter.

View File

@ -1,8 +0,0 @@
<html>
<head>
<meta http-equiv="Refresh" content="0; url='https://brainbaking.com/post/2020/10/a-personal-journey-through-the-history-of-webdesign/'" />
</head>
<body>
If your browser does not redirect you, click here: <a href="https://brainbaking.com/post/2020/10/a-personal-journey-through-the-history-of-webdesign/">https://brainbaking.com/post/2020/10/a-personal-journey-through-the-history-of-webdesign/</a>.
</body>
</html>