In the meanwhile, wikipedia ships wikidata, which uses RDF dumps (and probably 8x less compressed than it should be).
https://www.wikidata.org/wiki/Wikidata:Database_download
There is room for a third option leveraging commercial columnar database research.
I wouldn’t want to lose access to knowledge how to fix a sink or which medication is better, just because the local kingface currently feels that free exchange of opinions about him threatens his kingship.
In the 1950s, US Civil Defense had a set of microfilms on how to rebuild society. These were packaged with a sunlight reader and stored in larger fallout shelters. Someone should find one of those.
For the margins a $280 MSRP allows you'd think they'd at least try a little bit: maybe hook people up with the RPi Compute Module which has eMMC onboard
One "popular" example for those whose horizon doesn't extend over US country borders:
"Hurricane Katrina devastated communications infrastructure across the Gulf Coast, incapacitating telephone service, police and fire dispatch centers, and emergency radio systems. Almost three million customer phone lines were knocked out, telephone switching centers were seriously damaged, and 1,477 cell towers were incapacitated. Most of the radio stations and many television stations in the New Orleans area were knocked off the air. Paul McHale, the Assistant Secretary of Defense for Homeland Defense, summarized the damage by stating, “The magnitude of the storm was such that the local communications system wasn’t simply degraded; it was, at least for a period of time, destroyed."
https://georgewbush-whitehouse.archives.gov/reports/katrina-...
"Our preparedness culture must also emphasize the importance of citizen and community preparedness. […] Thus, citizens and communities can help themselves by becoming more prepared. If every family maintained the resources to live in their homes without electricity and running water for three days, we could allocate more Federal, State, and local response resources to saving lives. Similarly, if every family developed their own emergency preparedness plan, they almost certainly would reduce the demand for outside emergency resources. As the 9/11 Commission Report states, “One clear lesson of September 11 is that individual civilians need to take responsibility for maximizing the probability that they will survive, should disaster strike."
https://georgewbush-whitehouse.archives.gov/reports/katrina-...
Offline access and local models aren’t about assuming collapse—they’re about treating knowledge as infrastructure instead of something implicitly guaranteed.
That feels more like resilience than pessimism.
This isn’t prepping for anything it’s cosplaying as a vault dweller.
P.S. Having TED talks as part of the “educational” curriculum of this project is probably the biggest circle jerk imaginable.
AlexNet -> Tansformers -> ChatGPT -> Claude Code -> Small LMs serving KBs
Large LLMs could have a role in efficiently producing such KBs.
What if we build what i am calling WWTN (World wide text network). very low bit rate network that can at most send sms level data its a packet routing at lower level (possibly MAC addr is a hash of pub key of node like p2p networks work but fully p2p not ISP backed, censorship resistant. Reticulum + LORA + ... actually global.
someone come up with better name than WWTN tho
I do think having an LLM as an optional "sidecar" is a useful approach. If you can run a meaningful Ollama instance alongside your content, great!
The durable asset is the knowledge base itself. A local model can be useful on top, but it should stay a layer, not become the dependency.
I was planning to build my own offline repository, but will check out this repo.
>What is Project N.O.M.A.D.? Node for Offline Media, Archives, and Data
That's the first header, and the first sentence of the first paragraph, and I'm confused.
To "go offline" means for something to become inaccessible that was once accessible "online". ("Offline" is an adverb.)
Meanwhile, an "offline" thing is one which is usable even without ever being "online". ("Offline" is an adjective.)
So it becomes:
> "Knowledge That Never [Becomes Inaccessible]"
> "Node for [Accessible-Without-Connection] Media, Archives, and Data"
But definitely confusing to put them right next to each other like that. You'd think a copyeditor would flag it or something.
>Knowledge That Never Goes Offline
Means
>Knowledge That Never becomes inaccessible to you
While the next offline means you can access it even if you don't have access to a wider network.
At least that's how I would read it.
I used it on a long train trip. There was no internet due to drone attacks, and with Kiwix I could browse pre-downloaded Wikis
Maybe it's like linux distros: all based on the same software, but optimized for different use-cases or preferences.
whatever I think might be useful later, I capture through the web clipper extension. [0]