Does lemmy have any communities dedicated to archiving/hoarding data?

    • Random Dent@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      ·
      30 minutes ago

      You can also offline the whole of Project Gutenberg with Kiwix, it’s about 70GB IIRC.

    • Fmstrat@lemmy.world
      link
      fedilink
      arrow-up
      2
      ·
      edit-2
      7 hours ago

      120GB not including Wikimedia 😉

      Also, I wish they included OSM maps, not just the wiki.

    • clif@lemmy.world
      link
      fedilink
      arrow-up
      35
      ·
      edit-2
      21 hours ago

      Last time I updated it was closer to 120GB but if you’re not sweating 100 GB then an extra 20 isn’t going to bother anyone these days.

      Also, thanks for reminding me that I need to check my dates and update.

      EDIT: you can also easily configure a SBC like a Raspberry Pi (or any of the clones) that will boot, set the Wi-Fi to access point mode, and serve kiwix as a website that anyone (on the local AP wifi network) can connect to and query… And it’ll run off a USB battery pack. I have one kicking around the house somewhere

      • Fmstrat@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        edit-2
        7 hours ago

        Do you recommend adding anything else to it?

        For instance, OSM maps?

        I’ve been thinking about running the Kiwix app + OSMAnd on an old Android phone and auto updating it once a year.

        • clif@lemmy.world
          link
          fedilink
          arrow-up
          1
          ·
          6 hours ago

          That’s a good question (and good idea) that I hadn’t really thought about past a collection of ZIMs. The one I built advertises it’s own AP SSID that anyone can connect to and then access the ZIMs that are served via kiwix-serve on HTTP/80. That is, I wanted a single, low power, headless device that multiple people could use simultaneously via wifi and browser rather than a personal device.

          I hadn’t really thought about other helpful services past that. I mean, we’ve got a (wee) server so why not use it? I like the idea of OSM and their website is open source but has a lot of dependencies :

          openstreetmap-website is a Ruby on Rails application that uses PostgreSQL as its database, and has a large number of dependencies for installation

          A fully-functional openstreetmap-website installation depends on other services, including map tile servers and geocoding services, that are provided by other software. The default installation uses publicly-available services to help with development and testing.

          I wonder how hard it would be to host everything it needs locally/offline… and what that would do to power consumption : )

          Thanks for the idea - something to look into, for sure.

          • Fmstrat@lemmy.world
            link
            fedilink
            arrow-up
            1
            ·
            1 hour ago

            I might beat you to it. I’ve got Kiwix running in docker, just did a PR to the kiwix-zim-updater so it can run in Docker on a cron schedule next to the server, and have spun those up with Karakeep (self-hosted web archive I use for bookmarking).

            Right now I’m adding a ZIM list feature to the updater to list available ZIMs by language, and then I’ll move on to OSM.

      • techwithjake@sh.itjust.works
        link
        fedilink
        arrow-up
        14
        ·
        20 hours ago

        Just built one of those using Dietpi as the OS and NVME M.2 for the storage. I have many different ZIMs and running different services and only using about 270GB.

        Works great for offline use. Probably should add an ISO or 2 as well.

        • clif@lemmy.world
          link
          fedilink
          arrow-up
          3
          ·
          edit-2
          5 hours ago

          What other services are you running?

          @[email protected] asked what else I was running in a sibling comment to yours and I didn’t have an answer because I’m not… yet : )

          • techwithjake@sh.itjust.works
            link
            fedilink
            arrow-up
            4
            ·
            3 hours ago

            DietPi makes it dead simple to run most of these things as their “software suite” is pretty robust and simple to setup.

            For “user facing” applications:

            • Homer Dashboard as the landing page when going to the .local address in a browser
            • Kiwix for the ZIMs
            • Hedgedoc for personal note taking/wiki
            • Lychee photos for a very lightweight photo album maker/viewer for keepsake photos.

            For “admin side” stuff:

            • Portainer to manage the containers/stacks
            • Watchtower to auto-update the containers while they’re still network connected
            • Transmission daemonized to download and seed the ZIMs or anything else non-pirate related
            • Use jojo2357’s ZIM updater to auto-update ZIMs via cron job while they’re still network connected
            • DietPi-Dashboard as an all-in-one dashboard to monitor and control the RPi from a web interface. (Yeah I know I can do everything SSH’ing in but I’m lazy.)
            • File Browser just in case I want other people to have access to files but since it’s in maintenance mode and I’m unsure I want others to have access, might strip it out

            I try to use containers from LinuxServer.io whenever possible. Mostly just cause it’s what I do on my main server.

            I’m still looking at adding/removing things as I get more time to sit down but I’m pretty happy with it’s current state.

    • mistermodal@lemmy.ml
      link
      fedilink
      arrow-up
      29
      ·
      23 hours ago

      Yeah also if you make a Zim wiki or convert a website into Zim then you can run that stuff too. If you use Emacs it’s easy to convert some pages to wikitext for Zim too

    • Gigasser@lemmy.world
      link
      fedilink
      arrow-up
      1
      ·
      18 hours ago

      I wonder if there’s anyways to edit these files afterwards? They tend to be read only, right? I must confess, I don’t have too much experience with this myself.

      • Prathas@lemmy.zip
        link
        fedilink
        arrow-up
        4
        ·
        14 hours ago

        It’s probably hundreds of thousands of HTML files, no? What is the fear about being able to edit or not?