• 19 Posts
  • 1.06K Comments
Joined 2 years ago
cake
Cake day: June 20th, 2023

help-circle






  • I see, fair enough. Replication is never instantaneous, so do you have definite bounds on how much latency you’ll accept? Do you really want independent git servers online? Most HA systems have a primary and a failover, so users only see one server. If you want to use Ceph, in practice all servers would be in the same DC. Is that ok?

    I think I’d look in one of the many git books out there to see what they say about replication schemes. This sounds like something that must have been done before.


  • Why do you want 5 git servers instead of, say, 2? Are you after something more than high availability? Are you trying to run something like GitHub where some repos might have stupendous concurrent read traffic? What about update traffic?

    What happens if the servers sometimes get out of sync for 0.5 sec or whatever, as long as each is in a consistent state at all times?

    Anyway my first idea isn’t rsync, but rather, use update hooks to replicate pushes to the other servers, so the updates will still look atomic to clients. Alternatively, use a replicated file system under Ceph or the like, so you can quickly migrate failed servers. That’s a standard cloud hosting setup.

    What real world workload do you have, that appeared suddenly enough that your devs couldn’t stay in top of it, and you find yourself seeking advice from us relatively clueless dweebs on Lemmy? It’s not a problem most git users deal with. Git is pretty fast and most users are ok with a single server and a backup.




  • Borg is a backup program not a synchronizer. Backing up to mutliple targets just means running a normal backup to target 1, then another to target 2, etc. Maybe what you really want is git. There are also some self-hosted multi-access notepad programs, sort of like how google docs work. Anyway if your problem requires a server or synchronization, look into self-hosting rather than some cloud thing.


  • It’s hard to understand what you want. Why multiple devices? Why remote storage? Why not just use your laptop with local storage and encrypted backups? If you must have remote storage, why not self-host it on a cheap VPS? Just who are you trying to protect your data from? It’s a lot different if you think Trump is after you or something like that: you have to check your bed for microphones, rather than just worrying about your computer software.

    So far I’m satisfied with just using my laptop for personal files, but if I were more paranoid I’d set up a separate laptop with no internet and take some additional precautions besides that. Anyway, the more machines you use, the more potential security holes you have to deal with.

    Multiple backups is just a matter of running a script that backs up to more than one place, right? I use Borg for backup, and pointing it to multiple targets is pretty easy.