git-annex allows managing large files with git, without storing the file contents in git. It can sync, backup, and archive your data, offline and online. Checksums and encryption keep your data safe and secure. Bring the power and distributed nature of git to bear on your large files with git-annex.

git-annex is designed for git users who love the command line. For everyone else, the git-annex assistant turns git-annex into an easy to use folder synchroniser.

To get a feel for git-annex, see the walkthrough.

key concepts

new features

the details

other stuff

use case: The Archivist

Bob has many drives to archive his data, most of them kept offline, in a safe place.

With git-annex, Bob has a single directory tree that includes all his files, even if their content is being stored offline. He can reorganize his files using that tree, committing new versions to git, without worry about accidentally deleting anything.

When Bob needs access to some files, git-annex can tell him which drive(s) they're on, and easily make them available. Indeed, every drive knows what is on every other drive.
more about location tracking

Bob thinks long-term, and so he appreciates that git-annex uses a simple repository format. He knows his files will be accessible in the future even if the world has forgotten about git-annex and git.
more about future-proofing

Run in a cron job, git-annex adds new files to archival drives at night. It also helps Bob keep track of intentional and unintentional copies of files, and logs information he can use to decide when it's time to duplicate the content of old drives.
more about backup copies

use case: The Nomad

Alice is always on the move, often with her trusty netbook and a small handheld terabyte USB drive, or a smaller USB keydrive. She has a server out there on the net. She stores data, encrypted in the Cloud.

All these things can have different files on them, but Alice no longer has to deal with the tedious process of keeping them manually in sync, or remembering where she put a file. git-annex manages all these data sources as if they were git remotes.
more about special remotes

When she has 1 bar on her cell, Alice queues up interesting files on her server for later. At a coffee shop, she has git-annex download them to her USB drive. High in the sky or in a remote cabin, she catches up on podcasts, videos, and games, first letting git-annex copy them from her USB drive to the netbook (this saves battery power).
more about transferring data

When she's done, she tells git-annex which to keep and which to remove. They're all removed from her netbook to save space, and Alice knows that next time she syncs up to the net, her changes will be synced back to her server.
more about distributed version control

If that describes you, or if you're some from column A and some from column B, then git-annex may be the tool you've been looking for to expand from keeping all your small important files in git, to managing your large files with git.

Recent news

version 10.20241202
Posted Mon Dec 2 16:36:34 2024
version 10.20241031
Posted Thu Oct 31 21:20:10 2024

devblog

day 652-664 git-remote-annex
Posted Thu May 16 17:23:42 2024
day 651 a major release and a conference
Posted Mon Jun 26 15:53:43 2023
day 649-650 speeding up repeated imports
Posted Thu Jun 1 22:43:33 2023
day 644-648 terminal escape sequences
Posted Wed Apr 12 19:03:10 2023
day 643 adjusted view branches
Posted Mon Feb 27 20:28:28 2023

Recent videos

Recent forum posts

clone and initialize with a given uuid
Posted Sun Dec 8 20:26:16 2024
less paranoid mode
Posted Sun Dec 1 10:37:15 2024
Requesting of files across disconnected devices
Posted Mon Nov 25 00:55:06 2024
How to get a list of all NOT unused files
Posted Fri Nov 15 08:44:53 2024
Clusters - what's the use case?
Posted Tue Nov 12 21:18:43 2024

git-annex is Free Software, written in Haskell. You can contribute!

git-annex's wiki is powered by Ikiwiki and hosted by Branchable.