Normally, git-annex repositories consist of symlinks that are checked into git, and in turn point at the content of large files that is stored in .git/annex/objects/. Direct mode gets rid of the symlinks.

The advantage of direct mode is that you can access files directly, including modifying them. The disadvantage is that most regular git commands cannot be used in a direct mode repository.

Normally, git-annex repositories start off in indirect mode. With some exceptions:

  • Repositories created by the assistant use direct mode by default.
  • Repositories on FAT and other less than stellar filesystems that don't support things like symlinks will be automatically put into direct mode.
  • Windows always uses direct mode.

enabling (and disabling) direct mode

Any repository can be converted to use direct mode at any time, and if you decide not to use it, you can convert back to indirect mode just as easily. Also, you can have one clone of a repository using direct mode, and another using indirect mode.

To start using direct mode:

git annex direct

To stop using direct mode:

git annex indirect

safety of using direct mode

With direct mode, you're operating without large swathes of git-annex's carefully constructed safety net, which ensures that past versions of files are preserved and can be accessed. With direct mode, any file can be edited directly, or deleted at any time, and there's no guarantee that the old version is backed up somewhere else.

So if you care about preserving the history of files, you're strongly encouraged to tell git-annex that your direct mode repository cannot be trusted to retain the content of a file. To do so:

git annex untrust .

On the other hand, if you only care about the current versions of files, and are using git-annex with direct mode to keep files synchronised between computers, and manage your files, this should not be a concern for you.

use a direct mode repository

You can use most git-annex commands as usual in a direct mode repository.

Direct mode also works well with the git-annex assistant.

The most important command to use in a direct mode repository is git annex sync. This will commit any files you have run git annex add on, as well as files that were added earlier and have been modified. It will push the changes to other repositories for git annex sync there to pick up, and will pull and merge any changes made on other repositories into the local repository.

what doesn't work in direct mode

A very few git-annex commands don't work in direct mode, and will refuse to do anything. For example, git annex unlock doesn't make sense in direct mode.

As for git commands, direct mode prevents using any git command that would modify or access the work tree. So you cannot git commit or git pull (use git annex sync for both instead), or run git status (use git annex status instead). These git commands will complain "fatal: This operation must be run in a work tree".

The reason for this is that git doesn't understand how git-annex uses the work tree in direct mode. Where git expects the symlinks that get checked into git to be checked out in the work tree, direct mode instead replaces them with the actual content of files, as managed by git-annex.

There are still lots of git commands you can use in direct mode. For example, you can run git log on files, run git push, git fetch, git config, git remote add etc.

forcing git to use the work tree in direct mode

This is for experts only. You can lose data doing this, or check enormous files directly into your git repository, and it's your fault if you do! Also, there should be no good reason to need to do this, ever.

Ok, with the warnings out of the way, all you need to do to make any git command access the work tree in direct mode is pass it -c core.bare=false

So, if I edit a "content file" (change a music file's metadata, say), what's the workflow to record that fact and then synchronise it to other repositories?

I can't do a git add, so I don't understand what has to happen as a first step. (Thanks for your quick reply above, BTW.)

Comment by Michael Tue Feb 19 03:03:14 2013
git annex add $file
git annex sync
git annex copy $file --to otherrepo
Comment by joeyh.name Tue Feb 19 03:05:35 2013

What happens to the object database (.git/annex/objects) when going to direct mode? Are the objects deleted, moved to another location, kept?

If the objects are kept, does it means that the file on the repository in direct mode is duplicated in the object database? If so, would it be relevant to use cp --reflink=auto to populate the working directory to enable copy on write on filesystems that supports it?

Comment by mildred Mon Jul 8 13:27:21 2013
.git/annex/objects does not typically contain any file contents in direct mode. The file contents are stored directly in the working tree.
Comment by joeyh.name Mon Jul 8 16:11:32 2013

Would it be safe to add largefiles to gitignore in direct mode?

Can git-annex still track large files ignored by git?

Thanks. :-)

Comment by asbraithwaite Mon Aug 12 18:06:21 2013

asbraithwaite: No, as far as I know it can not.

Comment by arand Mon Aug 12 18:12:32 2013

I too have issues with mixing direct and indirect mode repositories.

I have a regular, existing repository with ebooks, shared between various clones on proper :) filesystems; now I would need a copy of some of them on an ereader which only offers a FAT filesystem, so it has to be direct mode.

mount $READER
cd $reader
git clone $REPO

I get a directory full of small files, the way git manages links on FAT.

git annex init "ebook reader"

This detects the fact that it is working on a crippled filesystem, enables direct mode and disables ssh connection caching; up to now everything seems to be fine, but then

git annex get $SOME_BOOK

seems to work, downloads the file somewhere, but when I try to open $SOME_BOOK it is still the fake link, and the file has been downloaded in its destination, as if the repo wasn't in direct mode.

I use version 4.20130723 on debian jessie

Comment by valhalla Sun Aug 18 08:47:35 2013

There should be no obstacles to using direct mode on one clone of a git repository, and indirect mode on another clone. The data stored in git for either mode is identical, and I do this myself for some repositories.

@valhalla, you probably need to run git annex fsck, and if that does not solve your problem, you need to file a bug report.

Comment by joeyh.name Fri Aug 23 17:48:54 2013

@obergix asked:

But then, how can a direct repo sync with changes made in other remotes, if there no pull/fetch available.

The answer is simple: By running git annex sync, which handles all that.

Comment by joeyh.name Fri Aug 23 17:50:15 2013

Thanks for these details @joeyh. But AFAIU, one needs to proceed to the git annex copy before doing the git annex sync, otherwise, symlinks (or files containing the symlink path on SMB) will be created, instead of the plain "direct" files that are expected.

I'm still not sure whether the git annex sync needs to be issued on either of the indirect or direct remotes first, or both, then in which sequence. I think a "walkthrough" script would help.

Comment by obergix Fri Aug 23 19:59:35 2013
No, you can sync before you copy, get, or whatever. git-annex will replace the symlinks with the actual files when they arrive at the repository.
Comment by joeyh.name Sat Aug 24 15:56:47 2013