command to migrate object files from hashdirlower to hashdirmixed

A somewhat follow up to https://git-annex.branchable.com/bugs/add_config_var_preventing_adjusted_branch_mode/ where we ended up in adjusted branch mode and want to get back to original indirect mode using the thaw/freeze commands. Checking out master branch is not sufficient since .git/annex/objects uses different layout I guess to ensure that symlinks do not jeopardize actual annex storage on systems without read-only protection. But we need some command to migrate .git/annex/objects layout. May be it is already there and I just failed to find

fixed --Joey

RSS Atom

comment 1

But adjusted branches do not affect the location of git-annex object files.

The git-annex adjust man page says to use git checkout to switch back, and it certianly does work.

If you are having a problem with this, you need to explain what the problem is and what is happening...

Comment by joey — Mon May 9 14:57:55 2022

Remove comment

hm...

I will need to figure out/try to reproduce how user ended up with .git/annex/objects in xxx/yyy instead of xx/yy layout...

Comment by yarikoptic — Mon May 9 19:10:10 2022

Remove comment

comment 3

Ok, so the object location usually used in bare repositories..

One way that could happen is if core.symlinks=false and annex.crippledfilesystem=true. Then it does use the bare form of object filenames, which is kind of ok since it's not going to be using symlinks in that repository.

Also, before 2016 (?commit 2d00523609def535588b693a00d4092768e1c3c6), git-annex used those names whenever annex.crippledfilesystem=true, no matter what core.symlinks was set to. So if the files are that old..

This does seem to point to there needing to be a way to migrate the object files in a repository to the right names. It might be a reasonable thing for git-annex fsck to do, when it sees a symlink to an object file that is in the other location.

Comment by joey — Mon May 9 19:54:35 2022

Remove comment

comment 4

The rationalle for using the bare object layout when on a crippled filesystem was given in f1b0a4b404ed835f1c4a27a92352180be8564f8a. Basically it may be more portable. Not a strong rationalle at all, as the later change to not do it when symlinks are supported shows. But I don't think worth changing at this point.

So teaching fsck to move object files to the preferred location seems the best way. It will also deal with the situation where a bare repository gets converted by the user into a non-bare repo.

Comment by joey — Tue May 10 16:37:21 2022

Remove comment

comment 5

As well as moving the object file, fsck will need to move any other associated files, including the object lock file. It may as well move the whole object directory.

Locking is a concern for implementing this in fsck. There would be a race where another process that is locking the object file sees the object file in the old location, so tries to lock it in the old location, but by then the object file has been moved.

Experimentally: In v10, moving the object file after it has checked its location in preparation for locking for drop results in it making a separate lock file in the old object directory. That lock file remains after the drop succeeds. In v8/v9, it seems to not create the object file when trying to lock it. (Based on reading the code, I though perhaps it would!) In v8-v10, moving the object directory in the race when it's locking content in place causes the lock to fail; it does not create any lock file or object file.

So, v10 post drop lock file cleanup is the problem. Or at least one problem, there could be other points in the race than the one I tested that have other behavior. This seems like an ugly race to insert fsck into the middle of; it would be much preferable if fsck could somehow avoid such races when moving the object directory. But how?

fsck could lock the object file for drop, and then rather than removeing it, move it to a holding location. Then it could move the object file into the right place the same as get does. This should avoid the race. Interrupting fsck at the wrong time would leave the object file in this holding location though. Re-running fsck would need to recover from this situation. Putting it in .git/annex/tmp/ might make sense, although git-annex get does not necessarily recover when the object file is located there.

Comment by joey — Tue May 10 16:56:47 2022

Remove comment

comment 6

I guess fsck could just lock the entire repo for its duration forbidding any operation? I would love to be able to migrate the layout also on "older" versions of repo/annex without upgrading all the way to 10. Meanwhile I think I am doomed to write a little helper to do those renames (once, and hopefully never ever again... I might even protect myself by making those top xxx known to me now non-writable at the level of ACL, so the attempt to migrate would lead to an error)

Comment by yarikoptic — Fri May 13 15:12:29 2022

Remove comment

shell helper

FWIW made this shell helper to migrate all keys into desired layout: https://raw.githubusercontent.com/datalad/datalad/maint/tools/convert-git-annex-layout

Comment by yarikoptic — Fri May 13 16:51:22 2022

Remove comment

comment 6

If fsck locks the content for removal, then moves it to the preferred location, how is that any different from git-annex first dropping content and then very quickly retrieving another copy and storing it in the other location? The only difference is timing, but things like being suspended and resumed can affect timing.

So, if there is a problem with fsck doing that, there would also be a more general problem, that could occur in other circumstances, even if only rarely.

One way to see the general problem happen would be to have two processes trying to drop the same object. One process finds the object location, then stalls. Meanwhile, the second process drops the object. Then the first process resumes, and locks for removal. Per comment #5 this will result in a dangling lock file in the object directory. I have not managed to get this to happen yet though.

A fix for the general problem is to make it not create the object directory when opening the object lock file. So I've made that change.

Comment by joey — Mon May 16 15:24:43 2022

Remove comment

comment 7

Made git-annex fsck move the object files to the preferred location for the repository type.

You can run it with --fast and it should solve your problem. I'm still not certain what circumstance led to you having the problem, but unless I hear back I'll assume it was something like an old version of git-annex. So will close this bug with this as the fix..

Comment by joey — Mon May 16 19:17:34 2022

Remove comment

comment 10

FWIW: confirming that git-annex fsck --fast worked out nicely on a sample test repo.

Re question above on how we got there: it is trivial. While having no globally defined thawcontent-command/freezecontent-command we created a new repo with git annex 10.20230126-1~ndall+1 (so - recent).

[d31548v@discovery7 test]$ git init; git annex init; echo 123 > 123 ; git annex add 123; git commit -m 123 123; ls -l 
Initialized empty Git repository in /dartfs-hpc/rc/lab/C/CANlab/labdata/projects/test/test/.git/
init  
  Filesystem does not allow removing write bit from files.

  Detected a crippled filesystem.

  Disabling core.symlinks.

  Entering an adjusted branch where files are unlocked as this filesystem does not support locked files.

Switched to branch 'adjusted/master(unlocked)'
ok
(recording state in git...)
add 123 
ok                                
(recording state in git...)
[adjusted/master(unlocked) 6fc2d66] 123
 1 file changed, 1 insertion(+)
 create mode 100644 123
total 24
-rwxrwx--- 1 d31548v rc-CANlab-admin 4 May 23 12:01 123
[d31548v@discovery7 test]$ ls -lta
total 120
drwxrwx--- 9 d31548v rc-CANlab-admin 293 May 23 12:01 .git
-rwxrwx--- 1 d31548v rc-CANlab-admin   4 May 23 12:01 123
drwxrwx--- 3 d31548v rc-CANlab-admin  43 May 23 12:01 .
drwxrwx--- 4 d31548v rc-CANlab-admin  62 May 23 12:01 ..

so we ended up in adjusted branches mode, with .git/config having those core.symlinks = false and annex.crippledfilesystem = true.

Then we wanted to move back to "normal" -- enabled those thaw/freeze config options, git config --unset core.symlinks; git config --unset annex.crippledfilesystem, ran git annex fsck --fast and seems got it all alright.

Comment by yarikoptic — Tue May 23 16:12:42 2023

Remove comment

Add a comment