Should transfer to a special remote verify source content?

A DataLad issue was raised about users inadvertently corrupting locked files. That led to an example where a user could copy corrupted content over to a special remote, and the content isn't flagged until a get call. (A slightly different example, based on a directory remote and without exporttree=yes, is included below.)

In the case of a regular remote, the copy call would fail earlier with

  verification of content failed

  failed to send content to remote


  verification of content failed

  failed to send content to remote
failed
git-annex: copy: 1 failed

Should something similar happen when copying or exporting to a special remote? Perhaps verification before transfer to a special remote isn't worth it, but the successful transfer surprised me given the behavior when transferring to regular remotes.

cd "$(mktemp -d "${TMPDIR:-/tmp}"/dl-XXXXXXX)"

mkdir d
git init a
(
    cd a
    git annex init

    echo one >one
    git annex add one
    git commit -mone

    one_resolved=$(readlink -f one)
    chmod +w $one_resolved
    echo more >>$one_resolved
    chmod -w $one_resolved

    git annex initremote d type=directory directory="$PWD"/../d encryption=none
    git annex copy --to=d
    git annex drop one
    git annex get one
)

[...]
copy one (to d...) 
ok                              
(recording state in git...)
drop one ok
(recording state in git...)
get one (from d...) 

  verification of content failed

  Unable to access these remotes: d

  Try making some of these repositories available:
    e6878750-3a21-4ae9-b9ae-a241f17176a4 -- [d]
failed
git-annex: get: 1 failed

RSS Atom

verifying source contents

This would especially make sense when sending files to a trusted special remote, where the file may be the only copy. Maybe, set the mtime of all files (and directories) in .git/annex/objects to some sentinel value after they're written, then check if the mtime still has that value before sending the file elsewhere?

If a special remote supports named pipes, the verification could be done on-the-fly as the file is streamed to the remote.

Comment by Ilya_Shlyakhter — Wed Sep 2 15:35:18 2020

Remove comment