Thank you for adding import options to handle duplicates. Very handy when consolidating data from various sources.
Can deletion of the source files be decoupled from annex duplication/deduplication options? For example, I would like to import source files without deleting them and at the same time do not import duplicates.
Better yet, since deletion of source files is potentially dangerous, a delete option could be required for deletion to be performed. Example:
git annex import --deduplicate --delete_all_source_files git annex import --deduplicate --delete_source_duplicates
Also, it would be great to have import "status" option which goes over files to be imported and logs their status ( to be imported, duplicate etc. ) without actually performing any changes. It would be great for testing and trial runs.
I hope the above make sense. It would make import feature more flexible.
Cheers,
Posted the above yesterday before realizing that it should probably go to wishlist requests. I am sorry about that.
Basically it is a request to extend import options and perhaps make the options easier to use/understand.
Suggested Import Options:
Many thanks for a great product!
Imported several thousand files to annex and would like to add the following comments:
it would be great to have an option to exclude hidden dot files from import,
empty directories should be deleted when files located in the directories are deleted,
"git annex add" seems to process directories and files alphabetically, unfortunately import processes files in a different order, which makes it hard to predict which files are deleted when deduplicating,
Cheers,