ENH: Speed up S3DataGrabber using prefix arg #2143

lukassnoek · 2017-08-04T12:41:53Z

Changes proposed in this pull request

When finding files on S3 (using S3DataGrabber), the command ...

bkt_files = list(k.key for k in bkt.list())

... finds all files in a given bucket, which takes very long for the openneuro and openfmri buckets. On my computer, this command (using the example from nipype/interfaces/tests/test_io.py) takes 170 seconds. However, as proposed in my PR, when you use the prefix argument with the bucket_path in the bkt.list() call, it only takes 116 milliseconds. This is because it restricts the filesearch to only the files in the specified bucket_path. The interface still works when bucket_path is not set as input (i.e., it works with the default '' value of the bucket_path parameter).

Add name/affiliation to zenodo-file (first PR!)

lukassnoek added 2 commits August 4, 2017 13:59

ENH: Speed up S3DataGrabber using prefix arg

4cfc6c9

Add name to zenodo

e762c3b

satra merged commit 3164602 into nipy:master Aug 4, 2017

satra added this to the 0.14.0 milestone Oct 20, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ENH: Speed up S3DataGrabber using prefix arg #2143

ENH: Speed up S3DataGrabber using prefix arg #2143

Uh oh!

lukassnoek commented Aug 4, 2017

Uh oh!

Uh oh!

ENH: Speed up S3DataGrabber using prefix arg #2143

ENH: Speed up S3DataGrabber using prefix arg #2143

Uh oh!

Conversation

lukassnoek commented Aug 4, 2017

Uh oh!

Uh oh!