Closed
Description
Edit: See @susemeee's comment below (the image COCO_train2014_000000167126.jpg
is corrupted, and you can download a replacement at https://msvocds.blob.core.windows.net/images/262993_z.jpg)
I was trying to run prepro.py
but eventually ran into an issue in scipy
's pilutil
package (see below).
I've installed all dependencies, run the coco_preprocess.ipynb
, and downloaded train2014.zip
+ val2014.zip
and extracted them into coco/images
.
Am I missing something?
$ python prepro.py --input_json coco/coco_raw.json --num_val 5000 --num_test 5000 --images_root coco/images --word_count_threshold 5 --output_json coco/cocotalk.json --output_h5 coco/cocotalk.h5
parsed input parameters:
{
"output_json": "coco/cocotalk.json",
"images_root": "coco/images",
"input_json": "coco/coco_raw.json",
"word_count_threshold": 5,
"max_length": 16,
"output_h5": "coco/cocotalk.h5",
"num_test": 5000,
"num_val": 5000
}
example processed tokens:
['a', 'woman', 'riding', 'a', 'bike', 'down', 'a', 'bike', 'trail']
... lots of info deleted for brevity ...
inserting the special UNK token
assigned 5000 to val, 5000 to test.
encoded captions to array of size (616767, 16)
processing 0/123287 (0.00% done)
... lots of percentages deleted for brevity ...
processing 60000/123287 (48.67% done)
Traceback (most recent call last):
File "prepro.py", line 236, in <module>
main(params)
File "prepro.py", line 186, in main
Ir = imresize(I, (256,256))
File "/usr/local/lib/python2.7/site-packages/scipy/misc/pilutil.py", line 424, in imresize
im = toimage(arr, mode=mode)
File "/usr/local/lib/python2.7/site-packages/scipy/misc/pilutil.py", line 234, in toimage
raise ValueError("'arr' does not have a suitable array shape for "
ValueError: 'arr' does not have a suitable array shape for any mode.
Metadata
Metadata
Assignees
Labels
No labels