Can't seem to get sample_speaker.py to generate text for new images

I wish to generate caption text for images that I'll be providing. My understanding is that sample_speaker.py will do this. However, when I run it I get an error. Here's what I run in terminal, with the relevant parts of config.json.txt changed.

`python sample_speaker.py -speaker-saved-args config.json.txt -speaker-checkpoint best_model.pt -img-dir image_folder -out-file /Outputs/results.pkl`

When I do this, I get:

```
RuntimeError: Error(s) in loading state_dict for ModuleDict:
	size mismatch for decoder.word_embedding.weight: copying a param with shape torch.Size([14469, 128]) from checkpoint, the shape in current model is torch.Size([35466, 128]).
	size mismatch for decoder.next_word.weight: copying a param with shape torch.Size([14469, 512]) from checkpoint, the shape in current model is torch.Size([35466, 512]).
	size mismatch for decoder.next_word.bias: copying a param with shape torch.Size([14469]) from checkpoint, the shape in current model is torch.Size([35466]).

```
Can you advise what I'm doing wrong here? I can't quite get to the bottom of it. Thanks!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't seem to get sample_speaker.py to generate text for new images #5

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Can't seem to get sample_speaker.py to generate text for new images #5

Description

Activity

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions