Skip to content

Pre-trained Model Not Loading Correctly #1

Closed
@MrWeast

Description

Pre-trained Model Not Loading Correctly:

Description:

When following the example instructions to run a pretrained model it fails due to size mismatching in various layers

Reproduction Steps:

  1. Get the Pre-trained on MolPCBA (default)
  1. run down stream evaluations
  • python main.py --cfg configs/mol_bench/zinc-GPS+GPSE.yaml

Environment:

  • Operating System: Windows 10

Code Snippets:

RuntimeError: Error(s) in loading state_dict for GraphGymModule:
        size mismatch for model.post_mp.node_post_mps.0.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.0.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.0.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.1.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.1.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.1.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.2.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.2.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.2.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.3.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.3.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.3.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.4.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.4.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.4.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.5.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.5.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.5.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.6.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.6.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.6.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.7.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.7.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.7.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.8.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.8.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.8.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.9.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.9.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.9.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.10.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.10.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.10.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.11.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.11.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.11.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.12.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.12.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.12.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.13.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.13.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.13.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.14.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.14.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.14.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.15.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.15.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.15.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.16.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.16.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.16.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.17.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.17.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.17.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.18.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.18.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.18.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.19.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.19.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.19.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.20.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.20.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.20.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.21.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.21.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.21.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.22.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.22.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.22.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.23.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.23.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.23.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.24.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.24.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.24.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.25.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.25.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.25.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.26.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.26.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.26.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.27.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.27.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.27.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.28.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.28.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.28.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.29.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.29.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.29.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.30.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.30.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.30.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.31.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.31.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.31.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.32.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.32.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.32.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.33.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.33.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.33.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.34.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.34.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.34.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.35.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.35.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.35.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.36.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.36.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.36.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.37.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.37.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.37.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.38.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.38.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.38.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.39.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.39.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.39.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.40.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.40.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.40.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.41.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.41.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.41.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.42.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.42.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.42.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.43.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.43.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.43.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.44.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.44.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.44.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.45.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.45.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.45.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.46.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.46.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.46.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.47.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.47.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.47.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.48.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.48.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.48.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.49.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.49.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.49.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).
        size mismatch for model.post_mp.node_post_mps.50.model.0.Layer_0.layer.model.weight: copying a param with shape torch.Size([32, 512]) from checkpoint, the shape in current model is torch.Size([512, 512]).
        size mismatch for model.post_mp.node_post_mps.50.model.0.Layer_0.layer.model.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([512]).
        size mismatch for model.post_mp.node_post_mps.50.model.1.model.weight: copying a param with shape torch.Size([1, 32]) from checkpoint, the shape in current model is torch.Size([1, 512]).

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions