Hi, there! I saved parallel state_dict (requires_grad True only) with 8 GPUs remotely, how to load these state_dicts and save them as one locally? Thanks in advance.
collie_dp0_pp0_tp0.pt collie_zero_dp0_pp0_tp0.pt collie_zero_dp2_pp0_tp0.pt collie_zero_dp4_pp0_tp0.pt collie_zero_dp6_pp0_tp0.pt
collie.json collie_zero_dp1_pp0_tp0.pt collie_zero_dp3_pp0_tp0.pt collie_zero_dp5_pp0_tp0.pt collie_zero_dp7_pp0_tp0.pt