Skip to content

Commit 61d50cf

Browse files
committed
fix
1 parent 6fbe229 commit 61d50cf

File tree

1 file changed

+1
-3
lines changed
  • colossalai/shardformer/policies

1 file changed

+1
-3
lines changed

colossalai/shardformer/policies/gpt2.py

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -40,10 +40,8 @@ def preprocess(self):
4040
self.model.resize_token_embeddings(new_vocab_size)
4141
else:
4242
# Make vocab_size divisible by `make_vocab_size_divisible_by` to select a faster CUDA kernel operator.
43-
new_vocab_size = vocab_size
4443
multiple = self.shard_config.make_vocab_size_divisible_by
45-
while (new_vocab_size % multiple) != 0:
46-
new_vocab_size += 1
44+
new_vocab_size = (vocab_size // multiple + 1) * multiple
4745
self.model.resize_token_embeddings(new_vocab_size)
4846
return self.model
4947

0 commit comments

Comments
 (0)