Skip to content

Commit 62d8430

Browse files
authored
Update README.md
1 parent 735d05b commit 62d8430

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

README.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,9 @@ Due to the small size of public released dataset, we proposed to collect data fr
1818
## Pretrained models
1919

2020
we aims to train median-large pre-trained models (model size with 110M) based on GPT-Neo:
21-
- PyCodeGPT-110M: derived from GPT-Neo 125M with a vocabulary size of 32K. [Download PyCodeGPT 110M](https://github.com/microsoft/PyCodeGPT/releases/tag/PyCodeGPT-110M). We also continue-pretrained PyCodeGPT for Libary oriented scenarios, called [CERT](https://github.com/microsoft/PyCodeGPT/tree/main/cert), the models can be downloaded [here](https://github.com/microsoft/PyCodeGPT/releases).
21+
- PyCodeGPT-110M: derived from GPT-Neo 125M with a vocabulary size of 32K.
22+
23+
PyCodeGPT-110M is available on [HuggingFace](https://huggingface.co/Daoguang/PyCodeGPT).
2224

2325
## Evaluation
2426
1. Install requirements (python 3.7)
@@ -100,8 +102,6 @@ Note: our model can have a comparable accuracy with Codex of similar model size.
100102
|PolyCoder 400M |2.96% |5.29% |11.59% |
101103
|PolyCoder 2.7B |5.59% |9.84% |17.68% |
102104
103-
=======
104-
105105
## Reference
106106
If you want to use the models, you need to cite our following paper:
107107

0 commit comments

Comments
 (0)