-
Notifications
You must be signed in to change notification settings - Fork 871
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Code complete? #57
Comments
for pre-training from scratch you'd need the pretraining data to be specified and available, which it isn't; therefore the code is by definition not complete. |
Yes, you are right. Because I want to train Mistral 7B from scratch using a different dataset, I'm eager to know the integrity of the model code. If you are familiar with the code and can inform me about it, I would be immensely grateful. |
I'm not connected to Mistral and have the same questions as you. From what I can see the repo is set up to share some code for running released versions of Mistral (and if you trawl the issues you see that it is not complete for even that, eg people cannot get the instruct version to run). It does not look like this repo contains the code you would need to pretrain from scratch. |
I'd like to know if the code in this repository is complete. Has anyone tried pre-training this model from scratch?
The text was updated successfully, but these errors were encountered: