Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Code complete? #57

Open
zhoumengbo opened this issue Oct 28, 2023 · 3 comments
Open

Code complete? #57

zhoumengbo opened this issue Oct 28, 2023 · 3 comments

Comments

@zhoumengbo
Copy link

I'd like to know if the code in this repository is complete. Has anyone tried pre-training this model from scratch?

@mdingemanse
Copy link

for pre-training from scratch you'd need the pretraining data to be specified and available, which it isn't; therefore the code is by definition not complete.

@zhoumengbo
Copy link
Author

for pre-training from scratch you'd need the pretraining data to be specified and available, which it isn't; therefore the code is by definition not complete.

Yes, you are right. Because I want to train Mistral 7B from scratch using a different dataset, I'm eager to know the integrity of the model code. If you are familiar with the code and can inform me about it, I would be immensely grateful.

@mdingemanse
Copy link

mdingemanse commented Oct 30, 2023

I'm not connected to Mistral and have the same questions as you. From what I can see the repo is set up to share some code for running released versions of Mistral (and if you trawl the issues you see that it is not complete for even that, eg people cannot get the instruct version to run).

It does not look like this repo contains the code you would need to pretrain from scratch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants