Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Give better error message in Tinystories data loader #319

Merged
merged 1 commit into from
Aug 19, 2023

Conversation

RahulSChand
Copy link
Contributor

@RahulSChand RahulSChand commented Aug 19, 2023

There are few open issues of people getting stuck running train.py because tinystories.py data loader will be stuck in an infinite loop if there are no .bin files in the data directory. Have added an assert statement so that there is a more informative error message rather than training just being stuck.

Relevant issues
#296
#311

@RahulSChand RahulSChand changed the title Give better error message in data loader Give better error message in Tinystories data loader Aug 19, 2023
Copy link
Contributor

@madroidmaq madroidmaq left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great tip, I just ran into this problem. I suggest that you can prompt the copy under optimization. When you really encounter this problem, you can provide some tips for checking errors. like:

"No bin files found in {bin_dir}, Please check the pretokenize execution result."

@karpathy karpathy merged commit d2a546c into karpathy:master Aug 19, 2023
@karpathy
Copy link
Owner

thank you for the PR, good add

vinhtran2611 pushed a commit to vinhtran2611/llama2.c that referenced this pull request Jan 20, 2024
Give better error message in Tinystories data loader
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants