Update documentation #392

StellaAthena · 2021-08-20T05:38:13Z

No description provided.

add the instructions to install triton

Update from main

* optimize data preprocessing semaphore is a little too small and slows down tokenizing * Make killall.sh less bruteforce * [temporary] fix to index errors * [temporary] fix to index errors * print sizes of tensors when inspecting checkpoint (#382) Co-authored-by: Samuel Weinbach <samuel.weinbach@gmail.com> * Use lru_cache for GPT2Tokenizer.bpe (#383) GPT2Tokenizer currently uses an unbounded cache, which causes very high memory usage with tools/preprocess_data.py * Fix bug with number of evaluation steps (#384) we were running way to many evaluation steps if the model is pipe parallel + has g.a.s on because of this line ```python for _ in range(neox_args.gradient_accumulation_steps): ``` - fixing this to 1 if the model is pipe parallel fixes the issue, as .eval_batch() already takes gradient accumulation steps into account. * Create CITATION.cff * Update CITATION.cff * Update documentation (#392) * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * add info about installing fused kernels * Update README.md * Update README.md * sparsity + minor typos add the instructions to install triton * change path to ssd-1 * typo * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md * Update README.md Co-authored-by: Shivanshu Purohit <42869065+ShivanshuPurohit@users.noreply.github.com> Co-authored-by: Stella Biderman <stellabiderman@gmail.com> Co-authored-by: Samuel Weinbach <samuel.weinbach@googlemail.com> Co-authored-by: Samuel Weinbach <samuel.weinbach@gmail.com> Co-authored-by: iczero <iczero4@gmail.com> Co-authored-by: Shivanshu Purohit <42869065+ShivanshuPurohit@users.noreply.github.com>

StellaAthena and others added 30 commits June 25, 2021 13:28

Update README.md

962eacc

Update README.md

50ea5cc

Update README.md

5018594

Update README.md

7a75cd4

Update README.md

c221109

Update README.md

8f43bac

Update README.md

3783d7f

Update README.md

f22769e

Update README.md

0b7d2fe

add info about installing fused kernels

5f978c8

Update README.md

20cadc3

Update README.md

2686396

sparsity + minor typos

94980dd

add the instructions to install triton

change path to ssd-1

7d44d97

typo

4333716

Update README.md

05249ab

Update README.md

28a830e

Update README.md

ae00018

Update README.md

5245c6d

Update README.md

4c6469e

Update README.md

c695714

Update README.md

1cccfd2

Update README.md

486ed38

Update README.md

1e97f7d

Update README.md

a3d06bc

Merge pull request #380 from EleutherAI/main

3e4f6d9

Update from main

Update README.md

173dfd4

Update README.md

91bb070

Update README.md

ff74c8a

Update README.md

74a6cdd

Merge pull request #385 from EleutherAI/main

e84a344

Update from main

StellaAthena requested a review from sdtblck August 20, 2021 05:38

StellaAthena requested a review from a team as a code owner August 20, 2021 05:38

StellaAthena requested a review from leogao2 August 20, 2021 05:38

StellaAthena linked an issue Aug 20, 2021 that may be closed by this pull request

Write Sampling Documentation #252

Closed

Update README.md

b6de20b

ShivanshuPurohit approved these changes Aug 21, 2021

View reviewed changes

ShivanshuPurohit merged commit 1d46283 into main Aug 21, 2021

ShivanshuPurohit deleted the documentation branch August 21, 2021 07:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update documentation #392

Update documentation #392

StellaAthena commented Aug 20, 2021

Update documentation #392

Update documentation #392

Conversation

StellaAthena commented Aug 20, 2021