Initial commit #1

carmocca · 2023-05-04T18:25:26Z

Generation works.

I removed all files that are not updated for simplicity. We can port them from upstream on demand.

carmocca · 2023-05-04T18:28:21Z

README.md

-## License
-
-Lit-LLaMA is released under the [Apache 2.0](https://github.com/Lightning-AI/lightning-llama/blob/main/LICENSE) license.
+# FIXME


We probably want to refresh this

carmocca · 2023-05-04T18:28:49Z

howto/inference.md

 > **Note**
 > All scripts support argument [customization](customize_paths.md)

+### FIXME: update this


Need to try this on a A100

lantiga

Fantastic work @carmocca!

lit_stablelm/model.py

lantiga · 2023-05-05T10:00:40Z

lit_stablelm/model.py

+
+        if hasattr(self, "bias"):
+            # causal self-attention; Self-attend: (B, nh, T, hs) x (B, nh, hs, T) -> (B, nh, T, T)
+            # NOTE: cannot use flash attention because it takes q.size(-1) as the norm factor which is different to the


why is this conditioned on bias being there?

lantiga · 2023-05-05T10:02:14Z

lit_stablelm/tokenizer.py

+
+class Tokenizer:
+    def __init__(self, vocabulary_path: Path, config_path: Path) -> None:
+        # https://github.com/Stability-AI/StableLM/blob/e60081/configs/stablelm-base-alpha-3b.yaml#L108


I think we should just vendor the yaml file in the repo directly

Are you suggesting this as a showcase of the configs used?

Because this is a gpt-neox config, meaning we don't need to use it

Or do you want to add support for running the scripts by passing it?

lantiga · 2023-05-05T10:04:29Z

There's a test failing in windows and the readme to complete. I can work on the readme.

Co-authored-by: Luca Antiga <luca@lightning.ai>

* Make trainer configurable and add docker file * Fix bugs * Add dockerignore * Fix config * Fix bug * Fix big * Fix bug * Fix bug * Try dlprof * fix bug * Add pytorch logger * FIx import * Add pytorch profiler * Fix bug * Reorder docker file * Fix bug * Make pytorch profiler optional * Try to fix profiler * Pytorch profiler working. Shunt torch comms again * tune profiler params * Make pt profiler configurable and run for global batches * Fix bug * Fix batch offset * Fix bug * Debug print issues * More print stuff * Add nvtx ranges * Adjust model sizes * tune validation iters

carmocca added 2 commits May 4, 2023 19:53

Initial commit

70bdff0

Add back bitsandbytes

0af65d2

carmocca commented May 4, 2023

View reviewed changes

carmocca added 4 commits May 4, 2023 20:30

Update setup.py

e97835c

pin transformers

3f67709

Fix test

b3a2e19

gitignore

3f28594

lantiga approved these changes May 5, 2023

View reviewed changes

Update lit_stablelm/model.py

9a1df3d

Co-authored-by: Luca Antiga <luca@lightning.ai>

lantiga merged commit f172f8d into main May 5, 2023

lantiga deleted the carmocca/initial-commit branch May 5, 2023 12:34

carmocca self-assigned this Nov 1, 2023

sadrafh mentioned this pull request Nov 14, 2024

more checkpoints #1819

Open

Andrei-Aksionov added a commit that referenced this pull request Jan 7, 2025

Scaffolding #1

0314318

lihux25 mentioned this pull request Jul 23, 2025

Errors in pretraining with TinyStories data #2096

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Initial commit #1

Initial commit #1

Uh oh!

carmocca commented May 4, 2023

Uh oh!

carmocca May 4, 2023

Uh oh!

carmocca May 4, 2023

Uh oh!

lantiga left a comment

Uh oh!

Uh oh!

lantiga May 5, 2023

Uh oh!

carmocca May 5, 2023

Uh oh!

lantiga May 5, 2023

Uh oh!

carmocca May 5, 2023

Uh oh!

lantiga commented May 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Initial commit #1

Initial commit #1

Uh oh!

Conversation

carmocca commented May 4, 2023

Uh oh!

carmocca May 4, 2023

Choose a reason for hiding this comment

Uh oh!

carmocca May 4, 2023

Choose a reason for hiding this comment

Uh oh!

lantiga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lantiga May 5, 2023

Choose a reason for hiding this comment

Uh oh!

carmocca May 5, 2023

Choose a reason for hiding this comment

Uh oh!

lantiga May 5, 2023

Choose a reason for hiding this comment

Uh oh!

carmocca May 5, 2023

Choose a reason for hiding this comment

Uh oh!

lantiga commented May 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants