-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
-
apex Public
Forked from NVIDIA/apexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 16, 2024 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
-
gpt-fast Public
Forked from pytorch-labs/gpt-fastSimple and efficient pytorch-native transformer text generation in <1000 LOC of python.
-
spark Public
Forked from apache/sparkMirror of Apache Spark
Scala Apache License 2.0 UpdatedAug 16, 2024 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
-
pytorch-jit-paritybench Public
Forked from jansel/pytorch-jit-paritybenchPython UpdatedDec 1, 2023 -
einops Public
Forked from arogozhnikov/einopsDeep learning operations reinvented (for pytorch, tensorflow, jax and others)
Python MIT License UpdatedAug 10, 2023 -
benchmark Public
Forked from pytorch/benchmarkTorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
Python BSD 3-Clause "New" or "Revised" License UpdatedJun 7, 2023 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedOct 25, 2022 -
torchdynamo Public
Forked from pytorch/torchdynamoA Python-level JIT compiler designed to make unmodified PyTorch programs faster.
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 13, 2022 -
tensorflow Public
Forked from tensorflow/tensorflowComputation using data flow graphs for scalable machine learning
C++ Apache License 2.0 UpdatedDec 21, 2018 -
-
spark-vlbfgs Public archive
Vector-free L-BFGS implementation for Spark MLlib
-
bound-optimization Public archive
Linear regression and logistic regression under bound constrained optimization in Python.