
Highlights
- Pro
-
-
-
RL_game_env Public
personal rl environment / framework for games. For self teaching purposes
Python UpdatedJul 27, 2025 -
torchtune Public
Forked from pytorch/torchtunePyTorch native post-training library
Python BSD 3-Clause "New" or "Revised" License UpdatedJul 11, 2025 -
Linux-NSPA-pkgbuild Public
Forked from nine7nine/Linux-NSPA-pkgbuildLinux-NSPA packages for Archlinux - custom hack of a hack of a hack of a
Shell UpdatedJun 29, 2025 -
llm-training Public
repo for my personal llm experiments in using different optimizers, attention mechs, data preprocessing strategies, quantization schemes, and other stuff. Optimized for my nvidia rtx 4080 w 12gb of…
Python MIT License UpdatedJun 15, 2025 -
llama.cpp Public
Forked from ggml-org/llama.cppcustom fast LLM inference in C/C++ (optimized for single gpu local setup)
C++ MIT License UpdatedMay 31, 2025 -
-
adheep_lm Public
medium-sized language model based on deepseek (in development)
Python UpdatedMay 14, 2025 -
-
img-seg-transformer Public
custom transformer-based image segmentation model
Jupyter Notebook UpdatedMar 26, 2025 -
-
InterPLM Public
Forked from SauersML/InterPLMDiscovering Interpretable Features in Protein Language Models via Sparse Autoencoders
Python MIT License UpdatedMar 12, 2025 -
lenet5-pure-pytorch Public
pure pytorch implementation of LeNet-5, a handwritten digit classifier; detailed documentation and comments included. model specifications derived from original paper.
-
ngram Public
company name generator based on a bigram and trigram
-
blt Public
Forked from facebookresearch/bltCode for BLT research paper
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 5, 2025 -
megabyte-language-model Public
small language model using the megabyte architecture inspired by the transformer
Python UpdatedFeb 4, 2025 -
gpt-tokenizer Public
implementation of gpt's tokenizer (byte pair encoding) for personal learning purposes
UpdatedFeb 3, 2025 -
-
-
style-transfer-generator Public
An AI image generator that stylizes a photo given 2 inputs: the image whose style to mimic, the picture to apply it on
-
llama8b-local-tests Public
testing gpt-fast inferencing using triton on wsl
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 3, 2025 -
personal-llm-mistral Public
running a mistral-7b instruct locally and experimenting with it
Jupyter Notebook Apache License 2.0 UpdatedJan 26, 2025 -
st-backbone-surgery Public
changing the style transfer model's backbone classifier from VGG into something that can handle variable input sizes
Python UpdatedJan 15, 2025 -
fcn-urban-segmentation Public
pytorch implementation of FCN-8 described in the attached paper. trained on more difficult-to-segment cityscape dataset
Jupyter Notebook UpdatedJan 5, 2025 -
transformer-img-seg Public
test training an image segmentation model using swin-v2-t as a backbone and a skip connections with fusion (fpn).
Jupyter Notebook UpdatedJan 4, 2025 -
cpp-xor-neuralnet Public
implementing the xor classifier neural net in c++ to learn the language
C++ UpdatedDec 30, 2024 -
numpy-xor-neuralnet Public
a multi-layered (2 -> 2 ->1) perceptron made without pytorch or other deep learning frameworks; classifies the XOR samples.
Python UpdatedDec 29, 2024 -
MEGABYTE-pytorch Public
Forked from lucidrains/MEGABYTE-pytorchImplementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Python MIT License UpdatedDec 27, 2024 -
youtube-download Public
script to automate downloading the first search result as an mp3 using: yt_download 'search'
Python UpdatedDec 13, 2024