-
Uniphore
- India
- https://manickavela1998@gmail.com
- in/manickavela
-
-
-
deploy-learn Public
Learning and nuances for docker and kubernetes deployements
Dockerfile UpdatedNov 7, 2024 -
C-Plus-Plus Public
Forked from TheAlgorithms/C-Plus-PlusCollection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.
C++ MIT License UpdatedNov 7, 2024 -
-
GLiNER Public
Forked from urchade/GLiNERGeneralist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Python Apache License 2.0 UpdatedOct 18, 2024 -
-
perftime_tools Public
Comparing tools used for performance metrics and validating their consistency
C++ UpdatedOct 1, 2024 -
zip-optim Public
Optimizing zipformer, Transducer model for inference
Python Apache License 2.0 UpdatedSep 21, 2024 -
sherpa-onnx Public
Forked from k2-fsa/sherpa-onnxSpeech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, x86_64 servers, webs…
-
cuda-samples Public
Forked from NVIDIA/cuda-samplesSamples for CUDA Developers which demonstrates features in CUDA Toolkit
C Other UpdatedJul 26, 2024 -
llama.cpp Public
Forked from ggerganov/llama.cppLLM inference in C/C++
C++ MIT License UpdatedJul 13, 2024 -
QLLM Public
Forked from wejoncy/QLLMA general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.
Python Apache License 2.0 UpdatedJul 8, 2024 -
llm-merging.github.io Public
Forked from llm-merging/llm-merging.github.ioSCSS MIT License UpdatedJul 7, 2024 -
sequitur-g2p Public
Forked from sequitur-g2p/sequitur-g2pThis is a github repository of the abandonware Sequitur G2P by Bisani & Ney
Python GNU General Public License v2.0 UpdatedJul 3, 2024 -
-
onnxruntime Public
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
-
tensorrt-cpp-api Public
Forked from cyrusbehr/tensorrt-cpp-apiTensorRT C++ API Tutorial
C++ MIT License UpdatedJun 11, 2024 -
EmoTwitter Public
OnnxRT based Inference Optimization of Roberta model trained for Sentiment Analysis On Twitter Dataset
Jupyter Notebook UpdatedJun 6, 2024 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 26, 2024 -
lectures Public
Forked from gpu-mode/lecturesMaterial for cuda-mode lectures
Jupyter Notebook Apache License 2.0 UpdatedMay 14, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
-
optimum-nvidia Public
Forked from huggingface/optimum-nvidiaPython Apache License 2.0 UpdatedApr 25, 2024 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedApr 16, 2024 -
Efficient-Computing Public
Forked from huawei-noah/Efficient-ComputingEfficient computing methods developed by Huawei Noah's Ark Lab
-
-
Master's Assignment and Course works
Java UpdatedJun 3, 2023 -
IBM-Hackchalllenge-Winner Public
Won IBM Hackchallenge 2020 Jury's Choice Award
UpdatedJun 28, 2021