Skip to content
View JueonPark's full-sized avatar

Block or report JueonPark

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their performance and efficiency.

Python 10 1 Updated Sep 30, 2024

The Triton TensorRT-LLM Backend

Python 664 96 Updated Sep 30, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,303 929 Updated Sep 30, 2024
MLIR 10 11 Updated Sep 20, 2024

Repository of model demos using TT-Buda

Python 54 14 Updated Sep 26, 2024

The Mojo Programming Language

Mojo 22,959 2,588 Updated Sep 30, 2024

IREE's PyTorch Frontend, based on Torch Dynamo.

Python 43 24 Updated Sep 27, 2024

TVM for Tenstorrent ASICs

Python 18 6 Updated Sep 30, 2024

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 82,565 22,221 Updated Sep 30, 2024

🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.

C++ 416 53 Updated Sep 30, 2024

Tenstorrent TT-BUDA Repository

Python 207 28 Updated Sep 24, 2024

A modern formatting library

C++ 20,537 2,457 Updated Sep 30, 2024

Flexible Intermediate Representation for RTL

Scala 720 175 Updated Aug 20, 2024

Fast and memory-efficient exact attention

Python 13,598 1,245 Updated Sep 30, 2024

Fast C++ logging library.

C++ 24,021 4,503 Updated Sep 23, 2024

A Borrow Checker and Memory Ownership System for C++20 (heavily inspired from Rust)

C++ 200 5 Updated Jun 24, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,619 4,072 Updated Sep 30, 2024

Circuit IR Compilers and Tools

C++ 1,638 286 Updated Sep 30, 2024

A new (MLIR based) high-level IR for clang.

LLVM 350 95 Updated Sep 28, 2024

FlatBuffers: Memory Efficient Serialization Library

C++ 23,149 3,230 Updated Sep 27, 2024

The ultimate Vim configuration (vimrc)

Vim Script 30,603 7,286 Updated Aug 18, 2024
Python 149 75 Updated Sep 28, 2024

Universal LLM Deployment Engine with ML Compilation

Python 18,745 1,528 Updated Sep 28, 2024

Automatic DNN generation for fuzzing and more

Python 117 26 Updated Mar 19, 2024

AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.

Fortran 204 44 Updated Sep 30, 2024

An open-source efficient deep learning framework/compiler, written in python.

Python 648 52 Updated Aug 27, 2024

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 3,706 528 Updated Sep 30, 2024

This repository contains the codebase for Virtual FPGA Lab in Makerchip contributing as a participant in Google Summer of Code 2021, under FOSSi Foundation.

Tcl 133 23 Updated Jul 12, 2024

Yet Another Random Program Generator

C++ 469 52 Updated Aug 8, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,927 4,057 Updated Sep 30, 2024
Next