JueonPark

Jueon Park JueonPark

Interested in Deep Learning Compilers

38 followers · 83 following

Rebellions Inc.
Seoul, South Korea
23:35 (UTC +09:00)
https://jueonpark.notion.site/Jueon-Park-1fcdd44a43134fe987f140c8881ac5e7
in/jueonpark11

Achievements

Lists (4)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

tenstorrent / tt-forge-fe

The TT-Forge FE is a graph compiler designed to optimize and transform computational graphs for deep learning models, enhancing their performance and efficiency.

Python 10 1 Updated Sep 30, 2024

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

Python 664 96 Updated Sep 30, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,303 929 Updated Sep 30, 2024

Xilinx / mlir-xten

MLIR 10 11 Updated Sep 20, 2024

tenstorrent / tt-buda-demos

Repository of model demos using TT-Buda

Python 54 14 Updated Sep 26, 2024

modularml / mojo

The Mojo Programming Language

Mojo 22,959 2,588 Updated Sep 30, 2024

iree-org / iree-turbine

IREE's PyTorch Frontend, based on Torch Dynamo.

Python 43 24 Updated Sep 27, 2024

tenstorrent / tt-tvm

TVM for Tenstorrent ASICs

Python 18 6 Updated Sep 30, 2024

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 82,565 22,221 Updated Sep 30, 2024

tenstorrent / tt-metal

🤘 TT-NN operator library, and TT-Metalium low level kernel programming model.

C++ 416 53 Updated Sep 30, 2024

tenstorrent / tt-buda

Tenstorrent TT-BUDA Repository

Python 207 28 Updated Sep 24, 2024

fmtlib / fmt

A modern formatting library

C++ 20,537 2,457 Updated Sep 30, 2024

chipsalliance / firrtl

Flexible Intermediate Representation for RTL

Scala 720 175 Updated Aug 20, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,598 1,245 Updated Sep 30, 2024

gabime / spdlog

Fast C++ logging library.

C++ 24,021 4,503 Updated Sep 23, 2024

Jaysmito101 / rusty.hpp

A Borrow Checker and Memory Ownership System for C++20 (heavily inspired from Rust)

C++ 200 5 Updated Jun 24, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,619 4,072 Updated Sep 30, 2024

llvm / circt

Circuit IR Compilers and Tools

C++ 1,638 286 Updated Sep 30, 2024

llvm / clangir

A new (MLIR based) high-level IR for clang.

LLVM 350 95 Updated Sep 28, 2024

google / flatbuffers

FlatBuffers: Memory Efficient Serialization Library

C++ 23,149 3,230 Updated Sep 27, 2024

amix / vimrc

The ultimate Vim configuration (vimrc)

Vim Script 30,603 7,286 Updated Aug 18, 2024

mlc-ai / relax

Python 149 75 Updated Sep 28, 2024

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 18,745 1,528 Updated Sep 28, 2024

ise-uiuc / nnsmith

Automatic DNN generation for fuzzing and more

Python 117 26 Updated Mar 19, 2024

ROCm / aomp

AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.

Fortran 204 44 Updated Sep 30, 2024

hidet-org / hidet

An open-source efficient deep learning framework/compiler, written in python.

Python 648 52 Updated Aug 27, 2024

ROCm / HIP

HIP: C++ Heterogeneous-Compute Interface for Portability

C++ 3,706 528 Updated Sep 30, 2024

os-fpga / Virtual-FPGA-Lab

This repository contains the codebase for Virtual FPGA Lab in Makerchip contributing as a participant in Google Summer of Code 2021, under FOSSi Foundation.

Tcl 133 23 Updated Jul 12, 2024

intel / yarpgen

Yet Another Random Program Generator

C++ 469 52 Updated Aug 8, 2024

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,927 4,057 Updated Sep 30, 2024

Jueon Park JueonPark

Lists (4)

CXL Project

PIM

Study

취준

Stars