Pre-built wheels for llama-cpp-python across platforms and CUDA versions
-
Updated
Apr 18, 2026
Pre-built wheels for llama-cpp-python across platforms and CUDA versions
Designed and developed Dimensional Model and performed data profiling, ETL operations for staging using Alteryx. Created data integration workflow in Talend to load data in AzureSQL and BigQuery DW and visualized analytical data reports dashboard using Tableau and PowerBI to get the insights and single truth story of the data
Build, run, and setup scripts for the complete TensorRT-LLM pipeline on RTX A6000 Ada (SM89). Reproducible path from HuggingFace checkpoint to deployable .engine file, with FP16 baseline and FP8 quantization. Companion material to the 4-part blog series on ai-box.eu — in preparation for the NVIDIA TensorRT Edge-LLM ecosystem.
Add a description, image, and links to the ada-architecture topic page so that developers can more easily learn about it.
To associate your repository with the ada-architecture topic, visit your repo's landing page and select "manage topics."