Stars
An app and set of methodologies designed to evaluate the performance of various Large Language Models (LLMs) on the text-to-SQL task. Our goal is to offer a standardized way to measure how well the…
This SDK generates datasets for training Video LLMs from youtube videos.
LaserGaze is an open-source video-focused tool for real-time gaze estimation, utilizing temporal data for enhanced accuracy in tracking eye positions and calculating gaze vectors, suitable for AR, …
This repository contains training code for the Gemamba VLM
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
A tool that converts scientific PDFs into plain text for your LLM-related needs, such as building RAGs or agents for academic knowledge. It was developed in collaboration with the LlamaIndex team.
Distribute and run LLMs with a single file.
FaceFlow is a PyTorch Lightning-based repository simplifying the creation of models for detecting facial biomechanics through Facial Action Units.
Bitmap & tilemap generation from a single example with the help of ideas from quantum mechanics
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
TensorFlow API for the Scala Programming Language
A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2025 without ANY background in the field and stay up-to-date with the latest news and state-of-the-ar…
Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
YOLOv3 in PyTorch > ONNX > CoreML > TFLite
Data science interview questions and answers
Home assignments for data science positions
A technical report on convolution arithmetic in the context of deep learning
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
A browser extension that links video explanations to research papers on arxiv.org
A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…
Distributed Asynchronous Hyperparameter Optimization in Python