Stars
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Supporting code for the tutorials on https://www.baeldung.com/scala
Blazingly fast analytics database that will rapidly devour all of your data.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
The BusTub Relational Database Management System (Educational)
ClickHouse Native Protocol JDBC implementation
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
Distributed SQL database in Rust, written as an educational project
A list of learning materials to understand databases internals
My collection of handwritten notes and resources for learning distributed systems
Clickhouse Scala Client with Reactive Streams support
Probabilistic Data Structures and Algorithms in Python
A connector for SingleStore and Spark
🦀 Small exercises to get you used to reading and writing Rust code!
TinyDB is a lightweight document oriented database optimized for your happiness :)
A better notebook for Scala (and more)
Presentations, meetups and talks about ClickHouse
sbt plugin to create a dependency graph for your project
Apache Spark - A unified analytics engine for large-scale data processing
An open-source toolkit for large-scale genomic analysis
This projects gives Kotlin bindings and several extensions for Apache Spark. We are looking to have this as a part of Apache Spark 3.x
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Scaraplate is a wrapper around cookiecutter which allows to repeatedly rollup project templates onto concrete projects.
Исследование формата протокола связи электросчетчика Меркурий
"Data Mining in Action Course", Moscow Institute of Physics and Technologies
DL course co-developed by YSDA, HSE and Skoltech