Skip to content
View radujica's full-sized avatar

Block or report radujica

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.

Go 700 27 Updated Dec 27, 2024

ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

Python 2,732 67 Updated Dec 24, 2024

A curated list of awesome Apache Spark packages and resources.

Shell 1,738 333 Updated Oct 24, 2024

Jellyfin Samsung TV Client

JavaScript 1,026 81 Updated Dec 22, 2024

All Algorithms implemented in Python

Python 195,959 46,030 Updated Dec 27, 2024

An extremely fast Python package and project manager, written in Rust.

Rust 33,703 912 Updated Dec 27, 2024

OpenTofu lets you declaratively manage your cloud infrastructure.

Go 23,671 924 Updated Dec 26, 2024

Python Testing for Databricks

Python 56 5 Updated Nov 15, 2024

The most intuitive desktop API client. Organize and execute REST, GraphQL, and gRPC requests in a simple and intuitive app.

TypeScript 2,470 75 Updated Dec 23, 2024

the portable Python dataframe library

Python 5,411 605 Updated Dec 27, 2024

Lightweight and extensible compatibility layer between dataframe libraries!

Python 712 117 Updated Dec 27, 2024

A curated list of awesome big data frameworks, ressources and other awesomeness.

13,341 2,561 Updated May 7, 2024

pyspark methods to enhance developer productivity πŸ“£ πŸ‘― πŸŽ‰

Python 654 100 Updated Dec 6, 2024

A Python module for decorators, wrappers and monkey patching.

Python 2,084 233 Updated Dec 5, 2024

Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

Jupyter Notebook 1,935 130 Updated Dec 24, 2024

Apache PyIceberg

Python 529 194 Updated Dec 27, 2024

Open-source scientific and technical publishing system built on Pandoc.

JavaScript 4,040 331 Updated Dec 27, 2024

Apache DataFusion SQL Query Engine

Rust 6,509 1,249 Updated Dec 27, 2024

Apache DataFusion Ballista Distributed Query Engine

Rust 1,592 199 Updated Dec 27, 2024

a Hassle-Free Python Experience

Rust 13,928 467 Updated Dec 27, 2024

🌹 Cookiecutter template featuring the modern and extensible Python project manager hatch

Python 71 7 Updated Nov 18, 2024

Package management made easy

Rust 3,597 206 Updated Dec 24, 2024

A library that provides useful extensions to Apache Spark and PySpark.

Scala 203 27 Updated Nov 30, 2024

PySpark test helper methods with beautiful error messages

Python 638 69 Updated Oct 24, 2024

Build mindmaps with plain text

TypeScript 9,784 681 Updated Dec 26, 2024

🎨 Diagram as Code for prototyping cloud system architectures

Python 39,999 2,556 Updated Dec 26, 2024

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,236 446 Updated Dec 27, 2024

Open-source BI for engineers

Rust 2,218 51 Updated Dec 16, 2024

β˜„πŸŒŒοΈ The minimal, blazing-fast, and infinitely customizable prompt for any shell!

Rust 46,076 1,986 Updated Dec 27, 2024

ZenML πŸ™: The bridge between ML and Ops. https://zenml.io.

Python 4,283 459 Updated Dec 27, 2024
Next