Skip to content
View mangrrua's full-sized avatar
  • Istanbul / Turkey

Block or report mangrrua

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based sentiment analysis tool that is specifically attuned to sentiments expressed in social …

Python 4,534 1,009 Updated Mar 16, 2024

A collective list of free APIs

Python 320,940 34,132 Updated Oct 31, 2024

FeatHub - A stream-batch unified feature store for real-time machine learning

Python 316 52 Updated May 27, 2024

real-time data + ML pipeline

Python 54 24 Updated Dec 19, 2024

The Open Source Feature Store for Machine Learning

Python 5,691 1,011 Updated Dec 28, 2024

Open Source AI/ML Platform

Python 8,386 778 Updated Dec 28, 2024

Open source platform for the machine learning lifecycle

Python 19,100 4,292 Updated Dec 27, 2024

A list of awesome data podcasts

370 57 Updated Apr 28, 2023

by ex-googlers, for ex-googlers - a lookup table of similar tech & services

14,679 1,045 Updated Jul 26, 2024

Flink CDC is a streaming data integration tool

Java 5,859 1,975 Updated Dec 27, 2024

Interesting readings and talks on computer science

675 71 Updated Jul 31, 2024

Druid Kubernetes Operator

Go 205 92 Updated Jun 4, 2024

The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.

Rust 5,847 466 Updated Dec 28, 2024

Code Samples for my Ververica Webinar "99 Ways to Enrich Streaming Data with Apache Flink"

Java 39 12 Updated Jan 4, 2022

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,529 2,439 Updated Dec 28, 2024

Mirror of Apache Samza

Java 819 336 Updated Dec 19, 2024

The official home of the Presto distributed SQL query engine for big data

Java 16,136 5,401 Updated Dec 27, 2024

Docker containers for testing in scala

Scala 639 126 Updated Dec 27, 2024

Mirror of Apache Helix

Java 470 229 Updated Dec 11, 2024

Apache Pinot - A realtime distributed OLAP datastore

Java 5,575 1,309 Updated Dec 27, 2024

Databricks Scala Coding Style Guide

2,747 581 Updated Apr 5, 2024

Readings in Databases

7,728 901 Updated Sep 9, 2024

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,336 542 Updated Dec 18, 2024

Notes on the book Clean Code - A Handbook of Agile Software Craftsmanship by Robert C. Martin

615 173 Updated Jan 18, 2023

A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.

Scala 129 57 Updated Jun 14, 2024

DataStax Connector for Apache Spark to Apache Cassandra

Scala 1,945 923 Updated Aug 27, 2024

Apache Druid: a high performance real-time analytics database.

Java 13,567 3,714 Updated Dec 25, 2024

Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark

Java 1,353 858 Updated Aug 22, 2023

A very simple, sample, Akka HTTP RESTful service

JavaScript 9 2 Updated Jun 10, 2016
Next