-
addcn.com
- SZ China
- http://ming.ws
大数据&AI
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K…
Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Pravega - Streaming as a new software defined storage primitive
A cloud-native database based on PostgreSQL developed by Alibaba Cloud.
SpaCy 中文模型 | Models for SpaCy that support Chinese
📚 [.md & .ipynb] Series of Artificial Intelligence & Deep Learning, including Mathematics Fundamentals, Python Practices, NLP Application, etc. 💫 人工智能与深度学习实战,数理统计篇 | 机器学习篇 | 深度学习篇 | 自然语言处理篇 | 工具实践 …
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
对帆软的FineBI的web应用,进行了一层封装打包,根据登录,目录等api,重做了登录页面和目录导航,对于有较多较深目录结构的公司报表项目,有极大的使用体验提升。
Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
Session replay and product analytics you can self-host. Ideal for reproducing issues, co-browsing with users and optimizing your product.
Simple, open source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
Open Source Feature Flagging and A/B Testing Platform
Web-based SQL editor. Legacy project in maintenance mode.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Google BigQuery support for Spark, SQL, and DataFrames
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Score documents using embedding-vectors dot-product or cosine-similarity with ES Lucene engine
Repository holding configuration files for running an HDFS cluster in Kubernetes
Helm chart from stable/hadoop, updated to hadoop 3.2.1
SQL databases in Python, designed for simplicity, compatibility, and robustness.
Hive Storage Handler for interoperability between BigQuery and Apache Hive