Skip to content
Change the repository type filter

All

    Repositories list

    • Python API for Deequ
      Python
      Apache License 2.0
      134001Updated Oct 15, 2024Oct 15, 2024
    • pipelines

      Public
      Machine Learning Pipelines for Kubeflow
      Python
      Apache License 2.0
      1.6k005Updated Oct 13, 2024Oct 13, 2024
    • evidence

      Public
      Evidence enables analysts to deliver a polished business intelligence system using SQL and markdown
      Svelte
      MIT License
      202002Updated Sep 17, 2024Sep 17, 2024
    • This repository has code samples to use MarkovML SDK
      Python
      0201Updated Sep 17, 2024Sep 17, 2024
    • Ethnicolr implementation with new models in pytorch
      Jupyter Notebook
      MIT License
      2002Updated Jul 25, 2024Jul 25, 2024
    • guidance

      Public
      A guidance language for controlling large language models.
      Jupyter Notebook
      MIT License
      1k000Updated Jul 15, 2024Jul 15, 2024
    • Predict race from name and location
      Python
      MIT License
      1003Updated Jul 9, 2024Jul 9, 2024
    • dataprep

      Public
      DataPrep — The easiest way to prepare data in Python
      Python
      MIT License
      206000Updated Jun 27, 2024Jun 27, 2024
    • dask-ml

      Public
      Scalable Machine Learning with Dask
      Python
      BSD 3-Clause "New" or "Revised" License
      256000Updated May 22, 2024May 22, 2024
    • A Curated List of Computational Biology Datasets Suitable for Machine Learning
      23000Updated Apr 19, 2024Apr 19, 2024
    • vanna

      Public
      🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
      Python
      MIT License
      888000Updated Apr 16, 2024Apr 16, 2024
    • pandas-ai

      Public
      PandasAI is the Python library that integrates Gen AI into pandas, making data analysis conversational
      Python
      Other
      1.3k001Updated Mar 18, 2024Mar 18, 2024
    • modin

      Public
      Modin: Speed up your Pandas workflows by changing a single line of code
      Python
      Apache License 2.0
      651001Updated Mar 6, 2024Mar 6, 2024
    • A library for mechanistic interpretability of GPT-style language models
      Python
      MIT License
      289000Updated Feb 22, 2024Feb 22, 2024
    • splink

      Public
      Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
      Python
      MIT License
      147000Updated Feb 22, 2024Feb 22, 2024
    • Resources for working with time series and sequence data
      79000Updated Feb 5, 2024Feb 5, 2024
    • find any kind of occupation or job title in a text or file
      Python
      MIT License
      28000Updated Jan 15, 2024Jan 15, 2024
    • Multi Model Server is a tool for serving neural net models for inference
      Java
      Apache License 2.0
      231001Updated Sep 25, 2023Sep 25, 2023
    • analyzer

      Public
      0001Updated Aug 23, 2023Aug 23, 2023
    • skill-ner

      Public
      A (smart) rule based NLP module to extract job skills from text
      Python
      MIT License
      52001Updated Aug 22, 2023Aug 22, 2023
    • Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
      Python
      MIT License
      332000Updated Aug 10, 2023Aug 10, 2023
    • scrubadub

      Public
      Clean personally identifiable information from dirty dirty text.
      Python
      Apache License 2.0
      95000Updated Aug 5, 2023Aug 5, 2023
    • Curated list of open source tooling for data-centric AI on unstructured data.
      Creative Commons Attribution 4.0 International
      35000Updated Jul 12, 2023Jul 12, 2023
    • Jupyter Notebook
      MIT License
      0000Updated May 18, 2023May 18, 2023
    • Tough and flexible tools for data analysis, transformation, validation and movement.
      Python
      Other
      18001Updated May 1, 2023May 1, 2023
    • RedisAI integration for MLFlow
      Python
      Apache License 2.0
      3002Updated May 1, 2023May 1, 2023
    • mezmorize

      Public
      Memoization for python functions (based on Flask-Cache)
      Python
      Other
      185101Updated Apr 10, 2023Apr 10, 2023
    • MLBox

      Public
      MLBox is a powerful Automated Machine Learning python library.
      Python
      Other
      274003Updated Mar 25, 2023Mar 25, 2023
    • bosquet

      Public
      LLMOps tools to build, chain, evaluate and deploy prompts for GPT and other models.
      Clojure
      Eclipse Public License 1.0
      18000Updated Mar 22, 2023Mar 22, 2023
    • pixie

      Public
      Instant Kubernetes-Native Application Observability
      C++
      Apache License 2.0
      4260012Updated Mar 14, 2023Mar 14, 2023