Skip to content
Change the repository type filter

All

    Repositories list

    • MinerU-1

      Public
      A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
      Python
      GNU Affero General Public License v3.0
      1.6k000Updated Sep 30, 2024Sep 30, 2024
    • MinerU

      Public
      A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
      Python
      GNU Affero General Public License v3.0
      1.6k000Updated Sep 30, 2024Sep 30, 2024
    • Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
      Jupyter Notebook
      Other
      4.6k000Updated Sep 26, 2024Sep 26, 2024
    • Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
      Jupyter Notebook
      Other
      4.6k000Updated Sep 26, 2024Sep 26, 2024
    • LLMsBook

      Public
      大型语言模型实战指南:应用实践与场景落地
      Python
      Apache License 2.0
      6000Updated Sep 13, 2024Sep 13, 2024
    • An observability database aims to ingest, analyze and store Metrics, Tracing and Logging data.
      Go
      Apache License 2.0
      87000Updated Sep 12, 2024Sep 12, 2024
    • APM, Application Performance Monitoring System
      Java
      Apache License 2.0
      6.5k000Updated Sep 12, 2024Sep 12, 2024
    • dubbo

      Public
      The java implementation of Apache Dubbo. An RPC and microservice framework.
      Java
      Apache License 2.0
      26k000Updated Sep 11, 2024Sep 11, 2024
    • Free and Open, Distributed, RESTful Search Engine
      Java
      Other
      25k000Updated Sep 10, 2024Sep 10, 2024
    • kotaemon

      Public
      An open-source RAG-based tool for chatting with your documents.
      Python
      Apache License 2.0
      1.4k000Updated Sep 8, 2024Sep 8, 2024
    • The Sidecar Project of Apache SkyWalking
      Go
      Apache License 2.0
      49000Updated Sep 6, 2024Sep 6, 2024
    • 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。
      Python
      6000Updated Sep 6, 2024Sep 6, 2024
    • Monitor and profiler powered by eBPF to monitor network traffic, and diagnose CPU and network performance.
      Go
      Apache License 2.0
      44000Updated Sep 5, 2024Sep 5, 2024
    • kindling

      Public
      eBPF-based Cloud Native Monitoring Tool
      Go
      Apache License 2.0
      183000Updated Sep 2, 2024Sep 2, 2024
    • pixie-1

      Public
      Instant Kubernetes-Native Application Observability
      C++
      Apache License 2.0
      441000Updated Aug 29, 2024Aug 29, 2024
    • 为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
      Python
      GNU General Public License v3.0
      8.2k000Updated Aug 28, 2024Aug 28, 2024
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      Apache License 2.0
      4.2k000Updated Aug 28, 2024Aug 28, 2024
    • kubeedge

      Public
      Kubernetes Native Edge Computing Framework (project under CNCF)
      Go
      Apache License 2.0
      1.7k000Updated Aug 28, 2024Aug 28, 2024
    • Ongoing research training transformer models at scale
      Python
      Other
      2.4k000Updated Aug 27, 2024Aug 27, 2024
    • 🔥 Seata is an easy-to-use, high-performance, open source distributed transaction solution.
      Java
      Apache License 2.0
      8.8k000Updated Aug 26, 2024Aug 26, 2024
    • 从零实现一个小参数量中文大语言模型。
      Python
      42000Updated Aug 22, 2024Aug 22, 2024
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      Apache License 2.0
      27k000Updated Aug 21, 2024Aug 21, 2024
    • sofa-rpc

      Public
      SOFARPC is a high-performance, high-extensibility, production-level Java RPC framework.
      Java
      Apache License 2.0
      1.2k000Updated Aug 21, 2024Aug 21, 2024
    • bcc

      Public
      BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
      C
      Apache License 2.0
      3.9k000Updated Aug 21, 2024Aug 21, 2024
    • The Java agent for Apache SkyWalking
      Java
      Apache License 2.0
      605000Updated Aug 21, 2024Aug 21, 2024
    • Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.
      Java
      Apache License 2.0
      6.8k000Updated Aug 21, 2024Aug 21, 2024
    • pixie

      Public
      Instant Kubernetes-Native Application Observability
      C++
      Apache License 2.0
      441000Updated Aug 20, 2024Aug 20, 2024
    • The Nginx Lua agent for Apache SkyWalking
      Lua
      Apache License 2.0
      71000Updated Aug 20, 2024Aug 20, 2024
    • sofa-boot

      Public
      SOFABoot is a framework that enhances Spring Boot and fully compatible with it, provides readiness check, class isolation, etc.
      Java
      Apache License 2.0
      1.3k000Updated Aug 19, 2024Aug 19, 2024
    • A production-grade java implementation of RAFT consensus algorithm.
      Java
      Apache License 2.0
      1.2k000Updated Aug 12, 2024Aug 12, 2024