Skip to content
View ZvanYang's full-sized avatar
🍊
🍊
  • beijing

Highlights

  • Pro

Block or report ZvanYang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 5,551 273 Updated Feb 17, 2025

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Python 2,989 365 Updated Feb 21, 2025

Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable (?), WIP

Python 363 28 Updated Feb 20, 2025

Transform PDFs into AI podcasts for engaging on-the-go audio content.

Python 534 77 Updated Feb 4, 2025

稳定工作4年的微信公众号爬虫 Based on python and vuejs 微信公众号采集 Python爬虫 公众号采集 公众号爬虫 公众号备份

Python 363 75 Updated Feb 27, 2024

A crawler for submissions on leetcode-cn. 这是一个用来爬取力扣中国(LeetCode CN)提交代码的爬虫。

Python 93 27 Updated May 21, 2024

A list of learning materials to understand databases internals

9,775 1,127 Updated Aug 29, 2024

Linux running inside a PDF file via a RISC-V emulator

C 3,111 111 Updated Feb 2, 2025

This is an interview preparation guide for software engineers. Includes behavior interview, system design and coding(Chinese).

Java 161 46 Updated Nov 30, 2021

Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown

TypeScript 75,950 7,019 Updated Feb 20, 2025

LeetCode 101:力扣刷题指南

9,129 1,202 Updated Dec 8, 2024

Your AI Dino Pal on Menubar

106 Updated Jan 24, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero

Python 17,479 1,405 Updated Feb 19, 2025

git commit --fixup, but automatic

Rust 4,863 78 Updated Feb 18, 2025

⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

Java 15,955 1,348 Updated Feb 20, 2025

A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.

Python 344 33 Updated Oct 7, 2024

Find, verify, and analyze leaked credentials

Go 18,155 1,776 Updated Feb 20, 2025

Open-source framework for exporting your personal data.

TypeScript 1,417 68 Updated Dec 25, 2024

图灵社区可用银子兑换的图书

Python 12 4 Updated Aug 12, 2019

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 40,283 5,389 Updated Feb 20, 2025

DiceDB is an open-source in-memory database with query subscriptions.

Go 7,673 1,155 Updated Feb 17, 2025

🎨 Diagram as Code for prototyping cloud system architectures

Python 40,316 2,589 Updated Feb 19, 2025

Classical equations and diagrams in machine learning

TeX 7,557 1,273 Updated Jul 30, 2024

A collective list of free APIs

Python 328,039 34,778 Updated Oct 31, 2024

Maestro: Netflix’s Workflow Orchestrator

Java 3,397 206 Updated Feb 21, 2025

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

Python 5,302 351 Updated Feb 21, 2025

Kspider 是一个爬虫平台,以图形化方式定义爬虫流程,无需代码即可实现一个爬虫流程,Kspider不仅限爬虫,也可用于WEB自动化测试,更多功能等你探索。

Java 1,226 120 Updated Aug 19, 2024

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、各种指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书、大众点评、推特、脉脉、知乎》

Python 1,726 443 Updated May 9, 2024

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 71,139 36,926 Updated Feb 17, 2025
Next