Skip to content
View ZvanYang's full-sized avatar
🍊
🍊
  • beijing

Highlights

  • Pro

Block or report ZvanYang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 5,572 274 Updated Feb 21, 2025

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Python 2,994 365 Updated Feb 22, 2025

Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable (?), WIP

Python 368 28 Updated Feb 22, 2025

Transform PDFs into AI podcasts for engaging on-the-go audio content.

Python 535 78 Updated Feb 4, 2025

稳定工作4年的微信公众号爬虫 Based on python and vuejs 微信公众号采集 Python爬虫 公众号采集 公众号爬虫 公众号备份

Python 363 75 Updated Feb 27, 2024

A crawler for submissions on leetcode-cn. 这是一个用来爬取力扣中国(LeetCode CN)提交代码的爬虫。

Python 94 27 Updated May 21, 2024

A list of learning materials to understand databases internals

9,778 1,127 Updated Aug 29, 2024

Linux running inside a PDF file via a RISC-V emulator

C 3,134 113 Updated Feb 2, 2025

This is an interview preparation guide for software engineers. Includes behavior interview, system design and coding(Chinese).

Java 161 46 Updated Nov 30, 2021

Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown

TypeScript 75,981 7,020 Updated Feb 21, 2025

LeetCode 101:力扣刷题指南

9,131 1,202 Updated Dec 8, 2024

Your AI Dino Pal on Menubar

106 Updated Jan 24, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero

Python 17,520 1,407 Updated Feb 21, 2025

git commit --fixup, but automatic

Rust 4,864 78 Updated Feb 18, 2025

⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

Java 15,966 1,348 Updated Feb 21, 2025

A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.

Python 344 33 Updated Oct 7, 2024

Find, verify, and analyze leaked credentials

Go 18,169 1,776 Updated Feb 22, 2025

Open-source framework for exporting your personal data.

TypeScript 1,417 68 Updated Dec 25, 2024

图灵社区可用银子兑换的图书

Python 12 4 Updated Aug 12, 2019

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 40,326 5,393 Updated Feb 20, 2025

DiceDB is an open-source in-memory database with query subscriptions.

Go 7,673 1,155 Updated Feb 17, 2025

🎨 Diagram as Code for prototyping cloud system architectures

Python 40,320 2,589 Updated Feb 19, 2025

Classical equations and diagrams in machine learning

TeX 7,559 1,274 Updated Jul 30, 2024

A collective list of free APIs

Python 328,101 34,782 Updated Oct 31, 2024

Maestro: Netflix’s Workflow Orchestrator

Java 3,398 206 Updated Feb 21, 2025

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

Python 5,305 351 Updated Feb 21, 2025

Kspider 是一个爬虫平台,以图形化方式定义爬虫流程,无需代码即可实现一个爬虫流程,Kspider不仅限爬虫,也可用于WEB自动化测试,更多功能等你探索。

Java 1,227 120 Updated Aug 19, 2024

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、各种指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书、大众点评、推特、脉脉、知乎》

Python 1,728 443 Updated May 9, 2024

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 71,211 36,969 Updated Feb 17, 2025
Next