Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable (?), WIP
Transform PDFs into AI podcasts for engaging on-the-go audio content.
稳定工作4年的微信公众号爬虫 Based on python and vuejs 微信公众号采集 Python爬虫 公众号采集 公众号爬虫 公众号备份
A crawler for submissions on leetcode-cn. 这是一个用来爬取力扣中国(LeetCode CN)提交代码的爬虫。
A list of learning materials to understand databases internals
Linux running inside a PDF file via a RISC-V emulator
This is an interview preparation guide for software engineers. Includes behavior interview, system design and coding(Chinese).
Generation of diagrams like flowcharts or sequence diagrams from text in a similar manner as markdown
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero
⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
A Gradio app that transcribes YouTube videos using audio extraction and OpenAI’s Whisper model.
Find, verify, and analyze leaked credentials
Open-source framework for exporting your personal data.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
DiceDB is an open-source in-memory database with query subscriptions.
🎨 Diagram as Code for prototyping cloud system architectures
Classical equations and diagrams in machine learning
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…
Kspider 是一个爬虫平台,以图形化方式定义爬虫流程,无需代码即可实现一个爬虫流程,Kspider不仅限爬虫,也可用于WEB自动化测试,更多功能等你探索。
爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、各种指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书、大众点评、推特、脉脉、知乎》
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/