Stars
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Tesseract Open Source OCR Engine (main repository)
Legado 3.0 Book Reader with powerful controls & full functions❤️阅读3.0, 阅读是一款可以自定义来源阅读网络内容的工具,为广大网络文学爱好者提供一种方便、快捷舒适的试读体验。
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
A feature-rich command-line audio/video downloader
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
得到 APP 课程下载工具,可在终端查看文章内容,可生成 PDF,音频文件,markdown 文稿,可下载电子书。
👾 Fast and simple video download library and CLI tool written in Go
Chrome Extension for one click downloading all resources files and keeping folder structures.
#1 Locally hosted web application that allows you to perform various operations on PDF files
《软件设计的哲学》中文翻译 | Chinese translation of A Philosophy of Software Design
Learn how to design, develop, deploy and iterate on production-grade ML applications.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
A natural language interface for computers
Master programming by recreating your favorite technologies from scratch.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Curated list of project-based tutorials
Stable Diffusion web UI
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
↥ ↥ ↥ 点击关注更新,基于 Spring Cloud 2024 、Spring Boot 3.4、 OAuth2 的 RBAC 权限管理系统