microsoft / OmniParser
A simple screen parsing tool towards pure vision based GUI agent
See what the GitHub community is most excited about today.
A simple screen parsing tool towards pure vision based GUI agent
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
GenAI Cookbook
本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Python - 100天从新手到大师
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Course Files for Complete Python 3 Bootcamp Course on Udemy
Learn Agentic AI using CrewAI, LangChain, LangGraph, and Knowledge Graphs.
🦜🔗 Build context-aware reasoning applications
Understanding Deep Learning - Simon J.D. Prince
FinRL: Financial Reinforcement Learning. 🔥
High-Resolution Image Synthesis with Latent Diffusion Models