Skip to content
View EasyIsAllYouNeed's full-sized avatar

Block or report EasyIsAllYouNeed

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Repository of GaitRDAE.

2 Updated Jan 30, 2025

An Interpretable Deep Learning Approach for Morphological Script Type Analysis (IWCP 2024)

Jupyter Notebook 5 Updated Sep 17, 2024
Python 368 31 Updated Nov 22, 2024

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 12,602 1,495 Updated Jan 28, 2025
Python 4 Updated Nov 23, 2024

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 8,001 903 Updated Jan 30, 2025

LVBench: An Extreme Long Video Understanding Benchmark

Python 79 1 Updated Aug 30, 2024

Run AI workflows with TypeScript & Vercel AI SDK

TypeScript 185 5 Updated Jan 26, 2025

🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手,无需GPU一键高质量字幕视频合成!视频字幕生成、断句、校正、字幕翻译全流程。让字幕制作简单高效!

Python 3,525 306 Updated Jan 10, 2025

Cross-platform get display info for MacOS、Windows、Linux, Like electron Display Object.

Rust 30 16 Updated Jan 30, 2025

An open-sourced end-to-end VLM-based GUI Agent

Python 641 48 Updated Jan 27, 2025

ZXing-C++ WebAssembly as an ES/CJS module with types. Read or write barcodes in various JS runtimes: Web, Node.js, Bun, and Deno.

TypeScript 97 10 Updated Jan 30, 2025

⚡ Transfer files over wifi from your computer to your mobile device by scanning a QR code without leaving the terminal.

Go 10,081 535 Updated Jan 25, 2025

The HTML5 Creation Engine: Create beautiful digital content with the fastest, most flexible 2D WebGL renderer.

TypeScript 1 Updated Jan 26, 2025

World's first AI meeting copilot

JavaScript 1,679 76 Updated Jan 29, 2025

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

CSS 159 29 Updated Mar 31, 2024

SmartEraser, built with a new removing paradigm called Masked-Region Guidance. This paradigm retains the masked region in the input, using it as guidance for the removal process.

53 Updated Jan 15, 2025

猫步简历 – 一款开源免费的简历制作神器,支持导出超高清PDF、图片、源码级JSON数据等,AI简历生成、AI润色、AI语种翻译等。提供海量在线制作模版、主题任意切换、高度定制化的简历模块。使用猫步简历,您可以制作出一份独特、优美、专业的求职简历。

Vue 2,018 213 Updated Jan 29, 2025

DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis

308 16 Updated Feb 1, 2023

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

Python 162 11 Updated May 23, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 39,062 4,791 Updated Jan 30, 2025

文本纠错工具包(Text Correct, CSC), 支持中文拼写纠错/标点符号纠错(CSC, Chinese Spelling Correct / Check; Punct), CSC支持各领域数据(包括古文), 模型在大规模、各领域的、现代/当代语料上训练而得, 泛化性强.

Python 3 Updated Jan 22, 2025

Deobfuscate Javascript code using ChatGPT

TypeScript 1,887 82 Updated Jan 30, 2025

[SOICT 2024] LLM-Powered Video Search: A Comprehensive Multimedia Retrieval System

Jupyter Notebook 43 Updated Jan 13, 2025

This repository has the code for creating Video RAG using open source models.

Python 13 3 Updated Jan 16, 2025

This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 100 8 Updated Jan 27, 2025

An open-source AI content search engine designed specifically for content creators. Supports extraction of text, images, and short videos. Allows full local deployment (web app, RAG server, LLM ser…

TypeScript 524 65 Updated Jun 23, 2024

Video Search and Streaming Agent 🕵️‍♂️

Python 457 29 Updated Jan 31, 2024
Next