-
DSoft JSC
- Quang Nam, Viet Nam
-
04:02
(UTC +07:00) - https://nacriema.github.io/about
- https://leetcode.com/huytm/
Lists (32)
Sort Name ascending (A-Z)
3D
Action Recognition
Awesome
Bidding
Car
DeployTools
DocumentExtraction
EmbeddingSimmilaritySearch
Facial Domain
Flask
GANs
Golf
Ideas
Image Comparison
Image-To-Image Translation
InferenceEngineEdgeDevices
InferenceEngines
LLMs
MLOps
MultiCameraTracking
New network structures
Network structuresObject Detection
ObjectReID
OCR
Product Checkout
PromptEngineer
PromptOptimization
Segmentation
Segmentation Losses
Solutions_AICity_Challenges
Speed-To-Text
SOTA models for Speed To Text tasksTrading
Stars
Janus-Series: Unified Multimodal Understanding and Generation Models
NXMP is a video player for Nintendo Switch based on MPV
High-performance multiple object tracking based on YOLO, Deep SORT, and KLT 🚀
C++ implementation of a ScienceDirect paper "An accelerating cpu-based correlation-based image alignment for real-time automatic optical inspection"
PlayStation 4 emulator for Windows, Linux and macOS written in C++
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Python quantitative trading strategies including VIX Calculator, Pattern Recognition, Commodity Trading Advisor, Monte Carlo, Options Straddle, Shooting Star, London Breakout, Heikin-Ashi, Pair Tra…
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
A Comprehensive Toolkit for High-Quality PDF Content Extraction
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
Optical character recognition for Japanese text, with the main focus being Japanese manga
OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.
A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognition
An AI-driven project for manga translation, employing DETR for object detection, Trocr for text recognition on Manga109 dataset, and a Transformer architecture to translate Japanese text to English.
Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。