Large Vision Models Inferences

[Made Public]

Multimodal AI Inference

Overview

This repository is dedicated to running inference tasks using various large vision models (LVMs) over secure SSH connections. It serves as a growing collection of scripts that implement and manage inference for cutting-edge multimodal AI models, focusing on both vision and language tasks.

Key Features

Qwen Inference: Leverages Qwen, a robust language model, to process multimodal input through qwen_inference.py.
Llava Next Integration: Adds advanced visual understanding with Llava Next, utilizing llava_next_inference.py.
Continuous Expansion: As more large vision models are explored and integrated, the repository will expand with additional inference files.
SSH-based Inference: All inference processes are conducted remotely over SSH, providing scalable and secure access to compute resources.

Files (Growing Collection)

qwen_inference.py: Script for running inference tasks using the Qwen model.
llava_next_inference.py: Inference script for Llava Next, aimed at advanced visual understanding.

Use Case

This repository is designed for AI researchers and developers working on large vision models. It facilitates the remote deployment and inference of state-of-the-art vision and multimodal models, with secure SSH-based access.

Future Work

Integration of additional large vision models for comprehensive multimodal tasks.
Support for larger datasets and batch processing capabilities.
Performance benchmarking and optimization for inference tasks.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.gitignore		.gitignore
AIAI.pdf		AIAI.pdf
LVM_project_flowchart		LVM_project_flowchart
LVM_project_flowchart.png		LVM_project_flowchart.png
PaliGemma2-video.ipynb		PaliGemma2-video.ipynb
README.md		README.md
florence-video-ucf101.ipynb		florence-video-ucf101.ipynb
florence-video.ipynb		florence-video.ipynb
florence-video.py		florence-video.py
florence2_inference.py		florence2_inference.py
hello_world.py		hello_world.py
llava-next-video.ipynb		llava-next-video.ipynb
llava_next_inference.py		llava_next_inference.py
openai-video.ipynb		openai-video.ipynb
qwen_inference.py		qwen_inference.py
test.ipynb		test.ipynb
video_processing.log		video_processing.log
video_processing_analytics-new.csv		video_processing_analytics-new.csv
video_processing_analytics.csv		video_processing_analytics.csv
video_processing_analytics_pg.csv		video_processing_analytics_pg.csv
video_processing_analytics_ucf.csv		video_processing_analytics_ucf.csv
visualize.ipynb		visualize.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Large Vision Models Inferences

Overview

Key Features

Files (Growing Collection)

Use Case

Future Work

About

Uh oh!

Releases

Packages

Uh oh!

Languages

hamzafer/Large-Vision-Models

Folders and files

Latest commit

History

Repository files navigation

Large Vision Models Inferences

Overview

Key Features

Files (Growing Collection)

Use Case

Future Work

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages