Skip to content

AI Video Retrieval: A Semantic Search & Timestamp Alignment System

Notifications You must be signed in to change notification settings

iaminhri/semanticSearch

Repository files navigation

GitHub followers GitHub commit activity


Logo

AI Video Retrieval: A Semantic Search & Timestamp Alignment System

View Project
View Demo

Table of Contents
  1. About The Project
  2. Project Summary
  3. Built With
  4. Getting Started
  5. Usage
  6. Top Contributors
  7. License
  8. Acknowledgments

About The Project

The AI Video Retrieval (AIVR) system enhances video search by integrating deep learning models for speech recognition, image captioning, and embedding generation. It uses txtai for indexing and Django for integration, enabling real-time video processing and semantic search. The system retrieves precise video timestamps based on natural language queries. A usability study confirms its improved retrieval accuracy and efficiency over traditional methods. Future extensions include OCR, object detection, and action recognition for enhanced relevance.

Project Summary

Our Goal

To develop an AI-powered video retrieval system that enables precise semantic search within videos by leveraging deep learning models for automatic speech recognition, image captioning, and embedding generation. The system aims to enhance retrieval accuracy, efficiency, and usability by indexing multimodal data and providing timestamp-aligned search results, with potential extensions for OCR, object detection, and action recognition. The content will be formatted for various platforms, including:


Objectives

  • Develop an AI-driven semantic video retrieval system – Leverage deep learning models for speech recognition, image captioning, and embedding generation to enable accurate and efficient video search.
  • Enhance retrieval accuracy and timestamp precision – Implement multimodal indexing and vector embeddings to improve search relevance and ensure precise timestamp alignment for retrieved segments.
  • ** Ensure scalability and real-time processing** – Design a framework that supports real-time video uploads, indexing, and search queries while allowing future enhancements like OCR, object detection, and action recognition.

(back to top)

Built With

This section should list any major frameworks/libraries used to bootstrap your project. Leave any add-ons/plugins for the acknowledgements section. Here are a few examples.

  • Docker
  • Django
  • Python
  • HTML
  • CSS
  • Bootstrap

(back to top)

Getting Started

The project can be viewed and run locally by installing the following tools.

Installation, Building, and Running the Project

  1. Install Docker Container
https://www.docker.com/products/docker-desktop/
  1. Install GitHub
https://docs.github.com/en/desktop/installing-and-authenticating-to-github-desktop/installing-github-desktop
  1. Use Git Clone to create a local repo
git clone https://github.com/iaminhri/semanticSearch.git
cd semanticSearch
  1. Build the docker project by using the docker-compose-deploy.yml file.
docker-compose -f docker-compose-deploy build
  1. Run the Project:
docker-compose -f docker-compose-deploy up
  1. Access The Website:
127.0.0.1:8080

(back to top)

Usage

AI Video Retrieval System

Query & Retrieval

Web Interface: Home Page

Query & Retrieval - Compact View

Web Interface: Home Page

Upload Videos

Web Interface: Home Page

Video Archive and Index

Web Interface: Home Page

Single Query Search

Web Interface: Home Page

Multiple Query Search

Web Interface: Home Page

(back to top)

Top contributors:

(back to top)

License

See LICENSE.txt for more information.

(back to top)

Acknowledgments

The authors wish to acknowledge the Responsible & Applied Machine Learning Laboratory (RAML Lab) at the Department of Computer Science, Brock University, Canada.

(back to top)

About

AI Video Retrieval: A Semantic Search & Timestamp Alignment System

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published