Skip to content

Collection of reference workflows for building intelligent agents with NIMs

License

Notifications You must be signed in to change notification settings

Abdelhameed88/metropolis-nim-workflows

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NVIDIA Metropolis Generative AI Workflows

Generative AI such as Large Language Models (LLMs) and Vision Language Models (VLMs) are used to build powerful workflows and agents to solve critical business problems. This repository hosts training materials, reference applications and workflows to build Visual AI Agents using NVIDIA NIM microservices. You can get started for free with no GPU required!

If you find this repository helpful, leave a star and share the link 🙂.

Table of Contents

News 📰

  • 9/25 - Llama 3.2 Vision NIM added to build.nvidia.com. Workflows updated to support Llama 3.2 Vision.
  • 9/24 - Added VIA workflows
  • 9/9 - VILA VLM NIM added to build.nvidia.com
  • 8/21 - Added NVDINOv2 Few Shot Classification workflow
  • 8/16 - Added Structured Text Extraction workflow
  • 8/7 - Added NVCLIP Semantic Search workflow
  • 6/26 - Added VLM Alert workflow

Getting Started 🚀

To access NIM Microservices, visit build.nvidia.com to create an account and generate an API key. Each new account can receive up to 5,000 free credits. These credits will give you free access to preview APIs that allow you to use state of the art generative AI models such as Llama3.2 Vision, Nemotron, Mistral, VILA and much more with no GPU required!

Prerequisites

  1. Go to build.nvidia.com and use your email to sign up.

  2. After making an account, you can get an API Key by selecting any of the available NIMs then in the example code section, click on "Get API Key" then "Generate Key".

  1. You will then see your API Key that will look something like "nvapi-xxx-xxxxxxxx_xxxxxxxxxxxxxxxx_xxxxxxxxxxx-xxxxxxx-xxxxxxxxxxxxxx". This full key is what will be used in the example notebooks and scripts.

You can now continue to explore the workflows in the next sections.

NIM microservice workflows

NVIDIA NIMs are GPU-accelerated AI models wrapped in an easy to use REST API interface. To help developers get started, each NIM has a hosted preview API that is accessible after generating an API token from build.nvidia.com. The preview APIs can be used for free to develop and experiment with state of the art AI models including LLMs, VLMs, Embedding and CV models.

The workflows listed under this section make use of the preview APIs and do not require a local GPU! You can run these workflows on nearly any computer and each one is quick and easy to launch. Each workflow includes a Jupyter notebook workshop that walks through how to build with NIM Preview APIs in Python.

Follow the links below to start running the workflows:

VIA microservice workflows

NVIDIA VIA Microservices are cloud-native building blocks to build AI agents capable of processing large amounts of live or archived videos and images with Vision-Language Models (VLM).

At a minimum VIA requires a NIM API Key and a local graphics card. A consumer RTX card is enough to get started.

First follow the VIA specific setup steps

Then you can explore the following VIA workflows from this repository:

Changelog 📝

  • rel-2.1: Add support for Llama 3.2 Vision
  • rel-2.0: Reorganize repository. Add VIA Workflows. Update READMEs.
  • rel-1.3.1: Update VLM Alert workflow with VILA
  • rel-1.3: Add NVDINOv2 Few Shot Classification workflow.
  • rel-1.2: Add Structured Text Extraction Workflow.
  • rel-1.1: Add Websocket server output for VLM Alert workflow. Add NVCLIP Semantic Search workflow.
  • rel-1.0: Add VLM Alert workflow

Questions, Discussion, and Bugs ❔

If you find any bugs, have questions or want to start a discussion around the workflows feel free to file an issue. Any suggestions, feedback and new ideas are also welcome 🙂.

Related Resources 🔗

Relevant technical blogs that explore building AI Agents. Technical Blogs

Web Pages
Relevant web pages to find more information about NIMs, Metropolis and VIA.

GitHub Repositories
The following GitHub repositories include more examples of how to build with NIM microservices.

NVIDIA Developer Forums For questions and discussions, feel free to post under the appropriate topic on our developer forums.

Contributors ⭐

Thanks to the following people for contributing to our workflows:

About

Collection of reference workflows for building intelligent agents with NIMs

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 52.4%
  • Python 45.9%
  • Shell 1.2%
  • Dockerfile 0.5%