Add guide for Multi-Agent RAG with Gen2 and Cortex #2750

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

i3xpl0it wants to merge 5 commits into Snowflake-Labs:master from i3xpl0it:i3xpl0it-patch-1

i3xpl0it commented Dec 7, 2025

This document provides a comprehensive guide on building a production-grade multi-agent Retrieval-Augmented Generation (RAG) system using Snowflake Gen2 Warehouses and Cortex. It includes architecture, setup instructions, and sample queries.


          Add guide for Multi-Agent RAG with Gen2 and Cortex

aca26ad

This document provides a comprehensive guide on building a production-grade multi-agent Retrieval-Augmented Generation (RAG) system using Snowflake Gen2 Warehouses and Cortex. It includes architecture, setup instructions, and sample queries.

i3xpl0it temporarily deployed to staging

December 7, 2025 14:59

— with

GitHub Actions Inactive

i3xpl0it temporarily deployed to staging

December 7, 2025 14:59

— with

GitHub Actions Inactive

snowflake-github-workato bot commented Dec 7, 2025

https://publish-p57963-e462098.adobeaemcloud.com/en/developers/guides/multi-agent-rag-gen2-cortex-aca26adc7339a66444807969911d6b1e2b46f366

Note: The image uploads happen async, so they may not be uploaded yet if you click this quickly.. try again in a few minutes if so.


          Enhanced multi-agent RAG guide with complete production code

e04e706

- Complete 6-agent architecture with full SQL/Python implementations
- Semantic chunking UDF + Cortex embeddings + external vector DB
- Gen2 warehouse optimization with 30-50% performance gains
- Production deployment, monitoring, and troubleshooting guides
- Based on Medium article about multi-agent RAG with Gen2 warehouses

i3xpl0it temporarily deployed to staging

December 7, 2025 17:56

— with

GitHub Actions Inactive

i3xpl0it temporarily deployed to staging

December 7, 2025 17:56

— with

GitHub Actions Inactive

snowflake-github-workato bot commented Dec 7, 2025

https://publish-p57963-e462098.adobeaemcloud.com/en/developers/guides/multi-agent-rag-gen2-cortex-e04e706b971827f41ed49ba0287421430705df5d

Note: The image uploads happen async, so they may not be uploaded yet if you click this quickly.. try again in a few minutes if so.


          Standardize duration format to include 'Hours'

0589dc1

Updated duration formatting throughout the document to include 'Hours'.

i3xpl0it temporarily deployed to staging

December 7, 2025 18:11

— with

GitHub Actions Inactive

i3xpl0it temporarily deployed to staging

December 7, 2025 18:12

— with

GitHub Actions Inactive

snowflake-github-workato bot commented Dec 7, 2025

https://publish-p57963-e462098.adobeaemcloud.com/en/developers/guides/multi-agent-rag-gen2-cortex-0589dc172ef157936afde65098b71db87987ffc2

Note: The image uploads happen async, so they may not be uploaded yet if you click this quickly.. try again in a few minutes if so.


          Revise duration estimates in multi-agent RAG guide

87789cb

Updated duration estimates for various sections of the document to reflect more accurate time requirements.

i3xpl0it temporarily deployed to staging

December 7, 2025 18:38

— with

GitHub Actions Inactive


          Update overview in multi-agent RAG guide

a588d31

Added overview section for multi-agent RAG system.

i3xpl0it temporarily deployed to staging

December 7, 2025 18:38

— with

GitHub Actions Inactive

i3xpl0it temporarily deployed to staging

December 7, 2025 18:38

— with

GitHub Actions Inactive

snowflake-github-workato bot commented Dec 7, 2025

https://publish-p57963-e462098.adobeaemcloud.com/en/developers/guides/multi-agent-rag-gen2-cortex-87789cb730a5cc8708008258f86ac53587463424

Note: The image uploads happen async, so they may not be uploaded yet if you click this quickly.. try again in a few minutes if so.

snowflake-github-workato bot commented Dec 7, 2025

https://publish-p57963-e462098.adobeaemcloud.com/en/developers/guides/multi-agent-rag-gen2-cortex-a588d3148d251855f1045391fba1d55e7d284ea9

Note: The image uploads happen async, so they may not be uploaded yet if you click this quickly.. try again in a few minutes if so.

Author

i3xpl0it commented Dec 8, 2025

Could a maintainer please assign a reviewer for this PR? This guide covers building a production-grade multi-agent RAG system using Snowflake Gen2 Warehouses and Cortex. Thank you!

sfc-gh-kkomail requested review from jamescha-earley and sfc-gh-jreini

December 8, 2025 20:36

sfc-gh-jreini requested changes

View reviewed changes

Contributor

sfc-gh-jreini left a comment

Requested some changes

site/sfguides/src/multi-agent-rag-gen2-cortex/multi-agent-rag-gen2-cortex.md


		nltk.download('punkt', quiet=True)

		def chunk_doc(doc_text: str) -> List[str]:

Contributor

sfc-gh-jreini Dec 8, 2025

Can you use native PARSE_DOCUMENT feature instead of python UDF?

site/sfguides/src/multi-agent-rag-gen2-cortex/multi-agent-rag-gen2-cortex.md

+              import numpy as np
+              import _snowflake
+              def search_vectors(query_embedding: list, top_k: int) -> list:

Contributor

sfc-gh-jreini Dec 8, 2025

Can you replicate this with cortex search instead of pinecone? Or is the pinecone integration something you're trying to show in the guide?

site/sfguides/src/multi-agent-rag-gen2-cortex/multi-agent-rag-gen2-cortex.md

+              ### Create Document Retriever Agent
+              ```sql
+              CREATE OR REPLACE FUNCTION document_retriever_agent(user_query STRING)

Contributor

sfc-gh-jreini Dec 8, 2025

This seems more like a retrieval tool than an agent - no agent reasoning is used here, just returning documents. suggest renaming

site/sfguides/src/multi-agent-rag-gen2-cortex/multi-agent-rag-gen2-cortex.md

+              ```
+              ## Agent 2: SQL Generator
+              Duration: 1.5 Hours

Contributor

sfc-gh-jreini Dec 8, 2025

very long duration here, intended?

site/sfguides/src/multi-agent-rag-gen2-cortex/multi-agent-rag-gen2-cortex.md

+              LANGUAGE SQL
+              AS
+              $$
+                SELECT SNOWFLAKE.CORTEX.COMPLETE(

Contributor

sfc-gh-jreini Dec 8, 2025

Use AI_COMPLETE instead of SNOWFLAKE.CORTEX.COMPLETE

site/sfguides/src/multi-agent-rag-gen2-cortex/multi-agent-rag-gen2-cortex.md

+              ### Create SQL Generation Agent
+              ```sql
+              CREATE OR REPLACE FUNCTION sql_generator_agent(user_question STRING)

Contributor

sfc-gh-jreini Dec 8, 2025

Better to use cortex analyst instead of hand-rolling this sql generator agent. Quality of sql generation will be much higher and more extensible.

site/sfguides/src/multi-agent-rag-gen2-cortex/multi-agent-rag-gen2-cortex.md

+              ### Business Logic Layer
+              ```sql
+              CREATE OR REPLACE FUNCTION semantic_model_agent(entity STRING, operation STRING)

Contributor

sfc-gh-jreini Dec 8, 2025

What is this agent doing?

site/sfguides/src/multi-agent-rag-gen2-cortex/multi-agent-rag-gen2-cortex.md

+                synthesized_response AS (
+                  SELECT
+                    SNOWFLAKE.CORTEX.COMPLETE(
+                      'mixtral-8x7b',

Contributor

sfc-gh-jreini Dec 8, 2025

would recommend using a stronger model here

site/sfguides/src/multi-agent-rag-gen2-cortex/multi-agent-rag-gen2-cortex.md

+              AS
+              $$
+                WITH agent_results AS (
+                  -- Execute all agents in parallel on Gen2 warehouse

Contributor

sfc-gh-jreini Dec 8, 2025

why execute all agents in parallel if they may not be needed depending on query?

Better to have the coordinator agent do planning/tool selection instead of just using all of the agents at once no matter what.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet