Examples, recipes, and best-practice pipelines for the Pipelex AI workflow framework.
Learn by doing with production-ready examples.
git clone https://github.com/Pipelex/pipelex-cookbook.git
cd pipelex-cookbook
# Create and activate virtual environment
python -m venv .venv
# or python3 -m venv .venv
source .venv/bin/activate
# Install
pip install . # or uv syncTo use AI models, you need an API key:
- Free Pipelex API Key: Join our Discord community and request your free API key (no credit card required) in the πγ»free-api-key channel.
- Bring your own API keys: OpenAI, Anthropic, Google, Mistral
- Local AI: Ollama, vLLM, LM Studio, llama.cpp... any endpoint based on the OpenAI API or not, as you can plug-in your own non-standard APIs.
See Configure AI Providers for details.
Try the hello world example:
python examples/_quick_start/hello_world.pyOr explore other cookbook examples:
# Extract data from a Gantt chart image
python examples/extract_gantt/extract_gantt.py
# Extract and summarize invoice information
python examples/invoice_extractor/invoice_extractor.py
# Multi-step text summarization
python examples/_quick_start/summarize_2_steps.pyCreate a complete AI workflow with a single command:
pipelex build pipe "Take a CV and Job offer in PDF, analyze if they match and generate 5 questions for the interview" --output results/cv_match.plxThis command generates a production-ready .plx file with domain definitions, concepts, and multiple processing steps that analyzes CV-job fit and prepares interview questions.
cv_match.plx
domain = "cv_match"
description = "Matching CVs with job offers and generating interview questions"
main_pipe = "analyze_cv_job_match_and_generate_questions"
[concept.MatchAnalysis]
description = """
Analysis of alignment between a candidate and a position, including strengths, gaps, and areas requiring further exploration.
"""
[concept.MatchAnalysis.structure]
strengths = { type = "text", description = "Areas where the candidate's profile aligns well with the requirements", required = true }
gaps = { type = "text", description = "Areas where the candidate's profile does not meet the requirements or lacks evidence", required = true }
areas_to_probe = { type = "text", description = "Topics or competencies that need clarification or deeper assessment during the interview", required = true }
[concept.Question]
description = "A single interview question designed to assess a candidate."
refines = "Text"
[pipe.analyze_cv_job_match_and_generate_questions]
type = "PipeSequence"
description = """
Main pipeline that orchestrates the complete CV-job matching and interview question generation workflow. Takes a candidate's CV and a job offer as PDF documents, extracts their content, performs a comprehensive match analysis identifying strengths, gaps, and areas to probe, and generates exactly 5 targeted interview questions based on the analysis results.
"""
inputs = { cv_pdf = "PDF", job_offer_pdf = "PDF" }
output = "Question[5]"
steps = [
{ pipe = "extract_documents_parallel", result = "extracted_documents" },
{ pipe = "analyze_match", result = "match_analysis" },
{ pipe = "generate_interview_questions", result = "interview_questions" },
]π Click to view the supporting pipes implementation
[pipe.extract_documents_parallel]
type = "PipeParallel"
description = """
Executes parallel extraction of text content from both the CV PDF and job offer PDF simultaneously to optimize processing time.
"""
inputs = { cv_pdf = "PDF", job_offer_pdf = "PDF" }
output = "Dynamic"
parallels = [
{ pipe = "extract_cv_text", result = "cv_pages" },
{ pipe = "extract_job_offer_text", result = "job_offer_pages" },
]
add_each_output = true
[pipe.extract_cv_text]
type = "PipeExtract"
description = """
Extracts text content from the candidate's CV PDF document using OCR technology, converting all pages into machine-readable text format for subsequent analysis.
"""
inputs = { cv_pdf = "PDF" }
output = "Page[]"
model = "extract_text_from_pdf"
[pipe.extract_job_offer_text]
type = "PipeExtract"
description = """
Extracts text content from the job offer PDF document using OCR technology, converting all pages into machine-readable text format for subsequent analysis.
"""
inputs = { job_offer_pdf = "PDF" }
output = "Page[]"
model = "extract_text_from_pdf"
[pipe.analyze_match]
type = "PipeLLM"
description = """
Performs comprehensive analysis comparing the candidate's CV against the job offer requirements. Identifies and structures: (1) strengths where the candidate's profile aligns well with requirements, (2) gaps where the profile lacks evidence or doesn't meet requirements, and (3) specific areas requiring deeper exploration or clarification during the interview process.
"""
inputs = { cv_pages = "Page[]", job_offer_pages = "Page[]" }
output = "MatchAnalysis"
model = "llm_to_answer_hard_questions"
system_prompt = """
You are an expert HR analyst and recruiter specializing in candidate-job fit assessment. Your task is to generate a structured MatchAnalysis comparing a candidate's CV against job requirements.
"""
prompt = """
Analyze the match between the candidate's CV and the job offer requirements.
Candidate CV:
@cv_pages
Job Offer:
@job_offer_pages
Perform a comprehensive comparison and provide a structured analysis.
"""
[pipe.generate_interview_questions]
type = "PipeLLM"
description = """
Generates exactly 5 targeted, relevant interview questions based on the match analysis results. Questions are designed to probe identified gaps, clarify areas of uncertainty, validate strengths, and assess competencies that require deeper evaluation to determine candidate-position fit.
"""
inputs = { match_analysis = "MatchAnalysis" }
output = "Question[5]"
model = "llm_to_write_questions"
system_prompt = """
You are an expert HR interviewer and talent assessment specialist. Your task is to generate structured interview questions based on candidate-position match analysis.
"""
prompt = """
Based on the following match analysis between a candidate and a position, generate exactly 5 targeted interview questions.
@match_analysis
The questions should:
- Probe the identified gaps to assess if they are deal-breakers or can be mitigated
- Clarify areas that require deeper exploration
- Validate the candidate's strengths with concrete examples
- Be open-ended and behavioral when appropriate
- Help determine overall candidate-position fit
Generate exactly 5 interview questions.
"""View the pipeline flowchart:
flowchart TD
subgraph PAR["extract_documents_parallel (PipeParallel)"]
direction LR
EXTRACT_CV["extract_cv_text (PipeExtract)"]
EXTRACT_JOB["extract_job_offer_text (PipeExtract)"]
end
subgraph MAIN["analyze_cv_job_match_and_generate_questions (PipeSequence)"]
direction TB
PAR
CV_PAGES[["cv_pages: Page"]]
JOB_PAGES[["job_offer_pages: Page"]]
ANALYZE["analyze_match (PipeLLM)"]
MATCH[["MatchAnalysis"]]
GENERATE["generate_interview_questions (PipeLLM)"]
OUT[["Question"]]
end
CV_IN[["cv_pdf: PDF"]] --> EXTRACT_CV
JOB_IN[["job_offer_pdf: PDF"]] --> EXTRACT_JOB
EXTRACT_CV --> CV_PAGES
EXTRACT_JOB --> JOB_PAGES
CV_PAGES --> ANALYZE
JOB_PAGES --> ANALYZE
ANALYZE --> MATCH
MATCH --> GENERATE
GENERATE --> OUT
classDef default stroke:#1976D2,stroke-width:2px,fill:#E3F2FD,color:#0D47A1
style EXTRACT_CV stroke:#1565C0,fill:#BBDEFB,color:#0D47A1
style EXTRACT_JOB stroke:#1565C0,fill:#BBDEFB,color:#0D47A1
style PAR fill:#FFF9C4,stroke:#F57C00,stroke-width:2px
style CV_PAGES stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
style JOB_PAGES stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
style ANALYZE stroke:#1565C0,fill:#BBDEFB,color:#0D47A1
style MATCH stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
style GENERATE stroke:#1565C0,fill:#BBDEFB,color:#0D47A1
style OUT stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
style CV_IN stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
style JOB_IN stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
style MAIN fill:#F3E5F5,stroke:#7B1FA2,stroke-width:2px
Run the pipeline:
# Via CLI with input file
pipelex run results/cv_match.plx --inputs inputs.jsonCreate an inputs.json file with your PDF URLs:
{
"cv_pdf": {
"concept": "PDF",
"content": {
"url": "https://pipelex-web.s3.amazonaws.com/demo/John-Doe-CV.pdf"
}
},
"job_offer_pdf": {
"concept": "PDF",
"content": {
"url": "https://pipelex-web.s3.amazonaws.com/demo/Job-Offer.pdf"
}
}
}Or via Python:
import asyncio
import json
from pipelex.pipeline.execute import execute_pipeline
from pipelex.pipelex import Pipelex
async def run_pipeline():
with open("inputs.json", encoding="utf-8") as f:
inputs = json.load(f)
pipe_output = await execute_pipeline(
pipe_code="cv_match",
inputs=inputs
)
print(pipe_output.main_stuff_as_str)
Pipelex.make()
asyncio.run(run_pipeline())Install AI assistant rules to easily modify your pipelines:
pipelex kit rulesThis installs rules for Cursor, Claude, OpenAI Codex, GitHub Copilot, Windsurf, and Blackbox AI. Now you can refine pipelines with natural language:
- "Include confidence scores between 0 and 100 in the match analysis"
- "Write a recap email at the end"
Pipelex is an open-source language that enables you to build and run repeatable AI workflows. Instead of cramming everything into one complex prompt, you break tasks into focused steps, each pipe handling one clear transformation.
Each pipe processes information using Concepts (typing with meaning) to ensure your pipelines make sense. The Pipelex language (.plx files) is simple and human-readable, even for non-technical users. Each step can be structured and validated, giving you the reliability of software with the intelligence of AI.
We highly recommend installing our extension for .plx files into your IDE. You can find it in the Open VSX Registry. It's coming soon to VS Code marketplace too. If you're using Cursor, Windsurf or another VS Code fork, you can search for it directly in your extensions tab.
.
βββ examples/ # Production-ready examples
β βββ _quick_start/ # Getting started tutorials
β βββ extract_gantt/ # Extract structured data from Gantt charts
β βββ extract_dpe/ # Extract energy performance diagnostics
β βββ invoice_extractor/ # Invoice data extraction
β βββ extract_table/ # Table extraction from images
β βββ synthesize/ # Data synthesis examples
β βββ wip/ # Work in progress examples
βββ assets/ # Sample data files for examples
βββ tests/ # Test suite for all examples
βββ utils/ # Helper utilities
The cookbook contains production-ready examples covering various use cases:
- Hello World - Your first Pipelex pipeline
- Simple OCR - Extract text from documents
- Summarization - Multi-step text summarization with structured output
- Invoice Extractor - Extract structured data from invoices
- Expense Report - Process and validate expense reports
- DPE Extraction - Extract energy performance diagnostics
- Generic Document - Extract content from any document type
- Gantt Chart - Extract project timelines from visual diagrams
- Table Extraction - Extract structured tables from images
- Data Synthesis - Generate synthetic data based on schemas
- Advisory Board (WIP) - Multi-agent advisory system
- Newsletter Generation (WIP) - Automated newsletter creation
Each example includes:
- Complete
.plxpipeline definition - Python execution script
- Sample input data in
assets/ - Structured output models where applicable
Learn More:
- Writing Workflows Tutorial - Complete guide with examples
- Build Reliable AI Workflows - Deep dive into Pipelex
- Configuration Guide - Set up AI providers and models
- Cookbook Examples - Detailed documentation of cookbook examples
We β€οΈ contributions! Before opening a pull request, please:
- Read
CONTRIBUTING.mdand the main repository's contributing guide. - Add your file under
examples/wip/<your-folder>; feel free to group related examples by topic. - Include a short README or docstring at the top describing purpose, inputs, and expected outputs.
- Verify the pipeline runs locally with a free/open LLM preset when possible, to lower the entry barrier for reviewers.
Tip: If you're unsure whether your idea fits, open a GitHub Discussion firstβfeedback is fast and public.
Join our vibrant Discord community to connect with other developers, share your experiences, and get help with your Pipelex projects!
| Channel | Use case |
|---|---|
| GitHub Discussions β "Show & Tell" | Share ideas, brainstorm, get early feedback. |
| GitHub Issues | Report bugs or request features. |
| Discord | Real-time chat β https://go.pipelex.com/discord |
| Email (privacy & security) | security@pipelex.com |
| Documentation | Comprehensive guides and API reference |
If you find Pipelex helpful, please consider giving us a star on both repositories! It helps us reach more developers and continue improving the tool.
This project is licensed under the MIT license. Runtime dependencies are distributed under their own licenses via PyPI.
Happy piping! π
"Pipelex" is a trademark of Evotis S.A.S.
Β© 2025 Evotis S.A.S.