docs2llm

A command-line tool to extract documentation from local directories and GitHub repositories, formatting it for use as context with Large Language Models (LLMs).

Purpose

docs2llm helps you capture documentation from codebases to use as context for AI assistants and large language models. It searches for documentation files (markdown, text, etc.), processes them, and creates a single consolidated file that can be used as reference material for LLMs.

Features

Extract documentation from local directories or GitHub repositories
Automatically identify and process common documentation files
Prioritize README files and important documentation
Support for multiple file formats (Markdown, RST, TXT)
Format output for optimal LLM context
Control scan depth to manage output size
Clone specific branches from Git repositories
Detailed logging with configurable verbosity

Installation

# Install from PyPI
pip install docs2llm

Usage

Command Line Interface

# Extract docs from a local directory
docs2llm /path/to/project --output context.txt

# Extract docs from a GitHub repository
docs2llm --git owner/repo --output context.txt

# Specify a branch
docs2llm --git owner/repo --branch develop

# Control scan depth
docs2llm /path/to/project --max-depth 2

# Enable verbose logging
docs2llm /path/to/project -v

# Write logs to a file
docs2llm /path/to/project --log-file extraction.log

Options

PATH: Local directory containing documentation files
--git: GitHub repository URL or owner/repo format
--output: Output file name (default: llm_context.txt)
--max-depth: Maximum directory depth to search (default: 3)
--branch: Specific branch to clone (only used with --git)
--verbose, -v: Enable verbose logging
--log-file: Log to this file in addition to console

Python API

from docs2llm import extract_documentation

# Extract from local directory
success = extract_documentation(
    local_path="/path/to/project",
    output_file="context.txt",
    max_depth=3,
    verbose=True
)

# Extract from GitHub repository
success = extract_documentation(
    git_repo="owner/repo",
    output_file="context.txt",
    branch="main",
    verbose=True
)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
src/docs2llm		src/docs2llm
tests		tests
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
test.log		test.log
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

docs2llm

Purpose

Features

Installation

Usage

Command Line Interface

Options

Python API

About

Uh oh!

Releases 1

Uh oh!

Languages

License

nklsw/docs2llm

Folders and files

Latest commit

History

Repository files navigation

docs2llm

Purpose

Features

Installation

Usage

Command Line Interface

Options

Python API

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Uh oh!

Languages