FeatherFlow

Project Vision

FeatherFlow is a Rust-based alternative to dbt (data build tool) designed to provide robust, statically-checked data transformations with superior performance. This project aims to solve common pain points in the data transformation workflow through static analysis and compiled performance.

Installation

Pre-built Binaries

You can download the latest pre-built binaries from the GitHub Releases page.

Linux

# For x86_64 systems
curl -L -o featherflow https://github.com/datastx/FeatherFlow/releases/latest/download/featherflow-linux-amd64
chmod +x featherflow
sudo mv featherflow /usr/local/bin/

# For ARM64 systems
curl -L -o featherflow https://github.com/datastx/FeatherFlow/releases/latest/download/featherflow-linux-arm64
chmod +x featherflow
sudo mv featherflow /usr/local/bin/

macOS

# For Intel Macs
curl -L -o featherflow https://github.com/datastx/FeatherFlow/releases/latest/download/featherflow-macos-amd64
chmod +x featherflow
sudo mv featherflow /usr/local/bin/

# For Apple Silicon (M1/M2)
curl -L -o featherflow https://github.com/datastx/FeatherFlow/releases/latest/download/featherflow-macos-arm64
chmod +x featherflow
sudo mv featherflow /usr/local/bin/

Windows

Download the Windows binary and add it to your PATH.

Why FeatherFlow?

Problems with existing tools like dbt:

Runtime SQL errors that could be caught earlier with static analysis
Performance limitations of Python-based tooling
Limited capabilities for deep SQL introspection and optimization

FeatherFlow's approach:

Parse SQL into Abstract Syntax Trees (ASTs) to perform static analysis
Catch schema issues, syntax errors, and semantic problems before execution
Compile transformation pipelines to efficient execution plans
Provide a robust, type-safe experience for data engineers

Architecture & Features

Core Components

FeatherFlow is built on three main architectural components:

SQL Parser: Analyzes SQL statements and transforms them into Abstract Syntax Trees (ASTs)
- Uses the sqlparser-rs crate for robust SQL parsing
- Enables deep analysis of query structure and references
Graph Builder: Creates and manipulates the dependency model
- Constructs directed acyclic graphs (DAGs) from model dependencies
- Provides graph traversal and analysis algorithms
- Implements visualization capabilities for dependency graphs
CLI Interface: Provides a user-friendly command interface
- Offers commands for parsing, analyzing, and visualizing SQL models
- Supports configuration options for customizing output

Key User Benefits

Early Error Detection: Catch schema mismatches and SQL errors before execution
Dependency Visualization: Understand and document your data pipeline's structure
Pipeline Validation: Verify your transformation flow is correctly structured
Performance Optimization: Rust-based implementation for blazing fast execution
Multiple Output Formats: Generate dependency documentation as text, DOT graphs, or JSON
Seamless Integration: Easily incorporate into your existing data workflow

Current Development Status

FeatherFlow v0.1 focuses on a minimal viable implementation with these capabilities:

Parsing SQL files to extract table dependencies
Building dependency graphs (DAGs) between models
Detecting circular dependencies
Outputting dependency information in various formats (text, DOT, JSON)

Usage Examples

FeatherFlow CLI currently supports two commands:

# Show version information
ff version

# Parse SQL files and build a dependency graph
ff parse --model-path <path> [--format <format>]

Parse Command Examples

# Parse SQL files in the models directory and show text output
ff parse --model-path ./models

# Generate DOT format for visualization
ff parse --model-path ./models --format dot > models.dot
dot -Tpng -o models.png models.dot

# Output as JSON for further processing
ff parse --model-path ./models --format json > models.json

You can also use the Makefile to run these examples:

# Run parser on example models with text output
make parse-example

# Generate DOT graph from example models
make parse-dot

# Generate JSON representation from example models
make parse-json

Development

Building the Project

# Build the project
make build

# Run linting checks
make lint

# Format code
make fmt

Builds and Releases

Continuous Builds

Every push to the main or master branch automatically:

Builds binaries for all supported platforms (Linux, macOS, Windows)
Archives the binaries as GitHub Actions artifacts for 14 days
No GitHub release is created for these builds

To access these builds, go to the GitHub Actions tab, select the most recent workflow run, and download the featherflow-all-binaries artifact.

Creating a Release

To create an official release with prebuilt binaries:

Use the make command to create a release tag:
```
make release
git push origin v0.1.0
```
Alternatively, manually trigger the release workflow from the GitHub Actions tab, providing the version tag.

The GitHub Actions workflow will automatically:

Build binaries for Linux (x86_64, ARM64), macOS (x86_64, ARM64), and Windows
Create a GitHub Release with these binaries
Generate release notes based on the commits since the last release

Cross-Platform Builds

You can build for specific target platforms using:

# Build for a specific target platform
make target TARGET=<platform>

# Build release version for a specific target platform
make target-release TARGET=<platform>

# Example: Build for ARM64 Linux
make target-release TARGET=aarch64-unknown-linux-gnu

Running Tests

You can run tests using the following commands:

# Run all tests in the project
make test

# Run only the sql_engine/ast_utils tests
make test-module MODULE=sql_engine::ast_utils

# Run a specific test
make test-single TEST=test_simple_select

# Run tests with output (even for passing tests)
make test-verbose

# Run tests with coverage report
make test-coverage

Test Structure

For tests to be discovered by Cargo:

Unit tests should be in the same file as the code they test, in a mod tests module
Integration tests should be directly in the tests/ directory (not in tests/src/)

If your tests aren't running, ensure they follow this structure.

Additional Make Targets

Run make help to see all available make targets with descriptions.

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
.devcontainer		.devcontainer
.github		.github
.vscode		.vscode
demo_project		demo_project
feather_flow		feather_flow
prompts		prompts
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST_ERD.md		MANIFEST_ERD.md
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FeatherFlow

Project Vision

Installation

Pre-built Binaries

Linux

macOS

Windows

Why FeatherFlow?

Architecture & Features

Core Components

Key User Benefits

Current Development Status

Usage Examples

Parse Command Examples

Development

Building the Project

Builds and Releases

Continuous Builds

Creating a Release

Cross-Platform Builds

Running Tests

Test Structure

Additional Make Targets

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

datastx/FeatherFlow

Folders and files

Latest commit

History

Repository files navigation

FeatherFlow

Project Vision

Installation

Pre-built Binaries

Linux

macOS

Windows

Why FeatherFlow?

Architecture & Features

Core Components

Key User Benefits

Current Development Status

Usage Examples

Parse Command Examples

Development

Building the Project

Builds and Releases

Continuous Builds

Creating a Release

Cross-Platform Builds

Running Tests

Test Structure

Additional Make Targets

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages