FastData

Description

FastData is a code generator that analyzes your data and creates high-performance, read-only data structures for static data with support for key/value and membership queries. It supports many output languages (C#, C++, Rust, etc.), ready for inclusion in your project with zero dependencies.

Download

Name	Link
Executable	GitHub Releases
C# source generator
C# library
.NET Tool
PowerShell

Getting started

Using the executable

Download the executable
Run FastData <lang> dogs.txt

<lang> can be one of rust, cpp, or csharp.

Using the .NET CLI tool

Install the Genbox.FastData.Cli tool: dotnet tool install --global Genbox.FastData.Cli
Run FastData <lang> dogs.txt

Using the PowerShell module

Install the PowerShell module: Install-Module -Name Genbox.FastData
Run Invoke-FastData -Language <lang> -InputFile dogs.txt

Using the .NET Source Generator

Add the Genbox.FastData.SourceGenerator package to your project
Add FastDataAttribute as an assembly level attribute.

using Genbox.FastData.SourceGenerator;

[assembly: FastData<string>("Dogs", ["Labrador", "German Shepherd", "Golden Retriever"])]

internal static class Program
{
    private static void Main()
    {
        Console.WriteLine(Dogs.Contains("Labrador"));
        Console.WriteLine(Dogs.Contains("Beagle"));
    }
}

Using the C# library

Add the Genbox.FastData.Generator.CSharp NuGet package to your project.
Use the FastDataGenerator.Generate() method:

internal static class Program
{
    private static void Main()
    {
        FastDataConfig config = new FastDataConfig();
        CSharpCodeGenerator generator = CSharpCodeGenerator.Create(new CSharpCodeGeneratorConfig("Dogs"));

        string source = FastDataGenerator.Generate(["Labrador", "German Shepherd", "Golden Retriever"], config, generator);
        Console.WriteLine(source);
    }
}

Why use FastData?

Generic data structures like arrays, hash tables, etc. are not optimized for your data. If you only need read-only access to a dataset, FastData can provide up to 14x better performance and less memory overhead.

Here is a classic example on just using an array:

string[] dogs = ["Labrador", "German Shepherd", "Golden Retriever"];

if (dogs.Contains("Beagle"))
    Console.WriteLine("It contains Beagle");

We know our data at compile-time, so why not let a program analyze it and come up with a better way?

FastData produces the following code:

internal static class Dogs
{
    public static bool Contains(string value)
    {
        if ((49280UL & (1UL << (value.Length - 1))) == 0)
           return false;

        switch (value)
        {
            case "Labrador":
            case "German Shepherd":
            case "Golden Retriever":
                return true;
            default:
                return false;
        }
    }

    public const int ItemCount = 3;
    public const int MinLength = 8;
    public const int MaxLength = 16;
}

Benefits of the generated code:

Early exit: A single-register bitmap of string lengths allows early termination for string lengths that cannot be in the set.
Efficient lookups: A switch-based data structure which is faster for small datasets.

As a bonus, we also get some metadata about the dataset as constants, which, when used, allows for better code generation by optimizing compiler.

Features

Data analysis: Optimizes the algorithms on the inherent properties of the dataset.
Many data structures: FastData automatically chooses the best data structure for your data.
Fast hashing: Strings are analyzed and the hash function is specially tailored to the data.
Zero dependencies: The generated code has no dependencies, making it easy to integrate into your project.
Minimal memory usage: The generated data structures are memory-efficient, using only the necessary amount of memory for the dataset.
High-perfromance: The generated data structures are generated without unnecessary branching or virtualization making the compiler produce optimal code.
Key/Value support: FastData can produce key/value lookup data structures

For more details about the data structures, see data structures.

FastData supports several output programming languages.

C#: FastData csharp <input-file>
C++: FastData cpp <input-file>
Rust: FastData rust <input-file>

Each output language has different settings. Run FastData <lang> --help to see the options.

Benchmarks

A benchmark of .NET's Array, HashSet<T> and FrozenSet<T> versus FastData's auto-generated data structure really illustrates the difference.

Membership queries

Method	Categories	Mean	Factor
Array	InSet	6.5198 ns	-
HashSet	InSet	6.2191 ns	1.05x
FrozenSet	InSet	1.6010 ns	4.07x
FastData	InSet	0.9378 ns	6.95x

Array	NotInSet	7.4015 ns	-
HashSet	NotInSet	4.6013 ns	1.61x
FrozenSet	NotInSet	1.5816 ns	4.68x
FastData	NotInSet	0.5284 ns	14.01x

Keyed queries

Method	Categories	Mean	Factor
Dictionary	InSet	6.890 ns	-
FrozenDictionary	InSet	1.484 ns	4.64x
FastData	InSet	1.375 ns	5.01x

DictionaryNF	NotInSet	5.832 ns	-
FrozenDictionaryNF	NotInSet	1.376 ns	4.24x
FastDataNF	NotInSet	1.349 ns	4.32x

FAQ

Why not use System.Collections.Frozen?

There are several reasons:

Frozen comes with considerable runtime overhead
Frozen is only available in .NET 8.0+
Frozen only provides a few of the optimizations provided in FastData
Frozen is only available in C#. FastData can produce data structures in many languages.

Does FastData use less memory than runtime structures?

Yes and no. For some data structures like Array, it uses the same amount of memory. For others, like HashTable, depending on the data, it can use considerably less memory.

Does it support case-insensitive lookups?

No, not yet.

Does it support custom equality comparers?

No, not yet.

Does it support key/value lookup?

Yes, you can specify key/value arrays as input data and FastData will generate a efficient key lookup function that returns a value.

Are there any best pratcies for using FastData?

Put the most often queried items first in the input data. It can speed up query speed for some data structures.
Enable string analysis when using string keys to produce a more efficient hash function.

Can I use it for dynamic data?

No, FastData is designed for static data only. It generates code at compile time, so the data must be known beforehand.

Name		Name	Last commit message	Last commit date
Latest commit History 352 Commits
.github		.github
Docs		Docs
Imports		Imports
Locals		Locals
Misc/PowerShell		Misc/PowerShell
Scripts		Scripts
Src		Src
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.globalconfig		.globalconfig
FastData.sln		FastData.sln
FastData.sln.DotSettings		FastData.sln.DotSettings
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

FastData

Description

Download

Getting started

Using the executable

Using the .NET CLI tool

Using the PowerShell module

Using the .NET Source Generator

Using the C# library

Why use FastData?

Features

Benchmarks

Membership queries

Keyed queries

FAQ

Why not use System.Collections.Frozen?

Does FastData use less memory than runtime structures?

Does it support case-insensitive lookups?

Does it support custom equality comparers?

Does it support key/value lookup?

Are there any best pratcies for using FastData?

Can I use it for dynamic data?

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Languages

Uh oh!

License

Genbox/FastData

Folders and files

Latest commit

History

Repository files navigation

FastData

Description

Download

Getting started

Using the executable

Using the .NET CLI tool

Using the PowerShell module

Using the .NET Source Generator

Using the C# library

Why use FastData?

Features

Benchmarks

Membership queries

Keyed queries

FAQ

Why not use System.Collections.Frozen?

Does FastData use less memory than runtime structures?

Does it support case-insensitive lookups?

Does it support custom equality comparers?

Does it support key/value lookup?

Are there any best pratcies for using FastData?

Can I use it for dynamic data?

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Languages