Mini Database Engine 🗄️

A lightweight, file-based relational database engine implemented in C++ with SQL-like query support. This project provides a simple yet powerful database management system that supports CRUD operations through an interactive SQL command-line interface.

Educational Project: Built as part of the ITI Object-Oriented Programming course to demonstrate database management system concepts, design patterns, and advanced C++ programming techniques.

🌟 Features

SQL-like Command Interface: Interactive CLI with familiar SQL syntax
Data Types Support:
- Nullable (NullType)
- Integer (INT)
- Double precision floating point (DOUBLE)
- Single character (CHAR)
- Variable-length text (VARCHAR, TEXT)
- Date (DATE)
Core SQL Operations:
- CREATE TABLE - Define tables with columns and constraints
- INSERT INTO - Add records to tables
- SELECT - Query data with filtering and conditions
- UPDATE - Modify existing records
- DELETE - Remove records from tables
Advanced Features:
- Primary key constraints (single and composite)
- WHERE clause filtering with comparison operators (=, !=, <, >, <=, >=)
- Logical operators (AND, OR)
- Persistent file-based storage
- NULL value support
- Automatic table metadata management

🎯 Project Highlights

💡 Technical Excellence

Modern C++ Implementation: Leverages C++17 features including std::variant, std::filesystem, and lambda expressions for clean, type-safe code
Zero External Dependencies: Completely self-contained using only STL, making it highly portable and easy to deploy
Robust Parser Design: Hand-crafted SQL parser with comprehensive syntax validation and error handling
Memory Efficient: Smart use of move semantics and RAII patterns for optimal resource management
Modular Architecture: Clean separation of concerns with dedicated parsers, models, and utilities

🏆 Key Achievements

Full SQL Lifecycle: Complete CRUD operations matching industry-standard SQL syntax
Type System: Rich data type support including custom Date class with proper comparison operators
Query Engine: Sophisticated WHERE clause evaluation with complex boolean logic
Persistence Layer: Reliable file-based storage with automatic serialization/deserialization
User Experience: Interactive CLI with help system and intuitive error messages
Data Integrity: Primary key enforcement and NULL value handling

🎓 Educational Value

Demonstrates advanced OOP concepts (encapsulation, abstraction, polymorphism)
Showcases design patterns (Parser pattern, Catalog/Registry pattern)
Implements data structures (variant types, AST)
Explores file I/O and serialization techniques
Practices string parsing and tokenization algorithms
Applies software engineering principles (SOLID, DRY)

📋 Prerequisites

C++ compiler with C++17 support or higher (GCC, Clang, MSVC)
Standard C++ libraries

🚀 Getting Started

Installation

Clone the repository

git clone https://github.com/yourusername/mini_db_engine.git
cd mini_db_engine

Compile the project

Using g++ (Linux/macOS/MinGW)

g++ -std=c++17 src/main.cpp -o mini_db

Using clang++

clang++ -std=c++17 src/main.cpp -o mini_db

Using MSVC (Windows)

cl /EHsc /std:c++17 src/main.cpp /Fe:mini_db.exe

Using CMake (Recommended)

mkdir build && cd build
cmake ..
cmake --build .

Running the Database

Linux/macOS:

./mini_db

Windows:

mini_db.exe

You'll be greeted with an interactive SQL prompt:

Welcome to mini database engine
Type 'help' for commands
SQL>

Quick Test

Try these commands to get started:

CREATE TABLE students (id INT PRIMARY KEY, name VARCHAR(50), grade DOUBLE);
INSERT INTO students (id, name, grade) VALUES (1, 'Alice', 95.5);
SELECT * FROM students;

📖 Usage Examples

Creating a Table

CREATE TABLE users (
    id INT PRIMARY KEY,
    name VARCHAR(100),
    email VARCHAR(255),
    age INT,
    created_at DATE
);

Composite Primary Key

CREATE TABLE enrollments (
    student_id INT,
    course_id INT,
    grade DOUBLE,
    PRIMARY KEY (student_id, course_id)
);

Inserting Data

INSERT INTO users (id, name, email, age, created_at) 
VALUES (1, 'John Doe', 'john@example.com', 30, '2024-01-15');

INSERT INTO users (id, name, email, age) 
VALUES (2, 'Jane Smith', 'jane@example.com', 28);

Querying Data

-- Select all records
SELECT * FROM users;

-- Select specific columns
SELECT name, email FROM users;

-- Filtering with WHERE clause
SELECT * FROM users WHERE age > 25;

-- Complex conditions
SELECT name, email FROM users 
WHERE age >= 25 AND age <= 35;

-- Multiple conditions
SELECT * FROM users 
WHERE age > 20 OR name = 'John Doe';

Updating Records

UPDATE users SET age = 31 WHERE id = 1;

UPDATE users SET email = 'newemail@example.com', age = 29 
WHERE name = 'Jane Smith';

Deleting Records

DELETE FROM users WHERE id = 2;

DELETE FROM users WHERE age < 18;

🗂️ Project Structure

mini_db_engine/
├── README.md                  # Project documentation
├── LICENSE                    # License file
├── src/
│   ├── main.cpp              # Entry point and CLI loop
│   ├── setup_test_data.cpp   # Test data setup utilities
│   └── README.md             # Source documentation
├── include/
│   ├── models.cpp            # Core data structures (Table, Column, Row, etc.)
│   ├── Helper.cpp            # Utility functions for parsing and file I/O
│   ├── CreateParse.cpp       # CREATE TABLE parser
│   ├── InsertParser.cpp      # INSERT INTO parser
│   ├── SelectParser.cpp      # SELECT query parser
│   ├── UpdateParser.cpp      # UPDATE statement parser
│   ├── DeleteParser.cpp      # DELETE statement parser
│   └── README.md             # Include documentation
└── data/
    └── README.md             # Data directory documentation

🏗️ Architecture

Design Principles

Separation of Concerns: Each parser handles one SQL command type
Object-Oriented Design: Leverages classes, inheritance, and polymorphism
Type Safety: Uses C++17 std::variant for type-safe value storage
File-Based Persistence: Simple, portable storage without external dependencies

Core Components

Models (models.cpp)
- Value: Variant type supporting multiple data types (Int, Double, Char, Text, Date, Null)
- Column: Table column definition with name, type, and constraints
- Row: Table row representation as a vector of values
- Table: Complete table structure with metadata and rows
- Catalog: Database-wide table registry and management
- AST: Abstract syntax tree for parsed queries
Parsers
- CreateParser: Handles table creation with column definitions and constraints
- InsertParser: Processes INSERT statements with value validation
- SelectParser: Executes SELECT queries with WHERE clause evaluation
- UpdateParser: Modifies existing records based on conditions
- DeleteParser: Removes records matching WHERE criteria
- Each parser validates syntax, converts queries to AST, and executes operations
Helper Utilities (Helper.cpp)
- String manipulation (trim, split, case conversion)
- SQL parsing (parentheses matching, quote handling)
- File I/O for persistent storage
- Table serialization/deserialization
- Query validation and formatting
- Help text and error message generation
Storage Engine
- File-based persistence in data/ directory
- Each table stored in a separate text file
- Automatic loading of existing tables on startup
- Human-readable format for easy debugging
- Atomic writes to prevent data corruption

🔧 Commands Reference

Command	Description
`help` or `?`	Display available commands and syntax
`exit` or `quit`	Exit the database engine
`CREATE TABLE ...`	Create a new table
`INSERT INTO ...`	Insert data into a table
`SELECT ...`	Query data from a table
`UPDATE ...`	Update existing records
`DELETE FROM ...`	Delete records from a table

📊 Supported Data Types

Type	Description	Example
`NullType`	nullable values	NULL
`INT`	Integer numbers	`42`, `-100`
`DOUBLE`	Floating-point numbers	`3.14`, `-0.5`
`CHAR`	Single character	`'A'`, `'x'`
`VARCHAR(n)`	Variable-length string (max n chars)	`'Hello'`
`TEXT`	Unlimited text	`'Long text...'`
`DATE`	Date in YYYY-MM-DD format	`'2024-01-15'`

🔍 WHERE Clause Operators

Comparison: =, !=, <, >, <=, >=
Logical: AND, OR
Value Types: Numbers, strings (quoted), dates (quoted), NULL

🛡️ Constraints

Primary Key: Ensures unique identification of records
- Single column: id INT PRIMARY KEY
- Composite: PRIMARY KEY (col1, col2)

💾 Data Persistence

All data is automatically persisted to files in the data/ directory:

Automatic Saving: Tables are saved after each modification (INSERT, UPDATE, DELETE)
Auto-Loading: Existing tables are loaded automatically on startup
File-Per-Table: Each table is stored in a separate file for isolation
Human-Readable Format: Data files can be inspected and manually edited if needed
Metadata Storage: Column definitions and constraints are stored with the data

Storage Format Example

TableName: users
Columns: id:INT:PK, name:VARCHAR(100), email:VARCHAR(255), age:INT
Rows:
1, "John Doe", "john@example.com", 30
2, "Jane Smith", "jane@example.com", 28

🤝 Contributing

Contributions are welcome! Here are some areas for improvement:

Add support for JOIN operations
Implement indexes for faster queries
Add more constraints (FOREIGN KEY, UNIQUE, NOT NULL)
Support for aggregate functions (COUNT, SUM, AVG, etc.)
Transaction support
Multi-user concurrency control
Query optimization

📝 License

This project is licensed under the terms specified in the LICENSE file.

👥 Authors

Developed as part of the ITI Object-Oriented Programming course.

_{Ali Elshorpagi}

_{Mohamed Wahba}

_{Hazem Ibrahim}

🐛 Known Limitations

As an educational project focused on core database concepts, certain advanced features are intentionally simplified:

Single-User Mode: Designed for learning, not concurrent access
Immediate Persistence: Changes committed instantly (educational approach to understand persistence)
Schema Simplicity: Fixed table structures after creation (demonstrates core CREATE operation)
Query Scope: Focused on fundamental single-table operations for clarity
Optimization: Emphasis on correctness over performance for educational clarity

These design decisions allow the codebase to remain readable and educational while demonstrating fundamental database concepts effectively.

Resources

SQL Tutorial - Learn SQL basics
Database Design - Database design principles
C++ Reference - C++ language reference

💬 Support

If you encounter any issues or have questions:

Check the Known Limitations section
Review the Usage Examples
Open an issue on GitHub with:
- Your SQL command
- Expected behavior
- Actual behavior
- Error messages (if any)

Built with ❤️ and passion for excellence

A testament to the power of object-oriented design and modern C++ in building complex systems

Perfect for students, educators, and developers learning database internals

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
data		data
include		include
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

License

Ali-Elshorpagi/mini_db_engine

Folders and files

Latest commit

History

Repository files navigation

Mini Database Engine 🗄️

🌟 Features

🎯 Project Highlights

💡 Technical Excellence

🏆 Key Achievements

🎓 Educational Value

📋 Prerequisites

🚀 Getting Started

Installation

Running the Database

Quick Test

📖 Usage Examples

Creating a Table

Composite Primary Key

Inserting Data

Querying Data

Updating Records

Deleting Records

🗂️ Project Structure

🏗️ Architecture

Design Principles

Core Components

🔧 Commands Reference

📊 Supported Data Types

🔍 WHERE Clause Operators

🛡️ Constraints

💾 Data Persistence

Storage Format Example

🤝 Contributing

📝 License

👥 Authors

🐛 Known Limitations

Resources

💬 Support

Built with ❤️ and passion for excellence

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages