Transformation of NoSQL-Data (JSON) in relational databases

This is a prototype for my Bachelor thesis "Data-Integration pipeline for the transformation of NoSQL-Data (JSON) in relational databases" at the University of Rostock. It allows the transformation of a document collection from MongoDB into a MySQL database.

The theory behind it can be read in the (German) Bachelor thesis itself (see paper.pdf).

TL;DR; version of the concept:

Objects become relations, with properties being attributes
Arrays become relations, with each element being a single tuple
Nested objects/arrays lead to new relations
Relations can potentially be inlined or merged into each other

Requirements

This project has four requirements to your system. Version numbers represent what I used. Older version might work, but aren't tested.All further dependencies are managed through the Gradle build file as listed in the following section.

Java 1.7
Gradle 2.5
MongoDB 3.2.6
MySQL 5.5.5

Dependencies

Installation

The installation procress should be straight forward.

git clone https://https://github.com/fbeuster/SchemaTransformation.git
Open folder in IntelliJ and follow the import dialog
Make project
Configure your environment (see below).
Run Main class

At least in theory. This is a prototype so everything can happen. Except world domination, that's one thing this code can't do for you.

Configuration

A lot of settings can be changed, including database names and credentials, along with a lot of transform related settings. You can find a full list of the settings in the defaults.yaml. Do yourself a favor and DO NOT change settings there. If you need to make changes, create a config.yaml for it and place it alongside the defaults.yaml.

Setting up your environment

You should create the config.yaml as described above. In there you need to configure your MongoDB instance, as well as your MySQL instance. The following settings are needed for this:

mongodb:
  database: mongodb_database_name
  collection: collection_name
sql:
  database: mysql_database_name
  host: host_name
  password: your_password
  port: port_number
  user: your_username

Please note that both, mongodb and sql, are top level entries in the YAML file.

Note

As said earlier, this is a prototype. While it worked fine with my test data sets, I can't guarantee that the program is free of bugs. Also there're lots of open ToDo's and the code is a long way from being perfect and optimized.

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
gradle/wrapper		gradle/wrapper
src/main/java		src/main/java
.gitignore		.gitignore
README.md		README.md
build.gradle		build.gradle
defaults.yaml		defaults.yaml
gradlew		gradlew
gradlew.bat		gradlew.bat
paper.pdf		paper.pdf
settings.gradle		settings.gradle

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformation of NoSQL-Data (JSON) in relational databases

Requirements

Dependencies

Installation

Configuration

Setting up your environment

Note

About

Releases

Packages

Languages

fbeuster/SchemaTransformation

Folders and files

Latest commit

History

Repository files navigation

Transformation of NoSQL-Data (JSON) in relational databases

Requirements

Dependencies

Installation

Configuration

Setting up your environment

Note

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages