GitHub - wangli1426/IndexingTopology: Real time indexing and query-processing using Apache Storm

Distributed Append-Only Store

This project is a distributed append-only store designed for high-throughput data ingestion and real-time key range and temporal queries. It is implemented as an application-level topology and runs on top of Apache Storm.

Data & Query Model

Data tuples continuouly arrive at the system. Each tuple consists of a key, a timestamp and a payload. A payload is a collection of primitives and/or user-defined objects. We assume the timestamps of the tuples are roughly in increasing order. A user query contains a key range constraint and a temporal range constraint.

Requirement

JDK 8 or higher;
maven;
Apache Storm (not needed in local mode);
HDFS (not needed in local mode);

Quick Start

1. Local mode

Running our system in local model is the easiest way to get a feeling of the system. Local model is typically used internally to debug the topology. We highly encourage the users to run our system in cluster model to fully exploit the performance.

To run our systemin local mode, you should:

Make sure that <scope>provided</scope> is commented in pom.xml file.
download the source codes

$ git clone https://github.com/ADSC-Cloud/append-only-store

Change the configures in config/TopologyConfig accordingly.

set HDFSFlag = false to use local file system.

Create a local folder to store the data chunks generated by the system and set dataDir properly. Make sure that the local folder is writable.

Compile the source code

$ mvn clean install -DskipTests

Launch the system

$ mvn exec:java -Dexec.mainClass=indexingTopology.KingBaseTopology

2. Cluster model

Make sure that <scope>provided</scope> is uncommented in pom.xml file.
Deploy Apache Storm and make sure that the nimbus and supervisors are running properly.
Setup HDFS
Change the configures in config/TopologyConfig accordingly.

set HDFSFlag = false to use HDFS as the storage system.

Create a folder for the system in HDFS and set dataDir in the config file properly. Make sure that the folder is writable.

Compile the source code

mvn clean install -DskipTests

Submit the topology to Apache

$ storm jar SOURCE_CODE_PATH/target/IndexingTopology-1.0-SNAPSHOT.jar indexingTopologyNormalDistributionTopology append-only-store

Name		Name	Last commit message	Last commit date
Latest commit History 231 Commits
src		src
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed Append-Only Store

Data & Query Model

Requirement

Quick Start

1. Local mode

2. Cluster model

About

Releases

Packages

Languages

wangli1426/IndexingTopology

Folders and files

Latest commit

History

Repository files navigation

Distributed Append-Only Store

Data & Query Model

Requirement

Quick Start

1. Local mode

2. Cluster model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages