From cf32ef1043cfb1d3fa72f8e4bc371b52e52d6e25 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Karri=20Niemel=C3=A4?= Date: Wed, 15 Feb 2017 22:16:44 +0200 Subject: [PATCH] Update README.md Adding embulk data loader --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index 9da7713..2490272 100644 --- a/README.md +++ b/README.md @@ -306,6 +306,7 @@ You can read more about this distinction on Prof. Daniel Abadi's blog: [Distingu * [Apache Kafka](http://kafka.apache.org/) - distributed publish-subscribe messaging system. * [Apache Sqoop](http://sqoop.apache.org/) - tool to transfer data between Hadoop and a structured datastore. * [Cloudera Morphlines](https://github.com/cloudera/cdk/tree/master/cdk-morphlines) - framework that help ETL to Solr, HBase and HDFS. +* [Embulk](http://embulk.org) - open-source bulk data loader that helps data transfer between various databases, storages, file formats, and cloud services. * [Facebook Scribe](https://github.com/facebookarchive/scribe) - streamed log data aggregator. * [Fluentd](http://www.fluentd.org) - tool to collect events and logs. * [Google Photon](http://research.google.com/pubs/pub41318.html) - geographically distributed system for joining multiple continuously flowing streams of data in real-time with high scalability and low latency.