Skip to content

A customized version of the HBase writer for Heritrix, mainly for testing purposes

License

Notifications You must be signed in to change notification settings

apurtell/hbase-writer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Welcome to HBase-Writer README.

This document can also be found online here:
http://code.google.com/p/hbase-writer/wiki/README

Specific versions of HBase-Writer now support different
version combinations of Heritrix and HBase. Please refer to
http://code.google.com/p/hbase-writer/wiki/VERSIONS
for a more detailed list.


INTRODUCTION
============
This is a processor for Heritrix that writes fetched pages to HBase.

The layout of this contribution is modeled after Doug Judds'
heritrix-hadoop-dfs-processor available off the heritrix home page.

This software is licensed under the LGPL.  See accompanying LICENSE.txt document.


GETTING STARTED
===============
HBase-Writer now supports Heritrix 2 and 3. Please refer to the corresponding
README-*.txt files for specific instructions.


PING BACK
========================
Thanks to Questio for all the support for releasing this project.

About

A customized version of the HBase writer for Heritrix, mainly for testing purposes

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages