eu.dicodeproject.analysis.hbase
Class HBaseLuceneTokenizerDriver

java.lang.Object
  extended by org.apache.hadoop.conf.Configured
      extended by org.apache.mahout.common.AbstractJob
          extended by eu.dicodeproject.analysis.hbase.HBaseLuceneTokenizerDriver
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool

public class HBaseLuceneTokenizerDriver
extends org.apache.mahout.common.AbstractJob

Reads text from a configurable HBase table and column, tokenizes with Lucene and writes the resulting tokenized stuff to HDFS for further processing by the Mahout colloc driver.


Method Summary
static void main(String[] args)
           
 int run(String[] args)
           
 
Methods inherited from class org.apache.mahout.common.AbstractJob
addFlag, addInputOption, addOption, addOption, addOption, addOption, addOutputOption, buildOption, getInputPath, getOption, getOutputPath, hasOption, keyFor, maybePut, parseArguments, parseDirectories, prepareJob, shouldRunNextPhase
 
Methods inherited from class org.apache.hadoop.conf.Configured
getConf, setConf
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 
Methods inherited from interface org.apache.hadoop.conf.Configurable
getConf, setConf
 

Method Detail

main

public static void main(String[] args)
                 throws Exception
Throws:
Exception

run

public int run(String[] args)
        throws ClassNotFoundException,
               IllegalAccessException,
               InstantiationException,
               InterruptedException,
               IOException
Throws:
ClassNotFoundException
IllegalAccessException
InstantiationException
InterruptedException
IOException


Copyright © 2011. All Rights Reserved.