This is a MapReduce job used for generating a large data of set and storing in the HDFS. More...
Inherits Configured, and Tool.
Classes | |
class | AmazonProducerInputFormat |
class | AmazonProducerOutputFormat |
class | AmazonProducerRecordReader |
class | AmazonProducerRecordWriter |
class | AmazonProducerSplit |
class | Map |
Public Member Functions | |
int | run (String[] args) throws Exception |
main function | |
Static Public Member Functions | |
static void | main (String[] args) throws Exception |
This is a MapReduce job used for generating a large data of set and storing in the HDFS.
The generated data in the HDFS will be read later by a counting job. This class was created and used for experiments.
Usage:
hadoop jar <JARFILE> mapred.Producer -Dquantity=<quantity> -Dparamfile=<paramfile> <genoutput>
For details on what data is generated and how records are formatted, see common.Producer.