Package org.apache.avro.mapred
Class AvroJob
java.lang.Object
org.apache.avro.mapred.AvroJob
Setters to configure jobs for Avro data.
-
Field Summary
Modifier and TypeFieldDescriptionstatic final String
The configuration key prefix for a binary output metadata.static final String
The configuration key for reflection-based input representation.static final String
The configuration key for a job's input schema.static final String
The configuration key for reflection-based map output representation.static final String
The configuration key for a job's intermediate schema.static final String
The configuration key for a job's output compression codec.static final String
The configuration key for a job's output schema.static final String
The configuration key prefix for a text output metadata. -
Method Summary
Modifier and TypeMethodDescriptionstatic GenericData
createDataModel
(Configuration conf) static GenericData
static GenericData
static Class
<? extends GenericData> Return the job's data model implementation class.static Schema
Return a job's map input schema.static Schema
Return a job's map output key schema.static Schema
Return a job's output key schema.static void
setCombinerClass
(JobConf job, Class<? extends AvroReducer> c) Configure a job's combiner implementation.static void
setDataModelClass
(JobConf job, Class<? extends GenericData> modelClass) Configure a job's data model implementation class.static void
setInputReflect
(JobConf job) Indicate that a job's input data should use reflect representation.static void
setInputSchema
(JobConf job, Schema s) Configure a job's map input schema.static void
Indicate that a job's input files are in SequenceFile format.static void
Indicate that a job's map output data should use reflect representation.static void
setMapOutputSchema
(JobConf job, Schema s) Configure a job's map output schema.static void
setMapperClass
(JobConf job, Class<? extends AvroMapper> c) Configure a job's mapper implementation.static void
setOutputCodec
(JobConf job, String codec) Configure a job's output compression codec.static void
setOutputMeta
(JobConf job, String key, byte[] value) Add metadata to job output files.static void
setOutputMeta
(JobConf job, String key, long value) Add metadata to job output files.static void
setOutputMeta
(JobConf job, String key, String value) Add metadata to job output files.static void
setOutputSchema
(JobConf job, Schema s) Configure a job's output schema.static void
setReducerClass
(JobConf job, Class<? extends AvroReducer> c) Configure a job's reducer implementation.static void
setReflect
(JobConf job) Indicate that all a job's data should use the reflect representation.
-
Field Details
-
INPUT_SCHEMA
The configuration key for a job's input schema.- See Also:
-
MAP_OUTPUT_SCHEMA
The configuration key for a job's intermediate schema.- See Also:
-
OUTPUT_SCHEMA
The configuration key for a job's output schema.- See Also:
-
OUTPUT_CODEC
The configuration key for a job's output compression codec. This takes one of the strings registered inCodecFactory
- See Also:
-
TEXT_PREFIX
The configuration key prefix for a text output metadata.- See Also:
-
BINARY_PREFIX
The configuration key prefix for a binary output metadata.- See Also:
-
INPUT_IS_REFLECT
The configuration key for reflection-based input representation.- See Also:
-
MAP_OUTPUT_IS_REFLECT
The configuration key for reflection-based map output representation.- See Also:
-
-
Method Details
-
setInputSchema
Configure a job's map input schema. -
getInputSchema
Return a job's map input schema. -
setMapOutputSchema
Configure a job's map output schema. The map output schema defaults to the output schema and need only be specified when it differs. Thus must be aPair
schema. -
getMapOutputSchema
Return a job's map output key schema. -
setOutputSchema
Configure a job's output schema. Unless this is a map-only job, this must be aPair
schema. -
setOutputCodec
Configure a job's output compression codec. -
setOutputMeta
Add metadata to job output files. -
setOutputMeta
Add metadata to job output files. -
setOutputMeta
Add metadata to job output files. -
setInputSequenceFile
Indicate that a job's input files are in SequenceFile format. -
setReflect
Indicate that all a job's data should use the reflect representation. -
setInputReflect
Indicate that a job's input data should use reflect representation. -
setMapOutputReflect
Indicate that a job's map output data should use reflect representation. -
getOutputSchema
Return a job's output key schema. -
setMapperClass
Configure a job's mapper implementation. -
setCombinerClass
Configure a job's combiner implementation. -
setReducerClass
Configure a job's reducer implementation. -
setDataModelClass
Configure a job's data model implementation class. -
getDataModelClass
Return the job's data model implementation class. -
createDataModel
-
createInputDataModel
-
createMapOutputDataModel
-