public class AvroAsTextInputFormat extends FileInputFormat<Text,Text>
InputFormat
for Avro data files, which
converts each datum to string form in the input key. The input value is
always empty. The string representation is
JSON.
This InputFormat
is useful for applications
that wish to process Avro data using tools like MapReduce Streaming.
By default, when pointed at a directory, this will silently skip over any
files in it that do not have .avro extension. To instead include all files,
set the avro.mapred.ignore.inputs.without.extension property to false.
FileInputFormat.Counter
INPUT_DIR_RECURSIVE, LOG, NUM_INPUT_FILES
Constructor and Description |
---|
AvroAsTextInputFormat() |
Modifier and Type | Method and Description |
---|---|
RecordReader<Text,Text> |
getRecordReader(InputSplit split,
JobConf job,
Reporter reporter) |
protected FileStatus[] |
listStatus(JobConf job) |
addInputPath, addInputPathRecursively, addInputPaths, computeSplitSize, getBlockIndex, getInputPathFilter, getInputPaths, getSplitHosts, getSplits, isSplitable, makeSplit, makeSplit, setInputPathFilter, setInputPaths, setInputPaths, setMinSplitSize
protected FileStatus[] listStatus(JobConf job) throws IOException
listStatus
in class FileInputFormat<Text,Text>
IOException
public RecordReader<Text,Text> getRecordReader(InputSplit split, JobConf job, Reporter reporter) throws IOException
getRecordReader
in interface InputFormat<Text,Text>
getRecordReader
in class FileInputFormat<Text,Text>
IOException
Copyright © 2009–2017 The Apache Software Foundation. All rights reserved.