SpecificParquetRecordReaderBase — Hadoop RecordReader
SpecificParquetRecordReaderBase is the base Hadoop RecordReader for parquet format readers that directly materialize to T.
|
Note
|
RecordReader reads <key, value> pairs from an Hadoop InputSplit.
|
|
Note
|
VectorizedParquetRecordReader is the one and only SpecificParquetRecordReaderBase that directly materialize to Java Objects.
|
| Name | Description |
|---|---|
|
Spark schema Initialized when |
initialize Method
|
1 2 3 4 5 |
void initialize(InputSplit inputSplit, TaskAttemptContext taskAttemptContext) |
|
Note
|
initialize is part of RecordReader Contract to initialize a RecordReader.
|
initialize…FIXME
spark技术分享