public class ParquetPipelineOutput extends FileSinkPipelineOutput
internalId, internalName, log, TIMESTAMP_FORMAT
SERIALIZED_CLASS_NAME, TYPE
XML_SERIALIZED_CLASS_NAME
Constructor and Description |
---|
ParquetPipelineOutput() |
Modifier and Type | Method and Description |
---|---|
DataWriter |
createDataWriter() |
ParquetPipelineOutput |
fromRecord(Record source)
Loads this instance's state from a record and returns
this (for fluid API call chaining). |
ParquetPipelineOutput |
fromXmlElement(Element pipelineOutputElement) |
void |
generateJavaCode(JavaCodeBuilder code) |
String |
getCacheFolder() |
CompressionCodecName |
getCompressionCodecName()
Indicates the compression used for writing (default UNCOMPRESSED).
|
Map<String,String> |
getConfiguration()
Returns the Parquet configuration parameters.
|
int |
getDefaultBigDecimalScale()
Returns the default scale used when writing BigDecimal values (default 5).
|
int |
getDefaultBigNumberPrecision()
Returns the default precision used when writing BigDecimal & BigInteger values (default 25).
|
FileSink |
getFileSink() |
String |
getName() |
int |
getRecordsPerCacheFile() |
RoundingMode |
getRoundingMode()
Indicates the rounding algorithm used for all BigDecimal values (default is
RoundingMode.HALF_UP ). |
MessageType |
getSchema()
Returns the schema used to write the file.
|
boolean |
isDefaultAdjustedToUTC()
Indicates if all datetime fields should be marked as AdjustedToUTC.
|
ParquetPipelineOutput |
setCacheFolder(String cacheFolder) |
ParquetPipelineOutput |
setCompressionCodecName(CompressionCodecName compressionCodecName)
Indicates the compression used for writing (default UNCOMPRESSED).
|
ParquetPipelineOutput |
setConfiguration(Map<String,String> configuration)
Sets the Parquet configuration parameters.
|
ParquetPipelineOutput |
setConfiguration(String key,
String value) |
ParquetPipelineOutput |
setDefaultAdjustedToUTC(boolean defaultAdjustedToUTC)
Indicates if all datetime fields should be marked as AdjustedToUTC.
|
ParquetPipelineOutput |
setDefaultBigDecimalScale(int defaultBigDecimalScale)
Sets the default scale used when writing BigDecimal values (default 5).
|
ParquetPipelineOutput |
setDefaultBigNumberPrecision(int defaultBigNumberPrecision)
Sets the default precision used when writing BigDecimal & BigInteger values (default 25).
|
ParquetPipelineOutput |
setFileSink(FileSink fileSink) |
ParquetPipelineOutput |
setMaxRecordsAnalyzed(Long maxRecordsAnalyzed)
Indicates how many records should be analyzed and cached to generate the Parquet schema if no schema was explicitly set
on this writer (default is 1000).
|
ParquetPipelineOutput |
setRecordsPerCacheFile(int recordsPerCacheFile) |
ParquetPipelineOutput |
setRoundingMode(RoundingMode roundingMode)
Indicates the rounding algorithm used for all BigDecimal values (default is
RoundingMode.HALF_UP ). |
ParquetPipelineOutput |
setSchema(MessageType schema)
Sets the schema used to write the file.
|
Record |
toRecord() |
Element |
toXmlElement(Document document) |
generateSaveCode
getNestedPipelineOutput, getRootPipelineOutput
addExceptionProperties, assertValid, assertValid, clone, exception, exception, exception, getInternalId, getInternalName, resetInternalId
equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
fromArrayValue, fromArrayValue, fromArrayValue, fromJson, fromJson, newInstanceFromRecord, newInstanceFromRecord, toArrayValue, toJson, toJson, toJson, toRecord
newInstanceFromJson
forEachGrandChildElement, fromXml, fromXml, getAttribute, getAttribute, getAttribute, getAttribute, getAttribute, getAttribute, getAttribute, getAttribute, getAttribute, getAttribute, getChildElement, getChildElements, newInstanceFromXml, newInstanceFromXml, setAttribute, toXml, toXml, toXml, toXml, toXml, writeXml, writeXml
addExceptionProperties, contributeExceptionProperties, contributeExceptionProperties
public String getName()
getName
in class FileSinkPipelineOutput
public DataWriter createDataWriter()
public FileSink getFileSink()
getFileSink
in class FileSinkPipelineOutput
public ParquetPipelineOutput setFileSink(FileSink fileSink)
setFileSink
in class FileSinkPipelineOutput
public CompressionCodecName getCompressionCodecName()
public Map<String,String> getConfiguration()
public ParquetPipelineOutput setConfiguration(Map<String,String> configuration)
public ParquetPipelineOutput setConfiguration(String key, String value)
public ParquetPipelineOutput setCompressionCodecName(CompressionCodecName compressionCodecName)
public boolean isDefaultAdjustedToUTC()
public ParquetPipelineOutput setDefaultAdjustedToUTC(boolean defaultAdjustedToUTC)
public RoundingMode getRoundingMode()
RoundingMode.HALF_UP
).public ParquetPipelineOutput setRoundingMode(RoundingMode roundingMode)
RoundingMode.HALF_UP
).public MessageType getSchema()
public ParquetPipelineOutput setSchema(MessageType schema)
public int getDefaultBigDecimalScale()
public ParquetPipelineOutput setDefaultBigDecimalScale(int defaultBigDecimalScale)
public int getDefaultBigNumberPrecision()
public ParquetPipelineOutput setDefaultBigNumberPrecision(int defaultBigNumberPrecision)
public ParquetPipelineOutput setMaxRecordsAnalyzed(Long maxRecordsAnalyzed)
null
will cause all records to be read and cached to determine the schema.null
or a high record count can significantly slow down processing and cause an OutOfMemoryError
. public String getCacheFolder()
public ParquetPipelineOutput setCacheFolder(String cacheFolder)
public int getRecordsPerCacheFile()
public ParquetPipelineOutput setRecordsPerCacheFile(int recordsPerCacheFile)
public void generateJavaCode(JavaCodeBuilder code)
public Record toRecord()
toRecord
in interface RecordSerializable
toRecord
in class Bean
public ParquetPipelineOutput fromRecord(Record source)
RecordSerializable
this
(for fluid API call chaining).
For fluid API call chaining, the overridden method should change the declared return type to its class.fromRecord
in interface RecordSerializable
fromRecord
in class PipelineOutput
public Element toXmlElement(Document document)
toXmlElement
in interface XmlSerializable
toXmlElement
in class PipelineOutput
public ParquetPipelineOutput fromXmlElement(Element pipelineOutputElement)
Copyright (c) 2006-2024 North Concepts Inc. All Rights Reserved.