HiveFileFormat (Spark 3.0.2 JavaDoc)

Object
- org.apache.spark.sql.hive.execution.HiveFileFormat

All Implemented Interfaces:

org.apache.spark.internal.Logging, org.apache.spark.sql.execution.datasources.FileFormat, DataSourceRegister
```
public class HiveFileFormat
extends Object
implements org.apache.spark.sql.execution.datasources.FileFormat, DataSourceRegister, org.apache.spark.internal.Logging
```
FileFormat for writing Hive tables.
TODO: implement the read logic.

Constructor Summary

Constructors
Constructor and Description

HiveFileFormat()

HiveFileFormat(org.apache.spark.sql.hive.HiveShim.ShimFileSinkDesc fileSinkConf)

Constructors
Constructor and Description
`HiveFileFormat()`
`HiveFileFormat(org.apache.spark.sql.hive.HiveShim.ShimFileSinkDesc fileSinkConf)`

Method Summary

All Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`scala.Option<StructType>`	`inferSchema(SparkSession sparkSession, scala.collection.immutable.Map<String,String> options, scala.collection.Seq<org.apache.hadoop.fs.FileStatus> files)`
`org.apache.spark.sql.execution.datasources.OutputWriterFactory`	`prepareWrite(SparkSession sparkSession, org.apache.hadoop.mapreduce.Job job, scala.collection.immutable.Map<String,String> options, StructType dataSchema)`
`String`	`shortName()` The string that represents the format that this data source provider uses.

Methods inherited from class Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Methods inherited from interface org.apache.spark.sql.execution.datasources.FileFormat
$init$, buildReader, buildReaderWithPartitionValues, isSplitable, supportBatch, supportDataType, vectorTypes

Methods inherited from interface org.apache.spark.internal.Logging
$init$, initializeForcefully, initializeLogIfNecessary, initializeLogIfNecessary, initializeLogIfNecessary$default$2, initLock, isTraceEnabled, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning, org$apache$spark$internal$Logging$$log__$eq, org$apache$spark$internal$Logging$$log_, uninitialize

Constructor Detail

HiveFileFormat

public HiveFileFormat(org.apache.spark.sql.hive.HiveShim.ShimFileSinkDesc fileSinkConf)

HiveFileFormat
```
public HiveFileFormat()
```

Method Detail

inferSchema

public scala.Option<StructType> inferSchema(SparkSession sparkSession,
                                            scala.collection.immutable.Map<String,String> options,
                                            scala.collection.Seq<org.apache.hadoop.fs.FileStatus> files)

Specified by:: inferSchema in interface org.apache.spark.sql.execution.datasources.FileFormat

prepareWrite

public org.apache.spark.sql.execution.datasources.OutputWriterFactory prepareWrite(SparkSession sparkSession,
                                                                                   org.apache.hadoop.mapreduce.Job job,
                                                                                   scala.collection.immutable.Map<String,String> options,
                                                                                   StructType dataSchema)

Specified by:: prepareWrite in interface org.apache.spark.sql.execution.datasources.FileFormat

shortName
```
public String shortName()
```
Description copied from interface: DataSourceRegister
The string that represents the format that this data source provider uses. This is overridden by children to provide a nice alias for the data source. For example:
```
   override def shortName(): String = "parquet"
 
```
Specified by:

shortName in interface DataSourceRegister

Returns:

(undocumented)

Class HiveFileFormat

Constructor Summary

Method Summary

Methods inherited from class Object

Methods inherited from interface org.apache.spark.sql.execution.datasources.FileFormat

Methods inherited from interface org.apache.spark.internal.Logging

Constructor Detail

HiveFileFormat

HiveFileFormat

Method Detail

inferSchema

prepareWrite

shortName