DataFrame class
A distributed collection of data grouped into named columns.
A DataFrame is equivalent to a relational table in Spark SQL, and can be created using various functions in SparkSession.
A DataFrame should not be directly created using the constructor.
Supports Spark Connect
Properties
Property | Description |
|---|---|
Returns SparkSession that created this DataFrame. | |
Returns the content as an RDD of Row (Classic mode only). | |
Returns a DataFrameNaFunctions for handling missing values. | |
Returns a DataFrameStatFunctions for statistic functions. | |
Interface for saving the content of the non-streaming DataFrame out into external storage. | |
Interface for saving the content of the streaming DataFrame out into external storage. | |
Returns the schema of this DataFrame as a StructType. | |
Returns all column names and their data types as a list. | |
Retrieves the names of all columns in the DataFrame as a list. | |
Get the DataFrame's current storage level. | |