object ParquetMetadataPath extends Logging with Serializable
- Alphabetic
- By Inheritance
- ParquetMetadataPath
- Serializable
- Serializable
- Logging
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Value Members
-
def
convertMetaDataLocationToPath(mdLocation: String): Path
Wrapper for stringToPath to return backend specific error message
Wrapper for stringToPath to return backend specific error message
- mdLocation
string to convert to org.apache.hadoop.fs.Path
- returns
metadata hadoop fs path
- Exceptions thrown
ParquetMetaDataStoreException
if invalid path
-
def
getBaseMetadataPath(sparkSession: SparkSession, metaDataLocation: String, metadataType: String): Path
- returns
Metadata base location
- Exceptions thrown
ParquetMetaDataStoreException
if invalid metaDataLocation or invalid metadataType
-
def
getBasePathFromDB(sparkSession: SparkSession, db: String): String
Given a db name extracts the base path from the database parameters
-
def
getMetadataPath(basePath: String, uri: String): Path
Constructs the metadata path for a given base path and a uri by concatenating sha256 digest of the uri to the base path
Constructs the metadata path for a given base path and a uri by concatenating sha256 digest of the uri to the base path
- basePath
the base path
- uri
the uri of the dataset (can be either a path or table identifier)
- returns
the path of the metadata
-
def
resolveMetadataPath(sparkSession: SparkSession, uri: String, metaDataLocation: String, type: String): Path
Resolves the metadata path according to the given parameters
Resolves the metadata path according to the given parameters
- sparkSession
SparkSession instance for accessing the catalog if needed
- uri
the URI of the dataset
- metaDataLocation
depending on the type can be: type = EXPLICIT_METADATA_LOCATION => an explicit location of the metadata type = EXPLICIT_BASE_PATH_LOCATION => an explicit definition of the base path of the metadata type = HIVE_TABLE_NAME => the name of the hive table that contains the table properties the exact path of the metadata. Note that if the parameter contains only the table name then the current database is used type = HIVE_DB_NAME => the name of the hive database that contains in the db properties the base path of the metadata
- returns
the path of the metadata