object ParquetMetadataPath extends Logging with Serializable

Linear Supertypes
Serializable, Serializable, Logging, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. ParquetMetadataPath
  2. Serializable
  3. Serializable
  4. Logging
  5. AnyRef
  6. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Value Members

  1. def convertMetaDataLocationToPath(mdLocation: String): Path

    Wrapper for stringToPath to return backend specific error message

    Wrapper for stringToPath to return backend specific error message

    mdLocation

    string to convert to org.apache.hadoop.fs.Path

    returns

    metadata hadoop fs path

    Exceptions thrown

    ParquetMetaDataStoreException if invalid path

  2. def getBaseMetadataPath(sparkSession: SparkSession, metaDataLocation: String, metadataType: String): Path

    returns

    Metadata base location

    Exceptions thrown

    ParquetMetaDataStoreException if invalid metaDataLocation or invalid metadataType

  3. def getBasePathFromDB(sparkSession: SparkSession, db: String): String

    Given a db name extracts the base path from the database parameters

  4. def getMetadataPath(basePath: String, uri: String): Path

    Constructs the metadata path for a given base path and a uri by concatenating sha256 digest of the uri to the base path

    Constructs the metadata path for a given base path and a uri by concatenating sha256 digest of the uri to the base path

    basePath

    the base path

    uri

    the uri of the dataset (can be either a path or table identifier)

    returns

    the path of the metadata

  5. def resolveMetadataPath(sparkSession: SparkSession, uri: String, metaDataLocation: String, type: String): Path

    Resolves the metadata path according to the given parameters

    Resolves the metadata path according to the given parameters

    sparkSession

    SparkSession instance for accessing the catalog if needed

    uri

    the URI of the dataset

    metaDataLocation

    depending on the type can be: type = EXPLICIT_METADATA_LOCATION => an explicit location of the metadata type = EXPLICIT_BASE_PATH_LOCATION => an explicit definition of the base path of the metadata type = HIVE_TABLE_NAME => the name of the hive table that contains the table properties the exact path of the metadata. Note that if the parameter contains only the table name then the current database is used type = HIVE_DB_NAME => the name of the hive database that contains in the db properties the base path of the metadata

    returns

    the path of the metadata