TheDocumentation Index
Fetch the complete documentation index at: https://private-7c7dfe99-page-updates.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
DataLakeCatalog database engine enables you to connect ClickHouse to external
data catalogs and query open table format data without the need for data duplication.
This transforms ClickHouse into a powerful query engine that works seamlessly with
your existing data lake infrastructure.
Supported catalogs
TheDataLakeCatalog engine supports the following data catalogs:
- AWS Glue Catalog - For Iceberg tables in AWS environments
- Databricks Unity Catalog - For Delta Lake and Iceberg tables
- Hive Metastore - Traditional Hadoop ecosystem catalog
- REST Catalogs - Any catalog supporting the Iceberg REST specification
Creating a database
You will need to enable the relevant settings below to use theDataLakeCatalog engine:
DataLakeCatalog engine can be created using the following syntax:
| Setting | Description |
|---|---|
catalog_type | Type of catalog: glue, unity (Delta), rest (Iceberg), hive, onelake (Iceberg) |
warehouse | The warehouse/database name to use in the catalog. |
catalog_credential | Authentication credential for the catalog (e.g., API key or token) |
auth_header | Custom HTTP header for authentication with the catalog service |
auth_scope | OAuth2 scope for authentication (if using OAuth) |
storage_endpoint | Endpoint URL for the underlying storage |
oauth_server_uri | URI of the OAuth2 authorization server for authentication |
vended_credentials | Boolean indicating whether to use vended credentials from the catalog (supports AWS S3 and Azure ADLS Gen2) |
aws_access_key_id | AWS access key ID for S3/Glue access (if not using vended credentials) |
aws_secret_access_key | AWS secret access key for S3/Glue access (if not using vended credentials) |
region | AWS region for the service (e.g., us-east-1) |
dlf_access_key_id | Access key ID for DLF access |
dlf_access_key_secret | Access key Secret for DLF access |
Examples
See below sections for examples of using theDataLakeCatalog engine:
- Unity Catalog
- Glue Catalog
- OneLake Catalog
Can be used by enabling
allow_experimental_database_icebergorallow_database_iceberg.