
Data Catalog Builder: Data Catalog Builder — Guide
Data Catalog Builder — Guide Automated metadata discovery, classification, quality scoring, and documentation for Databricks Unity Catalog. Table of Contents Prerequisites Scanner Configuration Running the Catalog Scan Metadata Enrichment Column Classification Quality Scoring Report Generation Search Index Dashboard Usage Troubleshooting Prerequisites Before using Data Catalog Builder, ensure: Databricks Runtime 13.3+ with Unity Catalog enabled Cluster access to the catalogs you want to scan Python packages : pyyaml , jinja2 (both included in DBR by default) Permissions : USE CATALOG , USE SCHEMA , and SELECT on information_schema / target tables For lineage features: access to system.access.table_lineage and system.access.column_lineage Recommended Cluster Config Setting Value Runtime 13.3 LTS or 14.x Node type Standard_DS3_v2 (or equivalent) Workers 1–2 (metadata-only workload) Unity Catalog Enabled Scanner Configuration The scanner is driven by configs/catalog_config.yaml . Key sett
Continue reading on Dev.to
Opens in a new tab



