Configure and control automated metadata harvesting for your databases. Scope by schema, schedule refreshes, monitor status, and clean up when needed.
Configure Sources
Schema Scoping
Scheduling
Monitor Coverage
Cleanup
_get_harvest_control_data
_set_harvest_control_data
Name | Definition | Format |
---|---|---|
connection_id | Connection id for the source to harvest. | String |
database_name | Database name to harvest. For BigQuery, use the project id. | String |
refresh_interval | Refresh cadence in minutes. Default 5. | Integer |
initial_crawl_complete | Set to false to trigger an immediate initial crawl. | Boolean |
status | Control row status. Use Include to enable or Exclude to disable. | String |
schema_inclusions | List of schemas to include. Empty list means include all. | Array |
schema_exclusions | Schemas to exclude from harvest. | Array |
_set_harvest_control_data
with initial_crawl_complete: false
._remove_harvest_control_data
Name | Definition | Format |
---|---|---|
source_name | Source or connection identifier of the control row to remove. | String |
database_name | Target database for the control row removal. | String |
_remove_metadata_for_database
Name | Definition | Format |
---|---|---|
source_name | Source or connection identifier for metadata purge. | String |
database_name | Database whose harvested metadata will be deleted. | String |
_get_harvest_summary
ANALYTICS
and MARTS
while excluding INFORMATION_SCHEMA
._remove_metadata_for_database
deletes stored metadata for the database. Make sure you truly do not need it before proceeding._get_harvest_control_data
and _get_harvest_summary
to see what is active._set_harvest_control_data
to upsert the row with status, interval, and filters. Set initial_crawl_complete: false
to kick off the first crawl._get_harvest_summary
during rollout. Adjust interval or scope if load is higher than expected._remove_harvest_control_data
to stop future crawls. If needed, _remove_metadata_for_database
to clear stored metadata.refresh_interval
of 5 minutes for active environments and relax to 15 to 60 minutes where change is rare.INFORMATION_SCHEMA
unless needed.SHOW
and DESCRIBE
style operations.