PostgreSQL metadata collector
Provides read-only access to metadata from a PostgreSQL database.
The collector harvests metadata for PostgreSQL tables and columns, making them searchable and discoverable in the data catalog. Supports both self-hosted PostgreSQL instances and managed PostgreSQL instances, such as those hosted on AWS RDS.
Authentication supported
Username and password authentication and AWS IAM authentication.
Metadata cataloged
The PostgreSQL collector catalogs the following information.
| Object | Information cataloged |
|---|---|
| Columns | Name, Description, JDBC type, Column Type, Is Nullable, Default Value, Key type (Primary, foreign), column size, column index |
| Table | Name, description, primary key, schema |
| Table Index | Index Cardinality, Column name, Index Type, Index Name, is non Unique, Ordinal Position, Pages, Sort Sequence |
| Views | Name, description, SQL definition |
| Materialized View | Name, description, SQL definition |
| Schema | Identifier, Name |
| Database | Type, name, identifier, server, port, environment, JDBC URL |
| Function | Name, Description, Function Type |
| Stored Procedure | Name, Description, Stored Procedure Type |
| Object | Information cataloged |
|---|---|
| Column |
|
| Table |
|
Relationships between objects
Catalog pages show relationships between the following data asset types:
| Data asset page | Relationship |
|---|---|
| Table | Columns, Table Indexes |
| Columns | Table |
| Schema |
|
| View |
|
| Materialized View |
|
Lineage for PostgreSQL
The collector identifies column-level lineage for Views and Materialized Views, tracing data sources from view columns to source table columns across SQL expressions and subqueries.
Establishes relationships between Views and source Table columns that:
- Sort rows (SQL ORDER BY)
- Filter rows (SQL WHERE and HAVING clauses)
- Aggregate rows (SQL GROUP BY)