Add virtual tables on relational datasources

You can create a virtual table on certain datasources using SQL. This capability is useful if you want to create data quality checks on tables that you define in Lightup using SQL. Normally you would create views on the datasource to accomplish this, and let Lightup discover the view when it scans the datasource. However, sometimes you may either not have the permissions to create views on the datasource or you may not want to create views directly on the datasource. This capability gives you the flexibility to do that inside Lightup.

Create

  1. From the schema context menu
  1. From the Table Configuration tab under a schema

Once you enter the virtual table creation dialogue you can begin defining the virtual table, configure, preview and save it before you create data quality checks (metrics and monitors) on it.

Define and Validate SQL

  1. Add the name of the virtual table and choose its query scope. ("Incremental" or "Full Table")
  2. Write the SQL needed to create the virtual table. The requirements for creating the SQL for virtual tables are the same as documented on the SQL metrics page
    Example:
    1. SELECT
      	*
      from
      	(
      		SELECT
      			*,
      			"timestamp" :: date as "order_date"
      		from
      			"testdata"."customer_orders_fact"
      		WHERE
      			"timestamp" >= {start_ts}
      			AND "timestamp" < {end_ts}
      	) as q1
      WHERE
      	"order_date" >= {start_date}
      	AND "order_date" < {end_date}
      

  3. Validate the SQL using the "Validate Query" button. You cannot save the virtual table without successfully validating the SQL

Configure

You can optionally add the required configuration parameters before saving in order to finish setting up a virtual table. The configuration parameters are the same as those you would setup for tables as listed in the Configure tables page.

While this step is optional during virtual table creation, you will eventually need to configure the virtual table when you activate it (see the Activate section) before setting up data quality checks on it.

Note that certain configuration fields are required for you to preview a virtual table of incremental query scope (see the Preview section)

Preview

You can optionally preview a virtual table to see its column names, inferred data types as well as sample data retrieved by the SQL query you used to define the virtual table.

The following fields are needed for you to preview the data for incremental query scope

  • Aggregation interval, aggregation timezone, timestamp column name, timestamp column type, timestamp timezone

Activate

Once the virtual table has been saved, you need to finish setting up the configuration parameters (if you have not already done so as described in the Configure section) and activate the table from the schema "Table Configuration tab" by toggling the table on. Once this is done, the virtual table will appear in the Explorer panel under the chosen schema.

Setup metrics

The following auto metrics are supported on virtual tables

Metric typeMetric nameSupport
Table Auto metricsRow count/Byte countNot supported
Column activityNot supported
Update delayNot supported
Data delaySupported
Data volumeSupported
Column Auto metricsCategory activitySupported
DistributionSupported
Null percentSupported
Custom metricsRow by row metricsSupported
Aggregation metricsSupported
Compare aggregate metricsSupported
Conformity metricsSupported
Distribution metricsSupported
Null percent metricsSupported
SQL metricsSupported

Setup monitors

Once you have setup metrics on virtual tables, you can enable monitors as you would for other tables

Supported datasources

Virtual tables are supported on the following datasources

  • Oracle
  • Postgres
  • Incorta
  • Snowflake
  • Databricks