Histogram Diff
Histogram Diff compares the distribution of a numeric column in an overlay histogram chart.
A Histogram Diff can be generated in two ways.
Via the Explore Change button menu:
- Select the model from the Lineage DAG.
- Click the
Explore Change
button. - Click
Histogram Diff
. - Select a column to diff.
- Click
Execute
.
Via the column options menu:
- Select the model from the Lineage DAG.
- Hover over the column in the Node Details panel.
- Click the vertical 3 dots
...
- Click
Histogram Diff
.
SQL Execution
Histogram Diff generates SQL queries to create distribution histograms for numeric and date columns. The queries use binning strategies to group values and count occurrences in each bin, supporting both integer and floating-point data types.
You can review the exact SQL generation functions in the HistogramDiffTask class.