Our organization has adopted both the type of and rate of change data splits of our satellites. In doing so, I’m wrestling with how you reconstruct the entire record for the end user to use instead of having to know which split satellite contains the data that they’re looking for?
After profiling my table data coming into staging, I have 4 split tables as follows:
- Table name ends with _pii;
- Table name ends with _slow;
- Table name ends with _intr; and
- Table name ends with _fast.
The consistent columns on all 4 tables are:
1) The hub hash key; and
2) The load datetime value.
Within the descriptive data, there are 2 columns that can act as indexed to organize the data coming in. They are the applied date column and a version number. Unfortunately, they both are included in the _fast table. Using the windowing function, I can locate the record for a date in question from the _fast table. However, I don’t have any columns in the other 3 tables where I can identify the row as belonging to the record in the _fast table. I’ve thought of including those 2 additional columns in all remaining tables and ONLY loading them when a row in the table is created with the current values on the record. Is this the correct solution and what would the query look like to accomplish it?
Thanks for your time
Clay