Implementing Data Vault Architecture & Model in both on-prem and on-Cloud (hybrid)

CEds · 11 December 2023 14:49

Hello,

I am new to this group, but I would to ask guidance or advice or info (for those who have done it) on how to implement data vault architecture and model where some “pieces” or “parts” of the raw data vault are on-prem and some “pieces”/“parts” are on Cloud platform.

For example, some Customer attributes can be on Cloud data storage but other Customer attributes must be on-prem storage.

If this is the case, what is the best practice or kind provide guidance on how to implement this?

Thanks!

AHenning · 13 December 2023 04:03

My suggestion is that you try to do a join between data on two different locations. Measure the performance and then perform the same join when the data is on one location.

patrickcuba · 13 December 2023 20:12

Hello,

Sounds like two different vault!

A DV has many joins and if you’re joining data between on-prem and cloud what is the cost of running those joins between different locations?

There isn’t any guide because I don’t think anyone does this within a data vault? DV includes historical data and connecting a client to join data across the ether just sounds impractical!

Topic		Replies	Views
Can/should we attempt to build a DV on file based storage? Data Vault 2.0	4	261	18 August 2022
Is Data Vault a good fit? Data Vault 2.0	1	579	5 February 2024
Data Vault with Databricks Data Vault 2.0 databricks	6	1297	19 June 2023
Approach for handling the need for "pre-join" Data Vault 2.0	6	373	13 March 2023
Parallel loading of Raw Data Vault and Data Mart from Staging. Data Vault 2.0	1	50	19 January 2025

Implementing Data Vault Architecture & Model in both on-prem and on-Cloud (hybrid)

Related topics