Multi hub from one source. Split the satellite ?

justinV · 28 January 2025 14:42

Hi everybody!

I need some advice regarding best practices in hub/satellite modelisation.
I have 2 hub :
hub_employee and hub_company

I have a source that give me data about employee AND company (a bool on each line say if the data is personal => about the employee or ‘generic’ => about the company).
Source :
firstName
famiyName
mail
phone
Company_ID

When it’s generic, names are null.

If I understand the best practice well, I should have one satellite per source. But if I put the satellite on the the hub_company, I loose information about which employee the data defines (as the relation is one to many).

Should I consider the source as 2 sources and split the satellite on the two hubs (company and employee)? Is it not a bad practice? Or should I consider another way to do it?

Many thanks for any advice!

Nat · 29 January 2025 10:59

So e.g. you may get the mail address of the company or the mail address of the employee depending on the boolean?

If so, you can add the company info to a sat off hub_company and the employee data off hub_employee. prefilter in staging. You can have whatever numbers of satellites from one source table if they mean different things.

Is this also a link source for employee to company? Feels like it. Don’t miss this part out.

justinV · 29 January 2025 15:17

Exactly! The same is true for the phone (can be the professional phone of the employee or the generic phone of the company).

I am a little bit relieved, it’s what seems logical to me and what I have done (2 hubs, 2 satellites and 1 link).

I am quite new to the data vault modelisation, and it seems that a satellite per source was a strong rule. I wanted to avoid rookie mistake if there is a strong reason for a strict one source => one satellite.

squash7733 · 30 January 2025 07:43

Agree with Nat. Use the same source to stage into two Hub- Sat’s and Link, if needed.

Incidentally, how does your source generate unique BK’s for Employee and Company, pls?

Seems like they would have different business domains i.e. EmployeeId is usually a Number (system-generated at HR-Onboarding) whereas CompanyID could be its ABN/ACN? - or some other way?

justinV · 31 January 2025 09:42

For the Company, the source give the national ID of the company in our country.

For the EmployeeID, we use the business mail of the employee (usually it is something like employeeName@companyName.com)

Topic		Replies	Views
How to design data vault in case of multiple source systems and many satellites for each source Data Vault 2.0 dbtvault	3	2097	5 February 2023
For JSON Source if there is a LINK why need HUB and SAT ? Data Vault 2.0 business-key , link , hub	3	357	6 February 2023
Restructuring Source Data Data Vault 2.0	1	52	11 March 2025
Hub with many multi-active satellites Data Vault 2.0 multi-active-sat	6	80	2 July 2025
Need some advice modeling a store hub and satellites Data Vault 2.0	3	466	17 January 2024

Multi hub from one source. Split the satellite ?

Related topics