Describe the two primary strategies for distributing data across multiple physical locations: Data Replication and Data Fragmentation, stating a key advantage of each.
Solution
Replication copies data to multiple sites; fragmentation splits data across sites.
- Replication advantage: improved read availability — any site can serve the data.
- Fragmentation advantage: locality — queries access the nearest site with the relevant data.
- Horizontal fragmentation: select rows by predicate (e.g., region = 'Nepal').
- Vertical fragmentation: select columns by usage pattern (e.g., personal info vs financial info).