Automating the Schema Matching Process for Heterogeneous Data Warehouses

A. Tjoa, M. Banek, B. Vrdoljak, Z. Skočir:
"Automating the Schema Matching Process for Heterogeneous Data Warehouses";
Vortrag: 9th International Conference on Data Warehousing and Knowledge Discovery (DaWaK'07), Regensburg, D; 03.09.2007 - 07.09.2007; in:"Datawarehousing and Knowledge Discovery", Springer, LNCS 4654 (2007), ISBN: 978-3-540-74552-5; S. 45 - 54.

[ Publication Database ]

Abstract:


A federated data warehouse is a logical integration of data warehouses applicable when physical integration is impossible due to privacy policy or legal restrictions. In order to enable the translation of queries in a federated approach, schemas of the federated and the local warehouses must be matched. In this paper we present a procedure that enables the matching process for schema structures specific to the multidimensional model of data warehouses: facts, measures, dimensions, aggregation levels and dimensional attributes. Similarities between warehouse-specific structures are computed by using linguistic and structural comparison, where calculated values are used to create necessary mappings. We present restriction rules and recommendations for aggregation level matching, which builds the most complex part of the process. A software implementation of the entire process is provided in order to perform its verification, as well as to determine the proper selection metric for mapping different multidimensional structures.