TY - GEN
T1 - Ontology guided data linkage framework for discovering meaningful data facts
AU - Gollapalli, Mohammed
AU - Li, Xue
AU - Wood, Ian
AU - Governatori, Guido
PY - 2011
Y1 - 2011
N2 - Making sensible queries on databases collected from different organizations presents a challenging task for linking semantic equivalent data facts. Current techniques primarily focused on performing pair-wise attribute matching and paid little attention towards discovering probabilistic structural dependencies by exploiting the ontological domain knowledge of tables, attributes and tuples to construct hierarchical cluster mapping trees. In this paper, we present Ontology Guided Data Linkage (OGDL) framework for self-organizing heterogeneous data sources into homogeneous ontological clusters through multi-faceted classification. Through the evaluation on real-world data, we demonstrate the robustness and accuracy of our system.
AB - Making sensible queries on databases collected from different organizations presents a challenging task for linking semantic equivalent data facts. Current techniques primarily focused on performing pair-wise attribute matching and paid little attention towards discovering probabilistic structural dependencies by exploiting the ontological domain knowledge of tables, attributes and tuples to construct hierarchical cluster mapping trees. In this paper, we present Ontology Guided Data Linkage (OGDL) framework for self-organizing heterogeneous data sources into homogeneous ontological clusters through multi-faceted classification. Through the evaluation on real-world data, we demonstrate the robustness and accuracy of our system.
KW - clustering
KW - Data linkage
KW - ontology matching
KW - table attributes
UR - http://www.scopus.com/inward/record.url?scp=84255186227&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84255186227&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-25856-5_19
DO - 10.1007/978-3-642-25856-5_19
M3 - Conference paper
AN - SCOPUS:84255186227
SN - 9783642258558
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 252
EP - 265
BT - Advanced Data Mining and Applications - 7th International Conference, ADMA 2011, Proceedings
T2 - 7th International Conference on Advanced Data Mining and Applications, ADMA 2011
Y2 - 17 December 2011 through 19 December 2011
ER -