Abstract
Data imputation addresses the challenge of imputing missing values in database instances, ensuring consistency with the overall semantics of the dataset. Although several heuristics which rely on statistical methods, and ad-hoc rules have been proposed. These do not generalise well and often lack data context. Consequently, they also lack explainability. The existing techniques also mostly focus on the relational data context making them unsuitable for wider application contexts such as in graph data. In this paper, we propose a graph data imputation approach called GIG which relies on graph differential dependencies (GDDs). GIG, learns the GDDs from a given knowledge graph, and uses these rules to train a transformer model which then predicts the value of missing data within the graph. By leveraging GDDs, GIG incoporates semantic knowledge into the data imputation process making it more reliable and explainable. Experimental results on seven real-world datasets highlight GIG’s effectiveness compared to existing state-of-the-art approaches.
| Original language | English |
|---|---|
| Title of host publication | Databases Theory and Applications |
| Subtitle of host publication | 35th Australasian Database Conference, ADC 2024, Proceedings |
| Editors | Tong Chen, Yang Cao, Quoc Viet Hung Nguyen, Thanh Tam Nguyen |
| Place of Publication | Singapore |
| Publisher | Springer |
| Pages | 347-358 |
| Number of pages | 12 |
| Volume | 15449 |
| ISBN (Electronic) | 9789819612420 |
| ISBN (Print) | 9789819612413 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 35th Australasian Database Conference, ADC 2024 - Griffith University, Gold Coast, Australia Duration: 16 Dec 2024 → 18 Dec 2024 https://adc-conference.github.io/2024/ (Conference website) https://adc-conference.github.io/2024/program/full-program (Program) |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 15449 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 35th Australasian Database Conference, ADC 2024 |
|---|---|
| Country/Territory | Australia |
| City | Gold Coast |
| Period | 16/12/24 → 18/12/24 |
| Other | The Australasian Database Conference (ADC) series is an annual forum for sharing the latest research progresses and novel applications of database systems, data management, data mining and data analytics for researchers and practitioners in these areas from Australia, New Zealand and in the world. The 35th edition of the Australasian Database Conference, ADC 2024, will be held in Gold Coast, Australia. We welcome contributions related to all aspects of database theory and foundation, techniques, and applications. |
| Internet address |
|
Fingerprint
Dive into the research topics of 'GIG: Graph data imputation with graph differential dependencies'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver