PXML: a probabilistic semistructured data model and algebra

Despite the recent proliferation of work on semistructured data models, there has been little work to date on supporting uncertainty in these models. We propose a model for probabilistic semistructured data (PSD). The advantage of our approach is that it supports a flexible representation that allow...

Full description

Saved in:
Bibliographic Details
Published in2003 19th International Conference on Data Engineering pp. 467 - 478
Main Authors Hung, E., Getoor, L., Subrahmanian, V.S.
Format Conference Proceeding
LanguageEnglish
Published IEEE 2003
Subjects
Online AccessGet full text
ISBN9780780376656
078037665X
DOI10.1109/ICDE.2003.1260814

Cover

More Information
Summary:Despite the recent proliferation of work on semistructured data models, there has been little work to date on supporting uncertainty in these models. We propose a model for probabilistic semistructured data (PSD). The advantage of our approach is that it supports a flexible representation that allows the specification of a wide class of distributions over semistructured instances. We provide two semantics for the model and show that the semantics are probabilistically coherent. Next, we develop an extension of the relational algebra to handle probabilistic semistructured data and describe efficient algorithms for answering queries that use this algebra. Finally, we present experimental results showing the efficiency of our algorithms.
ISBN:9780780376656
078037665X
DOI:10.1109/ICDE.2003.1260814