Monday 26 March 2012

DocBase: Design, Implementation and Evaluation of a Document Database for XML

an article by Arijit Sengupta (Wright State University, USA) and Ramesh Venkataraman (Indiana University, USA) published in Journal of Database Management Volume 22 Issue 4 (2011)

Abstract

This article introduces a complete storage and retrieval architecture for a database environment for XML documents.

DocBase, a prototype system based on this architecture, uses a flexible storage and indexing technique to allow highly expressive queries without the necessity of mapping documents to other database formats. DocBase is an integration of several techniques that include:

  1. a formal model called Heterogeneous Nested Relations (HNR),
  2. a conceptual model XER (Extensible Entity Relationship),
  3. formal query languages (Document Algebra and Calculus),
  4. a practical query language (Document SQL or DSQL),
  5. a visual query formulation method with QBT (Query By Templates), and 
  6. the DocBase query processing architecture.
This paper focuses on the overall architecture of DocBase including implementation details, describes the details of the query-processing framework, and presents results from various performance tests.

The paper summarizes experimental and usability analyses to demonstrate its feasibility as a general architecture for native as well as embedded document manipulation methods.


No comments: