×

Uh-oh, it looks like your Internet Explorer is out of date.

For a better shopping experience, please upgrade now.

XML Data Management: Native XML and XML- Enabled Database Systems
     

XML Data Management: Native XML and XML- Enabled Database Systems

by Akmal B. Chaudhri, Awais Rashid, Roberto Zicari
 

ISBN-10: 0201844524

ISBN-13: 9780201844528

Pub. Date: 03/07/2003

Publisher: Addison-Wesley

"This is an excellent book that combines a practical and analytical look at the subject."

—Leo Korman, Principal Software Engineer, KANA Software

As organizations begin to employ XML within their information-management and exchange strategies, data management issues pertaining to storage, retrieval, querying, indexing, and manipulation

Overview

"This is an excellent book that combines a practical and analytical look at the subject."

—Leo Korman, Principal Software Engineer, KANA Software

As organizations begin to employ XML within their information-management and exchange strategies, data management issues pertaining to storage, retrieval, querying, indexing, and manipulation increasingly arise. Moreover, new information-modeling challenges also appear. XML Data Management —with its contributions from experts at the forefront of the XML field—addresses these key issues and challenges, offering insights into the advantages and drawbacks of various XML solutions, best practices for modeling information with XML, and developing custom, in-house solutions.

In this book, you will find discussions on the newest native XML databases, along with information on working with XML-enabled relational database systems. In addition, XML Data Management thoroughly examines benchmarks and analysis techniques for performance of XML databases.

Topics covered include:

  • The power of good grammar and style in modeling information to alleviate the need for redundant domain knowledge
  • Tamino's XML storage, indexing, querying, and data access features
  • The features and APIs of open source eXist
  • Berkeley DB XML's ability to store XML documents natively
  • IBM's DB2 Universal Database and its support for XML applications
  • Xperanto's method of addressing information integration requirements
  • Oracle's XMLType for managing document centric XML documents
  • Microsoft SQL Server 2000's support for exporting and importing XML data
  • A generic architecture for storing XML documents in a relational database
  • X007, XMach-1, XMark, and other benchmarks for evaluating XML database performance
  • Numerous case studies demonstrate real-world problems, industry-tested solutions, and creative applications of XML data management solutions.

    Written for both XML and relational database professionals, XML Data Management provides a promising new approach to data management, one that is sure to positively impact the way organizations manage and exchange information.

    0201844524B01302003

    Product Details

    ISBN-13:
    9780201844528
    Publisher:
    Addison-Wesley
    Publication date:
    03/07/2003
    Pages:
    688
    Product dimensions:
    7.32(w) x 8.97(h) x 1.65(d)

    Table of Contents

    Preface.

    Acknowledgments.

    I. WHAT IS XML?

    1. Information Modeling with XML.

    Introduction.

    XML as an Information Domain.

    How XML Expresses Information.

    Patterns in XML.

    Common XML Information-Modeling Pitfalls.

    Attributes Used as Data Elements.

    Data Elements Used as Metadata.

    Inadequate Use of Tags.

    A Very Simple Way to Design XML.

    Conclusion.

    II. NATIVE XML DATABASES.

    2. TaminoSoftware AG's Native XML Server.

    Introduction.

    Tamino Architecture and APIs.

    XML Storage.

    Collections and Doctypes.

    Schemas.

    Access to Other DatabasesTamino X-Node.

    Mapping Data to FunctionsTamino X-Tension.

    Internationalization Issues.

    Indexing.

    Organization on Disk.

    Querying XML.

    Query LanguageTamino X-Query.

    Sessions and Transactions.

    Handling of Results.

    Query Execution.

    Tools.

    Database Browsing.

    Schema Editing.

    WebDAV Access.

    X-Application.

    Full Database Functionality.

    Conclusion.

    3. eXist Native XML Database.

    Introduction.

    Features.

    Schema-less XML Data Store.

    Collections.

    Index-Based Query Processing.

    Extensions for Full-Text Searching.

    System Architecture Overview.

    Pluggable Storage Backends.

    Deployment.

    Application Development.

    Getting Started.

    Query Language Extensions.

    Specifying the Input Document Set.

    Querying Text.

    Outstanding Features.

    Application Development.

    Programming Java Applications with the XML:DB API.

    Accessing eXist with SOAP.

    Integration with Cocoon.

    Technical Background.

    Approaches to Query Execution.

    Indexing Scheme.

    Index and Storage Implementation.

    Query Language Processing.

    Query Performance.

    Conclusion.

    4. Embedded XML Databases.

    Introduction.

    A Primer on Embedded Databases.

    Embedded XML Databases.

    Building Applications for Embedded XML Databases.

    Overview of Berkeley DB XML.

    Configuration.

    Indexing and Index Types.

    XPath Query Processing.

    Programming for Transactions.

    Two-Phase Locking and Deadlocks.

    Reducing Contention.

    Checkpoints.

    Recovery Processing after Failures.

    Conclusion.

    III. XML AND RELATIONAL DATABASES.

    5. IBM XML-Enabled Data Management Product Architecture and Technology.

    Introduction.

    Product and Technology Offering Summaries.

    DB2 Universal Database.

    Information Integration Technology.

    Current Architecture and Technology.

    Shared Architecture and Technology.

    XML Extender Architecture.

    XML Extender Technology.

    Using Both XML Collections and XML Columns.

    Transforming XML Data.

    Searching, Parsing, and Validating XML Data.

    XML Extender Federated Support.

    SQL XML Support Architecture.

    SQL XML Support Technology.

    Data Management Web Services Architecture.

    Data Management Web Services Technology.

    Information Integration-Specific Architecture and Technology.

    Future Architecture and Technology.

    The Vision.

    Application Interface, Data Type, and API Goals.

    Storage, Engine, and Data Manager Goals.

    Why Support Both XML and Relational Storage in One System?

    Why Not Object-Relational Long Term?

    Impacted Technology Areas.

    Conclusion.

    Notices.

    6. Supporting XML in Oracle9i.

    Introduction.

    Storing XML as CLOB.

    Using CLOB and the OracleText Cartridge.

    Search Predicates in OracleText.

    XML-Specific Functionality.

    Prerequisites.

    XMLType.

    Object Type XMLType.

    Processing of XMLType in Java.

    Using XSU for Fine-Grained Storage.

    Canonical Mapping.

    Retrieval.

    Modifications.

    Building XML Documents from Relational Data.

    SQL Functions existsNode and extract.

    The SQL Function SYS_XMLGen.

    The SQL Function SYS_XMLAgg.

    PL/SQL Package DBMS_XMLGen.

    Web Access to the Database.

    The Principle of XSQL.

    Posting XML Data into the Database.

    Parameterization.

    Servlet Invocations.

    Special Oracle Features.

    URI Support.

    Parsers.

    Class Generator.

    Special Java Beans.

    Conclusion.

    7. XML Support in Microsoft SQL Server 2000 165

    Introduction.

    XML and Relational Data.

    XML Access to SQL Server.

    Access via HTTP.

    Using the XML Features through SQLOLEDB, ADO, and .NET.

    Serializing SQL Query Results into XML.

    The Raw Mode.

    The Auto and Nested Modes.

    The Explicit Mode.

    Providing Relational Views over XML.

    SQLXML Templates.

    Providing XML Views over Relational Data.

    Annotated Schemata.

    Querying Using XPath.

    Updating Using Updategrams.

    Bulk Loading.

    Conclusion.

    8. A Generic Architecture for Storing XML Documents in a Relational Database.

    Introduction.

    System Architecture.

    Installing Xerces.

    The Data Model.

    DOM Storage in Relational Databases.

    The Nested Sets Model.

    Creating the Database.

    The Physical Data Model.

    Creating User-Defined Data Types.

    Creating the Tables.

    Serializing a Document out of the Repository.

    Building an XML Document Manually.

    Connecting to the Repository.

    The xmlrepDB Class.

    Uploading XML Documents.

    The xmlrepSAX Class.

    Stored Procedures for Data Entry.

    The uploadXML Class.

    The extractXML Class.

    Querying the Repository.

    Ad Hoc SQL Queries.

    Searching for Text.

    Some More Stored Procedures.

    Generating XPath Expressions.

    Further Enhancements.

    Conclusion.

    9. An Object-Relational Approach to Building a High-Performance XML Repository.

    Introduction.

    Overview of XML Use-Case Scenario.

    High-Level System Architecture.

    Detailed Design Descriptions.

    Conclusion.

    IV. APPLICATIONS OF XML.

    10. Knowledge Management in Bioinformatics.

    Introduction.

    A Brief Molecular Biology Background.

    Life Sciences Are Turning to XML to Model Their Information.

    A Genetic Information Model.

    NeoCore XMS.

    Integration of BLAST into NeoCore XMS.

    Sequence Search Types.

    Conclusion.

    11. Case Studies of XML Used with IBM DB2 Universal Database.

    Introduction.

    Case Study 1: “Our Most Valued Customers Come First”.

    Company Scenario.

    How This Business Problem Is Addressed.

    Future Extensions.

    Case Study 2: “Improve Cash Flow”.

    Company Scenario.

    How This Business Problem Is Addressed.

    Future Extensions.

    Conclusion.

    Notices.

    12. The Design and Implementation of an Engineering Data Management System Using XML and J2EE.

    Introduction.

    Background and Requirements.

    Overview.

    Security Service.

    Query Service.

    Image Query Service.

    Print Service.

    Design Choices.

    Using XML in OAI.

    Conversion of XML Input into Objects.

    Conversion of Database Data into XML.

    Conversion of Image Data into XML.

    Database Access.

    Validation.

    Future Directions.

    XSLT.

    Web Services.

    Mass Transfer Capability.

    Messaging.

    Conclusion.

    13. Geographical Data Interchange Using XML-Enabled Technology within the GIDB System.

    Introduction

    GIDB METOC Data Integration.

    Background.

    Implementation.

    GIDB Web Map Service Implementation.

    GIDB GML Import and Export.

    Conclusion.

    14. Space Wide Web by Adapters in Distributed Systems Configuration from Reusable Components.

    Introduction.

    Advanced Concept Description: The Research Problem.

    Future Supporting Communications Satellites Constellations.

    Integration of Components with Architecture.

    Example.

    Future Generation NASA Institute for Advanced Concepts, Space Wide Web Research, and Boundaries.

    Advanced Concept Development.

    The Research Approach.

    The Research Tasks.

    Conclusion.

    15. XML as a Unifying Framework for Inductive Databases.

    Introduction.

    Past Work.

    Extracting and Evaluating Association Rules.

    Classifying Data.

    Inductive Databases.

    PMML.

    The Proposed Data Model: XDM.

    Basic Concepts.

    Classification with XDM.

    Association Rules with XDM.

    Benefits of XDM.

    Toward Flexible and Open Systems.

    Related Work.

    Conclusion.

    16. Designing and Managing an XML Warehouse.

    Introduction.

    Why a View Mechanism for XML?

    Contributions.

    Outline.

    Architecture.

    Data Warehouse Specification.

    View Model for XML Documents.

    Graphic Tool for Data Warehouse Specification.

    Managing the Metadata.

    Data Warehouse.

    View Definition.

    Mediated Schema Definition.

    Storage and Management of the Data Warehouse.

    The Different Approaches to Storing XML Data.

    Mapping XML to Relational.

    View Storage.

    Extraction of Data.

    DAWAX: A Graphic Tool for the Specification and Management of a Data Warehouse.

    Data Warehouse Manager.

    The Different DAWAX Packages.

    Related Work.

    Query Languages for XML.

    Storing XML Data.

    Systems for XML Data Integration.

    Conclusion.

    V. PERFORMANCE AND BENCHMARKS.

    17. XML Management System Benchmarks.

    Introduction.

    Benchmark Specification.

    Benchmark Data Set.

    Benchmark Queries.

    Existing Benchmarks for XML.

    The XOO7 Benchmark.

    The XMach-1 Benchmark.

    The XMark Benchmark.

    Conclusion.

    18. The Michigan Benchmark: A Micro-Benchmark for XML Query Performance Diagnostics.

    Introduction.

    Related Work.

    Benchmark Data Set.

    A Discussion of the Data Characteristics.

    Schema of Benchmark Data.

    Generating the String Attributes and Element Content.

    Benchmark Queries.

    Selection.

    Value-Based Join.

    Pointer-Based Join.

    Aggregation.

    Updates.

    Using the Benchmark.

    Conclusion.

    19. A Comparison of Database Approaches for Storing XML Documents.

    Introduction.

    Data Models for XML Documents.

    The Nontyped DOM Implementation.

    The Typed DOM Implementation.

    Databases for Storing XML Documents.

    Relational Databases.

    Object-Oriented Databases.

    Directory Servers.

    Native XML Databases.

    Benchmarking Specification.

    Benchmarking a Relational Database.

    Benchmarking an Object-Oriented Database.

    Benchmarking a Directory Server.

    Benchmarking a Native XML Database.

    Test Results.

    Evaluation of Performance.

    Evaluation of Space.

    Conclusion.

    Related Work.

    Studies in Storing and Retrieving XML Documents.

    XML and Relational Databases

    XML and Object-Relational Databases.

    XML and Object-Oriented Databases.

    XML and Directory Servers.

    Benchmarks for XML Databases.

    Guidelines for Benchmarking XML Databases.

    Summary.

    20. Performance Analysis between an XML-Enabled Database and a Native XML Database.

    Introduction.

    Related Work.

    Methodology.

    Database Design.

    Discussion.

    Experiment Result.

    Database Size.

    SQL Operations (Single Record).

    SQL Operations (Mass Records).

    Reporting.

    Conclusion.

    21. Conclusion.

    References.

    Contributors.

    Editors.

    Chapter 1: Information Modeling with XML.

    Chapter 2: TaminoSoftware AG's Native XML Server.

    Chapter 3: eXist Native XML Database.

    Chapter 4: Embedded XML Databases.

    Chapter 5: IBM XML-Enabled Data Management Product Architecture and Technology.

    Chapter 6: Supporting XML in Oracle9i.

    Chapter 7: XML Support in Microsoft SQL Server 2000.

    Chapter 8: A Generic Architecture for Storing XML Documents in a Relational Database.

    Chapter 9: An Object-Relational Approach to Building a High-Performance XML Repository.

    Chapter 10: Knowledge Management in Bioinformatics.

    Chapter 11: Case Studies of XML Used with IBM DB2 Universal Database.

    Chapter 12: The Design and Implementation of an Engineering Data Management System Using XML and J2EE.

    Chapter 13: Geographical Data Interchange Using XML-Enabled Technology within the GIDB System.

    Chapter 14: Space Wide Web by Adapters in Distributed Systems Configuration from Reusable Components.

    Chapter 15: XML as a Unifying Framework for Inductive Databases.

    Chapter 16: Designing and Managing an XML Warehouse.

    Chapter 17: XML Management System Benchmarks.

    Chapter 18: The Michigan Benchmark: A Micro-Benchmark for XML Query Performance Diagnostics.

    Chapter 19: A Comparison of Database Approaches for Storing XML Documents.

    Chapter 20: Performance Analysis between an XML-Enabled Database and a Native XML Database.

    Index. 0201844524T02182003

    Customer Reviews

    Average Review:

    Post to your social network

         

    Most Helpful Customer Reviews

    See all customer reviews