Principles of Distributed Database Systems

The fourth edition of this classic textbook provides major updates. This edition has completely new chapters on Big Data Platforms (distributed storage systems, MapReduce, Spark, data stream processing, graph analytics) and on NoSQL, NewSQL and polystore systems. It also includes an updated web data management chapter that includes RDF and semantic web discussion, an integrated database integration chapter focusing both on schema integration and querying over these systems. The peer-to-peer computing chapter has been updated with a discussion of blockchains. The chapters that describe classical distributed and parallel database technology have all been updated.

The new edition covers the breadth and depth of the field from a modern viewpoint. Graduate students, as well as senior undergraduate students studying computer science and other related fields will use this book as a primary textbook. Researchers working in computer science will also find this textbook useful.

This textbook has a companion web site that includes background information on relational database fundamentals, query processing, transaction management, and computer networks for those who might need this background. The web site also includes all the figures and presentation slides as well as solutions to exercises (restricted to instructors).

Principles of Distributed Database Systems

89.99 In Stock

Principles of Distributed Database Systems

Add to Wishlist

Principles of Distributed Database Systems

Hardcover(Fourth Edition 2020)

$89.99

Hardcover(Fourth Edition 2020)
$89.99

SHIP THIS ITEM

In stock. Ships in 1-2 days.
PICK UP IN STORE

Your local store may have stock of this item.

Available within 2 business hours

Want it Today?
Check Store Availability

Related collections and offers

Overview

Product Details

ISBN-13:	9783030262525
Publisher:	Springer International Publishing
Publication date:	12/03/2019
Edition description:	Fourth Edition 2020
Pages:	674
Product dimensions:	6.10(w) x 9.25(h) x (d)

About the Author

M. Tamer Özsu is a University Professor at Cheriton School of Computer Science at University of Waterloo, Canada. He has been conducting research in distributed data management for thirty years. He is a Fellow of the Royal Society of Canada, American Association for the Advancement of Science (AAAS), Association for Computing Machinery (ACM) and Institute of Electrical and Electronics Engineers (IEEE). He is an elected member of the Science Academy, Turkey, and a member of Sigma Xi. He has received the CS-Can/Info-Can (Canadian Computer Science Society) Lifetime Achievement Award in 2019, ACM SIGMOD Test-of-Time Award in 2015, ACM SIGMOD Contributions Award in 2008 and the Ohio State University College of Engineering Distinguished Alumnus Award in 2008 and has two best paper awards and one honourable mention for his publications. He serves on the editorial boards of many journals and book series, and is also the co-editor-in-chief, with Ling Liu, of the Encyclopedia of Database Systems.

Patrick Valduriez is a senior scientist at Inria, France. He has also been a professor of computer science at University Pierre et Marie Curie (UPMC) in Paris (2000-2002) and a researcher at Microelectronics and Computer Technology Corp. in Austin, Texas (1985-1989). Since 2019, he is the scientific advisor of the LeanXcale startup.
He is currently the head of the Zenith team (between Inria and University of Montpellier, LIRMM) that focuses on data science, in particular data management in large-scale distributed and parallel systems and scientific data management. He currently serves as associate editor of several journals, including the VLDB Journal, Distributed and Parallel Databases, and Internet and Databases. He has served as PC chair of major conferences such as SIGMOD and VLDB. He was the general chair of SIGMOD 2004, EDBT 2008 and VLDB 2009 Conferences. He obtained several best paper awards, including at VLDB 2000.He was the recipient of the 1993 IBM scientific prize in Computer Science in France and the 2014 Innovation Award from Inria – French Academy of Science – Dassault Systems. He is an ACM Fellow.

Read an Excerpt

PREFACE: PREFACE TO THE SECOND EDITION
Many things have changed since the publication of the first edition of this book in 1991. At the time, we reported projections that, by 1998, centralized database managers (DBMSs) would be an \antique curiosity" and most organizations would move towards distributed database managers. Distribution was slowly starting and "client/server" had just started to enter our daily jargon. These systems were generally multiple client/single server systems in which the distribution was mostly in terms of functionality, not data. If multiple servers were used, clients were responsible for managing the connections to these servers. Thus, transparency of access was not widely supported, and each client had to "know" the location of the required data. The distribution of data among multiple servers was very primitive; systems did not support fragmentation or replication of data. Systems of the time were \homogeneous" in that each system could manage only data that were stored in its own database, with no linkage to other repositories.

Things have changed dramatically since then. Many vendors are much closer to achieving true distribution in their development cycle. Client/server systems remain the preferred solution in many cases, but they are much more sophisticated. For example, today's client/server systems provide signicant transparency in accessing data from multiple servers, support distributed transactions to facilitate transparency, and execute queries over (horizontally) fragmented data. Further, new systems implement both synchronous and asynchronous replication protocols, and many vendors have introduced gateways to accessother databases. In addition, significant achievements have taken place in the development and deployment of parallel database servers. Object database managers have entered the marketplace and have found a niche market in some classes of applications which are inherently distributed.

In parallel with these developments in the database system front, there have been phenomenal changes in the computer networking infrastructure that supports these systems. The relatively slow (10Mbit/sec) Ethernet has been replaced as the de facto local area network standard by much faster networks (FDDI or switched Ethernet) operating at around 100Mbit/sec, and broadband networks (particularly the ATM technology) have been deployed for both local area and wide area networking. These networks, coupled with very low overhead networking protocols, such as SCI, reduce the differences between local area and wide area networks (other than latency considerations) and potentially eliminate the network as the major performance bottleneck. This, in turn, requires us to review our system development assumptions and performance tuning criteria. Use of the Internet which is basically a heterogeneous network with links of varying capacities and capabilities has exploded.

There is clearly a technology push/application pull in effect with respect to distributed DBMS development: new applications are requiring changes in DBMS capabilities, and new technological developments are making these changes possible. With these developments, it was time to prepare a revised second edition of the book. In the process, we have retained the fundamental characteristics and key features of the book as outlined in the Preface to the first edition. However, the material has been heavily edited. Every chapter has been revised some in fundamental ways, others more superficially. The major changes are the following:

1. The query processing/optimization chapters (Chapters 7{9) have been revised to focus on the techniques employed in commercial systems. New algorithms, such as randomized search strategies, are now included.

2. The transaction management chapters (Chapters 10{12) now include material on advanced transaction models and work flows.

3. Chapter 13, which focused on the relationship of distributed DBMSs and distributed operating systems, has been dropped and some of the material is incorporated into the relevant chapters.

4. The first edition contained a chapter (Chapter 15) which discussed current issues at the time parallel DBMSs, distributed knowledge-base systems (mainly deductive DBMSs), and distributed object DBMSs. In the intervening years, two of these topics have matured and become major forces in their own rights, while the third (deductive databases) has not achieved the same prominence. In this edition, we devote full chapters to parallel DBMSs (Chapter 13) and distributed object DBMSs (Chapter 14), and have dropped deductive DBMSs.

5. Following the same approach, we introduce a new chapter devoted to current issues (Chapter 16). This chapter now includes sections on data warehousing (from a distributed data management perspective), World Wide Web and databases, push-based technologies, and mobile DBMSs.

6. The chapter on multidatabase systems (Chapter 15 in the current edition) has been revised to include a discussion of general interoperability issues and distributed object platforms such as OMA/CORBA and DCOM/OLE.

We are quite satisfied with the result, which represents a compromise between our desire to address new and emerging issues, and maintain the main characteristic of the book in addressing the principles of distributed data management. Certain chapters, in particular Chapters 15 and 16, require further depth, but those will be topics of future editions.

The guide to reading the book, introduced in the Preface to the first edition, is still valid in general terms. However, we now discuss, in Chapter 3, the relationship between distributed DBMSs and the new networking technologies. Thus, this chapter no longer serves simply as background and should be read (at least the relevant sections) following Chapter 1.

We have set up a Web site to communicate with our readers. The site is at ...

1 Introduction.- 2 Distributed and Parallel Database Design.- 3 Distributed Data Control.- 4 Distributed Query Processing.- 5 Distributed Transaction Processing.- 6 Data Replication.- 7 Database Integration – Multidatabase Systems.- 8 Parallel Database Systems.- 9 Peer-to-Peer Data Management.- 10 Big Data Processing.- 11 NoSQL, NewSQL and Polystores.- 12 Web Data Management.- Appendices.- A Overview of Relational DBMS.- B Centralized Query Processing.- C Transaction Processing Fundamentals.- D Review of Computer Networks.- References.- Index.

Preface

PREFACE: PREFACE TO THE SECOND EDITION
Many things have changed since the publication of the first edition of this book in 1991. At the time, we reported projections that, by 1998, centralized database managers (DBMSs) would be an \\antique curiosity" and most organizations would move towards distributed database managers. Distribution was slowly starting and "client/server" had just started to enter our daily jargon. These systems were generally multiple client/single server systems in which the distribution was mostly in terms of functionality, not data. If multiple servers were used, clients were responsible for managing the connections to these servers. Thus, transparency of access was not widely supported, and each client had to "know" the location of the required data. The distribution of data among multiple servers was very primitive; systems did not support fragmentation or replication of data. Systems of the time were \\homogeneous" in that each system could manage only data that were stored in its own database, with no linkage to other repositories.

Things have changed dramatically since then. Many vendors are much closer to achieving true distribution in their development cycle. Client/server systems remain the preferred solution in many cases, but they are much more sophisticated. For example, today's client/server systems provide signicant transparency in accessing data from multiple servers, support distributed transactions to facilitate transparency, and execute queries over (horizontally) fragmented data. Further, new systems implement both synchronous and asynchronous replication protocols, and many vendors have introduced gateways toaccessother databases. In addition, significant achievements have taken place in the development and deployment of parallel database servers. Object database managers have entered the marketplace and have found a niche market in some classes of applications which are inherently distributed.

In parallel with these developments in the database system front, there have been phenomenal changes in the computer networking infrastructure that supports these systems. The relatively slow (10Mbit/sec) Ethernet has been replaced as the de facto local area network standard by much faster networks (FDDI or switched Ethernet) operating at around 100Mbit/sec, and broadband networks (particularly the ATM technology) have been deployed for both local area and wide area networking. These networks, coupled with very low overhead networking protocols, such as SCI, reduce the differences between local area and wide area networks (other than latency considerations) and potentially eliminate the network as the major performance bottleneck. This, in turn, requires us to review our system development assumptions and performance tuning criteria. Use of the Internet which is basically a heterogeneous network with links of varying capacities and capabilities has exploded.

There is clearly a technology push/application pull in effect with respect to distributed DBMS development: new applications are requiring changes in DBMS capabilities, and new technological developments are making these changes possible. With these developments, it was time to prepare a revised second edition of the book. In the process, we have retained the fundamental characteristics and key features of the book as outlined in the Preface to the first edition. However, the material has been heavily edited. Every chapter has been revised some in fundamental ways, others more superficially. The major changes are the following:

1. The query processing/optimization chapters (Chapters 7{9) have been revised to focus on the techniques employed in commercial systems. New algorithms, such as randomized search strategies, are now included.

2. The transaction management chapters (Chapters 10{12) now include material on advanced transaction models and work flows.

3. Chapter 13, which focused on the relationship of distributed DBMSs and distributed operating systems, has been dropped and some of the material is incorporated into the relevant chapters.

4. The first edition contained a chapter (Chapter 15) which discussed current issues at the time parallel DBMSs, distributed knowledge-base systems (mainly deductive DBMSs), and distributed object DBMSs. In the intervening years, two of these topics have matured and become major forces in their own rights, while the third (deductive databases) has not achieved the same prominence. In this edition, we devote full chapters to parallel DBMSs (Chapter 13) and distributed object DBMSs (Chapter 14), and have dropped deductive DBMSs.

5. Following the same approach, we introduce a new chapter devoted to current issues (Chapter 16). This chapter now includes sections on data warehousing (from a distributed data management perspective), World Wide Web and databases, push-based technologies, and mobile DBMSs.

6. The chapter on multidatabase systems (Chapter 15 in the current edition) has been revised to include a discussion of general interoperability issues and distributed object platforms such as OMA/CORBA and DCOM/OLE.

We are quite satisfied with the result, which represents a compromise between our desire to address new and emerging issues, and maintain the main characteristic of the book in addressing the principles of distributed data management. Certain chapters, in particular Chapters 15 and 16, require further depth, but those will be topics of future editions.

The guide to reading the book, introduced in the Preface to the first edition, is still valid in general terms. However, we now discuss, in Chapter 3, the relationship between distributed DBMSs and the new networking technologies. Thus, this chapter no longer serves simply as background and should be read (at least the relevant sections) following Chapter 1.

We have set up a Web site to communicate with our readers. The site is at ...

From the B&N Reads Blog

Page 1 of

Editorial Reviews

Introduces and explains the theory, algorithms, and methods that underlie distributed DBMS, emphasizing the principles that guide the design of such systems more than their use. Useful as a text for a one- or two-semester graduate-level course. The bibliography is extensive. Annotation c. Book News, Inc., Portland, OR (booknews.com)

Booknews

From the reviews of the third edition:

“This is an excellent book that provides an in-depth overview of all issues related to distributed data management. … Each chapter … ends with a conclusion that provides a nice summary and additional reflections, as well as bibliographic notes. … The book ends with an extensive list of references and an index. … Instructor's of advanced database courses could use this book as a textbook. It would also interest researchers on topics related to distributed data management. I highly recommend this excellent book.” (Sergio Ilarri, ACM Computing Reviews, August, 2011)

From the Publisher

Principles of Distributed Database Systems

Principles of Distributed Database Systems

Hardcover(Fourth Edition 2020)

Hardcover(Fourth Edition 2020)

Related collections and offers

Overview

Product Details

About the Author

Read an Excerpt

Table of Contents

Preface

Customer Reviews

Related collections and offers

Overview

Product Details

About the Author

Read an Excerpt

Table of Contents

Preface

Related Subjects

Customer Reviews