SKIP THE SHIPPING
Use code NOSHIP during checkout to save 40% on eligible eBooks, now through January 5. Shop now.
Register your product to gain access to bonus material or receive a coupon.
Offers a readable and engaging presentation.
Teaches about this important first step of the planning process.
Provides a more extensive treatment of query processing than other books on the market.
Gives students practical advice on integrating state-of-the-art technologies with databases.
Includes discussions of views, integrity constraints, assertions, triggers, transactions, authorization, and recursion in SQL:1999.
Reflects how these plans are developed in the real world.
Shows students how to build efficient database management systems.
Shows students how to fine tune database systems to improve performance.
Shows how to design complex database systems that can handle real-world business applications.
Exposes readers to cutting edge technology used in business applications.
Provides students with the opportunity to practice and apply the concepts they've learned in each chapter.
Hector Garcia-Molina, Jeffrey D. Ullman, and Jennifer Widom, well-known computer scientists at Stanford University, have written an introduction to database systems with a comprehensive approach. The first half of the book provides in-depth coverage of databases from the point of view of the database designer, user, and application programmer. It covers the latest database standards SQL-1999, SQL/PSM, SQL/CLI, JDBC, ODL, and XML, with broader coverage of SQL than most other texts. The second half of the book provides in-depth coverage of databases from the point of view of the DBMS implementor. It focuses on management, covering the principal techniques in these areas with broader coverage of query optimization than most other texts. Advanced topics include multidimensional and bitmap indexes, distributed transactions, and information integration techniques. This comprehensive book is valuable either as an academic textbook or as a professional reference book.
NOTEWORTHY FEATURES(NOTE: Each chapter concludes with Summary and References sections.)
1. The Worlds of Database Systems.
The Evolution of Database Systems. Overview of a Database Management System. Outline of Database-System Studies.
Elements of the E/R Model. Design Principles. The Modeling of Constraints. Weak Entity Sets.
Basics of the Relational Model. From E/R Diagrams to Relational Designs. Converting Subclass Structures to Relations. Functional Dependencies. Rules About Functional Dependencies. Design of Relational Database Schemas. Multivalued Dependencies.
Review of Object-Oriented Concepts. Introduction to ODL. Additional ODL Concepts. From ODL Designs to Relational Designs. The Object-Relational Model. Semistructured Data. XML and Its Data Model.
An Example Database Schema. An Algebra of Relational Operations. Relational Operations on Bags. Extended Operators of Relational Algebra. Constraints on Relations.
Simple Queries in SQL. Queries Involving More Than One Relation. Subqueries. Full-Relation Operations. Database Modifications. Defining a Relation Schema in SQL. View Definitions.
Keys and Foreign Keys. Constraints on Attributes and Tuples. Modification of Constraints. Schema-Level Constraints and Triggers.
SQL in a Programming Environment. Procedures Stored in the Schema. The SQL Environment. Using a Call-Level Interface. Java Database Connectivity. Transactions in SQL. Security and User Authorization in SQL.
Introduction to OQL. Additional Forms of OQL Expressions. Object Assignment and Creation in OQL. User-Defined Types in SQL. Operations on Object-Relational Data.
A Logic for Relations. From Relational Algebra to Datalog. Recursive Programming in Datalog. Recursion in SQL.
The “Megatron 2002” Database System. The Memory Hierarchy. Disks. Using Secondary Storage Effectively. Accelerating Access to Secondary Storage. Disk Failures. Recovery from Disk Crashes.
Data Elements and Fields. Records. Representing Block and Record Addresses. Variable-Length Data and Records. Record Modifications.
Indexes on Sequential Files. Secondary Indexes. B-Trees. Hash Tables.
Applications Needing Multiple Dimensions. Hash-Like Structures for Multidimensional Data. Tree-Like Structures for Multidimensional Data. Bitmap Indexes.
Introduction to Physical-Query-Plan Operators. One-Pass Algorithms for Database Operations. Nested-Loop Joins. Two-Pass Algorithms Based on Sorting. Two-Pass Algorithms Based on Hashing. Index-Based Algorithms. Buffer Management. Algorithms Using More Than Two Passes. Parallel Algorithms for Relational Operations.
Parsing. Algebraic Laws for Improving Query Plans. From Parse Trees to Logical Query Plans. Estimating the Cost of Operations. Introduction to Cost-Based Plan Selection. Choosing an Order for Joins. Completing the Physical-Query-Plan.
Issues and Models for Resilient Operation. Undo Logging. Redo Logging. Undo/Redo Logging. Protecting Against Media Failures.
Serial and Serializable Schedules. Conflict-Serializability. Enforcing Serializability by Locks. Locking Systems With Several Lock Modes. An Architecture for a Locking Scheduler. Managing Hierarchies of Database Elements. The Tree Protocol. Concurrency Control by Timestamps. Concurrency Control by Validation.
Serializability and Recoverability. View Serializability. Resolving Deadlocks. Distributed Databases. Distributed Commit. Distributed Locking. Long-Duration Transactions.
Modes of Information Integration. Wrappers in Mediator-Based Systems. Capability-Based Optimization in Mediators. On-Line Analytic Processing. Data Cubes. Data Mining.
At Stanford, we are on the quarter system, and as a result, our introductory database instruction is divided into two courses. The first, CS145, is designed for students who will use database systems but not necessarily take a job implementing a DBMS. It is a prerequisite for CS245, which is the introduction to DBMS implementation. Students wishing to go further in the database field then take CS345 (theory), CS346 (DBMS implementation project), and CS347 (transaction processing and distributed databases).
Starting in 1997, we published a pair of books. <I>A First Course in Database Systems</I> was designed for CS145, and <I>Database System Implementation</I> was for CS245 and parts of CS346. Because many schools are on the semester system or combine the two kinds of database instruction into one introductory course, we felt that there was a need to produce the two books as a single volume. At the same time, the evolution of database systems has made a number of new topics imperative for a modern course. Thus, we have added, mostly to the application-programming area, topics such as object-relational data, SQL/PSM (stored programs), SQL/CLI (the emerging standard for the C/SQL interface), and JDBC (the same for Java/SQL).
Use of the BookWe recommend that two quarters be devoted to the material in this book. If you follow the Stanford approach, you would cover the first ten chapters in the first quarter and the last ten in the second quarter. Should you wish to cover the material in a single semester, then there will have to be some omitted portions. In general, we suggest that Chapters 2-7, 11-13, and 17-18 should be given highest priority, but there are pieces from each of these chapters that can be skipped.
If, as we do in CS145, you give students a substantial database-application design and implementation project, then you may have to reorder the material somewhat, so that SQL instruction occurs earlier in the Book. You may wish to defer material such as dependencies, although students need normalization for design.
PrerequisitesWe have used the book at the "mezzanine" level, in courses taken both by undergraduates and beginning graduate students. The formal prerequisites for the courses are Sophomore-level treatments of: (1) Data structures, algorithms, and discrete math, and (2) Software systems, software engineering, and programming languages. Of this material, it is important that students have at least a rudimentary understanding of such topics as: algebraic expressions and laws, logic, basic data structures such as search trees and graphs, object-oriented programming concepts, and programming environments. However, we believe that adequate background is surely acquired by the end of the Junior year in a. typical computer science program.
ExercisesThe book contains extensive exercises, with some for almost every section. We indicate harder exercises or parts of exercises with an exclamation point. The hardest exercises have a double exclamation point.
Some of the exercises or parts are marked with a star. For these exercises, we shall endeavor to maintain solutions accessible through the book's web page. These solutions are publicly available and should be used for self-testing. Note that in a few cases, one exercise B asks for modification or adaptation of your solution to another exercise A. If certain parts of A have solutions, then you should expect the corresponding parts of B to have solutions as well.
Support on the World Wide Web The book's home page is
http://www-db.stanford.edu/~ullman/dscb.html
Here are solutions to starred exercises, errata as we learn of them, and backup materials. We are making available the notes for each offering of CS145 and CS245 as we teach them, including homeworks, projects and exams.