| Database Systems: The Complete
Book
|
Welcome to the home page for Database Systems: The Complete Book
(DS:CB),
by Hector
Garcia-Molina, Jeff Ullman,
and Jennifer Widom.
The second edition of this book was published on June 9, 2008.
Some material on this page is also relevant to A
First Course in Database Systems, 3nd Edition.
The Gradiance contract with Pearson (Addison-Wesley + Prentice-Hall) has terminated, and
we have decided to turn Gradiance into a FREE service.
If you are an instructor who wants to use the system, start
by creating an account for yourself at www.gradiance.com/services
NOT at the Pearson site.
Note: passwords are >= 10 letters+digits,
with at least one of each. Also, we cannot make an account be an
instructor account for a book if the same account has registered
as a student for a course using the same materials.
Then, email your chosen login,
with the book whose materials you want, to support@gradiance.com
We'll enable you to create a class using those materials.
There are manuals at www.gradiance.com/info.html
that should enable
you to use the system without problems, but feel free to email
support@gradiance.com if you encounter difficulties.
In addition, we have created eleven free "omnibus classes" covering Databases,
Automata, Compilers, Operating Systems, Introductory Java, Data Structures,
and Data Mining. Students wishing to join either one of these classes
will find the Student Directions useful.
Index
About the Book
Table of Contents and Sample Chapters
New Features
-
We begin with a discussion of relational design, rather than E/R design.
There are a number of technical improvements to the discussion of functional
and multivalued dependencies, including the synthesis algorithm for 3NF and
the use of the "chase" algorithm to make inferences of dependencies.
-
We have added coverage of high-level design using UML.
-
Coverage of index selection and use of materialized views has been added.
-
There is new coverage of the database aspects of PHP.
-
There are two new chapters on XML-related material, including XML Schema,
XPath, XQuery, and XSLT.
-
Coverage of information integration has been expanded. There is new coverage of
algorithms for processing mediator queries. Also added is a treatment of
"local-as-view" mediation and the techniques necessary to process queries in this setting.
-
There is greatly expanded coverage of data mining for large-scale data.
Improvements on the a-priori algorithm for association rules are explained.
We also added material on similarity search (shingling, minhashing,
and locality-sensitive hashing) and clustering of large-scale, high-dimensional data.
-
The PageRank algorithm is explained, along with generalizations to topic-specific PageRank
and TrustRank for detecting link spam.
-
Entity resolution (merging of similar records) is covered.
-
Material on peer-to-peer databases has been added, including an algorithm for distributed hashing.
Support Materials Available
The materials below are
available for use by others.
Instructors are welcome to use them in their own
courses, download them to their own class' web
site, or modify them to suit.
However, you must acknowledge the source of the original and not
attempt to place your own copyright on this material.
Note: If you are creating your own materials for a course based
on the book and would like to share them
with the world, we would be happy to create a link to them.
Projects
Solutions to Exercises
Because the Gradiance (GOAL) automatic homework system is now available, we are not
updating the on-line solutions. Prentice-Hall is preparing a complete solution
manual available to instructors only.
On-line solutions to selected
exercises from the first edition of DS:CB.
Errata
Our list is growing!
Send us a correction to ullman aT gmail DoT com
and see yourself acknowledged on the errata page.
Slides and Lecture Notes
-
Jeff's Slides for CS145, Fall,
2007.
-
Jennifer's Notes
for CS145, Spring, 2006.
-
Jeff's Slides for CS145, Fall,
2004.
-
Jeff's Slides for CS145, Fall,
2002.
These are available in Powerpoint (with voiceover), or in GIF without
the voiceover.
-
Jennifer's Notes
for CS145, Spring, 2002.
- Hector's
Slides for CS245, Winter, 2002.
-
Jeff's Slides for CS145, Fall, 2001.
They are available on-line as Postscript and PDF only.
-
Slides in PowerPoint and other formats, by Arthur Keller, similar to
the above.
- Jeff's Slides for CS245A,
Winter, 1998.
Documentation for Oracle DMBS
The CS145 Oracle Guide.
Handouts, Homeworks, and Exams
CS145
This material is based approximately on Ch. 1-12 of DS:CB, or the entire
book FCDB.
Usually, Jeff teaches CS145 in the Fall and Jennifer in the Spring.
-
Fall,
1995.
-
Fall,
1996.
-
Spring,
1997.
-
Fall, 1997.
-
Spring, 1998.
-
Fall, 1998.
-
Spring 1999 (Taught by Jun Yang).
-
Fall, 1999.
-
Spring, 2000.
-
Fall, 2000.
-
Spring, 2001.
-
Fall, 2001.
-
Spring, 2002.
-
Fall, 2002.
-
Spring, 2003.
-
Fall, 2003.
-
Spring, 2004.
-
Fall, 2004.
-
Fall, 2006.
-
Fall, 2007.
CS245
This material is based approximately on Ch. 13-21 of DS:CB.
CS345A
CS345A is occasionally taught as Web-mining. When it is, Chapters 22 and 23
of DS:CB are covered, along with other material. You can find lecture notes
for past editions of CS345A, taught by Anand Rajaraman
and Jeff Ullman, in Data-Mining Notes.
CS346
CS346 is the database project course.
It covers query optimization, as in Ch. 15-16, in more detail than does
CS245.
Materials from First Edition Deleted in Second
-
Object-Oriented Query Languages (old Sections 4.1, 9.1, 9.2, and 9.3).
-
Recursive Datalog (old Section 10.3).
-
View Serializability (old Section 19.2).
Also Check Out
Ordering Information