CS346 Lecture Notes
Review of Metadata Management,
Overview of SM and EX Component

Example relation: Student(ID, name, major, GPA)

Example query:

   Select ID, name, major
   From   Student
   Where  GPA < 3.0

Query validity

System must check:

Student relation exists
Student relation has attributes ID, name, major, GPA
GPA has type compatible with float

Query plan

   1. open file for Student           // what is file name?
   2. scan records                    // index on GPA?
         if GPA < 3.0                 // where is GPA in record?
         then fetch ID, name, major   // where are ID, name, major in record?
   3. close file for Student

Metadata

All information about the data needed to process queries and updates over it

Metadata must be persistent, reliable, multi-user, etc., exactly like regular data
=> Store in database

Question: Is it a paradox to store (meta-)data about the data as data in the database?

Question: Can we use metadata to describe itself?

Terminology:

   metadata
   ~= data dictionary
   ~= schema information
   ~= system catalogs
   ~= system relations

Some of these terms are narrower -- "metadata" usually includes statistics, indexes, etc.

System structure of a real DBMS (simplified)

Query compiler touches metadata, no data

Query executor touches data, no metadata

RedBase SM component

RedBase metadata

Two catalogs: relcat and attrcat (more for certain extensions)

relcat: one tuple per relation (see SM document for details)

attrcat: one tuple per attribute per relation (see SM document for details)

Question: Can you store tuples for relcat and attrcat in relcat and attrcat?

Question: Is it useful to put indexes on catalogs?

Command-line utilities

Stand-alone programs

dbcreate <DBname> (can have multiple databases)
- create subdirectory
- create initial relcat & attrcat for this database
dbdestroy <DBname>
- delete subdirectory
redbase <DBname>
- initialize system: create XX_Managers
- call SM_Manager::OpenDb (cd DBname, open relcat & attrcat)
- call RBparse -- command loop
- call SM_Manager::CloseDb (close relcat & attrcat)

DDL commands

Similar to SQL syntax

One SM_Manager method per command

create table
- update system catalogs
- create file for relation
drop table
- destroy relation file
- destroy index files
- delete info from system catalogs
create index
- update system catalog
- create index
- build index for current relation contents
drop index
- update system catalog
- destroy index

System utilities

load
File format:
```
A1, A2, ..., An \n
A1, A2, ..., An \n
```
with attributes in same order as Create Table
for each tuple:
- create record in file
- create index entries
help [<relname>]
Without <relname> -- print list of all relation names (Question: How?)
With <relname> -- print attribute info for relation (Question: How?)
print <relname>
Scan relation, print all tuples
set <param> = <value>
For your convenience only, for example:
set tracing = "0"
set result = "verbose"

Read document carefully, lots of things not covered:

The command parser
Forcing catalog info
Restrictions on catalogs
Case sensitivity
Printing package and requirements
Statistics commands
Error handling
Many other details...

EX Component

Proposal due Wednesday May 11
Strongly encouraged to attend office hours to discuss
EX should be approximately the same amount of work as RM
Must decide if willing to:
- Modify parser
- Modify PF component
In proposal:
- Exhibit knowledge of RedBase
- Exhibit knowledge of DBMS's
- Have good idea for extension
- Exhibit that you've thought it out carefully
- Present everything clearly in 3-5 pages

CS346 Lecture Notes Review of Metadata Management, Overview of SM and EX Component