EECS-6421A: Advanced Database Systems
York University
Fall 2017
Class Homepage
Instructor: Parke Godfrey
Office: #2050 Lassonde
Office Hours: Th 3–5pm
& by appointment / availability
Ph#: 416-736-2100 x66671
e-mail: godfrey@yorku.ca
 
Term: fall 2017
Time: Mo & We 4:00pm–5:30pm
Place: BC #228
Textbook: Ramakrishnan & Gerhke
Database Management Systems
Third Edition, 2003
McGraw Hill
ISBN: 0-07-246563-8
URL: http://www.cs.wisc.edu/~dbbook
Readings: Posted throughout term.
Class URL: http://www.cse.yorku.ca/course/6421/

Welcome to EECS-6421A, Advanced Database Systems, for fall term 2017.

Materials, instructions, and notices for the course will accumulate here over the semester.

 
  Class Bulletin Board

Course notices will accumulate here over the term.

Be certain to check here regularly. Refresh this page via your browser when you visit to ensure that you are not looking at an old, cached copy. Otherwise, you can miss the latest message.

  • Exam (PDF) is up.
    Due by 11:59pm Friday 15 December 2017.

  • Assignment #4, TFIDF, on MapReduce is posted.
    Due Monday 4 December 2017.

  • Readings F & G on NoSQL are up.

  • Readings D & E on query optimization are up.

  • Assignment #3 (PDF) is posted.
    Due Wednesday 8 November 2017.

  • 2017-10-11: Reading C posted, Bridging the Gap between OLAP and SQL.

    (We won't manage to be as aggressive with readings as I originally intended, as marked in the initial schedule.)

  • Assignment #2, Querying the StL Database, is posted.
    Due Wednesday 25 October 2017.

  • For Assignment #1, turn it in by e-mail in PDF format.

    • to: Parke Godfrey <godfrey@yorku.ca>
    • Subject: EECS-6421 Assignment #1

    Or hardcopy by hand. (If you must!)

  • Readings A & B are up. Check the schedule.

  • Assignment #1 PDF is posted.

    Due Wednesday 4 October 2017.

  • The first class is Monday 11 September. Welcome!

 
  Materials
Syllabus
  1. About the Course
  2. Grading Criteria & Course Requirements
  3. Schedule
  4. Policies
Lecture Notes
  1. Introduction
  2. Logic
    1. semantics
      1. logic primer & datalog (36 pages): PDF, PDF (2-per)
      2. prolog diversion (14 pages): PDF, PDF (2-per)
      3. datalog w/ negation (9 pages): PDF, PDF (2-per)
      4. well-founded & stable model semantics (8 pages): PDF, PDF (2-per)
    2. query languages:
      1. datalog examples: ...
      2. relational algebra & calculus
        • relational algebra [chapter 3]*: PDF
        • miscellaneous exercises: PDF, PDF (w/ answers)
          See #14 & #15 for relational algebra and calculus examples, respectively.
      3. SQL
        • the relational model [chapter 3]*, for review (34 pages): PDF, PDF (2-per)
          Includes basic constraints, DDL / data definition language (e.g., create table), & DML / data manipulation language (e.g., insert).
        • queries, constraints, & triggers [chapter 5]* (35 pages): PDF, PDF (2-per)
        • SQL examples: PDF, PDF (2-per)
  3. System
    1. RDBMS architecture chart (1 page): PDF
    2. the physical database
      1. overview of storage & indexes [chapter 8]* (32 pages): PDF, PDF (2-per)
      2. storing data: disks and files [chapter 9]* (29 pages): PDF, PDF (2-per)
      3. tree-structured indexing [chapter 10]* (26 pages): PDF, PDF (2-per)
      4. hash-based indexing [chapter 10]* (19 pages): PDF, PDF (2-per)
    3. query optimization
      1. external sort [chapter 13]* (19 pages): PDF
      2. evaluation of joins [chapter 14-A]* (16 pages): PDF
      3. evaluation of other operators [chapter 14-B]* (11 pages): PDF
      4. the query optimizer (System-R) [chapter 15]* (19 pages): PDF
      5. query planning class examples (4 pages): PDF & sketch-through (PDF)
  4. Beyond Relational (“NoSQL”)
    1. NoSQL
      1. Variety: NoSQL
      2. XML & XQuery: examples
      3. Graph Databases: RDF & SPARQL
    2. scaling out
  5. Directions
    1. information integration
    2. data visualization
    3. student topics
*Derived from the textbook slides by Ramakrishnan and Gehrke.
Readings (You have to authenticate to reach here. This is to respect copyright on materials.)
References
 
  Components

  • Assignments
    1. Assignment #1: PDF
      Due Wednesday 4 October 2017.
    2. Assignment #2: Querying the StL Database.
      Due Wednesday 25 October 2017.
    3. Assignment #3: PDF
      Due Wednesday 8 November 2017.
    4. Assignment #4: TFIDF
      (Worth 10%; counts for A-4 & A-5 in course outline.)
      Due Monday 4 December 2017.
  • Presentations
  • Reports
  • Exam
    • PDF
      Due by 11:59pm Friday 15 December 2017.
      Deliver to my mailbox in Lassonde #1012, to me in person, or scan and email to me.
      (If you need more time, arrange with me.)