CS346 Advanced Databases
CS346 15 CATS (7.5 ECTS) Term 1
Availability
Option - CS, CSE, CBS, DM, DS
Prerequisites
CS258
Academic Aims
The aim of the module is to introduce students to current techniques, methods and results from the active field of database systems and data management. Typical topics include query planning and optimization; transaction processing and concurrency control; big data management; data warehousing and OLAP; theory of databases.
Learning Outcomes
By the end of the module the student should be able to:
- Demonstrate understanding of issues surrounding concurrency control and parallelism in data management.
- Express queries in different forms (relational algebra, SQL, etc).
- Devise appropriate ways to store and index data.
- Show understanding of modern data processing paradigms such as NoSQL and MapReduce/PigLatin
- Explain methods suitable for particular types of data such as temporal, multimedia or spatial data.
Content
The topics will be drawn from core conceptual topics in advanced databases, and current ideas in database systems. This will be drawn from:
- Refresher on databases and modelling
- Relational algebra, tuple relational calculus, SQL, and equivalences between them
- Query planning, evaluation and optimization
- Transaction processing, concurrency, ACID rules, OLTP
- Online analytical processing (OLAP), data warehouses
- Data storage and indexing, B-trees and hashing
- NoSQL to relax ACID rules; consistency, availability, partition tolerance
- Database security and privacy, including anonymisation and release
- Parallel databases, hardware and software
- Big data, MapReduce, Pig Latin
- Special purpose databases, e.g. temporal, spatial, or multimedia databases
Books
- Korth and Silberschatz, Database System Concepts (6th ed.), McGraw-Hill, 1991.
- Elmasri and Navathe, Fundamentals of Database Systems (6th ed.)
Assessment
Two-hour examination (70%), coursework (30%)
Teaching
30 one-hour lectures plus 8 one-hour seminars