Difference between revisions of "Course: Advanced Databases"
Jump to navigation
Jump to search
(11 intermediate revisions by the same user not shown) | |||
Line 9: | Line 9: | ||
== News == | == News == | ||
February 10th, 2014: | |||
* Wiki is now up-to-date | * Wiki is now up-to-date | ||
* Added research papers for reading assignment | * Added research papers for reading assignment | ||
Line 17: | Line 19: | ||
Here is the list of selected papers for the reading assignment: | Here is the list of selected papers for the reading assignment: | ||
# [http://gsl.azurewebsites.net/Portals/0/Users/dewitt/Papers/paralleldb/PDIS93.pdf | # [http://gsl.azurewebsites.net/Portals/0/Users/dewitt/Papers/paralleldb/PDIS93.pdf Nested loops revisited. D. J. DeWitt, J. F. Naughton, and J. Burger. 1993, January. In Proceedings of the Second International Conference on Parallel and Distributed Information Systems, (pp. 230-242).] | ||
# [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.56.6493&rep=rep1&type=pdf G. N. Paulley and Per-Åke Larson. 1994 | # [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.56.6493&rep=rep1&type=pdf Exploiting Uniqueness in Query Optimization. G. N. Paulley and Per-Åke Larson. 1994. In Proceedings of the Tenth International Conference on Data Engineering. IEEE Computer Society, Washington, DC, USA, 68-79.] (Assigned to Group 1) | ||
# [http:// | # [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.331.8616&rep=rep1&type=pdf Accelerating XPath location steps. Torsten Grust. Proceedings of the 2002 ACM SIGMOD international conference on Management of data.] | ||
# [http://www.vldb.org/conf/2003/papers/S11P03.pdf | # [http://www.vldb.org/conf/2003/papers/S11P03.pdf AQuery: query language for ordered data, optimization techniques, and experiments. A. Lerner and D. Shasha. In Proc. Int. Conf. on Very Large Data Bases (VLDB), pages 345–356, 2003.] | ||
# [http://homepages.inf.ed.ac.uk/libkin/papers/sigmod96b.pdf | # [http://homepages.inf.ed.ac.uk/libkin/papers/sigmod96b.pdf Algorithms for deferred view maintenance. Latha Colby, Timothy Griffin, Leonid Libkin, Inderpal Mumick and Howard Trickey. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'96), pages 469-480.] | ||
# [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.33.1999&rep=rep1&type=pdf | # [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.33.1999&rep=rep1&type=pdf Optimizing Queries with Materialized Views. Surajit Chaudhuri, Ravi Krishnamurthy, Spyros Potamianos, and Kyuseok Shim. Data Engineering 11 (1995): 190.] (Assigned to group 6). | ||
# [http://www.vldb.org/conf/2002/S17P02.pdf | # [http://www.vldb.org/conf/2002/S17P02.pdf Translating web data. L. Popa, Y. Velegrakis, M. A. Hernández, R. J. Miller, and R. Fagin. (In Proceedings of the 28th international conference on Very Large Data Bases (pp. 598-609). VLDB Endowment. August 2002. ] (Assigned Group 4). | ||
# [http://ilpubs.stanford.edu:8090/262/1/1997-49.pdf | # [http://ilpubs.stanford.edu:8090/262/1/1997-49.pdf Optimizing Queries across Diverse Data Sources. Laura M. Haas, Donald Kossmann, Edward L. Wimmers and Jun Yangy. Proceedings of the International Conference on Very Large Data Bases. Vol. 23. Morgan Kaufmann Pub, 1997.] (Assigned Group 3). | ||
# [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.34.9263&rep=rep1&type=pdf | # [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.34.9263&rep=rep1&type=pdf WebOQL: Restructuring documents, databases and webs. Gustavo O. Arocena, and Alberto O. Mendelzon. 14th International Conference on Data Engineering. IEEE, 1998.] (Assigned Group 2). | ||
# [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.42.1232&rep=rep1&type=pdf Peter Buneman, Susan B. Davidson, Kyle Hart, G. Christian Overton, and Limsoon Wong. 1995 | # [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.42.1232&rep=rep1&type=pdf A Data Transformation System for Biological Data Sources. Peter Buneman, Susan B. Davidson, Kyle Hart, G. Christian Overton, and Limsoon Wong. 1995. In Proceedings of the 21th International Conference on Very Large Data Bases (VLDB '95)] (Assigned Group 7.). | ||
# [http://www.ambuehler.ethz.ch/CDstore/www10/papers/pdf/p220.pdf Freire | # [http://www.ambuehler.ethz.ch/CDstore/www10/papers/pdf/p220.pdf WebViews: accessing personalized web content and services. Juliana Freire, Bharat Kumar, and Daniel Lieuwen. Proceedings of the 10th international conference on World Wide Web. ACM, 2001.] | ||
# [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.30.2620&rep=rep1&type=pdf Using schema matching to simplify heterogeneous data translation. Tova Milo and Sagit Zohar. VLDB. Vol. 98. 1998.] | |||
== Week 1: Tuesday | == Week 1: Tuesday February 4th - Course Overview == | ||
* Course overview and introduction | * Course overview and introduction | ||
Line 48: | Line 51: | ||
* [http://en.wikipedia.org/wiki/Enterprise_Information_Integration Enterprise Information Integration (Wikipedia)] | * [http://en.wikipedia.org/wiki/Enterprise_Information_Integration Enterprise Information Integration (Wikipedia)] | ||
== Week 2: Tuesday February | == Week 2: Tuesday February 11th - Query Compilation 1 == | ||
* | * Query Compilation 1. Indexing and Storage | ||
* Lecture notes: | * Lecture notes: | ||
=== Required Reading === | === Required Reading === | ||
Line 66: | Line 61: | ||
* [http://research.google.com/archive/mapreduce.html original google map-reduce paper] | * [http://research.google.com/archive/mapreduce.html original google map-reduce paper] | ||
== Week 3: | == Week 3: Tuesday February 18th - Query Compilation 2 == | ||
* Query Compilation and Rewriting |
Latest revision as of 17:27, 20 February 2014
NYU School of Engineering. CS6093: Spring 2014
Advanced Database Systems (CS6093) Syllabus for this semester: Syllabus (pdf)
This schedule is tentative and subject to change
Make sure to check my.poly.edu for course announcements
News
February 10th, 2014:
- Wiki is now up-to-date
- Added research papers for reading assignment
- Added slides for lecture 1 & 2
Reading Assignment
Here is the list of selected papers for the reading assignment:
- Nested loops revisited. D. J. DeWitt, J. F. Naughton, and J. Burger. 1993, January. In Proceedings of the Second International Conference on Parallel and Distributed Information Systems, (pp. 230-242).
- Exploiting Uniqueness in Query Optimization. G. N. Paulley and Per-Åke Larson. 1994. In Proceedings of the Tenth International Conference on Data Engineering. IEEE Computer Society, Washington, DC, USA, 68-79. (Assigned to Group 1)
- Accelerating XPath location steps. Torsten Grust. Proceedings of the 2002 ACM SIGMOD international conference on Management of data.
- AQuery: query language for ordered data, optimization techniques, and experiments. A. Lerner and D. Shasha. In Proc. Int. Conf. on Very Large Data Bases (VLDB), pages 345–356, 2003.
- Algorithms for deferred view maintenance. Latha Colby, Timothy Griffin, Leonid Libkin, Inderpal Mumick and Howard Trickey. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD'96), pages 469-480.
- Optimizing Queries with Materialized Views. Surajit Chaudhuri, Ravi Krishnamurthy, Spyros Potamianos, and Kyuseok Shim. Data Engineering 11 (1995): 190. (Assigned to group 6).
- Translating web data. L. Popa, Y. Velegrakis, M. A. Hernández, R. J. Miller, and R. Fagin. (In Proceedings of the 28th international conference on Very Large Data Bases (pp. 598-609). VLDB Endowment. August 2002. (Assigned Group 4).
- Optimizing Queries across Diverse Data Sources. Laura M. Haas, Donald Kossmann, Edward L. Wimmers and Jun Yangy. Proceedings of the International Conference on Very Large Data Bases. Vol. 23. Morgan Kaufmann Pub, 1997. (Assigned Group 3).
- WebOQL: Restructuring documents, databases and webs. Gustavo O. Arocena, and Alberto O. Mendelzon. 14th International Conference on Data Engineering. IEEE, 1998. (Assigned Group 2).
- A Data Transformation System for Biological Data Sources. Peter Buneman, Susan B. Davidson, Kyle Hart, G. Christian Overton, and Limsoon Wong. 1995. In Proceedings of the 21th International Conference on Very Large Data Bases (VLDB '95) (Assigned Group 7.).
- WebViews: accessing personalized web content and services. Juliana Freire, Bharat Kumar, and Daniel Lieuwen. Proceedings of the 10th international conference on World Wide Web. ACM, 2001.
- Using schema matching to simplify heterogeneous data translation. Tova Milo and Sagit Zohar. VLDB. Vol. 98. 1998.
Week 1: Tuesday February 4th - Course Overview
- Course overview and introduction
- Lecture notes: http://www.vistrails.org/images/ADB-Intro-Class1.pdf
- Student survey -- to be filled out today!
Textbooks
- Database Management Systems, by Raghu Ramakrishnan and Johannes Gehrke Database Management Systems
- Database Systems: The Complete Book, by Hector Garcia-Molina, Jeff Ullman, and Jennifer Widom, see the Database Systems: The Complete Book
- Guido Moerkotte's free book on query compilation and optimization: Query Compilers
- Principles of Data Integration by AnHai Doan, Alon Halevy, and Zachary Ives. Reference at: Principles of Data Integration
Additional References
- Graefe, Goetz. "Query evaluation techniques for large databases." ACM Computing Surveys (CSUR) 25.2 (1993): 73-169.. A classic database survey, and a must read for anyone serious about data processing.
- Data Integration (Wikipedia)
- Enterprise Information Integration (Wikipedia)
Week 2: Tuesday February 11th - Query Compilation 1
- Query Compilation 1. Indexing and Storage
- Lecture notes:
Required Reading
- Mining of Massive Datasets, Chapter 2
- Data-Intensive Text Processing with MapReduce, Chapter 2 and Chapter 3
- original google map-reduce paper
Week 3: Tuesday February 18th - Query Compilation 2
- Query Compilation and Rewriting