Difference between revisions of "CS6093/Lectures"

From VistrailsWiki
Jump to navigation Jump to search
Line 113: Line 113:
* [http://infolab.stanford.edu/~usriv/papers/pig-latin.pdf Pig latin: a not-so-foreign language for data processing].C Olston, B Reed, U Srivastava, R Kuma, A. Tomkins. SIGMOD 2008.
* [http://infolab.stanford.edu/~usriv/papers/pig-latin.pdf Pig latin: a not-so-foreign language for data processing].C Olston, B Reed, U Srivastava, R Kuma, A. Tomkins. SIGMOD 2008.
** Presenters: Dmitriy Gromov,Xiang Liu, Yuan Ding
** Presenters: Dmitriy Gromov,Xiang Liu, Yuan Ding
**Rebuttal:  
**Rebuttal: Nivan Ferreira,  Shoshana Gottesman


=== Additional suggested reading ===
=== Additional suggested reading ===
Line 139: Line 139:


* [http://cs-www.cs.yale.edu/homes/dna/papers/split-execution-hadoopdb.pdf Efficient Processing of Data Warehousing Queries in a Split Execution Environment.]  Bajda-Pawlikowsk et al., SIGMOD 2011
* [http://cs-www.cs.yale.edu/homes/dna/papers/split-execution-hadoopdb.pdf Efficient Processing of Data Warehousing Queries in a Split Execution Environment.]  Bajda-Pawlikowsk et al., SIGMOD 2011
** Presenters: Julie Odongo, Majed Hakami
** Presenters: Julie Odongo, Majed Hakami
** Rebuttal:  
** Rebuttal: Fernando Seabra, Dmitriy Gromov


For additional suggested readings, see http://www.vistrails.org/index.php?title=CS6093/Selected_Papers_and_Topics
For additional suggested readings, see http://www.vistrails.org/index.php?title=CS6093/Selected_Papers_and_Topics
Line 164: Line 165:
* [http://portal.acm.org/citation.cfm?id=1132863.1132872&coll=GUIDE&dl=GUIDE Automatic complex schema matching across Web query interfaces] Bin He, Kevin Chuan Chang, ACM Trans. Database Syst. 2006
* [http://portal.acm.org/citation.cfm?id=1132863.1132872&coll=GUIDE&dl=GUIDE Automatic complex schema matching across Web query interfaces] Bin He, Kevin Chuan Chang, ACM Trans. Database Syst. 2006
** Presenters: Joe Miller, Vineet Meghani
** Presenters: Joe Miller, Vineet Meghani
** Rebuttal:
** Rebuttal: Yuan Ding,  Chunqing Jiang


=== Additional Reading ===
=== Additional Reading ===
Line 182: Line 183:
* [http://vgc.poly.edu/~juliana/courses/cs6093/Readings/bizer-web-sem2009..pdf DBpedia - A crystallization point for the Web of Data] Bizer et al., Web Semantics 2009.
* [http://vgc.poly.edu/~juliana/courses/cs6093/Readings/bizer-web-sem2009..pdf DBpedia - A crystallization point for the Web of Data] Bizer et al., Web Semantics 2009.
** Presenters: Sergey Nepomnyachiy, Shoshana Gottesman, Haibo Zeng
** Presenters: Sergey Nepomnyachiy, Shoshana Gottesman, Haibo Zeng
** Rebuttal:
** Rebuttal: Wei Jiang,  Maneli Kadkhodazadeh


=== Additional Reading ===
=== Additional Reading ===
Line 203: Line 204:
* [http://turing.cs.washington.edu/papers/kdd08.pdf Information Extraction From Wikipedia:  Moving Down the Long Tail] Fei Wu, Raphael Hoffmann, Daniel S. Weld
* [http://turing.cs.washington.edu/papers/kdd08.pdf Information Extraction From Wikipedia:  Moving Down the Long Tail] Fei Wu, Raphael Hoffmann, Daniel S. Weld
** Presenters: Chunqing Jiang, Bhaktavatsalam Nallanthighal,  Sameer More
** Presenters: Chunqing Jiang, Bhaktavatsalam Nallanthighal,  Sameer More
** Rebuttal:  
** Rebuttal:   Xiang Liu,  May Thazin, Haibo Zeng


=== Additional Reading ===
=== Additional Reading ===
Line 222: Line 223:
* [http://vgc.poly.edu/wiki/vgc/index.php/File:TrackingTrends.pdf Tracking Trends: Incorporating Term Volume into Temporal Topic Models.] KDD 2011
* [http://vgc.poly.edu/wiki/vgc/index.php/File:TrackingTrends.pdf Tracking Trends: Incorporating Term Volume into Temporal Topic Models.] KDD 2011
** Presenters:  Maneli Kadkhodazadeh, Wei Jiang
** Presenters:  Maneli Kadkhodazadeh, Wei Jiang
** Rebuttal:  
** Rebuttal: Sameer More, Bhaktavatsalam Nallanthighal,  Julie Ondongo


=== Additional reading ===
=== Additional reading ===
Line 242: Line 243:
* [http://www.vldb.org/conf/2002/S33P11.pdf BANKS: Browsing and Keyword Searching in Relational Databases] Aditya et al., VLDB 2002
* [http://www.vldb.org/conf/2002/S33P11.pdf BANKS: Browsing and Keyword Searching in Relational Databases] Aditya et al., VLDB 2002
** Presenters:  May Thazin,  Tehila Minkus
** Presenters:  May Thazin,  Tehila Minkus
** Rebuttal:
** Rebuttal: Vineet Meghani,  Tehila Minkus


== Week 15 - May 1 ==
== Week 15 - May 1 ==
Project presentation
Project presentation

Revision as of 22:27, 13 February 2012

Make sure to check my.poly.edu for course announcements

Every week, you must write position papers for the papers in the Required Readings list

Week 1 - Jan 24

  • Course overview (First day of classes!)

http://vgc.poly.edu/~juliana/courses/cs6093/Lectures/lecture1.pdf

  • Provenance and Workflows

http://vgc.poly.edu/~juliana/courses/cs6093/Lectures/provenance-workflows.pdf

Readings

  • Querying and Creating Visualizations by Analogy. Carlos E. Scheidegger, Huy T. Vo, David Koop, Juliana Freire and Claudio T. Silva. IEEE Transactions on Visualization and Computer Graphics, 13(6), pp. 1560-1567, 2007. Best paper in IEEE Visualization 2007.

Week 2 - Jan 31

  • Provenance and Workflows (cont.)

http://vgc.poly.edu/~juliana/courses/cs6093/Lectures/provenance-workflows.pdf

  • Discussion about literature search

Readings

same as last week

Week 3 - Feb 7

  • Information extraction: survey

http://vgc.poly.edu/~juliana/courses/cs6093/Lectures/information-extraction.pdf

Announcements

  • The topic winners were: Information Extraction, Deep Web, Relational Data on the Web, Web Schema Matching, NoSQL DB, Provenance in DB, Graph Indexing, Usable query interfaces
  • I will email to you preliminary assignments tomorrow

Assignment

  • Write a position paper for the article: ONDUX: on-demand unsupervised learning for information extraction

Readings

Some history and perspective:

Week 4 - Feb 14

  • Provenance and Databases
  • Graph Indexing

Assignment

  • Write 2 position papers --- one for each of the articles in the required reading for this week (see below)


Required Reading

  • Peter Buneman, Sanjeev Khanna, Wang Chiew Tan: Why and Where: A Characterization of Data Provenance. ICDT 2001: 316-330 http://db.cis.upenn.edu/DL/whywhere.pdf
    • Presenter: Fernando Seabra
    • Rebuttal: Joe Miller (tentative)

Additional Suggested Reading

  • A. Das Sarma, M. Theobald, and J. Widom. LIVE: A Lineage-Supported Versioned DBMS. Proceedings of the 22nd International Conference on Scientific and Statistical Database Management, Heidelberg, Germany, June 2010.

http://ilpubs.stanford.edu:8090/926/1/versioning-TR.pdf

  • Total Recall | Oracle Database

http://www.oracle.com/technetwork/database/focus-areas/storage/total-recall-whitepaper-171749.pdf

  • Answering pattern match queries in large graph databases via graph embedding

Lei Zou, Lei Chen, M. Tamer Özsu and Dongyan Zhao http://vgc.poly.edu/~juliana/courses/cs6093/Readings/graph-matching-vldbj2011

  • Chenghui Ren, Eric Lo, Ben Kao, Xinjie Zhu, Reynold Cheng: On Querying Historical Evolving Graph Sequences. PVLDB 4(11): 726-737 (2011)

http://vgc.poly.edu/~juliana/courses/cs6093/Readings/evolving-graphs-vldb11.pdf

Week 5 - Feb 21

  • NoSQL databases

Assignment

  • Write a position papers for the required papers

Required Reading

  • Parallel data processing with MapReduce: a survey. Lee et al, SIGMOD Record 2011

http://vgc.poly.edu/~juliana/courses/cs6093/Readings/lee-sigrec2011.pdf

Additional suggested reading

For additional suggested readings, see http://www.vistrails.org/index.php?title=CS6093/Selected_Papers_and_Topics

Week 6 - Feb 28

TBD

Week 7 - March 6

  • NoSQL Databases

Assignment

  • Write a position papers for the required papers

Required Reading

    • Presenters: Julie Odongo, Majed Hakami
    • Rebuttal: Fernando Seabra, Dmitriy Gromov

For additional suggested readings, see http://www.vistrails.org/index.php?title=CS6093/Selected_Papers_and_Topics

Week 8 - March 13

Spring break - no class

Week 9 - March 20

TBD

Week 10 - March 27

  • Web information integration

Assignment

  • Write a position papers for the required papers

Required Reading

Additional Reading

Week 11 - April 3

  • Wikipedia

Assignment

  • Write a position papers for the required papers

Required Reading

Additional Reading

Week 12 - April 10

  • Information extraction

Assignment

  • Write a position papers for the required papers

Required Reading

Additional Reading

Week 13 - April 17

Assignment

  • Write a position papers for the required papers
  • Twitter and News: finding entities and trends

Required Reading

Additional reading

Week 14 - April 24

  • Keyword queries over relational data

Assignment

  • Write a position papers for the required papers

Required Reading

Week 15 - May 1

Project presentation