Difference between revisions of "Tutorials/Vis2009"

From VistrailsWiki
Jump to navigation Jump to search
Line 3: Line 3:
The webpage for last year's tutorials is here: http://vis.computer.org/VisWeek2008/session/tutorials.html
The webpage for last year's tutorials is here: http://vis.computer.org/VisWeek2008/session/tutorials.html


== Provenance-Enabled Data Exploration and Visualization ==
= Provenance-Enabled Data Exploration and Visualization =


=== Level ===  
== Level ==  
Beginning to advandced.
Beginning to advandced.


=== Presenters ===
== Presenters ==
Cláudio Silva, University of Utah<br>
Cláudio Silva, University of Utah<br>
Juliana Freire, University of Utah<br>
Juliana Freire, University of Utah<br>
Line 14: Line 14:
Erik Anderson, University of Utah<br>
Erik Anderson, University of Utah<br>


=== Abstract ===
== Abstract ==
Scientists are now faced with an incredible volume of data to analyze.  To explore and understand the data, they need to assemble complex workflows (pipelines) to manipulate the data and create insightful visual representations.  Provenance is essential in this process. The provenance of a digital artifact contains information about the process and data used to derive the artifact. This information is essential for preserving the data, for determining the data's quality and authorship, for both reproducing and validating results -- all important elements of the scientific process. Provenance has shown to be particularly useful for enabling comparative visualization and data analysis.  This tutorial will inform computational and visualization scientists, users and developers about different approaches to provenance and the trade-offs among them.  Using the VisTrails project as a basis, we will cover different approaches to acquiring and reusing provenance, including techniques that attendees can use for provenance-enabling their own tools.  The tutorial will also discuss uses of provenance that go beyond the ability to reproduce and share results.
Scientists are now faced with an incredible volume of data to analyze.  To explore and understand the data, they need to assemble complex workflows (pipelines) to manipulate the data and create insightful visual representations.  Provenance is essential in this process. The provenance of a digital artifact contains information about the process and data used to derive the artifact. This information is essential for preserving the data, for determining the data's quality and authorship, for both reproducing and validating results -- all important elements of the scientific process. Provenance has shown to be particularly useful for enabling comparative visualization and data analysis.  This tutorial will inform computational and visualization scientists, users and developers about different approaches to provenance and the trade-offs among them.  Using the VisTrails project as a basis, we will cover different approaches to acquiring and reusing provenance, including techniques that attendees can use for provenance-enabling their own tools.  The tutorial will also discuss uses of provenance that go beyond the ability to reproduce and share results.


=== Working Outline ===
== Working Outline ==


After discussion, it seems that the tutorial is best broken into two distinct pieces:  A user-centric portion focused on using VisTrails, and a power-user or developer-centric portion focusing on plug-ins, mashups, etc.
After discussion, it seems that the tutorial is best broken into two distinct pieces:  A user-centric portion focused on using VisTrails, and a power-user or developer-centric portion focusing on plug-ins, mashups, etc.


* Introduction to Provenance and VisTrails  (1h 30m) - Presenter:  Claudio/Erik/Juliana/Manu (?)
=== Introduction to Provenance and VisTrails  (1h 30m) - Presenter:  Claudio/Erik/Juliana/Manu (?) ===
** Brief view of plugins and released features of VisTrails
* Brief view of plugins and released features of VisTrails
** Define provenance, explain why it is important for data analysis and visualization
* Define provenance, explain why it is important for data analysis and visualization
** Introduce VisTrails, explain the differences and similarities between VisTrails and other vis and scientific workflow systems
* Introduce VisTrails, explain the differences and similarities between VisTrails and other vis and scientific workflow systems
*** Say it is open source
** Say it is open source
*** Multi-platform: written in Python+QT
** Multi-platform: written in Python+QT
*** Downloaded 1000's of times
** Downloaded 1000's of times
*** Shipped with packages: XXXYYYZZZ
** Shipped with packages: XXXYYYZZZ
*** User-contributed packages: XXXXXXX -- you can add your own! (more later)
** User-contributed packages: XXXXXXX -- you can add your own! (more later)
** Explain the version tree and the action-based provenance model
* Explain the version tree and the action-based provenance model
** Show features that are enabled by the action-based provenance, and how they can help in the process of analyzing and visualizing data
* Show features that are enabled by the action-based provenance, and how they can help in the process of analyzing and visualizing data
*** Going back and forth on the tree
** Going back and forth on the tree
*** Automatic capture of changes
** Automatic capture of changes
*** Compare different workflows and their results
** Compare different workflows and their results
*** Exploring parameter spaces  
** Exploring parameter spaces  
*** Querying workflows by example
** Querying workflows by example
*** Refining workflows by analogy
** Refining workflows by analogy
** Discuss how the action-based provenance model can be combined with interactive tools
* Discuss how the action-based provenance model can be combined with interactive tools
*** Give a brief overview of the plugins
** Give a brief overview of the plugins
*** Show videos for a couple of them and point to the detailed tutorial of how to build a plugin later.
** Show videos for a couple of them and point to the detailed tutorial of how to build a plugin later.




* Basic hands-on usage of VisTrails (1h 30m) - Presenter:  Manu (?)
=== Basic hands-on usage of VisTrails (1h 30m) - Presenter:  Manu ===
** Building Simple Pipelines
* Building Simple Pipelines
** Using PythonSource
* Using PythonSource
** Connecting to databases
* Connecting to databases
** VisTrails as a Server
* VisTrails as a Server


* Advanced hands-on usage of VisTrails (1h 30m) - Presenter:  Erik
=== Advanced hands-on usage of VisTrails (1h 30m) - Presenter:  Erik ===
** This will all be in the context of a very specific example - My first thought was to use Gil's data, as it has many very nice features that build a nice story, but problems with releasing data are pushing me towards a CMOP example.
* This will all be in the context of a very specific example - My first thought was to use Gil's data, as it has many very nice features that build a nice story, but problems with releasing data are pushing me towards a CMOP example.
** Organizational principles of the History Tree - QBE, Analogies, Parameter Exploration -> New Version
* Organizational principles of the History Tree - QBE, Analogies, Parameter Exploration -> New Version
** vtkInteractionHandlers - Slicing and dicing data
* vtkInteractionHandlers - Slicing and dicing data
** PythonSource as callback definition - Picking specific elements
* PythonSource as callback definition - Picking specific elements
** The workflow debugger
* The workflow debugger
** Transforming VTK pipelines into VisTrails workflows --- modified version of Dave's script to get all VTK example into a .vt, and show how to query/search that.
* Transforming VTK pipelines into VisTrails workflows --- modified version of Dave's script to get all VTK example into a .vt, and show how to query/search that.


* Developing a plug-in  (30-40m) - Presenter:  Manu (?)
=== Developing a plug-in  (30-40m) - Presenter:  Manu ===
** Plug-in architecture overview
* Plug-in architecture overview
** Different implementation strategies
* Different implementation strategies
** ParaView Plug-in demo
* ParaView Plug-in demo


* New Extensions to VisTrails (50-60m) - Presenter: Claudio/Erik/Juliana/Manu (?)
=== New Extensions to VisTrails (50-60m) - Presenter: Claudio/Erik/Juliana/Manu ===
** Display Wall
* Display Wall
** Provenance Rich Publications
* Provenance Rich Publications
** Mashups
* Mashups
** Repository
* Repository
** Web-based execution (?)
* Web-based execution (?)

Revision as of 12:21, 23 July 2009

I organized this first part to contain the information required for the program.

The webpage for last year's tutorials is here: http://vis.computer.org/VisWeek2008/session/tutorials.html

Provenance-Enabled Data Exploration and Visualization

Level

Beginning to advandced.

Presenters

Cláudio Silva, University of Utah
Juliana Freire, University of Utah
Emanuele Santos, University of Utah
Erik Anderson, University of Utah

Abstract

Scientists are now faced with an incredible volume of data to analyze. To explore and understand the data, they need to assemble complex workflows (pipelines) to manipulate the data and create insightful visual representations. Provenance is essential in this process. The provenance of a digital artifact contains information about the process and data used to derive the artifact. This information is essential for preserving the data, for determining the data's quality and authorship, for both reproducing and validating results -- all important elements of the scientific process. Provenance has shown to be particularly useful for enabling comparative visualization and data analysis. This tutorial will inform computational and visualization scientists, users and developers about different approaches to provenance and the trade-offs among them. Using the VisTrails project as a basis, we will cover different approaches to acquiring and reusing provenance, including techniques that attendees can use for provenance-enabling their own tools. The tutorial will also discuss uses of provenance that go beyond the ability to reproduce and share results.

Working Outline

After discussion, it seems that the tutorial is best broken into two distinct pieces: A user-centric portion focused on using VisTrails, and a power-user or developer-centric portion focusing on plug-ins, mashups, etc.

Introduction to Provenance and VisTrails (1h 30m) - Presenter: Claudio/Erik/Juliana/Manu (?)

  • Brief view of plugins and released features of VisTrails
  • Define provenance, explain why it is important for data analysis and visualization
  • Introduce VisTrails, explain the differences and similarities between VisTrails and other vis and scientific workflow systems
    • Say it is open source
    • Multi-platform: written in Python+QT
    • Downloaded 1000's of times
    • Shipped with packages: XXXYYYZZZ
    • User-contributed packages: XXXXXXX -- you can add your own! (more later)
  • Explain the version tree and the action-based provenance model
  • Show features that are enabled by the action-based provenance, and how they can help in the process of analyzing and visualizing data
    • Going back and forth on the tree
    • Automatic capture of changes
    • Compare different workflows and their results
    • Exploring parameter spaces
    • Querying workflows by example
    • Refining workflows by analogy
  • Discuss how the action-based provenance model can be combined with interactive tools
    • Give a brief overview of the plugins
    • Show videos for a couple of them and point to the detailed tutorial of how to build a plugin later.


Basic hands-on usage of VisTrails (1h 30m) - Presenter: Manu

  • Building Simple Pipelines
  • Using PythonSource
  • Connecting to databases
  • VisTrails as a Server

Advanced hands-on usage of VisTrails (1h 30m) - Presenter: Erik

  • This will all be in the context of a very specific example - My first thought was to use Gil's data, as it has many very nice features that build a nice story, but problems with releasing data are pushing me towards a CMOP example.
  • Organizational principles of the History Tree - QBE, Analogies, Parameter Exploration -> New Version
  • vtkInteractionHandlers - Slicing and dicing data
  • PythonSource as callback definition - Picking specific elements
  • The workflow debugger
  • Transforming VTK pipelines into VisTrails workflows --- modified version of Dave's script to get all VTK example into a .vt, and show how to query/search that.

Developing a plug-in (30-40m) - Presenter: Manu

  • Plug-in architecture overview
  • Different implementation strategies
  • ParaView Plug-in demo

New Extensions to VisTrails (50-60m) - Presenter: Claudio/Erik/Juliana/Manu

  • Display Wall
  • Provenance Rich Publications
  • Mashups
  • Repository
  • Web-based execution (?)