Querying the Version Tree¶
VisTrails is designed for manipulating collections of workflows, and
an integral part of this design is the ability to quickly search
through these collections. VisTrails provides two methods for
querying vistrails and workflows. The first is a Query by Example
interface which allows you to build query workflows and search for
those with similar structures and parameters. The second is a
textual interface with a straightforward syntax. For each interface,
the results are visual: each matching version is
highlighted in the History
view, and if the query
involves specific workflow characteristics, any matching entities are
also highlighted in the Pipeline
view for the current
version.
Query By Example¶
One of the problems faced when trying to query a collection of
workflows is the fact that structure is important. Suppose that you
want to find only workflows where two modules are used in sequence.
Instead of trying to translate this into a text-based syntax, it is
easier to construct this relationship visually. VisTrails provides such an
interface which mirrors the Pipeline
view, allowing
you to construct a (partial) workflow to serve as the search criteria.
To use the Query by Example interface, click on the
Search
button on the toolbar. This view is extremely
similar to the Pipeline
view and pipelines can be built
in a similar manner. Just like the Pipeline
view,
modules are added by dragging them from the list on the left side of
the window, connections are added by clicking and dragging from a port
on one module to a corresponding port on another module. Setting module parameters in this view will narrow the search to matching modules whose parameters fall within the specified range of values. Figure Example pipeline in Search mode shows an example pipeline that has been built in the query builder.
The next step is to decide whether to search the Current Vistrail
, Current Workflow
, or all Open Vistrails
. The results of the first option are displayed on a version tree as well as in the Workspace
panel. Non-matching versions in the version tree will be grayed out while matching versions will be displayed in the tree as normal. In contrast, the Workspace
panel will display matching results and omit versions with no matches. Double-clicking a version from the Workspace's
results will bring up the associated pipeline with matching modules highlighted. See Figures Workspace... and Pipeline....
The remaining two options are Current Workflow
and Open Vistrails
. The Current Workflow
option is the simplest and will display the pipeline with matching modules highlighted. The Open Vistrails
option will put all of its results in the Workspace
panel, listing open vistrails with their matching versions. From here, double-clicking on a vistrail will bring up a version tree which emphasizes matching versions, or double-clicking on a version will bring up the associated pipeline with matching modules highlighted.
After constructing a pipeline and selecting the appropriate search option, click the Execute
button to begin the query. This button will be available as long as the query window is not empty. However, you may need to press the Back to Search
button to return to the query window to re-execute.
Note
You may leave the Query
either through use of the toolbar or by pressing the Edit
button. However, the search results will persist until the search is cleared (press Clear Search
), returning the workspace to its normal form.
Try it now!
Let’s practice making a simple query. Open the offscreen.vt example vistrail. Click on the Search
button to enter Search
mode. Create a query like the one shown in Figure Example pipeline in Search mode by dragging the modules SheetReference
, CellLocation
, and RichTextCell
onto the Search canvas. (These modules can be found under the “VisTrails Spreadsheet” header in the Modules panel.) Connect the input and output ports of the modules as shown, then click the Execute
button to perform the query. VisTrails will automatically switch to the History
view, with all matching versions highlighted (Figure History...). Notice that the query results are also displayed in the Workspace
tab. Double-click on the html version in the workspace to open the results in the pipeline view.
Note that Query by Example provides the capability to iteratively
refine searches by adding more criteria. For example, if you were
interested in workflows that contain a certain module, you may find that
such a query returns too many results. You could then refine the query
to find only those workflows where the given module has a parameter with
a specific value. This is done by specifying parameter values in the
Inputs
panel on the right side of the window.
Textual Queries¶
There are many ways to search for versions in the version tree using textual queries, but they all rely on a simple
text box for input. Begin a search by selecting Search
on the toolbar. The search box is at the top of the center panel and has a magnifying glass icon next to it. If you enter query text, VisTrails
will attempt to match logical categories, but if your query is more
specific, VisTrails has special syntax to markup the query.
To execute a query, simply press the ‘Enter’ key after typing your query.
Search Type | Syntax |
---|---|
User name | user: user name |
Annotation | notes: phrase |
Tag | name: version tag |
Any | any: match user/notes/name (the default) |
Module | module: module name |
Date | before: date | relative time |
after: date | relative time |
Note
Since we allow regular expressions in our search box, question marks are treated as meta-characters. Thus, searching for ”?” returns everything and “abc?” will return everything containing “abc”. You need to use “\?” instead to search for ”?”. So the search for ”??” would be “\?\?”.
Table Syntax for querying specific information using textual queries. lists the different ways to markup a
query. Note that you can search by user name to see which changes a
particular user has made, and also by date to see which changes were
made in a specific time frame. When searching by date, you can search
for all changes before or after a given date or an amount of time
relative to the present. If searching for changes before or after a
specific date, the date can be entered in a variety of formats. The
simplest is ‘day month year,’ but if the
year is omitted, the current year is used. The month may be specified
by either its full name or an abbreviation. For example, 'before: 18 November 2004'
and 'after: 20 Dec'
are both valid queries. If searching by relative time,
you can prepend the amount of time relative to the present including
the units to ‘ago’. An example of this type of query is
'after: 30 minutes ago'
. The available units are seconds,
minutes, hours, days, months, or years.
You can concatenate simple search statements to create a compound
query to search across different criteria or for a specific range.
For example, to search for workflows whose tag includes
'brain'
and were created by the user 'johnsmith'
,
the query would be 'name: brain user: johnsmith'
. To search
for all workflows created between April 1 and June 1, the query would
be 'after: April 1 before: June 1'
.
Try it now!
Open the terminator.vt example file, and enter Search
mode. Let’s look for all workflows that were created after November 24, 2010. In the search box in the Search
panel, type 'after: 24 nov 2010'
and press ‘Enter’. The expected result is shown in Figure Results of a query to find any changes make after November 24, 2010.
In addition, VisTrails keeps track of the most recent textual queries, and repeating or clearing these queries can be accomplished by selecting the recent query from the dropdown menu attached to the search box. Finally, the ‘X’ button next to the search box will reset the query.
Refining the Results¶
While in the Search
view, you can select two different
ways of viewing search results. The magnifying glass icon to the left of the textual search box contains a dropdown menu with two options: “Search” and “Refine” (Figure Clicking the button to the left...). The first displays results by simply highlighting the matching nodes while the second condenses the tree to show only the versions that match. For large vistrails, this second method can help you determine relationships between the matching versions more easily.