Querying the corpus

AVAA is all about querying and filtering annotations. Complex queries can be expressed to extract only specific annotations. This is done via the select select block, various attributes can be combined to make a curated selection of annotations:

select

  • value select annotations with a specific value
  • value-contains select annotations with a value containing a specific string
  • tier select annotations of a tier
  • participant select annotations of a participant
  • group select annotations belonging to a group
  • value-match select annotations whose value matches a regular expression pattern
  • tier-match select annotations whose tier matches a regular expression pattern
  • group-match select annotations whose group matches a regular expression pattern
  • participant-match select annotations whose participant matches a pattern
  • file-match select annotations whose filename matches a regular expression pattern
  • file-tag-match select annotations whose file has a tag matching the pattern
  • tag-match select annotations which have a tag matching the pattern
  • lambda a custom function to select the annotations
  • sort sort resulting list in ascending/descending order, based on start time
  • mode mode used for completing an existing selection
  • limit maximum number of annotations to select
  • limit-per-file maximum number of annotations to select in one particular file

When multiple attributes are used, only annotations fulfilling all the constraints will be selected.

For instance, to select all annotations containing the word "bonjour":

select

After running a Preview select the editor will display the result of the query:

select

Now that we know how to select annotations, we can use views to display them.