Querying linguistic annotations

Sumukh Ghodke, Steven Bird

Research output: Chapter in Book/Report/Conference proceedingConference Paper published in Proceedingspeer-review


Over the past decade, a variety of expressive linguistic query languages have been developed. The most scalable of these have been implemented on top of an existing database engine. However, with the arrival of efficient, wide-coverage parsers, it is feasible to parse text on a scale that is several orders of magnitude larger. We show that the existing database approach will not scale up, and speculate on a new approach that leverages proximity search in the context of an IR engine. We also propose a simple syntax for querying linguistic annotations, avoiding the usability problems with existing tree query languages.

Original languageEnglish
Title of host publicationADCS 2008
Subtitle of host publicationProceedings of the Thirteenth Australasian Document Computing Symposium
EditorsRob McArthur, Paul Thomas, Andrew Turpin, Mingfang Wu
PublisherRMIT University Press
Number of pages4
ISBN (Print)139781921426216
Publication statusPublished - 1 Dec 2008
Externally publishedYes
Event13th Australasian Document Computing Symposium, ACDS 2008 - Hobart, TAS, Australia
Duration: 8 Dec 20088 Dec 2008


Conference13th Australasian Document Computing Symposium, ACDS 2008
CityHobart, TAS


Dive into the research topics of 'Querying linguistic annotations'. Together they form a unique fingerprint.

Cite this