Designing and evaluating an XPath dialect for linguistic queries

Steven Bird, Yi Chen, Susan B. Davidson, Haejoong Lee, Yifeng Zheng

Research output: Chapter in Book/Report/Conference proceedingConference Paper published in Proceedings

Abstract

Linguistic research and natural language processing employ large repositories of ordered trees. XML, a standard ordered tree model, and XPath, its associated language, are natural choices for linguistic data and queries. However, several important expressive features required for linguistic queries are missing or hard to express in XPath. In this paper, we motivate and illustrate these features with a variety of linguistic queries. Then we propose extensions to XPath to support linguistic queries, and design an efficient query engine based on a novel labeling scheme. Experiments demonstrate that our language is not only sufficiently expressive for linguistic trees but also efficient for practical usage.

Original languageEnglish
Title of host publicationProceedings of the 22nd International Conference on Data Engineering, ICDE '06
Number of pages10
DOIs
Publication statusPublished - 17 Oct 2006
Externally publishedYes
Event22nd International Conference on Data Engineering, ICDE '06 - Atlanta, GA, United States
Duration: 3 Apr 20067 Apr 2006

Publication series

NameProceedings - International Conference on Data Engineering
Volume2006
ISSN (Print)1084-4627

Conference

Conference22nd International Conference on Data Engineering, ICDE '06
CountryUnited States
CityAtlanta, GA
Period3/04/067/04/06

Fingerprint Dive into the research topics of 'Designing and evaluating an XPath dialect for linguistic queries'. Together they form a unique fingerprint.

  • Cite this

    Bird, S., Chen, Y., Davidson, S. B., Lee, H., & Zheng, Y. (2006). Designing and evaluating an XPath dialect for linguistic queries. In Proceedings of the 22nd International Conference on Data Engineering, ICDE '06 [1617420] (Proceedings - International Conference on Data Engineering; Vol. 2006). https://doi.org/10.1109/ICDE.2006.48