TY - GEN
T1 - Cross-lingual transfer for unsupervised dependency parsing without parallel data
AU - Duong, Long
AU - Cohn, Trevor
AU - Bird, Steven
AU - Cook, Paul
PY - 2015/7
Y1 - 2015/7
N2 - Cross-lingual transfer has been shown to produce good results for dependency parsing of resource-poor languages. Although this avoids the need for a target language treebank, most approaches have still used large parallel corpora. However, parallel data is scarce for low-resource languages, and we report a new method that does not need parallel data. Our method learns syntactic word embeddings that generalise over the syntactic contexts of a bilingual vocabulary, and incorporates these into a neural network parser. We show empirical improvements over a baseline delexicalised parser on both the CoNLL and Universal Dependency Treebank datasets. We analyse the importance of the source languages, and show that combining multiple source-languages leads to a substantial improvement.
AB - Cross-lingual transfer has been shown to produce good results for dependency parsing of resource-poor languages. Although this avoids the need for a target language treebank, most approaches have still used large parallel corpora. However, parallel data is scarce for low-resource languages, and we report a new method that does not need parallel data. Our method learns syntactic word embeddings that generalise over the syntactic contexts of a bilingual vocabulary, and incorporates these into a neural network parser. We show empirical improvements over a baseline delexicalised parser on both the CoNLL and Universal Dependency Treebank datasets. We analyse the importance of the source languages, and show that combining multiple source-languages leads to a substantial improvement.
UR - http://www.scopus.com/inward/record.url?scp=84959872405&partnerID=8YFLogxK
U2 - 10.18653/v1/K15-1012
DO - 10.18653/v1/K15-1012
M3 - Conference Paper published in Proceedings
T3 - CoNLL 2015 - 19th Conference on Computational Natural Language Learning, Proceedings
SP - 113
EP - 122
BT - CoNLL 2015 - 19th Conference on Computational Natural Language Learning, Proceedings
PB - Association for Computational Linguistics (ACL)
T2 - 19th Conference on Computational Natural Language Learning, CoNLL 2015
Y2 - 30 July 2015 through 31 July 2015
ER -