Date of this Version
The 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Latent-variable PCFGs (L-PCFGs) are a highly successful model for natural language parsing. Recent work (Cohen et al., 2012) has introduced a spectral algorithm for parameter estimation of L-PCFGs, which - unlike the EM algorithm - is guaranteed to give consistent parameter estimates (it has PAC-style guarantees of sample complexity). This paper describes experiments using the spectral algorithm. We show that the algorithm provides models with the same accuracy as EM, but is an order of magnitude more efficient. We describe a number of key steps used to obtain this level of performance; these should be relevant to other work on the application of spectral learning algorithms. We view our results as strong empirical evidence for the viability of spectral methods as an alternative to EM.
Cohen, S. B., Stratos, K., Collins, M., Foster, D. P., & Ungar, L. H. (2013). Experiments With Spectral Learning of Latent-Variable PCFGs. The 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Retrieved from https://repository.upenn.edu/statistics_papers/104
Date Posted: 27 November 2017