Evaluating a pivot-based approach for bilingual lexicon extraction.
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2015;
2015:434153. [PMID:
25983745 PMCID:
PMC4423015 DOI:
10.1155/2015/434153]
[Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2014] [Revised: 03/26/2015] [Accepted: 04/09/2015] [Indexed: 11/17/2022]
Abstract
A pivot-based approach for bilingual lexicon extraction is based on the similarity of context vectors represented by words in a pivot language like English. In this paper, in order to show validity and usability of the pivot-based approach, we evaluate the approach in company with two different methods for estimating context vectors: one estimates them from two parallel corpora based on word association between source words (resp., target words) and pivot words and the other estimates them from two parallel corpora based on word alignment tools for statistical machine translation. Empirical results on two language pairs (e.g., Korean-Spanish and Korean-French) have shown that the pivot-based approach is very promising for resource-poor languages and this approach observes its validity and usability. Furthermore, for words with low frequency, our method is also well performed.
Collapse