Jump to ContentJump to Main Navigation
Learning Machine Translation$
Users without a subscription are not able to see the full content.

Cyril Goutte, Nicola Cancedda, Marc Dymetman, and George Foster

Print publication date: 2008

Print ISBN-13: 9780262072977

Published to MIT Press Scholarship Online: August 2013

DOI: 10.7551/mitpress/9780262072977.001.0001

Show Summary Details
Page of

PRINTED FROM MIT PRESS SCHOLARSHIP ONLINE (www.mitpress.universitypressscholarship.com). (c) Copyright The MIT Press, 2021. All Rights Reserved. An individual user may print out a PDF of a single chapter of a monograph in MITSO for personal use.date: 30 July 2021

Named Entity Transliteration and Discovery in Multilingual Corpora

Named Entity Transliteration and Discovery in Multilingual Corpora

Chapter:
(p.79) 4 Named Entity Transliteration and Discovery in Multilingual Corpora
Source:
Learning Machine Translation
Author(s):

Alexandre Klementiev

Dan Roth

Publisher:
The MIT Press
DOI:10.7551/mitpress/9780262072977.003.0004

This chapter presents a novel algorithm for cross-lingual multiword name entity (NE) discovery in a bilingual weakly temporally aligned corpus. It shows that using two independent sources of information (transliteration and temporal similarity) together to guide NE extraction yields better performance than using them alone. The algorithm requires almost no supervision or linguistic knowledge. The algorithm was evaluated on an English-Russian corpus, and showed a high level of NE discovery in Russian.

Keywords:   algorithm, name recognition, transliteration, temporal similarity, English, Russian, machine learning

MIT Press Scholarship Online requires a subscription or purchase to access the full text of books within the service. Public users can however freely search the site and view the abstracts and keywords for each book and chapter.

Please, subscribe or login to access full text content.

If you think you should have access to this title, please contact your librarian.

To troubleshoot, please check our FAQs, and if you can't find the answer there, please contact us.