- Shopping Bag ( 0 items )
-
All (5) from $277.87
-
New (5) from $277.87
With the rising importance of multilingualism in language industries, brought about by global markets and world-wide information exchange, parallel corpora, i.e. corpora of texts accompanied by their translation, have become key resources in the development of natural language processing tools. The applications based upon parallel corpora are numerous and growing in number: multilingual lexicography and terminology, machine and human translation, cross-language information retrieval, language learning, etc.
The book's chapters have been commissioned from major figures in the field of parallel corpus building and exploitation, with the aim of showing the state of the art in parallel text alignment and use ten to fifteen years after the first parallel-text alignment techniques were developed. Within the book, the following broad themes are addressed: (i) techniques for the alignment of parallel texts at various levels such as sentence, clause, and word; (ii) the use of parallel texts in fields as diverse as translation, lexicography, and information retrieval; (iii) available corpus resources and the evaluation of alignment methods.
The book will be of interest to researchers and advanced students of computational linguistics, terminology, lexicography and translation, both in academia and industry.
| Foreword | xi | |
| Terminological note | xiii | |
| Preface | xv | |
| Contributors | xxi | |
| Introduction | ||
| 1. | From the Rosetta stone to the information society: A survey of parallel text processing | 1 |
| Alignment Methodology | ||
| 2. | Pattern recognition for mapping bitext correspondence | 25 |
| 3. | Multilingual text alignment: Aligning three or more versions of a text | 49 |
| 4. | A comprehensive bilingual word alignment system Application to disparate languages: Hebrew and English | 69 |
| 5. | A knowledge-lite approach to word alignment | 97 |
| 6. | From sentences to words and clauses | 117 |
| 7. | Bracketing and aligning words and constituents in parallel text using Stochastic Inversion Transduction Grammars | 139 |
| 8. | The translation network A model for a fine-grained description of translations | 169 |
| 9. | Parallel text alignment using crosslingual information retrieval techniques | 187 |
| 10. | Parallel alignment of structured documents | 201 |
| Applications | ||
| 11. | A statistical view on bilingual lexicon extraction From parallel corpora to non-parallel corpora | 219 |
| 12. | Terminology extraction from parallel technical texts | 237 |
| 13. | Term alignment in use Machine-aided human translation | 253 |
| 14. | Automatic dictionary extraction for cross-language information retrieval | 275 |
| 15. | Parallel texts in computer-assisted language learning | 299 |
| Resources and Evaluation | ||
| 16. | Japanese-English aligned bilingual corpora | 313 |
| 17. | Building a parallel corpus of English/Panjabi | 335 |
| 18. | Sharing of translation memory databases derived from aligned parallel text | 347 |
| 19. | Evaluation of parallel text alignment systems The ARCADE project | 369 |
| Index of terms | 389 | |
| Index of authors | 395 | |
| Index of languages and writing systems | 401 |
Overview
With the rising importance of multilingualism in language industries, brought about by global markets and world-wide information exchange, parallel corpora, i.e. corpora of texts accompanied by their translation, have become key resources in the development of natural language processing tools. The applications based upon parallel corpora are numerous and growing in number: multilingual lexicography and terminology, machine and human translation, cross-language information ...