Corpus Linguistics / Edition 1

Corpus Linguistics / Edition 1

by Geoffrey Sampson
     
 

Corpus Linguistics seeks to provide a comprehensive sampling of real-life usage in a given language, and to use these empirical data to test language hypotheses. Modern corpus linguistics began fifty years ago, but the subject has seen explosive growth since the early 1990s. These days corpora are being used to advance virtually every aspect of language study, from

See more details below

Overview

Corpus Linguistics seeks to provide a comprehensive sampling of real-life usage in a given language, and to use these empirical data to test language hypotheses. Modern corpus linguistics began fifty years ago, but the subject has seen explosive growth since the early 1990s. These days corpora are being used to advance virtually every aspect of language study, from computer processing techniques such as machine translation, to literary stylistics, social aspects of language use, and improved language-teaching methods.

Because corpus linguistics has grown fast from small beginnings, newcomers to the field often find it hard to get their bearings. Important papers can be difficult to track down. This volume reprints forty-two articles on corpus linguistics by an international selection of authors, which comprehensively illustrate the directions in which the subject is developing. It includes articles that are already recognized as classics, and others which deserve to become so, supplemented with editorial introductions relating the individual contributions to the field as a whole.

This collection of readings will be useful to students of corpus linguistics at both undergraduate and postgraduate level, as well as academics researching this fascinating area of linguistics.

Read More

Product Details

ISBN-13:
9780826488039
Publisher:
Bloomsbury Academic
Publication date:
10/01/2005
Series:
Open Linguistics Series
Edition description:
New Edition
Pages:
542
Product dimensions:
7.44(w) x 9.69(h) x 1.09(d)

Related Subjects

Meet the Author


Geoffrey Sampson is a former Professor of Natural Language Computing at the School of Informatics, University of Sussex. He is now a Research Fellow at the University of South Africa.

Table of Contents

1Introduction1
2From The structure of English (1952)9
3A standard corpus of edited present-day American English (1965)27
4On the distribution of noun-phrase types in English clause-structure (1971)35
5Predicting text segmentation into tone units (1986)49
6Typicality and meaning potentials (1986)58
7Historical drift in three English genres (1987)67
8Corpus creation (1987)78
9Cleft and pseudo-cleft constructions in English spoken and written discourse (1987)85
10What is wrong with adding one? (1989)95
11A statistical approach to machine translation (1990)103
12A point of verb syntax in South-Western British English : an analysis of a dialect continuum (1991)113
13Using corpus data in the Swedish Academy grammar (1991)122
14On the history of that zero as object clause links in English (1991)137
15Encoding the British National Corpus (1992)149
16Computer corpora - what do they tell us about culture? (1992)160
17Representativeness in corpus design (1992)174
18A corpus-driven approach to grammar : principles, methods, and examples (1993)198
19Structural ambiguity and lexical relations (1993)212
20Irony in the text or insincerity in the writer? : the diagnostic potential of semantic prosodies (1993)229
21Building a large annotated corpus of English : the Penn Treebank (1993)242
22Automatically extracting collocations from corpora for language learning (1994)258
23Developing and evaluating a probabilistic LR parser of part-of-speech and punctuation labels (1995)267
24Why a Fiji corpus? (1996)276
25Treebank grammars (1996)285
26English corpus linguistics and the foreign-language teaching syllabus (1996)293
27Data-oriented language processing : an overview (1996)304
28Conflict talk : a comparison of the verbal disputes between adolescent females in two corpora (1996)326
29Assessing agreement on classification tasks : the kappa statistic (1996)335
30Linguistic and interactional features of internet Relay Chat (1996)340
31Distinguishing systems and distinguishing senses : new evaluation methods for word-sense disambiguation (1997)353
32Qualification and certainty in L1 and L2 students writing (1997)371
33Analysing and predicting patterns of DAMSL utterance tags (1998)387
34Assessing claims about language use with corpus data - swearing and abuse (1998)396
35The syntax of disfluency in spontaneous spoken language (1998)404
36The use of large text corpora for evaluating text-to-speech systems (1998)421
37The Prague Dependency Treebank : how much of the underlying syntactic structure can be tagged automatically? (1999)427
38Reflections of a dendrographer (1999)434
39A generic approach to software support for linguistic annotation using XML (2000)449
40Europe's ignored languages (2001)460
41Semi-automatic tagging of intonation in French spoken corpora (2001)462
42Web as corpus (2001)471
43Intonational variation in the British Isles (2002)474

Read More

Customer Reviews

Average Review:

Write a Review

and post it to your social network

     

Most Helpful Customer Reviews

See all customer reviews >