- Shopping Bag ( 0 items )
Ships from: Grand Rapids, MI
Usually ships in 1-2 business days
Ships from: Hereford, United Kingdom
Usually ships in 1-2 business days
A corpus is a collection of specimens of a language as used in real life, in writing and/or speech. Corpus lingustics is research, carried out in university linguistics departments and computing departments (and nowadays in industrial research labs too), which uses corpora as crucial sources of evidence on the structure and properties of languages. Because corpus linguistics has grown fast from small beginnings, newcomers to teh filed often find it hard to get their bearings. Important papers can be difficult to track down. This volume reprints 42 corpus linguistics articles which first appeared at dates ranging from 1952 to 2002, and which between them illustrate all the main directions in which the sunject is developing. It includes articles that are already recognized as classics, and others which deserve to become so , supplemented with editorial introductions relating the individual contributors to teh field as a while.
2. From The Sstructure of English (1952) Charles Carpenter Fries
3. A Standard Corpus of Edited Present -day American English (1965) W. Nelson Francis
4. On the Distribution of Noun-phrase Types in English Clause-structure (1971) F.G.A.M. Aarts
5. Predicting Text Segmentation into Tone Units (1986) Bengt Altenberg
6. Typicality and Meaning Potentials (1986) Patrick Hanks
7. Historical Drift in Three English Genres (1987) Douglas Biber and Edward Finegan
8. Corpus Creation (1987) John Sinclair
9. Cleft and Pseudo-cleft Constructions in English Spoken and Written Discourse (1987) Peter C. Collins
10. What is Wrong with Adding One? (1989) William Gale and Kenneth Church
11. A Statistical Approach to Machine Translation (1990) Peter F. Brown, et al.
12. A Point of Verb Syntax in South-western British English: An Analysis of a Dialect Continuum (1991) Ossi Ihalainen
13. Using Corpus Data in the Swedish Academy Grammar (1991) Staffan Hellberg
14. On the History of That/Zero as Object Clause Links in English (1991) Matti Rissanen
15. Encoding the British National Corpus (1992) Gavin Burnage and Dominic Dunlop
16. Computer Corpora - What Do They Tell Us about Culture? (1992) Geoffrey Leech and Roger Fallon
17. Representativeness in Corpus Design (1992) Douglas Biber
18. A Corpus-driven Approach to Grammar: Principles, Methods and Examples (1993) Gill Francis
19. Structural Ambiguity and Lexical Relations (1993) Donald Hindle and Mats Rooth
20. Irony in the Text or Insincerity in the Writer? The Diagnostic Potential of Semantic Prosodies (1993) William Louw
21. Building a Large Annotated Corpus of English: The Penn Treebank (1993) Mitchell P. Marcus, et al.
22. Automatically Extracting Collocations from Corpora for Language Learning (1994) Kenji Kita, et al.
23. Developing and Evaluating a Probabilistic LR Parser of Part-of-Speech and Punctuation Labels (1995) E.J. Briscoe and J.A. Carroll
24. Why a Fiji Corpus? (1996) Jan Tent and France Mugler
25. Treebank Grammars (1996) Eugene Charniak
26. English Corpus Linguistics and the Foreign Language Teaching Syllabus (1996) Dieter Mindt
27. Data-oriented Language Processing: An Overview (1996) L.W.M. Bod and R.J.H. Scha
28. Conflict Talk: A Comparison of the Verbal Disputes between Adolescent Females in Two Corpora (1996) Ingrid Kristine Hasund and Anna-Brita Stenstrom
29. Assessing Agreement on Classification Tasks: The Kappa Statistic (1996) Jean Carletta
30. Linguistic and Interactional Features of Internet Relay Chat (1996) Christopher C. Werry
31. Distinguishing Systems and Distinguishing Senses: New Evaluation Methods for Word Sense Disambiguation (1997) Philip Resnik and David Yarowsky
32. Qualification and Certainty in L1 and L2 Students' Writing (1997) Kenneth Hyland and John Milton
33. Analysing and Predicting Patterns of DAMSL Utterance Tags (1998) Mark G. Core
34. Assessing Claims about Language Use with Corpus Data - Swearing and Abuse (1998) Anthony McEnery, et al.
35. The Syntax of Disfluency in Spontaneous Spoken Language (1998) David McKelvie
36. The Use of Large Text Corpora for Evaluating Text-to-Text Speech Systems (1998) Louis C.W. Pols et al.
37. The Prague Dependency Treebank: How Much of the Underlying Syntactic Structure can be Tagged Automaticaly? (1999) Alena Bohmova and Eva Hajicova
38. Reflections of a Dendrographer (1999) Geoffrey Sampson
39. A Generic Approach to Software Support for Linguistic Annotation Using XML (2000) Jean Carletta, et al.
40. Europe's Ignored Languages (2001) Anthony McEnery
41. Semi-automatic Tagging of Intonation in French Spoken Corpora (2001) Estelle Campione and Jean Veronis
42. Web as Corpus (2001) Adam Kilgarriff
43. Intonational Variation in the British Isles (2002) Esther Grabe and Brechtje Post URL List.