Formulaic language is not all the same: comparing the frequency of idiomatic phrases, collocations, lexical bundles, and phrasal verbs
Author(s) -
Laura VilkaitėLozdienė
Publication year - 2016
Publication title -
taikomoji kalbotyra
Language(s) - English
Resource type - Journals
ISSN - 2029-8935
DOI - 10.15388/tk.2016.17505
Subject(s) - linguistics , computer science , natural language processing , philosophy
Formulaic language is widely acknowledged to be a central part of a language. However, it is heterogeneous in nature, made up of various formulaic categories with their own characteristics and behaviour. A first step towards systematically describing the relationship between these categories is to describe their distribution in language. This study investigated the frequency of occurrence of four categories of formulaic sequences: collocations, phrasal verbs, idiomatic phrases, and lexical bundles. Together the four categories made up about 41% of English, with lexical bundles being by far the most common, followed by collocations, idiomatic phrases and phrasal verbs. There were differences in the frequencies of each category in the overall corpus, and also in the four registers analysed (academic prose, fiction, newspaper language, and spoken conversation). Language mode (spoken/written) had a substantial effect on the frequency distribution of the categories as well.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom