
Discovering Language Properties through Corpus-Based Dictionary Data Analysis
Author(s) -
Paul A. Lyddon
Publication year - 2017
Publication title -
vocabulary learning and instruction
Language(s) - English
Resource type - Journals
eISSN - 2187-2759
pISSN - 2187-2767
DOI - 10.7820/vli.v06.2.lyddon
Subject(s) - computer science , natural language processing , raw data , symbol (formal) , artificial intelligence , corpus linguistics , linguistics , natural language , philosophy , programming language
To reveal underlying patterns in real language use, linguists have increasingly come to rely on corpus analyses, involving the evaluation of statistical frequencies in generally sizable bodies of natural linguistic data. However, accessing and analyzing large samples of raw language is neither always practical nor even truly necessary, especially in cases pertaining to structural characteristics. In fact, the requisite data can oftentimes be gleaned from a state-of-the-art (i.e., corpus-based) dictionary. Moreover, given the widespread availability of easily searchable electronic dictionaries nowadays, almost any language teacher or learner can use one to answer a number of these types of queries. This paper illustrates this claim with a step-by-step analysis of corpus-based dictionary data for the purpose of formulating the sound-symbol relations in English words with vowels preceding –gh.