Letter Frequency Analysis of Lithuanian and Other Languages Using the Latin Alphabet
Author(s) -
Gintautas Grigas,
Anita Juškevičienė
Publication year - 2015
Publication title -
santalka filologija edulokogija
Language(s) - English
Resource type - Journals
eISSN - 2351-714X
pISSN - 2335-7711
DOI - 10.3846/cpe.2015.271
Subject(s) - lithuanian , alphabet , computer science , space (punctuation) , linguistics , natural language processing , the internet , portuguese , boundary (topology) , artificial intelligence , mathematics , world wide web , philosophy , operating system , mathematical analysis
It is important to evaluate specificities of alphabets, particularly the letter frequencies while designing keyboards, analyzing texts, designing games based on alphabets, and doing some text mining. In order to adequately compare lettter frequences of Lithuanian language to other languages in the Internet space, Wikipedia source was selected which content is common to different languages. The method of letter frequency jumps is used. The main attention is paid to the analysis of letter frequencies at the boundary between native letters and foreign letters used in Lithuanian and other languages
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom