z-logo
open-access-imgOpen Access
Od nagrania do korpusu, czyli o metodzie archiwizowania języka mówionego mieszkańców wsi z wykorzystaniem narzędzi lingwistyki cyfrowej
Author(s) -
Helena GrocholaSzczepanek
Publication year - 2021
Publication title -
annales universitatis paedagogicae cracoviensis. studia linguistica
Language(s) - English
Resource type - Journals
ISSN - 2083-1765
DOI - 10.24917/20831765.16.5
Subject(s) - transcription (linguistics) , spoken language , computer science , code (set theory) , natural language processing , linguistics , speech recognition , programming language , philosophy , set (abstract data type)
The article presents the method of archiving of the rural speech during the development of the electronic language corpus. Attention is focused on how to get spoken data and transcription of non-standard dialect code. It also presents the problems and limitations resulting from nonnormative spoken data and the solutions applied. The recording and converting of spoken language data for corpus is a complex and multi-phase process. The data is obtained from recorded interviews with respondents. The developed system of spoken data transcription combines the properties of non-standard code, the capabilities of tools and needs of corpus.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here