Premium
Design and analysis of a hyphenation procedure
Author(s) -
Moitra Abha,
Mudur S. P.,
Narwekar A. W.
Publication year - 1979
Publication title -
software: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.437
H-Index - 70
eISSN - 1097-024X
pISSN - 0038-0644
DOI - 10.1002/spe.4380090406
Subject(s) - prefix , computer science , table (database) , flexibility (engineering) , probabilistic logic , trie , natural language processing , suffix , value (mathematics) , arithmetic , algorithm , artificial intelligence , data mining , data structure , mathematics , machine learning , programming language , statistics , linguistics , philosophy
A hyphenation procedure is described wherein the search list includes exceptional words, prefixes, suffixes as well as a probabilistic Break‐Value‐Table. The list of prefixes and suffixes is augmented with what are termed as root words to achieve greater flexibility and accuracy. Importance is given to a number of ways whereby the overall algorithm can be speeded up; in this connection a number of rejection rules are formulated so that only the likely candidates are processed. The order of searching of the various data tables is also considered. A further refinement is tried wherein the common suffixes and common prefixes are given preferential treatment. The algorithm developed was tested on approximately 2,700 common English technical words and an attempt is made to analyse the incorrectly handled words.