z-logo
open-access-imgOpen Access
Selectivity Estimation for String Predicates Based on Modified Pruned Count‐Suffix Tree
Author(s) -
Li Dong,
Zhang Qixu,
Liang Xiaochong,
Guan Jida,
Xu Yang
Publication year - 2015
Publication title -
chinese journal of electronics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.267
H-Index - 25
eISSN - 2075-5597
pISSN - 1022-4653
DOI - 10.1049/cje.2015.01.013
Subject(s) - string (physics) , suffix , computer science , suffix tree , tree (set theory) , generalized suffix tree , algorithm , mathematics , combinatorics , data structure , programming language , linguistics , philosophy , mathematical physics
The accuracy of predicates selectivity estimation is one of the important factors affecting query optimization performance. State‐of‐art selectivity estimation algorithms for string predicates based on Pruned countsuffix tree (PST) often suffer severe underestimating and overestimating problems, thus their average relative errors are not good. We analyze the main causes of the underestimating and overestimating problems, propose a novel Restricted pruned count‐suffix tree (RPST) and a new pruning strategy. Based on these, we present the EKVI algorithm and the EMO algorithm which are extended from the KVI algorithm and the MO algorithm respectively. The experiments compare the EKVI algorithm and the EMO algorithm with the traditional KVI algorithm and the MO algorithm, and the results show that the average relative errors of our selectivity estimation algorithms are significantly better than the traditional selectivity estimation algorithms. The EMO algorithm is the best among these algorithms from the overall view.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here