z-logo
open-access-imgOpen Access
The minimizer Jaccard estimator is biased and inconsistent
Author(s) -
Mahdi Belbasi,
Antonio Blanca,
Robert S. Harris,
David Koslicki,
Paul Medvedev
Publication year - 2022
Publication title -
bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.599
H-Index - 390
eISSN - 1367-4811
pISSN - 1367-4803
DOI - 10.1093/bioinformatics/btac244
Subject(s) - jaccard index , estimator , computer science , statistics , mathematics , econometrics , algorithm , artificial intelligence , pattern recognition (psychology)
Sketching is now widely used in bioinformatics to reduce data size and increase data processing speed. Sketching approaches entice with improved scalability but also carry the danger of decreased accuracy and added bias. In this article, we investigate the minimizer sketch and its use to estimate the Jaccard similarity between two sequences.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here