z-logo
Premium
Sketched Stochastic Dictionary Learning for large‐scale data and application to high‐throughput mass spectrometry
Author(s) -
Permiakova Olga,
Burger Thomas
Publication year - 2022
Publication title -
statistical analysis and data mining: the asa data science journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.381
H-Index - 33
eISSN - 1932-1872
pISSN - 1932-1864
DOI - 10.1002/sam.11542
Subject(s) - sketch , computer science , factorization , mass spectrometry , function (biology) , sampling (signal processing) , column (typography) , algorithm , chromatography , chemistry , computer vision , filter (signal processing) , evolutionary biology , biology , telecommunications , frame (networking)
Factorization of large data corpora has emerged as an essential technique to extract dictionaries (sets of patterns that are meaningful for sparse encoding). Following this line, we present a novel algorithm based on compressive learning theory. In this framework, the (arbitrarily large) dataset of interest is replaced by a fixed‐size sketch resulting from a random sampling of the data distribution characteristic function. We apply our algorithm to the extraction of chromatographic elution profiles in mass spectrometry data, where it demonstrates its efficiency and interest compared to other related algorithms.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here