Research Library

open-access-imgOpen AccessIdentification of similarities and clusters of bread baking recipes based on data of ingredients
Author(s)
Anlauf Stefan,
Dorl Sebastian,
Hirz Theresa,
Lasslberger Melanie,
Grassmann Rudolf,
Himmelbauer Johannes,
Winkler Stephan
Publication year2024
Publication title
international journal of food engineering
Resource typeJournals
PublisherDe Gruyter
We define the similarity of bakery recipes using different distance calculations and identify groups of similar recipes using different clustering algorithms. Our analyses are based on the relative amounts of ingredients included in the recipes. We compare different clustering algorithms (k-means, k-medoid, and hierarchical clustering) to find the optimal number of clusters. Besides the standard distance calculation (euclidean distance), we test three other distance metrics (hamming distance, manhattan distance, and cosine similarity). Additionally, we reduce the impact of raw materials used in large quantities by applying two different data transformations, namely the logarithm of the original data and the binarization of the original data. Clustering recipes based on their ingredients can improve the search for similar recipes and therefore help with the time-consuming process of developing new recipes. Using the hierarchical clustering on the logarithm of the original data, we can separate 704 recipes into three different clusters, achieving a Silhouette Score of 0.531. We visualize our results via dendrograms representing the recipes’ hierarchical separation into individual groups and sub-groups.
Keyword(s)machine learning, clustering, ingredient, baking recipes
Language(s)English
SCImago Journal Rank0.362
H-Index26
eISSN1556-3758
DOI10.1515/ijfe-2023-0032

Seeing content that should not be on Zendy? Contact us.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here