Open AccessIdentification of similarities and clusters of bread baking recipes based on data of ingredientsOpen Access
Author(s)
Anlauf Stefan,
Dorl Sebastian,
Hirz Theresa,
Lasslberger Melanie,
Grassmann Rudolf,
Himmelbauer Johannes,
Winkler Stephan
Publication year2024
Publication title
international journal of food engineering
Resource typeJournals
PublisherDe Gruyter
We define the similarity of bakery recipes using different distance calculations and identify groups of similar recipes using different clustering algorithms. Our analyses are based on the relative amounts of ingredients included in the recipes. We compare different clustering algorithms (k-means, k-medoid, and hierarchical clustering) to find the optimal number of clusters. Besides the standard distance calculation (euclidean distance), we test three other distance metrics (hamming distance, manhattan distance, and cosine similarity). Additionally, we reduce the impact of raw materials used in large quantities by applying two different data transformations, namely the logarithm of the original data and the binarization of the original data. Clustering recipes based on their ingredients can improve the search for similar recipes and therefore help with the time-consuming process of developing new recipes. Using the hierarchical clustering on the logarithm of the original data, we can separate 704 recipes into three different clusters, achieving a Silhouette Score of 0.531. We visualize our results via dendrograms representing the recipes’ hierarchical separation into individual groups and sub-groups.
Keyword(s)machine learning, clustering, ingredient, baking recipes
Language(s)English
SCImago Journal Rank0.362
H-Index26
eISSN1556-3758
DOI10.1515/ijfe-2023-0032
Seeing content that should not be on Zendy? Contact us.
To access your conversation history and unlimited prompts, please
Prompt 0/10