Prediction the Probability of Purchases Recommended Items
Author(s) -
P. G. Parfenov,
A.A. Timofeeva,
G.B. Sologub,
A.S. Alekseychuk
Publication year - 2020
Publication title -
modelling and data analysis
Language(s) - English
Resource type - Journals
eISSN - 2311-9454
pISSN - 2219-3758
DOI - 10.17759/mda.2020100402
Subject(s) - similarity (geometry) , collaborative filtering , recall , cosine similarity , computer science , quality (philosophy) , precision and recall , random forest , data mining , recommender system , artificial intelligence , machine learning , statistics , pattern recognition (psychology) , mathematics , image (mathematics) , linguistics , philosophy , epistemology
This paper discusses various methods for improving recommendation systems. A comparative analysis of two models for solving classification problems is performed: random forest and CatBoostClassifier. The research was performed on the data of the purchase history of Ozon customers. Standard methods that are often used in recommendation systems were used. We implemented collaborative filtering methods, cosine similarity of products from customer views per site visit, and similarity of text data. To evaluate the results, we used special metrics that evaluate the quality of predictions of the first k objects from the recommendations: Mean average precision (map@K) and Recall at K (recall@k). When generating additional features based on various methods that reveal the similarity of objects, an increase in the quality of model forecasts is noted. The CatBoostClassifier model showed the best results.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom