
Predictive Analysis Of Breast Cancer Using Machine Learning Techniques
Author(s) -
Rashmi Agrawal
Publication year - 2019
Publication title -
ingeniería solidaria/revista ingeniería solidaria
Language(s) - English
Resource type - Journals
eISSN - 2357-6014
pISSN - 1900-3102
DOI - 10.16925/2357-6014.2019.03.01
Subject(s) - breast cancer , machine learning , artificial intelligence , computer science , naive bayes classifier , support vector machine , predictive modelling , linear discriminant analysis , field (mathematics) , cancer , data mining , medicine , mathematics , pure mathematics
This paper is a product of the research Project “Predictive Analysis Of Breast Cancer Using Machine Learning Techniques” performed in Manav Rachna International Institute of Research and Studies, Faridabad in the year 2018.
Introduction: The present article is part of the effort to predict breast cancer which is a serious concern for women’s health.
Problem: Breast cancer is the most common type of cancer and has always been a threat to women’s lives. Early diagnosis requires an effective method to predict cancer to allow physicians to distinguish benign and malicious cancer. Researchers and scientists have been trying hard to find innovative methods to predict cancer.
Objective: The objective of this paper will be predictive analysis of breast cancer using various machine learning techniques like Naïve Bayes method, Linear Discriminant Analysis, K-Nearest Neighbors and Support Vector Machine method.
Methodology: Predictive data mining has become an instrument for scientists and researchers in the medical field. Predicting breast cancer at an early stage helps in better cure and treatment. KDD (Knowledge Discovery in Databases) is one of the most popular data mining methods used by medical researchers to identify the patterns and the relationship between variables and also helps in predicting the outcome of the disease based upon historical data of datasets.
Results: To select the best model for cancer prediction, accuracy of all models will be estimated and the best model will be selected.
Conclusion: This work seeks to predict the best technique with highest accuracy for breast cancer.
Originality: This research has been performed using R and the dataset taken from UCI machine learning repository.
Limitations: The lack of exact information provided by data.