z-logo
open-access-imgOpen Access
Resume Classification System using Natural Language Processing and Machine Learning Techniques
Author(s) -
Irfan Ali,
AUTHOR_ID,
Nimra Mughal,
Zahid Hussain Khan,
Javed Ahmed,
Ghulam Mujtaba,
AUTHOR_ID,
AUTHOR_ID,
AUTHOR_ID,
AUTHOR_ID
Publication year - 2022
Publication title -
mehran university research journal of engineering and technology
Language(s) - English
Resource type - Journals
eISSN - 2413-7219
pISSN - 0254-7821
DOI - 10.22581/muet1982.2201.07
Subject(s) - artificial intelligence , computer science , machine learning , support vector machine , confusion matrix , naive bayes classifier , random forest , feature selection , natural language processing
The selection of a suitable job applicant from the pool of thousands applications is often daunting job for an employer. The categorization of job applications submitted in form of Resumes against available vacancy(s) takes significant time and efforts of an employer. Thus, Resume Classification System (RCS) using the Natural Language Processing (NLP) and Machine Learning (ML) techniques could automate this tedious process. Moreover, the automation of this process can significantly expedite and transparent the applicants’ screening process with mere human involvement. This experimental study presents an automated NLP and ML-based RCS that classifies the Resumes according to job categories with performance guarantees. This study employs various ML algorithms and NLP techniques to measure the accuracy of RCS and proposes a solution with better accuracy and reliability in different settings. To demonstrate the significance of NLP and ML techniques for RCS, the extracted features were evaluated on nine ML classification models namely Support Vector Machine - SVM (Linear, SGD, SVC and NuSVC), Naïve Bayes (Bernoulli, Multinomial & Gaussian), K-Nearest Neighbor (KNN), and Logistic Regression (LR). The Term-Frequency-Inverse-Document-Frequency (TF-IDF) feature representation scheme was proved suitable for RCS. The developed models were evaluated using the Confusion Matrix, F-Score, Recall, Precision, and overall Accuracy. The experimental results indicate that using the One-Vs-Rest-Classification strategy for this multi-class Resume classification task, the SVM class of Machine Learning classifiers performed better on the study dataset of over nine hundred sixty plus parsed resumes with more than 96% accuracy. The promising results suggest that NLP and ML techniques employed in this study could be used for developing an efficient RCS.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here