Machine Learning Sms Spam Detection Model
Abstract
Millions of shillings are lost by mobile phone users every year in Kenya due to SMs Spam,
a social engineering skill attempting to obtain sensitive information such as passwords,
Personal identification numbers and other details by masquerading as a trustworthy entity
in an electronic commerce. The design of efficient fraud detection algorithm and
techniques is key to reducing these losses. Fraud detection using machine learning is a new
approach of detecting fraud especially in Mobile commerce. The design of fraud detection
techniques in a mobile platform is challenging due to the non-stationary distribution of the
data. Most machine learning techniques especially in SMs Spam deal with one language. It
is in this background that the study will focus on a client side SMs Spam detection in
Kenya’s mobile using machine learning. Naive’s Bayes algorithm was used for this
purpose because it is highly scalable in text classification. The contributors of Corpus
include mobile service providers in Kenya and selected mobile phone users. Machine
learning and data mining experiments were conducted using WEKA .The results and
discussions are presented in form of descriptive statistics and detection metrics, the model
registered an overall classification accuracy of 96.1039% .