Automatic classification of news stories – A machine learning approach

dc.contributor.authorKwawu, Anthony Kafui
dc.date.accessioned2017-05-04T11:35:49Z
dc.date.available2017-05-04T11:35:49Z
dc.date.issued2016-04
dc.descriptionThesis submitted to the Department of Computer Science, Ashesi University College, in partial fulfillment of Bachelor of Science degree in Computer Science, April 2016en_US
dc.description.abstractHumans are good at classifying things because our brains are adept at understanding contextual nuances. Machines, however, need to be fed the right features to achieve reasonably good levels of classification. Classifying text manually is a time-consuming and expensive process especially in the information age where a combination of the success of cloud computing, big data and the resurgent trend of the internet of things as well as unprecedented population growth have led to an explosion in the amount of data that we have to deal with – approximately 2.5 quintillion bytes every 24 hours (Walker, 2015). This Thesis explores the efficiency of two well-known machine learning classification algorithms; Naïve Bayes and Support Vector Machines in classifying news stories - an important subset of the global repositories of information. The findings in this study report that using machine learning to classify news stories is not easy but is feasible and if done properly can yield accuracy rates of at least 70%. These results translate into significant time savings that cannot be achieved by manual classification and are a precursor to other machine learning techniques such as recommendation, clustering and sentiment analysis.en_US
dc.description.sponsorshipAshesi University Collegeen_US
dc.identifier.urihttp://hdl.handle.net/20.500.11988/179
dc.language.isoen_USen_US
dc.subjectmachine learningen_US
dc.subjectclassification algorithmsen_US
dc.subjectNaïve Bayes classification algorithmen_US
dc.subjectSupport Vector Machines classification algorithmen_US
dc.titleAutomatic classification of news stories – A machine learning approachen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Anthony Kwawu_27349_assignsubmission_file_Final Submission.pdf
Size:
852.93 KB
Format:
Adobe Portable Document Format
Description: