Perbandingan Metode Naïve Bayes Classifier dan Support Vector Machine untuk Klasifikasi Cyber Harassment pada Twitter

Rezi, Iwardani Saputri (2021) Perbandingan Metode Naïve Bayes Classifier dan Support Vector Machine untuk Klasifikasi Cyber Harassment pada Twitter. Undergraduate Thesis thesis, Institut Teknologi Telkom Purwokerto.

	Text cover.pdf Download (420kB)
	Text Abstract.pdf Download (68kB)
	Text Abstrak.pdf Download (139kB)
	Text Bab I.pdf Download (220kB)
	Text Bab II.pdf Download (469kB)
	Text Bab III.pdf Download (378kB)
	Text Bab IV.pdf Restricted to Registered users only Download (750kB) \| Request a copy
	Text Bab V.pdf Download (139kB)
	Text Daftar Pustaka.pdf Download (160kB)
	Text Lampiran.pdf Restricted to Registered users only Download (229kB) \| Request a copy

Abstract

At this time technology has developed rapidly, more and more are interested in technology, especially social media. With the ease of being able to access social media, it is not uncommon for those who use it to commit crimes. Cyber harassment also known as online harassment can be in the form of threatening or harassing via email, instant message, social media or, posting information online. This case often occurs on social media, such as Twitter. For that, we need an appropriate classification method to solve the Cyber Harassment case from Twitter data, namely in the form of a collection of tweets obtained from crawling using API and Rapidminer. Tweet data consists of two classes, namely those containing harassment (class 1) and not containing harassment (class 0). The data is preprocessed to manage text or features. After that, feature selection is carried out, namely weighing the features with TF-IDF. This study uses the Python programming language and uses two methods, namely the Naïve Bayes Classifier and the Support Vector Machine to compare methods that have good accuracy and determine the performance of each method. The Naïve Bayes Classifier method uses the Complement Naïve Bayes model and the Support Vector Machine uses the Support Vector Classification (SVC) model. The results of the performance of each method by dividing the training data and testing data, namely 80%: 20%, indicate the Naïve Bayes Classifier method with an accuracy of 86.30%, precision of 84.51%, recall of 87.21% and, f1 score 85.84%. and Support Vector Machine with 89.56% accuracy, 83.62% precision, 94.5% recall and, f1 score 88.73%. Thus the Support Vector Machine method is better than the Naïve Bayes Classifier method and can be implemented for the Cyber Harassment case on Twitter. Keywords: Cyber Harassment, Twitter, Python, Naïve Bayes Classifier and, Support Vector Machine.

Item Type:	Thesis (Undergraduate Thesis)
Subjects:	T Technology > T Technology (General)
Divisions:	Faculty of Informatics > Informatics Engineering
Depositing User:	pustakawan ittp
Date Deposited:	24 Sep 2021 07:31
Last Modified:	24 Sep 2021 07:31
URI:	http://repository.ittelkom-pwt.ac.id/id/eprint/6450

Actions (login required)

View Item