Please use this identifier to cite or link to this item:
http://hdl.handle.net/11452/33039
Title: | Sentiment analysis with term weighting and word vectors |
Authors: | Köktas, Haldun Bursa Uludağ Üniversitesi/Mühendislik Fakültesi/Bilgisayar Mühendisliği/Bilgisayar Yazılımı Bölümü. Bilgin, Metin AAH-2049-2021 57198185260 |
Keywords: | Computer science Engineering Word2vec Doc2vec Sentiment analysis Machine learning Natural language processing Classification |
Issue Date: | Sep-2019 |
Publisher: | Zarka Private University |
Citation: | Bilgin, M. ve Köktas, H. (2019). ''Sentiment analysis with term weighting and word vectors''. International Arab Journal of Information Technology, 16(5), 953-959. |
Abstract: | It is the sentiment analysis with which it is fried to predict the sentiment being told in the texts in an area where Natural Language Processing (NLP) studies are being frequently used in recent years. In this study sentiment extraction has been made from Turkish texts and performances of methods that are used in text representation have been compared. In the study being conducted, besides Bag of Words (BoW) method which is traditionally used for the representation of texts, Word2Vec, which is word vector algorithm being developed in recent years and Doc2Vec, being document vector algorithm, have been used. For the study 5 different Machine Learning (ML) algorithms have been used to classify the texts being represented in 5 different ways on 3000 pieces of labeled tweets belonging to a telecom company. As a conclusion it was seen that Word2Vec, being among text representation methods and Random Forest, being among ML algorithms were most successful and most applicable ones. It is important as it is the first study with which BoW and word vectors have been compared for sentiment analysis in Turkish texts. |
URI: | http://hdl.handle.net/11452/33039 |
ISSN: | 1683-3198 |
Appears in Collections: | Scopus Web of Science |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.