Ανάπτυξη εργαλείων για αυτόματη εκτίμηση διάθεσης Ελληνικών κειμένων με χρήση Ημι-Επιβλεπόμενων Αναδρομικών Αυτοσυσχετιστών (Master thesis)
Κοτρότσιος, Κωνσταντίνος
In this master thesis our goal is to develop tools in order to be used in sentiment
analysis applications for texts in Greek language. We have developed a Greek language
dictionary using MySQL and MongoDB database systems, which contains words in
different numbers, genders, tenses etc. A dataset, from users’ opinions about technology
products characterized according to sentiment, has been developed. Two applications, one
for automatic spelling correction and another for lemmatization have also been developed.
A framework of semi-supervised recursive autoencoders, which are trained in vector space
representations for multi-word phrases, is used. These representations in automatic
assessment of text provision outperform other state-of–the-art approaches on commonly
used datasets. This model is applied in on our users’ opinions dataset and the results were
examined to see how they were affected by using our tools use, are tested. Using these tools
for the pretreatment of the dataset, we have improved the results up to 24%.
Institution and School/Department of submitter: | Σχολή Τεχνολογικών Εφαρμογών. Τμήμα Μηχανικών Πληροφορικής |
Keywords: | Αυτόματη εκτίμηση διάθεσης;Ημι-Επιβλεπόμενοι αναδρομικοι αυτοσυσχετιστές;Βαθιά μάθηση;Λημματοποίηση;Sentiment Analysis;Semi-supervised Recursive Autoencoders;Deep Learning;Lemmatization |
Description: | Μεταπτυχιακή Εργασία -- Σχολή Τεχνολογικών Εφαρμογών -- Τμήμα Μηχανικών Πληροφορικής, 2015 (α/α 6977) |
URI: | http://195.251.240.227/jspui/handle/123456789/13687 |
Appears in Collections: | Μεταπτυχιακές Διατριβές |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Kotrotsios_Konstantinos.pdf | 2.45 MB | Adobe PDF | View/Open |
Please use this identifier to cite or link to this item:
This item is a favorite for 0 people.
http://195.251.240.227/jspui/handle/123456789/13687
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.