Enhancement of Deep Neural Networks and Their Application to Text Mining

File
Publisher
Florida Atlantic University
Date Issued
2018
EDTF Date Created
2018
Description
Many current application domains of machine learning and arti cial intelligence
involve knowledge discovery from text, such as sentiment analysis, document
ontology, and spam detection. Humans have years of experience and training with
language, enabling them to understand complicated, nuanced text passages with relative
ease. A text classi er attempts to emulate or replicate this knowledge so that
computers can discriminate between concepts encountered in text; however, learning
high-level concepts from text, such as those found in many applications of text classi-
cation, is a challenging task due to the many challenges associated with text mining
and classi cation. Recently, classi ers trained using arti cial neural networks have
been shown to be e ective for a variety of text mining tasks. Convolutional neural
networks have been trained to classify text from character-level input, automatically
learn high-level abstract representations and avoiding the need for human engineered
features.
This dissertation proposes two new techniques for character-level learning,
log(m) character embedding and convolutional window classi cation. Log(m) embedding
is a new character-vector representation for text data that is more compact and memory e cient than previous embedding vectors. Convolutional window classi
cation is a technique for classifying long documents, i.e. documents with lengths
exceeding the input dimension of the neural network. Additionally, we investigate the
performance of convolutional neural networks combined with long short-term memory
networks, explore how document length impacts classi cation performance and
compare performance of neural networks against non-neural network-based learners
in text classi cation tasks.
Note

Includes bibliography.

Language
Type
Extent
156 p.
Identifier
FA00005959
Additional Information
Includes bibliography.
Dissertation (Ph.D.)--Florida Atlantic University, 2018.
FAU Electronic Theses and Dissertations Collection
Date Backup
2018
Date Created Backup
2018
Date Text
2018
Date Created (EDTF)
2018
Date Issued (EDTF)
2018
Extension


FAU

IID
FA00005959
Person Preferred Name

Prusa, Joseph Daniel

author

Graduate College
Physical Description

application/pdf
156 p.
Title Plain
Enhancement of Deep Neural Networks and Their Application to Text Mining
Use and Reproduction
Copyright © is held by the author, with permission granted to Florida Atlantic University to digitize, archive and distribute this item for non-profit research and educational purposes. Any reuse of this item in excess of fair use or other copyright exemptions requires permission of the copyright holder.
http://rightsstatements.org/vocab/InC/1.0/
Origin Information

2018
2018
Florida Atlantic University

Boca Raton, Fla.

Physical Location
Florida Atlantic University Libraries
Place

Boca Raton, Fla.
Sub Location
Digital Library
Title
Enhancement of Deep Neural Networks and Their Application to Text Mining
Other Title Info

Enhancement of Deep Neural Networks and Their Application to Text Mining