Corpus Analysis Using Relaxed Conjugate Gradient Neural Network Training Algorithm

Corpus analysis is one of the most powerful methods in text mining, data discovery, and finding relationships among documents. In linguistics, a corpus (plural corpora) is a large and structured set of texts which should to be classified by artificial intelligence systems. The performance of convent...

Full description

Saved in:
Bibliographic Details
Published inNeural processing letters Vol. 50; no. 1; pp. 839 - 849
Main Author Borhani, Mostafa
Format Journal Article
LanguageEnglish
Published New York Springer US 01.08.2019
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN1370-4621
1573-773X
DOI10.1007/s11063-018-9948-8

Cover

More Information
Summary:Corpus analysis is one of the most powerful methods in text mining, data discovery, and finding relationships among documents. In linguistics, a corpus (plural corpora) is a large and structured set of texts which should to be classified by artificial intelligence systems. The performance of conventional text classifiers on corpora is usually unsatisfying. In this paper, a novel text classifier for corpus analysis is proposed by using advanced numerical unconstrained nonlinear optimization in collaboration with neural networks. The proposed approach, the relaxed conjugate gradient (RCG) trained artificial neural network, classifies each document using n-gram token filter by TF score multiplied by its IDF score. The proposed updating formula for training of neural networks combines the good numerical performance of Polak–Ribière technique and the wonderful global convergence properties of Fletcher–Reeves method and also it inherits some adaption from Hestenes–Stiefel, and Dai–Yuan conjugate gradient updating procedures by using the relaxation equation. The our proposed algorithm was evaluated on verses of Holy Quran and its outcomes were compared with results of its competitors such as the classical gradient descent algorithm, the modified quickprop algorithm, the conjugate gradient algorithm with Hestenes–Stiefel update, the conjugate gradient algorithm with Polak–Ribiere update, the conjugate gradient algorithm with Fletcher–Reeves updates, the scaled conjugate gradient algorithm, the variable memory Broyden–Fletcher–Goldfarb–Shanno update, and smoothed regularized conjugate gradient method. Based on these experiments, the proposed RCG is able to accurately classify text corpus with low computational cost.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1370-4621
1573-773X
DOI:10.1007/s11063-018-9948-8