Corpus Analysis Using Relaxed Conjugate Gradient Neural Network Training Algorithm
Corpus analysis is one of the most powerful methods in text mining, data discovery, and finding relationships among documents. In linguistics, a corpus (plural corpora) is a large and structured set of texts which should to be classified by artificial intelligence systems. The performance of convent...
Saved in:
| Published in | Neural processing letters Vol. 50; no. 1; pp. 839 - 849 |
|---|---|
| Main Author | |
| Format | Journal Article |
| Language | English |
| Published |
New York
Springer US
01.08.2019
Springer Nature B.V |
| Subjects | |
| Online Access | Get full text |
| ISSN | 1370-4621 1573-773X |
| DOI | 10.1007/s11063-018-9948-8 |
Cover
| Summary: | Corpus analysis is one of the most powerful methods in text mining, data discovery, and finding relationships among documents. In linguistics, a corpus (plural corpora) is a large and structured set of texts which should to be classified by artificial intelligence systems. The performance of conventional text classifiers on corpora is usually unsatisfying. In this paper, a novel text classifier for corpus analysis is proposed by using advanced numerical unconstrained nonlinear optimization in collaboration with neural networks. The proposed approach, the relaxed conjugate gradient (RCG) trained artificial neural network, classifies each document using n-gram token filter by TF score multiplied by its IDF score. The proposed updating formula for training of neural networks combines the good numerical performance of Polak–Ribière technique and the wonderful global convergence properties of Fletcher–Reeves method and also it inherits some adaption from Hestenes–Stiefel, and Dai–Yuan conjugate gradient updating procedures by using the relaxation equation. The our proposed algorithm was evaluated on verses of Holy Quran and its outcomes were compared with results of its competitors such as the classical gradient descent algorithm, the modified quickprop algorithm, the conjugate gradient algorithm with Hestenes–Stiefel update, the conjugate gradient algorithm with Polak–Ribiere update, the conjugate gradient algorithm with Fletcher–Reeves updates, the scaled conjugate gradient algorithm, the variable memory Broyden–Fletcher–Goldfarb–Shanno update, and smoothed regularized conjugate gradient method. Based on these experiments, the proposed RCG is able to accurately classify text corpus with low computational cost. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 1370-4621 1573-773X |
| DOI: | 10.1007/s11063-018-9948-8 |