문장 레벨 그래프 회선 신경망을 통한 텍스트 분류

텍스트 분류는 자연어처리 분야의 전통적인 문제이다. 기존의 RNN 및 CNN 기반 텍스트 분류 모델들은 순차적인 단어 구조에 의존하기 때문에 인접하지 않지만 관련성이 높은 단어 간의 관계를 유추하기 어렵다는 문제점이 있다. 반면 GCN(Graph Convolutional Network)은 그래프의 형태로 데이터를 입력받기 때문에 문장의 순차적 구조에 대한 의존도를 줄일 수 있다. 본 논문에서는 문서의 비순차적인 관계를 그래프로 담아내어 더욱 효과적으로 파악하고 분류하는 인공신경망 모델을 제안한다. 문서를 그래프로 표현하기 위해 각...

Full description

Saved in:

Bibliographic Details
Published in	정보과학회 컴퓨팅의 실제 논문지 Vol. 25; no. 8; pp. 397 - 401
Main Authors	이민우(Minwoo Lee), 김양훈(Yanghoon Kim), 정교민(Kyomin Jung)
Format	Journal Article
Language	Korean
Published	Korean Institute of Information Scientists and Engineers 01.08.2019 한국정보과학회
Subjects	컴퓨터학 텍스트그래프 자연어 처리 텍스트 분류 natural language processing textgraph graph classification neural networks 그래프 회선 신경망 text classification graph convolutional networks 인공 신경망 그래프 분류
Online Access	Get full text
ISSN	2383-6318 2383-6326
DOI	10.5626/KTCP.2019.25.8.397

Cover

Abstract	텍스트 분류는 자연어처리 분야의 전통적인 문제이다. 기존의 RNN 및 CNN 기반 텍스트 분류 모델들은 순차적인 단어 구조에 의존하기 때문에 인접하지 않지만 관련성이 높은 단어 간의 관계를 유추하기 어렵다는 문제점이 있다. 반면 GCN(Graph Convolutional Network)은 그래프의 형태로 데이터를 입력받기 때문에 문장의 순차적 구조에 대한 의존도를 줄일 수 있다. 본 논문에서는 문서의 비순차적인 관계를 그래프로 담아내어 더욱 효과적으로 파악하고 분류하는 인공신경망 모델을 제안한다. 문서를 그래프로 표현하기 위해 각 단어를 그래프의 노드로 변환하고, 단어 간의 관계를 계산해 엣지로 정의한다. 최근에 제시된 GCN 구조를 통해 단어 간의 관계가 반영된 단어 벡터를 계산한 뒤, 어텐션 기반 요약 함수를 통해 문단을 주어진 클래스로 분류하는 방법을 제시한다. 실험 결과, 새롭게 제시된 모델이 RNN 및 CNN 기반 텍스트 분류 모델보다 좋은 성능을 보였다. Text classification is an important task in natural language processing, and most of the recent approaches employ neural networks to learn and classify the texts. RNN and CNN based models, which are widely used for solving the task, involve reading and processing the text in a sequential manner. This creates inefficiency in learning dependencies between far-apart words. On the contrary, Graph Convolutional Network (GCN) architecture is capable of processing more complex graph-structured data, thus having potential to recognize and learn from complex linguistic structures. In the present work, we transform text sequences into graphs by assigning each word in the text as a node and representing the relationship between words as edges. We then propose a method for solving text classification that uses recent GCN architectures to take the transformed text-graph as input, learn hidden representations, and output a single hidden representation for classification. In our experiments, our proposed model outperformed RNN and CNN based models with regards to various text classification tasks. KCI Citation Count: 0
AbstractList	텍스트 분류는 자연어처리 분야의 전통적인 문제이다. 기존의 RNN 및 CNN 기반 텍스트 분류 모델들은 순차적인 단어 구조에 의존하기 때문에 인접하지 않지만 관련성이 높은 단어 간의 관계를 유추하기 어렵다는 문제점이 있다. 반면 GCN(Graph Convolutional Network)은 그래프의 형태로 데이터를 입력받기 때문에 문장의 순차적 구조에 대한 의존도를 줄일 수 있다. 본 논문에서는 문서의 비순차적인 관계를 그래프로 담아내어 더욱 효과적으로 파악하고 분류하는 인공신경망 모델을 제안한다. 문서를 그래프로 표현하기 위해 각 단어를 그래프의 노드로 변환하고, 단어 간의 관계를 계산해 엣지로 정의한다. 최근에 제시된 GCN 구조를 통해 단어 간의 관계가 반영된 단어 벡터를 계산한 뒤, 어텐션 기반 요약 함수를 통해 문단을 주어진 클래스로 분류하는 방법을 제시한다. 실험 결과, 새롭게 제시된 모델이 RNN 및 CNN 기반 텍스트 분류 모델보다 좋은 성능을 보였다. Text classification is an important task in natural language processing, and most of the recent approaches employ neural networks to learn and classify the texts. RNN and CNN based models, which are widely used for solving the task, involve reading and processing the text in a sequential manner. This creates inefficiency in learning dependencies between far-apart words. On the contrary, Graph Convolutional Network (GCN) architecture is capable of processing more complex graph-structured data, thus having potential to recognize and learn from complex linguistic structures. In the present work, we transform text sequences into graphs by assigning each word in the text as a node and representing the relationship between words as edges. We then propose a method for solving text classification that uses recent GCN architectures to take the transformed text-graph as input, learn hidden representations, and output a single hidden representation for classification. In our experiments, our proposed model outperformed RNN and CNN based models with regards to various text classification tasks. KCI Citation Count: 0
Author	정교민(Kyomin Jung) 이민우(Minwoo Lee) 김양훈(Yanghoon Kim)
Author_xml	– sequence: 1 fullname: 이민우(Minwoo Lee) – sequence: 2 fullname: 김양훈(Yanghoon Kim) – sequence: 3 fullname: 정교민(Kyomin Jung)
BackLink	https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002491902$$DAccess content in National Research Foundation of Korea (NRF)
BookMark	eNo9kD9Lw1AUxR9SwVr7BZyyODgkvndf8vIylli1WKxI9kf-PQnVVFIc3GuRqptCEVs6WHRwKqhd_EJN8h0MWrzL-XE4HLhnHZXiThwitEmwZjBgO4eOfawBJpYGhsY1apkrqAyUU5VRYKV_JnwNVbvdyMNAOOjcImVkp-_zbDxV0slNOntTFl_zdDzMH3pK_nSX9SZKdjtZzL7T11E2Krz-R_74rOTX99ngJR_MlfSzl06HG2hVumfdsLrUCnL26o59oDZb-w271lRjphOV6pIBMEv3fJ8ySZiFJYA0cUCpjwvmATCfAAXuezL0i_MCI7SoNDkYLKAVtP1XGydStP1IdNzoV087op2I2onTEIalF5_hIru1zF4m0XkYRK64KMBNrsRRa7eOuVkMB4T-AOsJcGQ
ContentType	Journal Article
DBID	DBRKI TDB ACYCR
DOI	10.5626/KTCP.2019.25.8.397
DatabaseName	DBPIA - 디비피아 누리미디어 DBpia Korean Citation Index
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
DocumentTitleAlternate	Text Classification via Sentence-level Graph Convolutional Networks
DocumentTitle_FL	Text Classification via Sentence-level Graph Convolutional Networks
EISSN	2383-6326
EndPage	401
ExternalDocumentID	oai_kci_go_kr_ARTI_5948240 NODE08762621
GroupedDBID	.UV ALMA_UNASSIGNED_HOLDINGS DBRKI TDB ACYCR
ID	FETCH-LOGICAL-n641-34f622694bcc36f1690f22f70d33c00f28d26c12328cbfeccccbd5e93f78256d3
ISSN	2383-6318
IngestDate	Sun Mar 09 07:50:45 EDT 2025 Thu Feb 06 13:28:17 EST 2025
IsPeerReviewed	false
IsScholarly	false
Issue	8
Keywords	텍스트그래프 자연어 처리 텍스트 분류 natural language processing textgraph graph classification neural networks 그래프 회선 신경망 text classification graph convolutional networks 인공 신경망 그래프 분류
Language	Korean
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-n641-34f622694bcc36f1690f22f70d33c00f28d26c12328cbfeccccbd5e93f78256d3
PageCount	5
ParticipantIDs	nrf_kci_oai_kci_go_kr_ARTI_5948240 nurimedia_primary_NODE08762621
PublicationCentury	2000
PublicationDate	2019-08
PublicationDateYYYYMMDD	2019-08-01
PublicationDate_xml	– month: 08 year: 2019 text: 2019-08
PublicationDecade	2010
PublicationTitle	정보과학회 컴퓨팅의 실제 논문지
PublicationYear	2019
Publisher	Korean Institute of Information Scientists and Engineers 한국정보과학회
Publisher_xml	– name: Korean Institute of Information Scientists and Engineers – name: 한국정보과학회
SSID	ssib021824891 ssib044742771 ssib053377435 ssib019653237
Score	1.7084358
Snippet	텍스트 분류는 자연어처리 분야의 전통적인 문제이다. 기존의 RNN 및 CNN 기반 텍스트 분류 모델들은 순차적인 단어 구조에 의존하기 때문에 인접하지 않지만 관련성이...
SourceID	nrf nurimedia
SourceType	Open Website Publisher
StartPage	397
SubjectTerms	컴퓨터학
Title	문장 레벨 그래프 회선 신경망을 통한 텍스트 분류
URI	https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE08762621 https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002491902
Volume	25
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
ispartofPNX	정보과학회 컴퓨팅의 실제 논문지, 2019, 25(8), , pp.397-401
journalDatabaseRights	– providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2383-6326 dateEnd: 99991231 omitProxy: true ssIdentifier: ssib044742771 issn: 2383-6318 databaseCode: M~E dateStart: 20140101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnR3LbtNA0CrlABcEAkR5VBZiT5FD4l2vd4-2k6q0auEQpHKy7E1cogoHRe0BDpwKQgVuIFWIVj1QwYELlYBe-KEm-Qdm1o7jAhKPS7KazM7MerK7M-uZWcO4weO2IyiTlkpUYjE7diwwjlwriiKHRlE7SXTW-9Iyn7_LFlaclakTn0tRSxvrcVU9_m1eyf9oFWCgV8yS_QfNFkQBAG3QL3yChuHzr3RMmj7xAuIL0gyIbBLPqWhQjQiBDd8mngCQR3xXY_mIJaHRIBKMSFbRLY-IACkIBl0ruqWJYD-b-A1N0yWyodk0xv0EJ76jCThEBhnIIUJjCY94TEOAhtBS-RwZIClAF2WjGDsAOyCDaJT4TLOGRjAmLwtBtXy-r5EARGGEmk-AzDP5pCgGwca0Ay0DoCDN0lODcYnijU3e38_EnNO4AbL1a2CIL3UxQKmiQ5fkpAv8KjFeBDE5kXUtlo8asMW9KF2934MJpi-tlmU--YA9fIayNmFoi8VHvQfdtLKgl2FZPpbBTDBRPpZZ7PXxTcqxiI88wSzbCXS6K8zkrBL3uPhk6YAWbClqcZrvTp0yLKsxMN6_ssTxfJ6K0mZEs8jn3K5hmXQ_b5lg_-LpzWIruIOBjrJqO1VRLboeK0W-fLvRxBKGNsfSDSdtl3O8NGTpSXO8hGN9Sloqf4T3AzAxKcfEmMtsd-KBgLMB_oe-BrcYb5bahnLd_FUqMAHTPliOp9INvP4C1tCSOdg6a5zJ_TjTyyblOWNqrXfeCAafDoe7--Zg7_ng4KN59O1wsLs9er1pjt6-HG7umcMXe0cH3wcfdoY7AHv2ZfTmnTl6-mq49X60dWgOvm4O9rcvGK25ZiuYt_JbSqyUs7pFWcJtTAePlaI8wbfOiW0nbq1NqapBW7RtrtBxESpOYMFUCpbHjqQJ2OYOb9OLxnTaSzuXDDOuK-G4dXDwI8VEnYqEOdSNY6k64CWwaMa4DmMP11Q3xKLw-L3aC9f6Ibi-t0KsuwTm-YwxWzya8GFWsSYsK-7ynxCuGKcnf-irxvR6f6NzDSzv9XhW6_oHEzOxXQ
linkProvider	ISSN International Centre
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=%EB%AC%B8%EC%9E%A5+%EB%A0%88%EB%B2%A8+%EA%B7%B8%EB%9E%98%ED%94%84+%ED%9A%8C%EC%84%A0+%EC%8B%A0%EA%B2%BD%EB%A7%9D%EC%9D%84+%ED%86%B5%ED%95%9C+%ED%85%8D%EC%8A%A4%ED%8A%B8+%EB%B6%84%EB%A5%98&rft.jtitle=%EC%A0%95%EB%B3%B4%EA%B3%BC%ED%95%99%ED%9A%8C+%EC%BB%B4%ED%93%A8%ED%8C%85%EC%9D%98+%EC%8B%A4%EC%A0%9C+%EB%85%BC%EB%AC%B8%EC%A7%80&rft.au=%EC%9D%B4%EB%AF%BC%EC%9A%B0%28Minwoo+Lee%29&rft.au=%EA%B9%80%EC%96%91%ED%9B%88%28Yanghoon+Kim%29&rft.au=%EC%A0%95%EA%B5%90%EB%AF%BC%28Kyomin+Jung%29&rft.date=2019-08-01&rft.pub=Korean+Institute+of+Information+Scientists+and+Engineers&rft.issn=2383-6318&rft.eissn=2383-6326&rft.volume=25&rft.issue=8&rft.spage=397&rft.epage=401&rft_id=info:doi/10.5626%2FKTCP.2019.25.8.397&rft.externalDocID=NODE08762621
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2383-6318&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2383-6318&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2383-6318&client=summon