Sentence structure-based summarization for Indonesian news articles

Automatic multi-document summarization may help news readers retrieve information from digital news media efficiently. The summarizer create a concise summary containing important information from a collection of articles, enabling readers to read only one text to gain information from multiple text...

Full description

Saved in:
Bibliographic Details
Published in2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA) pp. 1 - 6
Main Authors Reztaputra, Raihannur, Khodra, Masayu Leylia
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.08.2017
Subjects
Online AccessGet full text
DOI10.1109/ICAICTA.2017.8090983

Cover

Abstract Automatic multi-document summarization may help news readers retrieve information from digital news media efficiently. The summarizer create a concise summary containing important information from a collection of articles, enabling readers to read only one text to gain information from multiple text sources. Reflecting on previous researches, we propose an automatic summarization system using sentence structure information (subject, object, predicate, complement). The system consists of four main components, preprocessing and feature extraction, sentence structure information extraction, sentence clustering and fusion, and sentence selection. The system will extract the necessary information using dependency tree, cluster sentences using Density Based Spatial Clustering for Application with Noise (DBSCAN), fuse sentences with sentence structure information graph, and select sentences using Maximal Marginal Relevance (MMR). The evaluation shows that the proposed system performs with 0.276 average ROUGE-2, with many chances of improvements. Sentence structure extractor has 0.75 f1-measure score.
AbstractList Automatic multi-document summarization may help news readers retrieve information from digital news media efficiently. The summarizer create a concise summary containing important information from a collection of articles, enabling readers to read only one text to gain information from multiple text sources. Reflecting on previous researches, we propose an automatic summarization system using sentence structure information (subject, object, predicate, complement). The system consists of four main components, preprocessing and feature extraction, sentence structure information extraction, sentence clustering and fusion, and sentence selection. The system will extract the necessary information using dependency tree, cluster sentences using Density Based Spatial Clustering for Application with Noise (DBSCAN), fuse sentences with sentence structure information graph, and select sentences using Maximal Marginal Relevance (MMR). The evaluation shows that the proposed system performs with 0.276 average ROUGE-2, with many chances of improvements. Sentence structure extractor has 0.75 f1-measure score.
Author Reztaputra, Raihannur
Khodra, Masayu Leylia
Author_xml – sequence: 1
  givenname: Raihannur
  surname: Reztaputra
  fullname: Reztaputra, Raihannur
  email: raihannurr@gmail.com
  organization: School of Electrical Engineering and Informatics, Institut Teknologi Bandung, Bandung, Indonesia
– sequence: 2
  givenname: Masayu Leylia
  surname: Khodra
  fullname: Khodra, Masayu Leylia
  email: masayu@stei.itb.ac.id
  organization: School of Electrical Engineering and Informatics, Institut Teknologi Bandung, Bandung, Indonesia
BookMark eNotj8tKxDAYRiPowhl9Al3kBVr_JJO0WQ7FS2HAhV24G3L5A4GZVJIU0ae34Ky-1Tmcb0Ou05yQkEcGLWOgn8ZhPw7TvuXAurYHDboXV2TDpOiVAGCft2T4wFQxOaSl5sXVJWNjTUFPy3I-mxx_TY1zomHOdEx-9ZdoEk34XajJNboTljtyE8yp4P1lt2R6eZ6Gt-bw_romHJqooTYiqCA99wE7zjAE0XFnjbY7qZ3eBeAW1mhvASRDJRQ6b1GylWBcW2XEljz8ayMiHr9yXPN-jpdb4g-yoUlY
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICAICTA.2017.8090983
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Geology
EISBN 153863001X
9781538630013
EndPage 6
ExternalDocumentID 8090983
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i90t-3f6f5d2dfe721eff372cba9b459c94f02b0110db0051e636ecdbe51d2d129b6a3
IEDL.DBID RIE
IngestDate Thu Jun 29 18:39:06 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-3f6f5d2dfe721eff372cba9b459c94f02b0110db0051e636ecdbe51d2d129b6a3
PageCount 6
ParticipantIDs ieee_primary_8090983
PublicationCentury 2000
PublicationDate 2017-Aug.
PublicationDateYYYYMMDD 2017-08-01
PublicationDate_xml – month: 08
  year: 2017
  text: 2017-Aug.
PublicationDecade 2010
PublicationTitle 2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA)
PublicationTitleAbbrev ICAICTA
PublicationYear 2017
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.652789
Snippet Automatic multi-document summarization may help news readers retrieve information from digital news media efficiently. The summarizer create a concise summary...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms clustering
Data mining
dependency tree
Earthquakes
extraction
Feature extraction
Fuses
Geology
Information retrieval
maximal marginal relevance (MMR)
part-of-speech (POS) tag
Redundancy
selection
sentence structure
summarization
Title Sentence structure-based summarization for Indonesian news articles
URI https://ieeexplore.ieee.org/document/8090983
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LS8MwGP_YBoInH5v4JgePpktfaXIcxbkJE8EJu42m-QIiTJH1oH-9-dpuonjwFkpC2oTy-9L-HgBXqAtydlJcuDDiiT9C8EJlvhWiBysXSkxJKDy7l5On5G6RLjpwvdXCIGJNPsOAmvW_fPtaVvSpbKiEFlrFXehmSjZarVYNFwo9nOajaT4fEV0rC9quPzJTasgY78FsM1nDFHkJqrUJys9fPoz_vZt9GHyL89jDFnYOoIOrQ9i5rRN6P_qQP5LLJnVqrGGrd-QEVZY1OrVWd8l8scqmFOWBJKNkVF2zDUtuAPPxzTyf8DYpgT9rseaxky61kXXoz3PoXJxFpSm0SVJd6sSJyBDK2_oNRBlLLK3BNPQjPNobWcRH0Fv5-Y6BWYVaaoqu8bWVjIwyaJSQJixihaWJTqBPK7F8a7wwlu0inP59-Qx2aTcawtw59Pxz44UH8bW5rHfvC5FPnvo
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NS8MwFH_MiejJj038tgePtutHkiXHUZyrbkOwwm6jaV9AhCnSHfSvN6_tJooHb6GkpLxQfi_t7wPgClVGzk7S9U0QusweIdxM9u0oQAtWJhDISSg8mYrRE7ub8VkLrtdaGESsyGfo0bD6l1-85kv6VNaTvvKVjDZgkzPGeK3WavRwga96STxI4nRAhK2-10z-kZpSgcZwFyar5WquyIu3LLWXf_5yYvzv8-xB91ue5zysgWcfWrg4gK3bKqP3owPxI_ls0qTaHHb5ji6BVeHUSrVGeenYdtVJKMwDSUjpUH_trHhyXUiHN2k8cpusBPdZ-aUbGWF4ERYG7YkOjYn6Ya4zpRlXuWLGDzXhfFG9gygigXmhkQf2Dov3WmTRIbQXdr0jcAqJSigKr7HdlQi11KilL3SQRRJzHR5Dhyoxf6vdMOZNEU7-vnwJ26N0Mp6Pk-n9KezQztT0uTNo2xrguYX0Ul9UO_kFEgOiRw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2017+International+Conference+on+Advanced+Informatics%2C+Concepts%2C+Theory%2C+and+Applications+%28ICAICTA%29&rft.atitle=Sentence+structure-based+summarization+for+Indonesian+news+articles&rft.au=Reztaputra%2C+Raihannur&rft.au=Khodra%2C+Masayu+Leylia&rft.date=2017-08-01&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FICAICTA.2017.8090983&rft.externalDocID=8090983