Sentence structure-based summarization for Indonesian news articles
Automatic multi-document summarization may help news readers retrieve information from digital news media efficiently. The summarizer create a concise summary containing important information from a collection of articles, enabling readers to read only one text to gain information from multiple text...
Saved in:
| Published in | 2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA) pp. 1 - 6 |
|---|---|
| Main Authors | , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
01.08.2017
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.1109/ICAICTA.2017.8090983 |
Cover
| Abstract | Automatic multi-document summarization may help news readers retrieve information from digital news media efficiently. The summarizer create a concise summary containing important information from a collection of articles, enabling readers to read only one text to gain information from multiple text sources. Reflecting on previous researches, we propose an automatic summarization system using sentence structure information (subject, object, predicate, complement). The system consists of four main components, preprocessing and feature extraction, sentence structure information extraction, sentence clustering and fusion, and sentence selection. The system will extract the necessary information using dependency tree, cluster sentences using Density Based Spatial Clustering for Application with Noise (DBSCAN), fuse sentences with sentence structure information graph, and select sentences using Maximal Marginal Relevance (MMR). The evaluation shows that the proposed system performs with 0.276 average ROUGE-2, with many chances of improvements. Sentence structure extractor has 0.75 f1-measure score. |
|---|---|
| AbstractList | Automatic multi-document summarization may help news readers retrieve information from digital news media efficiently. The summarizer create a concise summary containing important information from a collection of articles, enabling readers to read only one text to gain information from multiple text sources. Reflecting on previous researches, we propose an automatic summarization system using sentence structure information (subject, object, predicate, complement). The system consists of four main components, preprocessing and feature extraction, sentence structure information extraction, sentence clustering and fusion, and sentence selection. The system will extract the necessary information using dependency tree, cluster sentences using Density Based Spatial Clustering for Application with Noise (DBSCAN), fuse sentences with sentence structure information graph, and select sentences using Maximal Marginal Relevance (MMR). The evaluation shows that the proposed system performs with 0.276 average ROUGE-2, with many chances of improvements. Sentence structure extractor has 0.75 f1-measure score. |
| Author | Reztaputra, Raihannur Khodra, Masayu Leylia |
| Author_xml | – sequence: 1 givenname: Raihannur surname: Reztaputra fullname: Reztaputra, Raihannur email: raihannurr@gmail.com organization: School of Electrical Engineering and Informatics, Institut Teknologi Bandung, Bandung, Indonesia – sequence: 2 givenname: Masayu Leylia surname: Khodra fullname: Khodra, Masayu Leylia email: masayu@stei.itb.ac.id organization: School of Electrical Engineering and Informatics, Institut Teknologi Bandung, Bandung, Indonesia |
| BookMark | eNotj8tKxDAYRiPowhl9Al3kBVr_JJO0WQ7FS2HAhV24G3L5A4GZVJIU0ae34Ky-1Tmcb0Ou05yQkEcGLWOgn8ZhPw7TvuXAurYHDboXV2TDpOiVAGCft2T4wFQxOaSl5sXVJWNjTUFPy3I-mxx_TY1zomHOdEx-9ZdoEk34XajJNboTljtyE8yp4P1lt2R6eZ6Gt-bw_romHJqooTYiqCA99wE7zjAE0XFnjbY7qZ3eBeAW1mhvASRDJRQ6b1GylWBcW2XEljz8ayMiHr9yXPN-jpdb4g-yoUlY |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/ICAICTA.2017.8090983 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Geology |
| EISBN | 153863001X 9781538630013 |
| EndPage | 6 |
| ExternalDocumentID | 8090983 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL CBEJK RIE RIL |
| ID | FETCH-LOGICAL-i90t-3f6f5d2dfe721eff372cba9b459c94f02b0110db0051e636ecdbe51d2d129b6a3 |
| IEDL.DBID | RIE |
| IngestDate | Thu Jun 29 18:39:06 EDT 2023 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i90t-3f6f5d2dfe721eff372cba9b459c94f02b0110db0051e636ecdbe51d2d129b6a3 |
| PageCount | 6 |
| ParticipantIDs | ieee_primary_8090983 |
| PublicationCentury | 2000 |
| PublicationDate | 2017-Aug. |
| PublicationDateYYYYMMDD | 2017-08-01 |
| PublicationDate_xml | – month: 08 year: 2017 text: 2017-Aug. |
| PublicationDecade | 2010 |
| PublicationTitle | 2017 International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA) |
| PublicationTitleAbbrev | ICAICTA |
| PublicationYear | 2017 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| Score | 1.652789 |
| Snippet | Automatic multi-document summarization may help news readers retrieve information from digital news media efficiently. The summarizer create a concise summary... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 1 |
| SubjectTerms | clustering Data mining dependency tree Earthquakes extraction Feature extraction Fuses Geology Information retrieval maximal marginal relevance (MMR) part-of-speech (POS) tag Redundancy selection sentence structure summarization |
| Title | Sentence structure-based summarization for Indonesian news articles |
| URI | https://ieeexplore.ieee.org/document/8090983 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LS8MwGP_YBoInH5v4JgePpktfaXIcxbkJE8EJu42m-QIiTJH1oH-9-dpuonjwFkpC2oTy-9L-HgBXqAtydlJcuDDiiT9C8EJlvhWiBysXSkxJKDy7l5On5G6RLjpwvdXCIGJNPsOAmvW_fPtaVvSpbKiEFlrFXehmSjZarVYNFwo9nOajaT4fEV0rC9quPzJTasgY78FsM1nDFHkJqrUJys9fPoz_vZt9GHyL89jDFnYOoIOrQ9i5rRN6P_qQP5LLJnVqrGGrd-QEVZY1OrVWd8l8scqmFOWBJKNkVF2zDUtuAPPxzTyf8DYpgT9rseaxky61kXXoz3PoXJxFpSm0SVJd6sSJyBDK2_oNRBlLLK3BNPQjPNobWcRH0Fv5-Y6BWYVaaoqu8bWVjIwyaJSQJixihaWJTqBPK7F8a7wwlu0inP59-Qx2aTcawtw59Pxz44UH8bW5rHfvC5FPnvo |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NS8MwFH_MiejJj038tgePtutHkiXHUZyrbkOwwm6jaV9AhCnSHfSvN6_tJooHb6GkpLxQfi_t7wPgClVGzk7S9U0QusweIdxM9u0oQAtWJhDISSg8mYrRE7ub8VkLrtdaGESsyGfo0bD6l1-85kv6VNaTvvKVjDZgkzPGeK3WavRwga96STxI4nRAhK2-10z-kZpSgcZwFyar5WquyIu3LLWXf_5yYvzv8-xB91ue5zysgWcfWrg4gK3bKqP3owPxI_ls0qTaHHb5ji6BVeHUSrVGeenYdtVJKMwDSUjpUH_trHhyXUiHN2k8cpusBPdZ-aUbGWF4ERYG7YkOjYn6Ya4zpRlXuWLGDzXhfFG9gygigXmhkQf2Dov3WmTRIbQXdr0jcAqJSigKr7HdlQi11KilL3SQRRJzHR5Dhyoxf6vdMOZNEU7-vnwJ26N0Mp6Pk-n9KezQztT0uTNo2xrguYX0Ul9UO_kFEgOiRw |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2017+International+Conference+on+Advanced+Informatics%2C+Concepts%2C+Theory%2C+and+Applications+%28ICAICTA%29&rft.atitle=Sentence+structure-based+summarization+for+Indonesian+news+articles&rft.au=Reztaputra%2C+Raihannur&rft.au=Khodra%2C+Masayu+Leylia&rft.date=2017-08-01&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FICAICTA.2017.8090983&rft.externalDocID=8090983 |