Automatic news-roundup generation using clustering, extraction, and presentation

Along with the growth of the internet, the number of information published increased exponentially. This huge flow of information causes a problem called “information overload” which makes it harder for internet users to find key information they needed on the internet. To solve this, this paper pro...

Full description

Saved in:
Bibliographic Details
Published inMultimedia systems Vol. 26; no. 2; pp. 201 - 221
Main Authors Utomo, Vincent, Leu, Jenq-Shiou
Format Journal Article
LanguageEnglish
Published Berlin/Heidelberg Springer Berlin Heidelberg 01.04.2020
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN0942-4962
1432-1882
DOI10.1007/s00530-019-00638-4

Cover

Abstract Along with the growth of the internet, the number of information published increased exponentially. This huge flow of information causes a problem called “information overload” which makes it harder for internet users to find key information they needed on the internet. To solve this, this paper proposes an application that helps user find trending news of their query/interest easily. Some challenges include how to determining the trending subtopic, how to extract only the content of each webpage, and how to present the data to user. Therefore, three core modules are used in this study, which are clustering, extraction, and presentation. Several methods are tested in this study, including naïve, manual thresholding, and heuristic clustering method. The result shows that hierarchical clustering using tf–idf word weighting, cosine similarity as distance measure and heuristically terminated using elbow point analysis achieves the best result at 50.84% Acc and 61.96% NMI. One challenge commonly faced by extraction algorithm is the tendency to have lower effectivity over time. In this paper, extraction algorithm using a prior-known subject/keyword to help the content extraction process is used. Second stage of noise removal process is also introduced to further remove noise that exists within the content block. The evaluation result shows improved score of 7.48%. The final application was able to receive score of 4.18 of 5 for its helpfulness and 4.35 of 5 for its effectiveness by respondents; showing that the proposed application could really help users to find information and help to solve information overload problem.
AbstractList Along with the growth of the internet, the number of information published increased exponentially. This huge flow of information causes a problem called “information overload” which makes it harder for internet users to find key information they needed on the internet. To solve this, this paper proposes an application that helps user find trending news of their query/interest easily. Some challenges include how to determining the trending subtopic, how to extract only the content of each webpage, and how to present the data to user. Therefore, three core modules are used in this study, which are clustering, extraction, and presentation. Several methods are tested in this study, including naïve, manual thresholding, and heuristic clustering method. The result shows that hierarchical clustering using tf–idf word weighting, cosine similarity as distance measure and heuristically terminated using elbow point analysis achieves the best result at 50.84% Acc and 61.96% NMI. One challenge commonly faced by extraction algorithm is the tendency to have lower effectivity over time. In this paper, extraction algorithm using a prior-known subject/keyword to help the content extraction process is used. Second stage of noise removal process is also introduced to further remove noise that exists within the content block. The evaluation result shows improved score of 7.48%. The final application was able to receive score of 4.18 of 5 for its helpfulness and 4.35 of 5 for its effectiveness by respondents; showing that the proposed application could really help users to find information and help to solve information overload problem.
Author Utomo, Vincent
Leu, Jenq-Shiou
Author_xml – sequence: 1
  givenname: Vincent
  surname: Utomo
  fullname: Utomo, Vincent
  organization: Department of Electronic and Computer Engineering, National Taiwan University of Science and Technology
– sequence: 2
  givenname: Jenq-Shiou
  orcidid: 0000-0001-7197-9912
  surname: Leu
  fullname: Leu, Jenq-Shiou
  email: jsleu@mail.ntust.edu.tw
  organization: Department of Electronic and Computer Engineering, National Taiwan University of Science and Technology
BookMark eNp9kE1LAzEQhoNUsK3-AU8Br41OPvYjx1L8goIe9ByW7KRsabNrskH99267guChpwmTeTJ5nxmZ-NYjIdccbjlAcRcBMgkMuGYAuSyZOiNTrqRgvCzFhExBK8GUzsUFmcW4BeBFLmFKXpepb_dV31jq8TOy0CZfp45u0GMY2q2nKTZ-Q-0uxR7DcFxQ_OpDZQ-XC1r5mnYBI_r-OH5Jzl21i3j1W-fk_eH-bfXE1i-Pz6vlmlmZi57VAMqClk4XZVHXwjrBFSglLTqB0jmrnZNS51VpOWau0E5AkQlUqC2Ak3NyM77bhfYjYezNtk3BDyuNkKXMlBCDiDkR45QNbYwBnelCs6_Ct-FgDubMaM4M5szRnFEDVP6DbDOGG2I3u9OoHNHYHVRh-PvVCeoHB9GFsQ
CitedBy_id crossref_primary_10_1155_2022_4887470
crossref_primary_10_1155_2022_2783792
crossref_primary_10_1155_2022_9884273
crossref_primary_10_1155_2022_5914893
crossref_primary_10_1007_s00530_024_01331_x
Cites_doi 10.1016/0377-0427(87)90125-7
10.1016/j.eswa.2017.05.002
10.1017/CBO9781139924801
10.1016/S1389-1286(02)00214-1
10.1016/j.jlap.2013.01.002
10.1007/s10115-013-0687-x
10.1080/01621459.1963.10500845
10.1145/3068335
10.1016/S1389-1286(99)00054-7
10.1145/2897350.2897353
10.1038/234034a0
10.1093/comjnl/20.4.364
10.1016/j.eswa.2017.11.055
10.1145/956863.956961
10.1145/1772690.1772789
10.1109/ICMCS.2014.6911249
10.1090/chel/367
10.1109/SocialCom.2010.17
10.1145/332040.332418
10.1145/2736277.2741659
10.1109/ISIE.2016.7745047
10.1145/1008992.1009030
10.14778/3402707.3402735
10.1145/2009916.2009952
10.1145/860435.860485
10.1002/(SICI)1097-0266(199606)17:6<441::AID-SMJ819>3.0.CO;2-G
10.1145/1559845.1559882
10.1145/775152.775182
10.1145/1062745.1062763
10.1145/276305.276330
ContentType Journal Article
Copyright Springer-Verlag GmbH Germany, part of Springer Nature 2019
2019© Springer-Verlag GmbH Germany, part of Springer Nature 2019
Copyright_xml – notice: Springer-Verlag GmbH Germany, part of Springer Nature 2019
– notice: 2019© Springer-Verlag GmbH Germany, part of Springer Nature 2019
DBID AAYXX
CITATION
JQ2
DOI 10.1007/s00530-019-00638-4
DatabaseName CrossRef
ProQuest Computer Science Collection
DatabaseTitle CrossRef
ProQuest Computer Science Collection
DatabaseTitleList ProQuest Computer Science Collection

DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1432-1882
EndPage 221
ExternalDocumentID 10_1007_s00530_019_00638_4
GroupedDBID --Z
-4Z
-59
-5G
-BR
-EM
-ET
-Y2
-~C
-~X
.4S
.86
.DC
.VR
06D
0R~
0VY
123
1N0
1SB
203
28-
29M
2J2
2JN
2JY
2KG
2LR
2P1
2VQ
2~H
30V
4.4
406
408
409
40D
40E
5QI
5VS
67Z
6NX
78A
85S
8TC
8UJ
95-
95.
95~
96X
AAAVM
AABHQ
AACDK
AAHNG
AAIAL
AAJBT
AAJKR
AANZL
AAOBN
AARHV
AARTL
AASML
AATNV
AATVU
AAUYE
AAWCG
AAYIU
AAYOK
AAYQN
AAYTO
AAYZH
ABAKF
ABBBX
ABBXA
ABDZT
ABECU
ABFTD
ABFTV
ABHLI
ABHQN
ABJNI
ABJOX
ABKCH
ABKTR
ABMNI
ABMQK
ABNWP
ABQBU
ABQSL
ABSXP
ABTEG
ABTHY
ABTKH
ABTMW
ABULA
ABWNU
ABXPI
ACAOD
ACBXY
ACDTI
ACGFS
ACHSB
ACHXU
ACKNC
ACMDZ
ACMLO
ACOKC
ACOMO
ACPIV
ACZOJ
ADHHG
ADHIR
ADIMF
ADINQ
ADKNI
ADKPE
ADMLS
ADRFC
ADTPH
ADURQ
ADYFF
ADZKW
AEBTG
AEFIE
AEFQL
AEGAL
AEGNC
AEJHL
AEJRE
AEKMD
AEMSY
AENEX
AEOHA
AEPYU
AESKC
AETLH
AEVLU
AEXYK
AFBBN
AFEXP
AFFNX
AFGCZ
AFLOW
AFQWF
AFWTZ
AFZKB
AGAYW
AGDGC
AGGDS
AGJBK
AGMZJ
AGQEE
AGQMX
AGRTI
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHKAY
AHSBF
AHYZX
AIAKS
AIGIU
AIIXL
AILAN
AITGF
AJBLW
AJRNO
AJZVZ
ALMA_UNASSIGNED_HOLDINGS
ALWAN
AMKLP
AMXSW
AMYLF
AMYQR
AOCGG
ARCSS
ARMRJ
ASPBG
AVWKF
AXYYD
AYJHY
AZFZN
B-.
BA0
BBWZM
BDATZ
BGNMA
BSONS
CAG
COF
CS3
CSCUP
DDRTE
DL5
DNIVK
DPUIP
DU5
EBLON
EBS
EDO
EIOEI
EJD
ESBYG
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRRFC
FSGXE
FWDCC
GGCAI
GGRSB
GJIRD
GNWQR
GQ6
GQ7
GQ8
GXS
H13
HF~
HG5
HG6
HMJXF
HQYDN
HRMNR
HVGLF
HZ~
H~9
I-F
I09
IHE
IJ-
IKXTQ
ITG
ITH
ITM
IWAJR
IXC
IZIGR
IZQ
I~X
I~Z
J-C
J0Z
JBSCW
JCJTX
JZLTJ
KDC
KOV
KOW
LAS
LLZTM
M4Y
MA-
N2Q
N9A
NB0
NDZJH
NPVJJ
NQJWS
NU0
O9-
O93
O9G
O9I
O9J
OAM
P19
P2P
P9O
PF0
PT4
PT5
QF4
QM1
QN7
QO4
QOK
QOS
R4E
R89
R9I
RHV
RIG
RNI
RNS
ROL
RPX
RSV
RZK
S16
S1Z
S26
S27
S28
S3B
SAP
SCJ
SCLPG
SCO
SDH
SDM
SHX
SISQX
SJYHP
SNE
SNPRN
SNX
SOHCF
SOJ
SPISZ
SRMVM
SSLCW
STPWE
SZN
T13
T16
TAE
TN5
TSG
TSK
TSV
TUC
TUS
U2A
UG4
UOJIU
UTJUX
UZXMN
VC2
VFIZW
W23
W48
WK8
YIN
YLTOR
Z45
Z7R
Z7X
Z83
Z88
Z8M
Z8R
Z8W
Z92
ZMTXR
~EX
AAPKM
AAYXX
ABBRH
ABDBE
ABFSG
ABRTQ
ACSTC
ADHKG
AETEA
AEZWR
AFDZB
AFHIU
AFOHR
AGQPQ
AHPBZ
AHWEU
AIXLP
ATHPR
AYFIA
CITATION
JQ2
ID FETCH-LOGICAL-c362t-d004c093f9787dd2cf2140443cef2e3ffc9ff3396a8c1e5f79f20752e4e9c00f3
IEDL.DBID AGYKE
ISSN 0942-4962
IngestDate Thu Sep 25 00:55:19 EDT 2025
Thu Apr 24 23:01:44 EDT 2025
Wed Oct 01 03:10:33 EDT 2025
Fri Feb 21 02:35:01 EST 2025
IsPeerReviewed true
IsScholarly true
Issue 2
Keywords Search result clustering
Information retrieval
Information overload
User query
Subtopic discovery
Second-stage noise removal
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c362t-d004c093f9787dd2cf2140443cef2e3ffc9ff3396a8c1e5f79f20752e4e9c00f3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0001-7197-9912
PQID 2383542206
PQPubID 2043725
PageCount 21
ParticipantIDs proquest_journals_2383542206
crossref_primary_10_1007_s00530_019_00638_4
crossref_citationtrail_10_1007_s00530_019_00638_4
springer_journals_10_1007_s00530_019_00638_4
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2020-04-01
PublicationDateYYYYMMDD 2020-04-01
PublicationDate_xml – month: 04
  year: 2020
  text: 2020-04-01
  day: 01
PublicationDecade 2020
PublicationPlace Berlin/Heidelberg
PublicationPlace_xml – name: Berlin/Heidelberg
– name: Heidelberg
PublicationTitle Multimedia systems
PublicationTitleAbbrev Multimedia Systems
PublicationYear 2020
Publisher Springer Berlin Heidelberg
Springer Nature B.V
Publisher_xml – name: Springer Berlin Heidelberg
– name: Springer Nature B.V
References Singhal (CR32) 2001; 24
Levandowsky, Winter (CR19) 1971; 234
CR17
CR39
CR15
CR37
Insa, Silva, Tamarit (CR16) 2013; 82
CR36
Ward (CR38) 1963; 58
CR13
CR12
CR34
Myllymaki (CR23) 2002; 39
Zamir, Etzioni (CR42) 1999; 31
CR31
Leskovec, Rajaraman, Ullman (CR18) 2014
Ester, Kriegel, Sander, Xu (CR11) 1996; 96
Schubert (CR30) 2017; 42
CR2
CR4
CR6
CR5
CR8
CR7
CR29
Weninger, Palacios, Crescenzi, Gottron, Merialdo (CR35) 2016; 2
CR9
CR27
CR26
CR25
Defays (CR10) 1977; 20
CR24
Song, Sun, Liao (CR33) 2015; 42
CR22
CR21
CR43
CR20
CR41
CR40
Arın, Erpam, Saygın (CR3) 2018; 96
Hartigan, Wong (CR14) 1979; 28
Rousseeuw (CR28) 1987; 20
Abualigah, Khader, Al-Betar, Alomari (CR1) 2017; 84
M Ester (638_CR11) 1996; 96
638_CR40
PJ Rousseeuw (638_CR28) 1987; 20
JA Hartigan (638_CR14) 1979; 28
D Defays (638_CR10) 1977; 20
638_CR26
LM Abualigah (638_CR1) 2017; 84
638_CR25
638_CR24
638_CR22
638_CR21
638_CR43
638_CR20
638_CR41
638_CR4
JH Ward Jr (638_CR38) 1963; 58
638_CR6
638_CR5
638_CR8
638_CR7
638_CR29
638_CR9
638_CR27
J Leskovec (638_CR18) 2014
A Singhal (638_CR32) 2001; 24
638_CR2
E Schubert (638_CR30) 2017; 42
J Myllymaki (638_CR23) 2002; 39
D Song (638_CR33) 2015; 42
T Weninger (638_CR35) 2016; 2
O Zamir (638_CR42) 1999; 31
638_CR15
638_CR37
638_CR36
638_CR13
M Levandowsky (638_CR19) 1971; 234
638_CR12
638_CR34
638_CR31
D Insa (638_CR16) 2013; 82
638_CR17
638_CR39
İ Arın (638_CR3) 2018; 96
References_xml – ident: CR22
– ident: CR43
– volume: 28
  start-page: 100
  issue: 1
  year: 1979
  end-page: 108
  ident: CR14
  article-title: Algorithm AS 136: A k-means clustering algorithm
  publication-title: J. R. Stat. Soc. Ser. C (Appl. Stat.)
– volume: 20
  start-page: 53
  year: 1987
  end-page: 65
  ident: CR28
  article-title: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis
  publication-title: J. Comput. Appl. Math.
  doi: 10.1016/0377-0427(87)90125-7
– volume: 84
  start-page: 24
  year: 2017
  end-page: 36
  ident: CR1
  article-title: Text feature selection with a robust weight scheme and dynamic dimension reduction to text document clustering
  publication-title: Expert Syst. Appl.
  doi: 10.1016/j.eswa.2017.05.002
– ident: CR4
– ident: CR39
– ident: CR2
– ident: CR37
– ident: CR12
– volume: 96
  start-page: 226
  issue: 34
  year: 1996
  end-page: 231
  ident: CR11
  article-title: A density-based algorithm for discovering clusters in large spatial databases with noise
  publication-title: KDD
– year: 2014
  ident: CR18
  publication-title: Mining of Massive Datasets
  doi: 10.1017/CBO9781139924801
– ident: CR6
– ident: CR29
– ident: CR8
– volume: 39
  start-page: 635
  issue: 5
  year: 2002
  end-page: 644
  ident: CR23
  article-title: Effective web data extraction with standard XML technologies
  publication-title: Comput. Netw.
  doi: 10.1016/S1389-1286(02)00214-1
– ident: CR40
– ident: CR25
– ident: CR27
– volume: 82
  start-page: 311
  issue: 8
  year: 2013
  end-page: 325
  ident: CR16
  article-title: Using the words/leafs ratio in the DOM tree for content extraction
  publication-title: J. Logic Algebr. Program
  doi: 10.1016/j.jlap.2013.01.002
– volume: 42
  start-page: 75
  issue: 1
  year: 2015
  end-page: 96
  ident: CR33
  article-title: A hybrid approach for content extraction with text density and visual importance of DOM nodes
  publication-title: Knowl. Inf. Syst.
  doi: 10.1007/s10115-013-0687-x
– ident: CR21
– ident: CR15
– volume: 58
  start-page: 236
  issue: 301
  year: 1963
  end-page: 244
  ident: CR38
  article-title: Hierarchical grouping to optimize an objective function
  publication-title: J. Am. Stat. Assoc.
  doi: 10.1080/01621459.1963.10500845
– ident: CR17
– volume: 24
  start-page: 35
  issue: 4
  year: 2001
  end-page: 43
  ident: CR32
  article-title: Modern information retrieval: a brief overview
  publication-title: IEEE Data Eng. Bull.
– ident: CR31
– ident: CR13
– ident: CR9
– ident: CR34
– ident: CR36
– ident: CR5
– ident: CR7
– volume: 42
  start-page: 19
  issue: 3
  year: 2017
  ident: CR30
  article-title: DBSCAN revisited, revisited: why and how you should (still) use DBSCAN
  publication-title: ACM Trans. Database Syst. (TODS)
  doi: 10.1145/3068335
– volume: 31
  start-page: 1361
  issue: 11
  year: 1999
  end-page: 1374
  ident: CR42
  article-title: Grouper: a dynamic clustering interface to Web search results
  publication-title: Comput. Netw.
  doi: 10.1016/S1389-1286(99)00054-7
– volume: 2
  start-page: 17
  issue: 17
  year: 2016
  end-page: 23
  ident: CR35
  article-title: Web content extraction: a MetaAnalysis of its past and thoughts on its future
  publication-title: ACM SIGKDD Explor. Newsl
  doi: 10.1145/2897350.2897353
– volume: 234
  start-page: 34
  issue: 5323
  year: 1971
  end-page: 35
  ident: CR19
  article-title: Distance between sets
  publication-title: Nature
  doi: 10.1038/234034a0
– ident: CR41
– ident: CR26
– volume: 20
  start-page: 364
  issue: 4
  year: 1977
  end-page: 366
  ident: CR10
  article-title: An efficient algorithm for a complete link method
  publication-title: Comput J
  doi: 10.1093/comjnl/20.4.364
– ident: CR24
– volume: 96
  start-page: 1
  year: 2018
  end-page: 13
  ident: CR3
  article-title: I-TWEC: interactive clustering tool for Twitter
  publication-title: Expert Syst. Appl.
  doi: 10.1016/j.eswa.2017.11.055
– ident: CR20
– volume: 58
  start-page: 236
  issue: 301
  year: 1963
  ident: 638_CR38
  publication-title: J. Am. Stat. Assoc.
  doi: 10.1080/01621459.1963.10500845
– ident: 638_CR24
– ident: 638_CR22
  doi: 10.1145/956863.956961
– ident: 638_CR36
  doi: 10.1145/1772690.1772789
– ident: 638_CR29
  doi: 10.1109/ICMCS.2014.6911249
– ident: 638_CR20
– volume: 96
  start-page: 1
  year: 2018
  ident: 638_CR3
  publication-title: Expert Syst. Appl.
  doi: 10.1016/j.eswa.2017.11.055
– volume: 234
  start-page: 34
  issue: 5323
  year: 1971
  ident: 638_CR19
  publication-title: Nature
  doi: 10.1038/234034a0
– ident: 638_CR21
  doi: 10.1090/chel/367
– volume: 42
  start-page: 19
  issue: 3
  year: 2017
  ident: 638_CR30
  publication-title: ACM Trans. Database Syst. (TODS)
  doi: 10.1145/3068335
– ident: 638_CR31
  doi: 10.1109/SocialCom.2010.17
– volume: 24
  start-page: 35
  issue: 4
  year: 2001
  ident: 638_CR32
  publication-title: IEEE Data Eng. Bull.
– ident: 638_CR7
  doi: 10.1145/332040.332418
– ident: 638_CR12
– ident: 638_CR39
  doi: 10.1145/2736277.2741659
– volume: 2
  start-page: 17
  issue: 17
  year: 2016
  ident: 638_CR35
  publication-title: ACM SIGKDD Explor. Newsl
  doi: 10.1145/2897350.2897353
– volume: 42
  start-page: 75
  issue: 1
  year: 2015
  ident: 638_CR33
  publication-title: Knowl. Inf. Syst.
  doi: 10.1007/s10115-013-0687-x
– ident: 638_CR6
  doi: 10.1109/ISIE.2016.7745047
– volume-title: Mining of Massive Datasets
  year: 2014
  ident: 638_CR18
  doi: 10.1017/CBO9781139924801
– ident: 638_CR5
– ident: 638_CR43
  doi: 10.1145/1008992.1009030
– volume: 20
  start-page: 364
  issue: 4
  year: 1977
  ident: 638_CR10
  publication-title: Comput J
  doi: 10.1093/comjnl/20.4.364
– ident: 638_CR26
  doi: 10.14778/3402707.3402735
– ident: 638_CR37
– volume: 28
  start-page: 100
  issue: 1
  year: 1979
  ident: 638_CR14
  publication-title: J. R. Stat. Soc. Ser. C (Appl. Stat.)
– ident: 638_CR34
  doi: 10.1145/2009916.2009952
– ident: 638_CR41
  doi: 10.1145/860435.860485
– volume: 31
  start-page: 1361
  issue: 11
  year: 1999
  ident: 638_CR42
  publication-title: Comput. Netw.
  doi: 10.1016/S1389-1286(99)00054-7
– volume: 84
  start-page: 24
  year: 2017
  ident: 638_CR1
  publication-title: Expert Syst. Appl.
  doi: 10.1016/j.eswa.2017.05.002
– ident: 638_CR17
  doi: 10.1002/(SICI)1097-0266(199606)17:6<441::AID-SMJ819>3.0.CO;2-G
– ident: 638_CR40
– ident: 638_CR9
  doi: 10.1145/1559845.1559882
– ident: 638_CR25
– ident: 638_CR13
  doi: 10.1145/775152.775182
– ident: 638_CR27
– ident: 638_CR4
– ident: 638_CR8
  doi: 10.1145/1062745.1062763
– ident: 638_CR2
  doi: 10.1145/276305.276330
– volume: 20
  start-page: 53
  year: 1987
  ident: 638_CR28
  publication-title: J. Comput. Appl. Math.
  doi: 10.1016/0377-0427(87)90125-7
– volume: 82
  start-page: 311
  issue: 8
  year: 2013
  ident: 638_CR16
  publication-title: J. Logic Algebr. Program
  doi: 10.1016/j.jlap.2013.01.002
– volume: 96
  start-page: 226
  issue: 34
  year: 1996
  ident: 638_CR11
  publication-title: KDD
– ident: 638_CR15
– volume: 39
  start-page: 635
  issue: 5
  year: 2002
  ident: 638_CR23
  publication-title: Comput. Netw.
  doi: 10.1016/S1389-1286(02)00214-1
SSID ssj0017630
Score 2.2620494
Snippet Along with the growth of the internet, the number of information published increased exponentially. This huge flow of information causes a problem called...
SourceID proquest
crossref
springer
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 201
SubjectTerms Algorithms
Cluster analysis
Clustering
Computer Communication Networks
Computer Graphics
Computer Science
Cryptology
Data Storage Representation
Distance measurement
Heuristic methods
Information flow
Information overload
Internet
News
Operating Systems
Regular Paper
Title Automatic news-roundup generation using clustering, extraction, and presentation
URI https://link.springer.com/article/10.1007/s00530-019-00638-4
https://www.proquest.com/docview/2383542206
Volume 26
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVEBS
  databaseName: Inspec with Full Text
  customDbUrl:
  eissn: 1432-1882
  dateEnd: 20241003
  omitProxy: false
  ssIdentifier: ssj0017630
  issn: 0942-4962
  databaseCode: ADMLS
  dateStart: 20080101
  isFulltext: true
  titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text
  providerName: EBSCOhost
– providerCode: PRVLSH
  databaseName: SpringerLink Journals
  customDbUrl:
  mediaType: online
  eissn: 1432-1882
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017630
  issn: 0942-4962
  databaseCode: AFBBN
  dateStart: 19970101
  isFulltext: true
  providerName: Library Specific Holdings
– providerCode: PRVAVX
  databaseName: SpringerLINK - Czech Republic Consortium
  customDbUrl:
  eissn: 1432-1882
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017630
  issn: 0942-4962
  databaseCode: AGYKE
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: http://link.springer.com
  providerName: Springer Nature
– providerCode: PRVAVX
  databaseName: SpringerLink Journals (ICM)
  customDbUrl:
  eissn: 1432-1882
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017630
  issn: 0942-4962
  databaseCode: U2A
  dateStart: 19970101
  isFulltext: true
  titleUrlDefault: http://www.springerlink.com/journals/
  providerName: Springer Nature
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NT8IwFH8RuHgRPyOKpAdvUrK13WaPREGiYkyUBE_L1rUcJIPAdvGvty0bRKImnLZkXbO9j7732vd-D-Daj1zic1dgE_CYrZsIx1IGOIj0xeFSRRaBb_jiD0bsceyNi6KwZZntXh5J2pV6Xexm5MUkUXFs7SxmFah5JkCpQq378PHUW58eaJ2xeyucEcy4T4pimd9n-WmQNl7m1sGotTf9OozKL12lmXx28izuiK8tEMddf-UQDgoHFHVXEnMEezI9hnrZ3AEVun4Cr908m1k8V2Q8b7ww_ZfyOZpYnGrDTmRy5idITHMDtqBv20iv9ItVpUQbRWmC5pvipvQURv3e-90AF-0XsNBWLcOJ1h_hcKp0oBkkCRGKGCweRoVURFKlBFeKUu5Ht8KVngq4ItoBIZJJLhxH0TOoprNUnpv8KSY8EilBdfiVeG4cK-0pKuXFIqZJ4jXALXkQigKb3LTImIZrVGVLslCTLLQkC1kDbtbvzFfIHP-ObpasDQstXYbaXaEeI8TxG9AuObV5_PdsF7sNv4R9YsJ0m_DThGq2yOWV9mWyuKVF9374_NYqRLgFlRHpfgMn8-wU
linkProvider Springer Nature
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3NT8IwFH9RPOjFbyN-9uBNSra2G_RIDIgKxAMkeFq2ruUgGQS2i3-9bbdBJGrCaUvWNVtfX9_vte_9HsCDH7rE567AxuExWzchjqRs4EaoLw6XKrQMfP2B3x2x17E3LpLClmW0e3kkaVfqVbKbmS8miIpja2cx24U95jabrAJ7reePt_bq9EDrjN1b4Yxgxn1SJMv83stPg7RGmRsHo9bedI5gVH5pHmbyWc_SqC6-Nkgct_2VYzgsAChq5TPmBHZkcgpHZXEHVOj6Gby3snRm-VyRQd54YeovZXM0sTzVRpzIxMxPkJhmhmxB39aQXukXeaZEDYVJjObr5KbkHEad9vCpi4vyC1hoq5biWOuPcDhV2tFsxDERihguHkaFVERSpQRXilLuh03hSk81uCIagBDJJBeOo-gFVJJZIi9N_BQTHgmVoNr9ij03ipRGikp5kYhoHHtVcEsZBKLgJjclMqbBilXZDlmghyywQxawKjyu3pnnzBz_tr4pRRsUWroMNFyhHiPE8atQKyW1fvx3b1fbNb-H_e6w3wt6L4O3azggxmW3wT83UEkXmbzVuCaN7opp_A13LOyI
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07T8MwELagSIiFN6JQwAMbtZrYToLHCqjKq-pApW5RYvu6VGlVkv-P7TwKCJCYEimOhzvb953v7juErsPEp6HwJbEOj726SUiqdUSixDw8oSFxDHyvo3A44U_TYPqpit9lu9chybKmwbI0ZXlvqaDXFL7ZtWMTqgRxNpfwTbTFja227teE9ps4gtk97pZFcEq4CGlVNvPzHF9N0xpvfguROssz2Ee7FWTE_VLHB2hDZ4dor27HgKvdeYTG_SJfOAZWbLEyWdmOScUSzxyztFUAtlnuMyznhaVHMK9dbM7mVVnb0MVJpvByXY6UHaPJ4OHtbkiqhglEGjuUE2VWvPQEA-MaRkpRCdSy53AmNVDNAKQAYEyEya30dQCRAGogA9VcC-l5wE5QK1tk-tRmPHEZ0AQkMw6TCvw0BYPtAIJUpkypoI38WlaxrNjEbVOLedzwIDv5xka-sZNvzNvopvlnWXJp_Dm6U6sgrvbVe2wABgs4pV7YRt1aLevPv8929r_hV2h7fD-IXx5Hz-doh1of22XrdFArXxX6wgCRPL10a-0DpnXT1Q
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Automatic+news-roundup+generation+using+clustering%2C+extraction%2C+and+presentation&rft.jtitle=Multimedia+systems&rft.au=Utomo%2C+Vincent&rft.au=Leu%2C+Jenq-Shiou&rft.date=2020-04-01&rft.pub=Springer+Berlin+Heidelberg&rft.issn=0942-4962&rft.eissn=1432-1882&rft.volume=26&rft.issue=2&rft.spage=201&rft.epage=221&rft_id=info:doi/10.1007%2Fs00530-019-00638-4&rft.externalDocID=10_1007_s00530_019_00638_4
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0942-4962&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0942-4962&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0942-4962&client=summon