A New Approach for Semi-External Topological Sorting on Big Graphs

This paper presents a new approach for semi-external topological sorting algorithm on big directed acyclic graph(DAG). Topological sorting aims to find an ordering of each node in DAG, which satisfies <inline-formula><tex-math notation="LaTeX">u</tex-math></inline-form...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on knowledge and data engineering Vol. 35; no. 12; pp. 1 - 14
Main Authors Gao, Tianpeng, Li, Jianzhong, Ma, Hengzhao
Format Journal Article
LanguageEnglish
Published New York IEEE 01.12.2023
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN1041-4347
1558-2191
DOI10.1109/TKDE.2023.3274528

Cover

Abstract This paper presents a new approach for semi-external topological sorting algorithm on big directed acyclic graph(DAG). Topological sorting aims to find an ordering of each node in DAG, which satisfies <inline-formula><tex-math notation="LaTeX">u</tex-math></inline-formula> precedes <inline-formula><tex-math notation="LaTeX">v</tex-math></inline-formula> in the ordering for each edge <inline-formula><tex-math notation="LaTeX">(u,v)</tex-math></inline-formula> in DAG. Topological sorting is an important subroutine for scheduling and other external graph algorithms. But, the internal topological sorting algorithm cannot handle big DAGs and the I/O complexity of total external topological sorting is too high for practical applications. Therefore, we pay attention to the semi-external topological sorting for big DAGs in this paper. We find that the existing semi-external topological sorting algorithm is mainly based on constructing a DFS-Tree in internal memory. However, this DFS-based algorithm is natively more difficult than topological sorting, because DFS-Tree determines a strict total order, while topological order is only a partial order. Therefore, a partial order level order is proposed in this paper. Based on the level order , we propose a new semi-external topological sorting algorithm. Next, two optimizations, NodeRemove and EdgeRemove , are proposed to reduce the CPU and I/O cost. In addition, we also propose a batch algorithm. Finally, we perform experimental studies using real and synthetic datasets to confirm the efficiency of our approach. According to the experimental results, our algorithms are better than the previous DFS-based algorithms.
AbstractList This paper presents a new approach for semi-external topological sorting algorithm on big directed acyclic graph(DAG). Topological sorting aims to find an ordering of each node in DAG, which satisfies [Formula Omitted] precedes [Formula Omitted] in the ordering for each edge [Formula Omitted] in DAG. Topological sorting is an important subroutine for scheduling and other external graph algorithms. But, the internal topological sorting algorithm cannot handle big DAGs and the I/O complexity of total external topological sorting is too high for practical applications. Therefore, we pay attention to the semi-external topological sorting for big DAGs in this paper. We find that the existing semi-external topological sorting algorithm is mainly based on constructing a DFS-Tree in internal memory. However, this DFS-based algorithm is natively more difficult than topological sorting, because DFS-Tree determines a strict total order, while topological order is only a partial order. Therefore, a partial order level order is proposed in this paper. Based on the level order , we propose a new semi-external topological sorting algorithm. Next, two optimizations, NodeRemove and EdgeRemove , are proposed to reduce the CPU and I/O cost. In addition, we also propose a batch algorithm. Finally, we perform experimental studies using real and synthetic datasets to confirm the efficiency of our approach. According to the experimental results, our algorithms are better than the previous DFS-based algorithms.
This paper presents a new approach for semi-external topological sorting algorithm on big directed acyclic graph(DAG). Topological sorting aims to find an ordering of each node in DAG, which satisfies <inline-formula><tex-math notation="LaTeX">u</tex-math></inline-formula> precedes <inline-formula><tex-math notation="LaTeX">v</tex-math></inline-formula> in the ordering for each edge <inline-formula><tex-math notation="LaTeX">(u,v)</tex-math></inline-formula> in DAG. Topological sorting is an important subroutine for scheduling and other external graph algorithms. But, the internal topological sorting algorithm cannot handle big DAGs and the I/O complexity of total external topological sorting is too high for practical applications. Therefore, we pay attention to the semi-external topological sorting for big DAGs in this paper. We find that the existing semi-external topological sorting algorithm is mainly based on constructing a DFS-Tree in internal memory. However, this DFS-based algorithm is natively more difficult than topological sorting, because DFS-Tree determines a strict total order, while topological order is only a partial order. Therefore, a partial order level order is proposed in this paper. Based on the level order , we propose a new semi-external topological sorting algorithm. Next, two optimizations, NodeRemove and EdgeRemove , are proposed to reduce the CPU and I/O cost. In addition, we also propose a batch algorithm. Finally, we perform experimental studies using real and synthetic datasets to confirm the efficiency of our approach. According to the experimental results, our algorithms are better than the previous DFS-based algorithms.
Author Li, Jianzhong
Ma, Hengzhao
Gao, Tianpeng
Author_xml – sequence: 1
  givenname: Tianpeng
  orcidid: 0000-0002-1542-5322
  surname: Gao
  fullname: Gao, Tianpeng
  organization: School of Computer Science and Technology, Harbin Institute of Technology, China
– sequence: 2
  givenname: Jianzhong
  orcidid: 0000-0002-4119-0571
  surname: Li
  fullname: Li, Jianzhong
  organization: Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China
– sequence: 3
  givenname: Hengzhao
  orcidid: 0000-0002-2769-6138
  surname: Ma
  fullname: Ma, Hengzhao
  organization: Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China
BookMark eNp9kMFOAjEQhhujiYA-gImHJp4XO-122z0CIhqJHsBz0y1dKIHt2i5R397dwMF48DLzH_5vMvn66LzylUXoBsgQgOT3y5eH6ZASyoaMipRTeYZ6wLlMKORw3maSQpKyVFyifoxbQogUEnpoPMKv9hOP6jp4bTa49AEv7N4l06_Ghkrv8NLXfufXzrR54UPjqjX2FR67NZ4FXW_iFboo9S7a69MeoPfH6XLylMzfZs-T0TwxNE-bhGdCZMJwJnOeyYxksqTaFjmFUlBhKNfM8HbS1crqUmeEF4VYES6KXOc0k2yA7o5321c_DjY2ausP3YtRUSkFQMoEtC1xbJngYwy2VMY1unG-aoJ2OwVEdcJUJ0x1wtRJWEvCH7IObq_D97_M7ZFx1tpffWi1E85-AP2mdjo
CODEN ITKEEH
CitedBy_id crossref_primary_10_1016_j_future_2024_04_004
Cites_doi 10.1007/978-3-319-91452-7_14
10.1145/988672.988752
10.1145/2740908.2742839
10.1145/1963405.1963488
10.1145/48529.48535
10.1145/1401890.1402008
10.1109/ICDE.2014.6816652
10.1007/978-3-642-02094-0_1
10.1016/j.cor.2008.02.002
10.1145/564870.564917
10.1007/978-3-642-10631-6_116
10.1145/2213836.2213899
10.1137/1.9781611975482.124
10.1007/s00778-014-0372-z
10.1145/2723372.2723740
10.1137/1.9781611973068.120
10.1145/2133803.2330083
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023
DBID 97E
RIA
RIE
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
DOI 10.1109/TKDE.2023.3274528
DatabaseName IEEE All-Society Periodicals Package (ASPP) 2005–Present
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Xplore
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList Technology Research Database

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1558-2191
EndPage 14
ExternalDocumentID 10_1109_TKDE_2023_3274528
10121905
Genre orig-research
GroupedDBID -~X
.DC
0R~
29I
4.4
5GY
6IK
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABQJQ
ABVLG
ACGFO
ACIWK
AENEX
AGQYO
AHBIQ
AKJIK
AKQYR
ALMA_UNASSIGNED_HOLDINGS
ASUFR
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EBS
EJD
F5P
HZ~
IEDLZ
IFIPE
IPLJI
JAVBF
LAI
M43
MS~
O9-
OCL
P2P
PQQKQ
RIA
RIE
RNS
RXW
TAE
TN5
UHB
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c294t-567767c53895686068f2aeb921f727c25a3c525a2ddeafa605bb7d057b9a92683
IEDL.DBID RIE
ISSN 1041-4347
IngestDate Mon Jun 30 02:22:09 EDT 2025
Wed Oct 01 02:06:29 EDT 2025
Thu Apr 24 23:01:23 EDT 2025
Wed Aug 27 02:18:22 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 12
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
https://doi.org/10.15223/policy-029
https://doi.org/10.15223/policy-037
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c294t-567767c53895686068f2aeb921f727c25a3c525a2ddeafa605bb7d057b9a92683
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0002-1542-5322
0000-0002-2769-6138
0000-0002-4119-0571
PQID 2887114371
PQPubID 85438
PageCount 14
ParticipantIDs crossref_citationtrail_10_1109_TKDE_2023_3274528
crossref_primary_10_1109_TKDE_2023_3274528
ieee_primary_10121905
proquest_journals_2887114371
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2023-12-01
PublicationDateYYYYMMDD 2023-12-01
PublicationDate_xml – month: 12
  year: 2023
  text: 2023-12-01
  day: 01
PublicationDecade 2020
PublicationPlace New York
PublicationPlace_xml – name: New York
PublicationTitle IEEE transactions on knowledge and data engineering
PublicationTitleAbbrev TKDE
PublicationYear 2023
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References ref13
ref12
ref15
ref20
ref11
ref10
ref21
ref2
ref1
ref17
ref19
ref18
ref8
buchsbaum (ref7) 2000
ref9
leskovec (ref22) 2014
ref4
ref3
fasino (ref23) 2019
ref5
wan (ref14) 2020
(ref16) 2014
chiang (ref6) 1995
References_xml – ident: ref12
  doi: 10.1007/978-3-319-91452-7_14
– ident: ref17
  doi: 10.1145/988672.988752
– year: 2014
  ident: ref16
  article-title: Deep first search
– ident: ref20
  doi: 10.1145/2740908.2742839
– ident: ref18
  doi: 10.1145/1963405.1963488
– ident: ref4
  doi: 10.1145/48529.48535
– ident: ref19
  doi: 10.1145/1401890.1402008
– ident: ref10
  doi: 10.1109/ICDE.2014.6816652
– start-page: 859
  year: 2000
  ident: ref7
  article-title: On external memory graph traversal
  publication-title: Proc 11th Annu ACM-SIAM Symp Discrete Algorithms
– ident: ref21
  doi: 10.1007/978-3-642-02094-0_1
– year: 2020
  ident: ref14
  article-title: Efficient semi-external depth-first search
– year: 2019
  ident: ref23
  article-title: Generating large scale-free networks with the chung-lu random graph model
– ident: ref2
  doi: 10.1016/j.cor.2008.02.002
– ident: ref11
  doi: 10.1145/564870.564917
– start-page: 139
  year: 1995
  ident: ref6
  article-title: External-memory graph algorithms
  publication-title: Proc 6th Annu ACM-SIAM Symp Discrete Algorithms
– ident: ref1
  doi: 10.1007/978-3-642-10631-6_116
– ident: ref5
  doi: 10.1145/2213836.2213899
– ident: ref3
  doi: 10.1137/1.9781611975482.124
– ident: ref9
  doi: 10.1007/s00778-014-0372-z
– ident: ref13
  doi: 10.1145/2723372.2723740
– year: 2014
  ident: ref22
  article-title: SNAP datasets: Stanford large network dataset collection
– ident: ref15
  doi: 10.1137/1.9781611973068.120
– ident: ref8
  doi: 10.1145/2133803.2330083
SSID ssj0008781
Score 2.4217048
Snippet This paper presents a new approach for semi-external topological sorting algorithm on big directed acyclic graph(DAG). Topological sorting aims to find an...
SourceID proquest
crossref
ieee
SourceType Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 1
SubjectTerms Algorithms
Complexity theory
Costs
DAG
Memory management
Optimization
partial order
Scheduling
semi-external algorithm
Sorting
Sorting algorithms
Synthetic data
topological sorting
Topology
Transforms
Title A New Approach for Semi-External Topological Sorting on Big Graphs
URI https://ieeexplore.ieee.org/document/10121905
https://www.proquest.com/docview/2887114371
Volume 35
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE Xplore
  customDbUrl:
  eissn: 1558-2191
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0008781
  issn: 1041-4347
  databaseCode: RIE
  dateStart: 19890101
  isFulltext: true
  titleUrlDefault: https://ieeexplore.ieee.org/
  providerName: IEEE
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED4BEwyURxGFgjwwISU0dl4eW2hBIFjaSmxR_EhVAQ2CduHXc5e4qAKBWKIMdmT57O_uYt_3AZwFQnMtufFUgdsNPQbiYCCNZwQ3RRAZlRoqcL5_iG_G4e1j9OiK1ataGGttdfnM-vRaneWbUi_oV9kFcVGhA4vWYT1J47pY6wt206RSJMX0ApMiESbuCDPoyIvR3VXfJ51wX2ASFpHy-ooTqlRVfkBx5V8GDXhYjqy-VvLkL-bK1x_fSBv_PfQd2HaRJuvWS2MX1uxsDxpLFQfmNvUebK1QEu5Dr8sQ91jXUY0zjGnZ0L5Mvb6ji2ajWlaBjMuGJZEQTFg5Y73phF0T-_V7E8aD_ujyxnM6C57mMpx7UUyUPhqhj2oHMaNJC55bJXlQYHSjeZQLHeGTIxTmRY4JkFKJwUBPyVzyOBUHsDErZ_YQWB5KVUSJJp2_UKRG5gYdXiASVcQ5t0kLOsuJz7QjISctjOesSkY6MiNbZWSrzNmqBedfXV5rBo6_Gjdp7lca1tPegvbSvJnbpO8ZR4DFdFAkwdEv3Y5hk75eX19pw8b8bWFPMAiZq9Nq8X0CdKfUwA
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED5BGYCBN6I8PTAhJTR23MRjgUJ5daFIbFH8CEJAi2i78Ou5S1xUgUAsUQZbsXz2d3ex7_sADiNhuFHcBrrA7YYeA3EwUjawgtsiklanlgqcb7vNzn189SAffLF6WQvjnCsvn7mQXsuzfDswY_pVdkxcVOjA5CzMyTiOZVWu9QW8aVJqkmKCgWmRiBN_iBk11HHv-qwdklJ4KDANk6S9PuWGSl2VH2BcepjzZehOxlZdLHkOxyMdmo9vtI3_HvwKLPlYk7WqxbEKM66_BssTHQfmt_UaLE6REq7DSYsh8rGWJxtnGNWyO_f6FLQ9YTTrVcIKZF52NyAagkc26LOTp0d2QfzXww24P2_3TjuBV1oIDFfxKJBNIvUxCH5UPYg5TVrw3GnFowLjG8NlLozEJ0cwzIscUyCtE4uhnla54s1UbEKtP-i7LWB5rHQhE0NKf7FIrcoturxIJLpo5twldWhMJj4znoac1DBesjIdaaiMbJWRrTJvqzocfXV5qzg4_mq8QXM_1bCa9jrsTsyb-W06zDhCLCaEIom2f-l2APOd3u1NdnPZvd6BBfpSdZllF2qj97Hbw5BkpPfLhfgJxwrYDQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+New+Approach+for+Semi-External+Topological+Sorting+on+Big+Graphs&rft.jtitle=IEEE+transactions+on+knowledge+and+data+engineering&rft.au=Gao%2C+Tianpeng&rft.au=Li%2C+Jianzhong&rft.au=Ma%2C+Hengzhao&rft.date=2023-12-01&rft.pub=IEEE&rft.issn=1041-4347&rft.spage=1&rft.epage=14&rft_id=info:doi/10.1109%2FTKDE.2023.3274528&rft.externalDocID=10121905
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1041-4347&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1041-4347&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1041-4347&client=summon