A New Approach for Semi-External Topological Sorting on Big Graphs
This paper presents a new approach for semi-external topological sorting algorithm on big directed acyclic graph(DAG). Topological sorting aims to find an ordering of each node in DAG, which satisfies <inline-formula><tex-math notation="LaTeX">u</tex-math></inline-form...
Saved in:
| Published in | IEEE transactions on knowledge and data engineering Vol. 35; no. 12; pp. 1 - 14 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
New York
IEEE
01.12.2023
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Subjects | |
| Online Access | Get full text |
| ISSN | 1041-4347 1558-2191 |
| DOI | 10.1109/TKDE.2023.3274528 |
Cover
| Abstract | This paper presents a new approach for semi-external topological sorting algorithm on big directed acyclic graph(DAG). Topological sorting aims to find an ordering of each node in DAG, which satisfies <inline-formula><tex-math notation="LaTeX">u</tex-math></inline-formula> precedes <inline-formula><tex-math notation="LaTeX">v</tex-math></inline-formula> in the ordering for each edge <inline-formula><tex-math notation="LaTeX">(u,v)</tex-math></inline-formula> in DAG. Topological sorting is an important subroutine for scheduling and other external graph algorithms. But, the internal topological sorting algorithm cannot handle big DAGs and the I/O complexity of total external topological sorting is too high for practical applications. Therefore, we pay attention to the semi-external topological sorting for big DAGs in this paper. We find that the existing semi-external topological sorting algorithm is mainly based on constructing a DFS-Tree in internal memory. However, this DFS-based algorithm is natively more difficult than topological sorting, because DFS-Tree determines a strict total order, while topological order is only a partial order. Therefore, a partial order level order is proposed in this paper. Based on the level order , we propose a new semi-external topological sorting algorithm. Next, two optimizations, NodeRemove and EdgeRemove , are proposed to reduce the CPU and I/O cost. In addition, we also propose a batch algorithm. Finally, we perform experimental studies using real and synthetic datasets to confirm the efficiency of our approach. According to the experimental results, our algorithms are better than the previous DFS-based algorithms. |
|---|---|
| AbstractList | This paper presents a new approach for semi-external topological sorting algorithm on big directed acyclic graph(DAG). Topological sorting aims to find an ordering of each node in DAG, which satisfies [Formula Omitted] precedes [Formula Omitted] in the ordering for each edge [Formula Omitted] in DAG. Topological sorting is an important subroutine for scheduling and other external graph algorithms. But, the internal topological sorting algorithm cannot handle big DAGs and the I/O complexity of total external topological sorting is too high for practical applications. Therefore, we pay attention to the semi-external topological sorting for big DAGs in this paper. We find that the existing semi-external topological sorting algorithm is mainly based on constructing a DFS-Tree in internal memory. However, this DFS-based algorithm is natively more difficult than topological sorting, because DFS-Tree determines a strict total order, while topological order is only a partial order. Therefore, a partial order level order is proposed in this paper. Based on the level order , we propose a new semi-external topological sorting algorithm. Next, two optimizations, NodeRemove and EdgeRemove , are proposed to reduce the CPU and I/O cost. In addition, we also propose a batch algorithm. Finally, we perform experimental studies using real and synthetic datasets to confirm the efficiency of our approach. According to the experimental results, our algorithms are better than the previous DFS-based algorithms. This paper presents a new approach for semi-external topological sorting algorithm on big directed acyclic graph(DAG). Topological sorting aims to find an ordering of each node in DAG, which satisfies <inline-formula><tex-math notation="LaTeX">u</tex-math></inline-formula> precedes <inline-formula><tex-math notation="LaTeX">v</tex-math></inline-formula> in the ordering for each edge <inline-formula><tex-math notation="LaTeX">(u,v)</tex-math></inline-formula> in DAG. Topological sorting is an important subroutine for scheduling and other external graph algorithms. But, the internal topological sorting algorithm cannot handle big DAGs and the I/O complexity of total external topological sorting is too high for practical applications. Therefore, we pay attention to the semi-external topological sorting for big DAGs in this paper. We find that the existing semi-external topological sorting algorithm is mainly based on constructing a DFS-Tree in internal memory. However, this DFS-based algorithm is natively more difficult than topological sorting, because DFS-Tree determines a strict total order, while topological order is only a partial order. Therefore, a partial order level order is proposed in this paper. Based on the level order , we propose a new semi-external topological sorting algorithm. Next, two optimizations, NodeRemove and EdgeRemove , are proposed to reduce the CPU and I/O cost. In addition, we also propose a batch algorithm. Finally, we perform experimental studies using real and synthetic datasets to confirm the efficiency of our approach. According to the experimental results, our algorithms are better than the previous DFS-based algorithms. |
| Author | Li, Jianzhong Ma, Hengzhao Gao, Tianpeng |
| Author_xml | – sequence: 1 givenname: Tianpeng orcidid: 0000-0002-1542-5322 surname: Gao fullname: Gao, Tianpeng organization: School of Computer Science and Technology, Harbin Institute of Technology, China – sequence: 2 givenname: Jianzhong orcidid: 0000-0002-4119-0571 surname: Li fullname: Li, Jianzhong organization: Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China – sequence: 3 givenname: Hengzhao orcidid: 0000-0002-2769-6138 surname: Ma fullname: Ma, Hengzhao organization: Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, China |
| BookMark | eNp9kMFOAjEQhhujiYA-gImHJp4XO-122z0CIhqJHsBz0y1dKIHt2i5R397dwMF48DLzH_5vMvn66LzylUXoBsgQgOT3y5eH6ZASyoaMipRTeYZ6wLlMKORw3maSQpKyVFyifoxbQogUEnpoPMKv9hOP6jp4bTa49AEv7N4l06_Ghkrv8NLXfufXzrR54UPjqjX2FR67NZ4FXW_iFboo9S7a69MeoPfH6XLylMzfZs-T0TwxNE-bhGdCZMJwJnOeyYxksqTaFjmFUlBhKNfM8HbS1crqUmeEF4VYES6KXOc0k2yA7o5321c_DjY2ausP3YtRUSkFQMoEtC1xbJngYwy2VMY1unG-aoJ2OwVEdcJUJ0x1wtRJWEvCH7IObq_D97_M7ZFx1tpffWi1E85-AP2mdjo |
| CODEN | ITKEEH |
| CitedBy_id | crossref_primary_10_1016_j_future_2024_04_004 |
| Cites_doi | 10.1007/978-3-319-91452-7_14 10.1145/988672.988752 10.1145/2740908.2742839 10.1145/1963405.1963488 10.1145/48529.48535 10.1145/1401890.1402008 10.1109/ICDE.2014.6816652 10.1007/978-3-642-02094-0_1 10.1016/j.cor.2008.02.002 10.1145/564870.564917 10.1007/978-3-642-10631-6_116 10.1145/2213836.2213899 10.1137/1.9781611975482.124 10.1007/s00778-014-0372-z 10.1145/2723372.2723740 10.1137/1.9781611973068.120 10.1145/2133803.2330083 |
| ContentType | Journal Article |
| Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023 |
| Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023 |
| DBID | 97E RIA RIE AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D |
| DOI | 10.1109/TKDE.2023.3274528 |
| DatabaseName | IEEE All-Society Periodicals Package (ASPP) 2005–Present IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Xplore CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Technology Research Database |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Xplore url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering Computer Science |
| EISSN | 1558-2191 |
| EndPage | 14 |
| ExternalDocumentID | 10_1109_TKDE_2023_3274528 10121905 |
| Genre | orig-research |
| GroupedDBID | -~X .DC 0R~ 29I 4.4 5GY 6IK 97E AAJGR AARMG AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFO ACIWK AENEX AGQYO AHBIQ AKJIK AKQYR ALMA_UNASSIGNED_HOLDINGS ASUFR ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EBS EJD F5P HZ~ IEDLZ IFIPE IPLJI JAVBF LAI M43 MS~ O9- OCL P2P PQQKQ RIA RIE RNS RXW TAE TN5 UHB AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D |
| ID | FETCH-LOGICAL-c294t-567767c53895686068f2aeb921f727c25a3c525a2ddeafa605bb7d057b9a92683 |
| IEDL.DBID | RIE |
| ISSN | 1041-4347 |
| IngestDate | Mon Jun 30 02:22:09 EDT 2025 Wed Oct 01 02:06:29 EDT 2025 Thu Apr 24 23:01:23 EDT 2025 Wed Aug 27 02:18:22 EDT 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 12 |
| Language | English |
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c294t-567767c53895686068f2aeb921f727c25a3c525a2ddeafa605bb7d057b9a92683 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ORCID | 0000-0002-1542-5322 0000-0002-2769-6138 0000-0002-4119-0571 |
| PQID | 2887114371 |
| PQPubID | 85438 |
| PageCount | 14 |
| ParticipantIDs | crossref_citationtrail_10_1109_TKDE_2023_3274528 crossref_primary_10_1109_TKDE_2023_3274528 ieee_primary_10121905 proquest_journals_2887114371 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2023-12-01 |
| PublicationDateYYYYMMDD | 2023-12-01 |
| PublicationDate_xml | – month: 12 year: 2023 text: 2023-12-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationPlace | New York |
| PublicationPlace_xml | – name: New York |
| PublicationTitle | IEEE transactions on knowledge and data engineering |
| PublicationTitleAbbrev | TKDE |
| PublicationYear | 2023 |
| Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| References | ref13 ref12 ref15 ref20 ref11 ref10 ref21 ref2 ref1 ref17 ref19 ref18 ref8 buchsbaum (ref7) 2000 ref9 leskovec (ref22) 2014 ref4 ref3 fasino (ref23) 2019 ref5 wan (ref14) 2020 (ref16) 2014 chiang (ref6) 1995 |
| References_xml | – ident: ref12 doi: 10.1007/978-3-319-91452-7_14 – ident: ref17 doi: 10.1145/988672.988752 – year: 2014 ident: ref16 article-title: Deep first search – ident: ref20 doi: 10.1145/2740908.2742839 – ident: ref18 doi: 10.1145/1963405.1963488 – ident: ref4 doi: 10.1145/48529.48535 – ident: ref19 doi: 10.1145/1401890.1402008 – ident: ref10 doi: 10.1109/ICDE.2014.6816652 – start-page: 859 year: 2000 ident: ref7 article-title: On external memory graph traversal publication-title: Proc 11th Annu ACM-SIAM Symp Discrete Algorithms – ident: ref21 doi: 10.1007/978-3-642-02094-0_1 – year: 2020 ident: ref14 article-title: Efficient semi-external depth-first search – year: 2019 ident: ref23 article-title: Generating large scale-free networks with the chung-lu random graph model – ident: ref2 doi: 10.1016/j.cor.2008.02.002 – ident: ref11 doi: 10.1145/564870.564917 – start-page: 139 year: 1995 ident: ref6 article-title: External-memory graph algorithms publication-title: Proc 6th Annu ACM-SIAM Symp Discrete Algorithms – ident: ref1 doi: 10.1007/978-3-642-10631-6_116 – ident: ref5 doi: 10.1145/2213836.2213899 – ident: ref3 doi: 10.1137/1.9781611975482.124 – ident: ref9 doi: 10.1007/s00778-014-0372-z – ident: ref13 doi: 10.1145/2723372.2723740 – year: 2014 ident: ref22 article-title: SNAP datasets: Stanford large network dataset collection – ident: ref15 doi: 10.1137/1.9781611973068.120 – ident: ref8 doi: 10.1145/2133803.2330083 |
| SSID | ssj0008781 |
| Score | 2.4217048 |
| Snippet | This paper presents a new approach for semi-external topological sorting algorithm on big directed acyclic graph(DAG). Topological sorting aims to find an... |
| SourceID | proquest crossref ieee |
| SourceType | Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 1 |
| SubjectTerms | Algorithms Complexity theory Costs DAG Memory management Optimization partial order Scheduling semi-external algorithm Sorting Sorting algorithms Synthetic data topological sorting Topology Transforms |
| Title | A New Approach for Semi-External Topological Sorting on Big Graphs |
| URI | https://ieeexplore.ieee.org/document/10121905 https://www.proquest.com/docview/2887114371 |
| Volume | 35 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Xplore customDbUrl: eissn: 1558-2191 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0008781 issn: 1041-4347 databaseCode: RIE dateStart: 19890101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED4BEwyURxGFgjwwISU0dl4eW2hBIFjaSmxR_EhVAQ2CduHXc5e4qAKBWKIMdmT57O_uYt_3AZwFQnMtufFUgdsNPQbiYCCNZwQ3RRAZlRoqcL5_iG_G4e1j9OiK1ataGGttdfnM-vRaneWbUi_oV9kFcVGhA4vWYT1J47pY6wt206RSJMX0ApMiESbuCDPoyIvR3VXfJ51wX2ASFpHy-ooTqlRVfkBx5V8GDXhYjqy-VvLkL-bK1x_fSBv_PfQd2HaRJuvWS2MX1uxsDxpLFQfmNvUebK1QEu5Dr8sQ91jXUY0zjGnZ0L5Mvb6ji2ajWlaBjMuGJZEQTFg5Y73phF0T-_V7E8aD_ujyxnM6C57mMpx7UUyUPhqhj2oHMaNJC55bJXlQYHSjeZQLHeGTIxTmRY4JkFKJwUBPyVzyOBUHsDErZ_YQWB5KVUSJJp2_UKRG5gYdXiASVcQ5t0kLOsuJz7QjISctjOesSkY6MiNbZWSrzNmqBedfXV5rBo6_Gjdp7lca1tPegvbSvJnbpO8ZR4DFdFAkwdEv3Y5hk75eX19pw8b8bWFPMAiZq9Nq8X0CdKfUwA |
| linkProvider | IEEE |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED5BGYCBN6I8PTAhJTR23MRjgUJ5daFIbFH8CEJAi2i78Ou5S1xUgUAsUQZbsXz2d3ex7_sADiNhuFHcBrrA7YYeA3EwUjawgtsiklanlgqcb7vNzn189SAffLF6WQvjnCsvn7mQXsuzfDswY_pVdkxcVOjA5CzMyTiOZVWu9QW8aVJqkmKCgWmRiBN_iBk11HHv-qwdklJ4KDANk6S9PuWGSl2VH2BcepjzZehOxlZdLHkOxyMdmo9vtI3_HvwKLPlYk7WqxbEKM66_BssTHQfmt_UaLE6REq7DSYsh8rGWJxtnGNWyO_f6FLQ9YTTrVcIKZF52NyAagkc26LOTp0d2QfzXww24P2_3TjuBV1oIDFfxKJBNIvUxCH5UPYg5TVrw3GnFowLjG8NlLozEJ0cwzIscUyCtE4uhnla54s1UbEKtP-i7LWB5rHQhE0NKf7FIrcoturxIJLpo5twldWhMJj4znoac1DBesjIdaaiMbJWRrTJvqzocfXV5qzg4_mq8QXM_1bCa9jrsTsyb-W06zDhCLCaEIom2f-l2APOd3u1NdnPZvd6BBfpSdZllF2qj97Hbw5BkpPfLhfgJxwrYDQ |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+New+Approach+for+Semi-External+Topological+Sorting+on+Big+Graphs&rft.jtitle=IEEE+transactions+on+knowledge+and+data+engineering&rft.au=Gao%2C+Tianpeng&rft.au=Li%2C+Jianzhong&rft.au=Ma%2C+Hengzhao&rft.date=2023-12-01&rft.pub=IEEE&rft.issn=1041-4347&rft.spage=1&rft.epage=14&rft_id=info:doi/10.1109%2FTKDE.2023.3274528&rft.externalDocID=10121905 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1041-4347&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1041-4347&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1041-4347&client=summon |