VGrADS enabling e-Science workflows on grids and clouds with fault tolerance

Today's scientific workflows use distributed heterogeneous resources through diverse grid and cloud interfaces that are often hard to program. In addition, especially for time-sensitive critical applications, predictable quality of service is necessary across these distributed resources. VGrADS...

Full description

Saved in:
Bibliographic Details
Published inProceedings of the Conference on High Performance Computing Networking, Storage and Analysis pp. 1 - 12
Main Authors Ramakrishnan, Lavanya, Koelbel, Charles, Kee, Yang-Suk, Wolski, Rich, Nurmi, Daniel, Gannon, Dennis, Obertelli, Graziano, YarKhan, Asim, Mandal, Anirban, Huang, T. Mark, Thyagaraja, Kiran, Zagorodnov, Dmitrii
Format Conference Proceeding
LanguageEnglish
Published New York, NY, USA ACM 14.11.2009
SeriesACM Conferences
Subjects
Online AccessGet full text
ISBN1605587443
9781605587448
ISSN2167-4329
DOI10.1145/1654059.1654107

Cover

Abstract Today's scientific workflows use distributed heterogeneous resources through diverse grid and cloud interfaces that are often hard to program. In addition, especially for time-sensitive critical applications, predictable quality of service is necessary across these distributed resources. VGrADS' virtual grid execution system (vgES) provides an uniform qualitative resource abstraction over grid and cloud systems. We apply vgES for scheduling a set of deadline sensitive weather forecasting workflows. Specifically, this paper reports on our experiences with (1) virtualized reservations for batchqueue systems, (2) coordinated usage of TeraGrid (batch queue), Amazon EC2 (cloud), our own clusters (batch queue) and Eucalyptus (cloud) resources, and (3) fault tolerance through automated task replication. The combined effect of these techniques was to enable a new workflow planning method to balance performance, reliability and cost considerations. The results point toward improved resource selection and execution management support for a variety of e-Science applications over grids and cloud systems.
AbstractList Today's scientific workflows use distributed heterogeneous resources through diverse grid and cloud interfaces that are often hard to program. In addition, especially for time-sensitive critical applications, predictable quality of service is necessary across these distributed resources. VGrADS' virtual grid execution system (vgES) provides an uniform qualitative resource abstraction over grid and cloud systems. We apply vgES for scheduling a set of deadline sensitive weather forecasting workflows. Specifically, this paper reports on our experiences with (1) virtualized reservations for batchqueue systems, (2) coordinated usage of TeraGrid (batch queue), Amazon EC2 (cloud), our own clusters (batch queue) and Eucalyptus (cloud) resources, and (3) fault tolerance through automated task replication. The combined effect of these techniques was to enable a new workflow planning method to balance performance, reliability and cost considerations. The results point toward improved resource selection and execution management support for a variety of e-Science applications over grids and cloud systems.
Author Kee, Yang-Suk
Wolski, Rich
Nurmi, Daniel
Gannon, Dennis
YarKhan, Asim
Thyagaraja, Kiran
Obertelli, Graziano
Zagorodnov, Dmitrii
Ramakrishnan, Lavanya
Mandal, Anirban
Huang, T. Mark
Koelbel, Charles
Author_xml – sequence: 1
  givenname: Lavanya
  surname: Ramakrishnan
  fullname: Ramakrishnan, Lavanya
  organization: Indiana University, Bloomington
– sequence: 2
  givenname: Charles
  surname: Koelbel
  fullname: Koelbel, Charles
  organization: Rice University
– sequence: 3
  givenname: Yang-Suk
  surname: Kee
  fullname: Kee, Yang-Suk
  organization: Oracle US Inc
– sequence: 4
  givenname: Rich
  surname: Wolski
  fullname: Wolski, Rich
  organization: University of California, Santa Barbara
– sequence: 5
  givenname: Daniel
  surname: Nurmi
  fullname: Nurmi, Daniel
  organization: University of California, Santa Barbara
– sequence: 6
  givenname: Dennis
  surname: Gannon
  fullname: Gannon, Dennis
  organization: Microsoft Research
– sequence: 7
  givenname: Graziano
  surname: Obertelli
  fullname: Obertelli, Graziano
  organization: University of California, Santa Barbara
– sequence: 8
  givenname: Asim
  surname: YarKhan
  fullname: YarKhan, Asim
  organization: University of Tennessee, Knoxville
– sequence: 9
  givenname: Anirban
  surname: Mandal
  fullname: Mandal, Anirban
  organization: Renaissance Computing Institute
– sequence: 10
  givenname: T. Mark
  surname: Huang
  fullname: Huang, T. Mark
  organization: University of Houston
– sequence: 11
  givenname: Kiran
  surname: Thyagaraja
  fullname: Thyagaraja, Kiran
  organization: Rice University
– sequence: 12
  givenname: Dmitrii
  surname: Zagorodnov
  fullname: Zagorodnov, Dmitrii
  organization: University of California, Santa Barbara
BookMark eNqNjzFPwzAQhQ9REG3pzMAfYEnqs322M1altJUqdSiwWpfEkQK0QQkL_x5HzcDYW55OT-89fRMYnZpTAHhAkSJqmqMhLShLe0Vhr2CCRhA5q7W6_v-MYCzR2EQrmd3BrOs-RDyHUjkaw-37ul08H-7hpuKvLswGncLby-p1uUl2-_V2udglLLX9SWyBiGVME1eFZgzMopTGijJXGtlkRlmXxyUqykyRZcNEFLjSDp2hSk3h8dxbhxD8d1sfuf31MUUkVXTTs8vF0edN89l5FL7H9QOuH3B93tahr3u6MKD-ALIJTns
CODEN IEEPAD
ContentType Conference Proceeding
Copyright 2009 ACM
Copyright_xml – notice: 2009 ACM
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/1654059.1654107
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 1605587443
9781605587448
EndPage 12
ExternalDocumentID 6375523
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
AAJGR
AARBI
ACM
ADPZR
ALMA_UNASSIGNED_HOLDINGS
APO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
GUFHI
OCL
RIE
RIL
6IH
6IK
AAWTH
ABLEC
ADZIZ
CHZPO
IEGSK
IPLJI
ID FETCH-LOGICAL-a247t-7c111d0085afc4a1eaa0d2670db341a696378b4325cd9357a6a555eaf481865f3
IEDL.DBID RIE
ISBN 1605587443
9781605587448
ISSN 2167-4329
IngestDate Wed Jul 30 06:14:25 EDT 2025
Wed Jan 31 06:46:33 EST 2024
Wed Jan 31 06:45:55 EST 2024
IsPeerReviewed false
IsScholarly false
Language English
License Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org
LinkModel DirectLink
MeetingName SC '09: International Conference for High Performance Computing, Networking, Storage and Analysis
MergedId FETCHMERGED-LOGICAL-a247t-7c111d0085afc4a1eaa0d2670db341a696378b4325cd9357a6a555eaf481865f3
PageCount 12
ParticipantIDs acm_books_10_1145_1654059_1654107_brief
ieee_primary_6375523
acm_books_10_1145_1654059_1654107
PublicationCentury 2000
PublicationDate 20091114
2009-Nov.
PublicationDateYYYYMMDD 2009-11-14
2009-11-01
PublicationDate_xml – month: 11
  year: 2009
  text: 20091114
  day: 14
PublicationDecade 2000
PublicationPlace New York, NY, USA
PublicationPlace_xml – name: New York, NY, USA
PublicationSeriesTitle ACM Conferences
PublicationTitle Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
PublicationTitleAbbrev SUPERC
PublicationYear 2009
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0000812385
ssj0003204180
Score 1.6107624
Snippet Today's scientific workflows use distributed heterogeneous resources through diverse grid and cloud interfaces that are often hard to program. In addition,...
SourceID ieee
acm
SourceType Publisher
StartPage 1
SubjectTerms Cloud computing
Computational modeling
Engines
Fault tolerance
Fault tolerant systems
General and reference -- Cross-computing tools and techniques -- Performance
Lead
Portals
Schedules
Software and its engineering -- Software organization and properties -- Contextual software domains -- Operating systems -- Process management
Software and its engineering -- Software organization and properties -- Extra-functional properties -- Software performance
Software and its engineering -- Software organization and properties -- Software system structures -- Distributed systems organizing principles
Virtual machines
Weather forecasting
Subtitle enabling e-Science workflows on grids and clouds with fault tolerance
Title VGrADS
URI https://ieeexplore.ieee.org/document/6375523
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS-RAEC7UkyddHzi6SguCl804Sbo70Zv4RBhZ8IG3UOmuFtnZRGYSFvbXW90TRxRBT-kkfWiqH19VdX1VAHuZK8miynnxSoykVhQhI3kk0enYGcTUeYLz8Fpf3smrB_UwB79mXBgiCsFn1PfNcJdva9N6V9mBTjPFhtM8zGe5nnK1Zv4UhjZGHzV7T5OBjEPhtCSk9k6Twy6zTyzVgefwsGLR9894EKDJ_H1XYCXgy_kSDF9HNg0r-dNvm7Jv_n9I2vjdoS_D2huTT_yeYdQPmKNqBZZeSzmIbmevwvD-Ynx8enMkyHOpuKugqPspfOiWG9X_JqKuxOP4yU4EVlaYUd1y0ztyhcN21IimHpGv00FrcHd-dntyGXWVFiJMZNZEmeEjz3r1C52RGBPiwCY6G9iSUQ4179IsL1mGytjDVGWoUSlF6KRPiKdcug4LVV3RBgiMHZsoZSlZ75LGl8aK05yM1dbx4aHzHuyyqAtvQkyKKStaFd10FN109GD_yz5FOX4i14NVL-vieZqao-jEvPn55y1YDJdBgUr4ExaacUvbrFM05U5YTC8sYMJQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fT9swED4xeGBPbBREx9g8CYkXUprEdlrepjHWAUWTBhNv0cU-T9W6BLWJJu2v5-yGTiAk9hQn8YN1_vHdne-7A9jPXEEW1YAXr8RIakURMpJHEp2OnUFMnSc4jy_16Fqe3aibFThccmGIKASfUc83w12-rUzjXWVHOs0UG04vYE1JKdWCrbX0qDC4Mf6o5Xua9GUcSqclIbl3mgzb3D6xVEeexcOqRc8_434AJ_P7QYmVgDCnGzC-H9sisORXr6mLnvn7KG3j_w7-FWz94_KJb0uUeg0rVG7Cxn0xB9Hu7Q6Mf3yZfTz5fizIs6m4q6Co_Sl88JabVn_moirFz9nEzgWWVphp1XDTu3KFw2Zai7qakq_UQVtwffr56tMoamstRJjIrI4yw4ee9QoYOiMxJsS-TXTWtwXjHGrep9mgYBkqY4epylCjUorQSZ8ST7l0G1bLqqQdEBg7NlKKQrLmJY0vjhWnAzJWW8fHhx504QOLOvdGxDxf8KJV3k5H3k5HFw6e7ZMXswm5LnS8rPPbRXKOvBXzm6c_v4f10dX4Ir_4enm-Cy_D1VAgFr6F1XrW0B5rGHXxLiysO4-kxZ0
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+Conference+on+High+Performance+Computing+Networking%2C+Storage+and+Analysis&rft.atitle=VGrADS&rft.au=Ramakrishnan%2C+Lavanya&rft.au=Koelbel%2C+Charles&rft.au=Kee%2C+Yang-Suk&rft.au=Wolski%2C+Rich&rft.series=ACM+Conferences&rft.date=2009-11-14&rft.pub=ACM&rft.isbn=1605587443&rft.spage=1&rft.epage=12&rft_id=info:doi/10.1145%2F1654059.1654107
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2167-4329&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2167-4329&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2167-4329&client=summon