Auto-locating and fix-propagating for HTML validation errors to PHP server-side code

Checking/correcting HTML validation errors in Web pages is helpful for Web developers in finding/fixing bugs. However, existing validating/fixing tools work well only on static HTML pages and do not help fix the corresponding server code if validation errors are found in HTML pages, due to several c...

Full description

Saved in:
Bibliographic Details
Published in2011 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011) pp. 13 - 22
Main Authors Nguyen, Hung Viet, Nguyen, Hoan Anh, Nguyen, Tung Thanh, Nguyen, Tien N.
Format Conference Proceeding
LanguageEnglish
Published Washington, DC, USA IEEE Computer Society 06.11.2011
IEEE
SeriesACM Conferences
Subjects
Online AccessGet full text
ISBN1457716380
9781457716386
ISSN1938-4300
DOI10.1109/ASE.2011.6100047

Cover

Abstract Checking/correcting HTML validation errors in Web pages is helpful for Web developers in finding/fixing bugs. However, existing validating/fixing tools work well only on static HTML pages and do not help fix the corresponding server code if validation errors are found in HTML pages, due to several challenges with dynamically generated pages in Web development. We propose PhpSync, a novel automatic locating/fixing tool for HTML validation errors in PHP-based Web applications. Given an HTML page produced by a server-side PHP program, PhpSync uses Tidy, an HTML validating/correcting tool to find the validation errors in that HTML page. If errors are detected, it leverages the fixes from Tidy in the given HTML page and propagates them to the corresponding location(s) in PHP code. Our core solutions include 1) a symbolic execution algorithm on the given PHP program to produce a single tree-based model, called D-model, which approximately represents its possible client page outputs, 2) an algorithm mapping any text in the given HTML page to the text(s) in the node(s) of the D-model and then to the PHP code, and 3) a fix-propagating algorithm from the fixes in the HTML page to the PHP code via the D-model and the mapping algorithm. Our empirical evaluation shows that on average, PhpSync achieves 96.7% accuracy in locating the corresponding locations in PHP code from client pages, and 95% accuracy in propagating the fixes to the server-side code.
AbstractList Checking/correcting HTML validation errors in Web pages is helpful for Web developers in finding/fixing bugs. However, existing validating/fixing tools work well only on static HTML pages and do not help fix the corresponding server code if validation errors are found in HTML pages, due to several challenges with dynamically generated pages in Web development. We propose PhpSync, a novel automatic locating/fixing tool for HTML validation errors in PHP-based Web applications. Given an HTML page produced by a server-side PHP program, PhpSync uses Tidy, an HTML validating/correcting tool to find the validation errors in that HTML page. If errors are detected, it leverages the fixes from Tidy in the given HTML page and propagates them to the corresponding location(s) in PHP code. Our core solutions include 1) a symbolic execution algorithm on the given PHP program to produce a single tree-based model, called D-model, which approximately represents its possible client page outputs, 2) an algorithm mapping any text in the given HTML page to the text(s) in the node(s) of the D-model and then to the PHP code, and 3) a fix-propagating algorithm from the fixes in the HTML page to the PHP code via the D-model and the mapping algorithm. Our empirical evaluation shows that on average, PhpSync achieves 96.7% accuracy in locating the corresponding locations in PHP code from client pages, and 95% accuracy in propagating the fixes to the server-side code.
Author Nguyen, Hoan Anh
Nguyen, Tien N.
Nguyen, Hung Viet
Nguyen, Tung Thanh
Author_xml – sequence: 1
  givenname: Hung Viet
  surname: Nguyen
  fullname: Nguyen, Hung Viet
  organization: Electrical and Computer Engineering Department, Iowa State University, USA
– sequence: 2
  givenname: Hoan Anh
  surname: Nguyen
  fullname: Nguyen, Hoan Anh
  organization: Electrical and Computer Engineering Department, Iowa State University, USA
– sequence: 3
  givenname: Tung Thanh
  surname: Nguyen
  fullname: Nguyen, Tung Thanh
  organization: Electrical and Computer Engineering Department, Iowa State University, USA
– sequence: 4
  givenname: Tien N.
  surname: Nguyen
  fullname: Nguyen, Tien N.
  organization: Electrical and Computer Engineering Department, Iowa State University, USA
BookMark eNqNkEFPAjEQhWvEREDuJl569bAw3e622yMhKCYYScRz0-1OSRW2pLsS_fcuAePV02Temzd5-QakV4caCbllMGYM1GT6Oh-nwNhYMADI5AUZsCyXkgmu1OXfUkCP9JniRZJxgGsyapr3LgBCFFIUfbKefrYh2QZrWl9vqKkr6vxXso9hbzYnzYVIF-vnJT2Yra86LdQUYwyxoW2gq8WKNhgPGJPGV0htqPCGXDmzbXB0nkPy9jBfzxbJ8uXxaTZdJoaJrE3SEvOuSSp5VxO5QlcKa6FQKpeqkK501mEuLZdCFgwE5Fw45KJMJaosM3xI7k5_PSLqffQ7E7_1mUjnTk6usTtdhvDRaAb6SE939PSR3u-tLqNH1yXu_5vgP840bMw
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ASE.2011.6100047
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Xplore
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 1457716399
9781457716393
EndPage 22
ExternalDocumentID 6100047
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
ACM
ADPZR
ALMA_UNASSIGNED_HOLDINGS
APO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
GUFHI
IEGSK
IERZE
OCL
RIB
RIC
RIE
RIL
29I
6IH
6IM
6J9
AAWTH
ABLEC
ACREN
ADYOE
ADZIZ
AFYQB
AMTXH
CHZPO
IPLJI
M43
ID FETCH-LOGICAL-a164t-2be5006273380e39efb6cc089957987fbfcfe57c376781060536fe36b27e944a3
IEDL.DBID RIE
ISBN 1457716380
9781457716386
ISSN 1938-4300
IngestDate Wed Aug 27 03:23:23 EDT 2025
Wed Jan 31 06:53:19 EST 2024
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MeetingName ASE '11: Automated Software Engineering
MergedId FETCHMERGED-LOGICAL-a164t-2be5006273380e39efb6cc089957987fbfcfe57c376781060536fe36b27e944a3
PageCount 10
ParticipantIDs ieee_primary_6100047
acm_books_10_1109_ASE_2011_6100047_brief
acm_books_10_1109_ASE_2011_6100047
PublicationCentury 2000
PublicationDate 20111106
2011-Nov.
PublicationDateYYYYMMDD 2011-11-06
2011-11-01
PublicationDate_xml – month: 11
  year: 2011
  text: 20111106
  day: 06
PublicationDecade 2010
PublicationPlace Washington, DC, USA
PublicationPlace_xml – name: Washington, DC, USA
PublicationSeriesTitle ACM Conferences
PublicationTitle 2011 26th IEEE/ACM International Conference on Automated Software Engineering (ASE 2011)
PublicationTitleAbbrev ASE
PublicationYear 2011
Publisher IEEE Computer Society
IEEE
Publisher_xml – name: IEEE Computer Society
– name: IEEE
SSID ssj0000668768
ssj0051577
Score 1.966787
Snippet Checking/correcting HTML validation errors in Web pages is helpful for Web developers in finding/fixing bugs. However, existing validating/fixing tools work...
SourceID ieee
acm
SourceType Publisher
StartPage 13
SubjectTerms Accuracy
Approximation algorithms
Browsers
Bug Localization
Fix Propagation
HTML
PHP Dynamic Web Applications
Servers
Three dimensional displays
Validation Errors
Web pages
Title Auto-locating and fix-propagating for HTML validation errors to PHP server-side code
URI https://ieeexplore.ieee.org/document/6100047
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3da9swED_SPvWp65qxrO0Qow97qFInlm35sZSWMJoRaAp9M5J8gjAWF8eB0r--d7Lj0jHY3mzZYPlO0n3-7gDOy8kEUWslnTJOKl9OpHXEkIzU5dhMTa41A5znP9PZg_rxmDwO4KLHwiBiSD7DMV-GWH5ZuS27yi5TdkarbA_2Mp22WK3en0KiUwfVuT2FSUyHroukn2ip4igKoC4aY_2jr_XU3ffxyyi_vLq_aQt7dh9jqeV-v-u9EkTP7SHMd5NuM05-jbeNHbuXP-o5_u9ffYDhG8hPLHrxdQQDXH-Ew12XB9Ft-mNYXm2bSrLM4wxpYdal8KtnSbOhs6gdI8VXzJbzO0HLdtU2aRJY11W9EU0lFrOFYN8v1pJ7gwpG0Q_h4fZmeT2TXS8GacigauTUYsJ4y4xM2gjjHL1NneOYYZLlOvPWO49J5rg4jCYzk_Z26jFO7TTDXCkTf4L9dbXGzyC4oLNOrTLelArR5x5zn8aoMCqNKZMRfCOKF2xkbIpgo0R5QWwpmC1FR60RfP_3S4WtV-hHcMxEL57a8h27h1_-PnwCB8FtHOCGp7Df1Fs8I72jsV_DgnsFSKLLmQ
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3da9swED-y7mF7art2LP3YxNjDHqbUiWVbfiylJduSElgKfTOSfIIwFhfXgdK_vney47ExWN9s2WD5TtJ9_u4APpXjMaLWSjplnFS-HEvriCEZqcuxmZhcawY4z6_T6Y36dpvcDuBLj4VBxJB8hiO-DLH8snIbdpWdpeyMVtkLeJkopZIWrdV7VEh46qA8t-cwCerQd5E0FC1VHEUB1kVjrIH01Z66-z6CGeVn5z8u29Ke3edYbrlff3RfCcLnahfm22m3OSc_R5vGjtzjXxUdn_tfe3D4G-YnFr0A24cBrt_A7rbPg-i2_QEszzdNJVnqcY60MOtS-NWDpNnQadSOkeorpsv5TNDCXbVtmgTWdVXfi6YSi-lCsPcXa8ndQQXj6A_h5upyeTGVXTcGacikauTEYsKIy4yM2gjjHL1NneOoYZLlOvPWO49J5rg8jCZDk3Z36jFO7STDXCkTv4WddbXGdyC4pLNOrTLelArR5x5zn8aoMCqNKZMhfCSKF2xm3BfBSonygthSMFuKjlpD-Pz_lwpbr9AP4YCJXty1BTy2D4_-PfwBXjHBitnX6-_H8Do4kQP48AR2mnqDp6SFNPZ9WHxPKH3O5g
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+26th+IEEE%2FACM+International+Conference+on+Automated+Software+Engineering&rft.atitle=Auto-locating+and+fix-propagating+for+HTML+validation+errors+to+PHP+server-side+code&rft.au=Nguyen%2C+Hung+Viet&rft.au=Nguyen%2C+Hoan+Anh&rft.au=Nguyen%2C+Tung+Thanh&rft.au=Nguyen%2C+Tien+N.&rft.series=ACM+Conferences&rft.date=2011-11-06&rft.pub=IEEE+Computer+Society&rft.isbn=1457716380&rft.spage=13&rft.epage=22&rft_id=info:doi/10.1109%2FASE.2011.6100047
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1938-4300&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1938-4300&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1938-4300&client=summon