Capturing Implicit Structures in Unstructured Content of Construction Documents
In a paper-based paradigm, whether construction information is presented in structured or unstructured format is not a critical issue because human beings are good at processing both types of information. However, the paradigm shift from using paper-based media to electronic media requires better ha...
Saved in:
| Published in | Journal of computing in civil engineering Vol. 21; no. 3; pp. 220 - 227 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
Reston, VA
American Society of Civil Engineers
01.05.2007
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 0887-3801 1943-5487 |
| DOI | 10.1061/(ASCE)0887-3801(2007)21:3(220) |
Cover
| Summary: | In a paper-based paradigm, whether construction information is presented in structured or unstructured format is not a critical issue because human beings are good at processing both types of information. However, the paradigm shift from using paper-based media to electronic media requires better handling of unstructured information, as a computer is more adept at processing structured information. It is observed by the writers that many existing information systems in the construction industry simply mimic paper-based construction documents, not just in presentations but also in data structures, such as request for information (RFI) forms developed by many existing software vendors. The writers thus question whether enough attention has been put into understanding the impact of the paradigm shift on data management for construction documents. This paper argues that a construction document such as a RFI is usually used by construction professionals to serve particular business purposes, which restricts the content of such a document. Consequently, some types of unstructured information make references implicitly or explicitly to certain types of information that are often well structured such as building or project management information. Further, the occurrence of such cross references is constant and stable among different instances of the same type of documents. A major challenge is to properly capture, represent, and utilize such knowledge in order to improve information processing for construction professionals. Text analysis and statistical analysis are applied by this study to determine the patterns of the occurrences for certain types of concepts that are related to building components, contract documents, or project management in unstructured content. With the identified patterns, this paper further discusses the potential use of metadata models as a method to support processing unstructured content in those documents. |
|---|---|
| Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
| ISSN: | 0887-3801 1943-5487 |
| DOI: | 10.1061/(ASCE)0887-3801(2007)21:3(220) |