Ontology and rule-based natural language processing approach for interpreting textual regulations on underground utility infrastructure |
| |
Affiliation: | 1. Lyles School of Civil Engineering, Purdue University, 550 Stadium Mall Drive, West Lafayette, IN 47907, United States;2. Division of Construction Engineering & Management and Lyles School of Civil Engineering, Purdue University, 550 Stadium Mall Drive, West Lafayette, IN 47907, United States;1. Department of Civil & Environmental Engineering, National University of Singapore, Block E1A, #07-03, No.1 Engineering Drive 2, Singapore 117576, Singapore;2. Future Cities Laboratory, Singapore-ETH Centre, 1 CREATE Way, CREATE Tower, #06-01, Singapore 138602, Singapore;3. Applied Computing and Mechanics Laboratory (IMAC), School of Architecture, Civil and Environmental Engineering (ENAC), Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland;1. Applied Mechanics and Construction, University of Vigo, Spain;2. Chair of Computational Modelling and Simulation, Technical University of Munich, Germany;1. Faculty of Science, Agriculture, and Engineering, Newcastle University, Singapore 599493, Singapore;2. Xylem Inc, USA;3. Department of Civil, Architectural and Environmental Engineering, University of Texas at Austin, TX 78712, USA |
| |
Abstract: | The nation’s massive underground utility infrastructure must comply with a multitude of regulations. The regulatory compliance checking of underground utilities requires an objective and consistent interpretation of the regulations. However, utility regulations contain a variety of domain-specific terms and numerous spatial constraints regarding the location and clearance of underground utilities. It is challenging for the interpreters to understand both the domain and spatial semantics in utility regulations. To address the challenge, this paper adopts an ontology and rule-based Natural Language Processing (NLP) framework to automate the interpretation of utility regulations – the extraction of regulatory information and the subsequent transformation into logic clauses. Two new ontologies have been developed. The urban product ontology (UPO) is domain-specific to model domain concepts and capture domain semantics on top of heterogeneous terminologies in utility regulations. The spatial ontology (SO) consists of two layers of semantics – linguistic spatial expressions and formal spatial relations – for better understanding the spatial language in utility regulations. Pattern-matching rules defined on syntactic features (captured using common NLP techniques) and semantic features (captured using ontologies) were encoded for information extraction. The extracted information elements were then mapped to their semantic correspondences via ontologies and finally transformed into deontic logic (DL) clauses to achieve the semantic and logical formalization. The approach was tested on the spatial configuration-related requirements in utility accommodation policies. Results show it achieves a 98.2% precision and a 94.7% recall in information extraction, a 94.4% precision and a 90.1% recall in semantic formalization, and an 83% accuracy in logical formalization. |
| |
Keywords: | Ontology Natural language processing Pattern-matching rules Information extraction Information formalization Utility regulations |
本文献已被 ScienceDirect 等数据库收录! |
|