To read this content please select one of the options below:

Intelligent detection on construction project contract missing clauses based on deep learning and NLP

Hong Zhou (Xiamen University, Xiamen, China)
Binwei Gao (Xiamen University, Xiamen, China)
Shilong Tang (Xiamen University, Xiamen, China)
Bing Li (Xiamen University, Xiamen, China)
Shuyu Wang (Xiamen University, Xiamen, China)

Engineering, Construction and Architectural Management

ISSN: 0969-9988

Article publication date: 31 October 2023

264

Abstract

Purpose

The number of construction dispute cases has maintained a high growth trend in recent years. The effective exploration and management of construction contract risk can directly promote the overall performance of the project life cycle. The miss of clauses may result in a failure to match with standard contracts. If the contract, modified by the owner, omits key clauses, potential disputes may lead to contractors paying substantial compensation. Therefore, the identification of construction project contract missing clauses has heavily relied on the manual review technique, which is inefficient and highly restricted by personnel experience. The existing intelligent means only work for the contract query and storage. It is urgent to raise the level of intelligence for contract clause management. Therefore, this paper aims to propose an intelligent method to detect construction project contract missing clauses based on Natural Language Processing (NLP) and deep learning technology.

Design/methodology/approach

A complete classification scheme of contract clauses is designed based on NLP. First, construction contract texts are pre-processed and converted from unstructured natural language into structured digital vector form. Following the initial categorization, a multi-label classification of long text construction contract clauses is designed to preliminary identify whether the clause labels are missing. After the multi-label clause missing detection, the authors implement a clause similarity algorithm by creatively integrating the image detection thought, MatchPyramid model, with BERT to identify missing substantial content in the contract clauses.

Findings

1,322 construction project contracts were tested. Results showed that the accuracy of multi-label classification could reach 93%, the accuracy of similarity matching can reach 83%, and the recall rate and F1 mean of both can reach more than 0.7. The experimental results verify the feasibility of intelligently detecting contract risk through the NLP-based method to some extent.

Originality/value

NLP is adept at recognizing textual content and has shown promising results in some contract processing applications. However, the mostly used approaches of its utilization for risk detection in construction contract clauses predominantly are rule-based, which encounter challenges when handling intricate and lengthy engineering contracts. This paper introduces an NLP technique based on deep learning which reduces manual intervention and can autonomously identify and tag types of contractual deficiencies, aligning with the evolving complexities anticipated in future construction contracts. Moreover, this method achieves the recognition of extended contract clause texts. Ultimately, this approach boasts versatility; users simply need to adjust parameters such as segmentation based on language categories to detect omissions in contract clauses of diverse languages.

Keywords

Acknowledgements

This research was supported by the National Natural Science Foundation of China (Grant No. 71871192). The China Railway South Investment Group Co. Ltd, Major Science and Technology Planning 2016 “Xiamen Metro No. 3 Line Cross-sea Tunnel Construction Risk Integrated Control and System Development” (2016-Zhongda-08).

Citation

Zhou, H., Gao, B., Tang, S., Li, B. and Wang, S. (2023), "Intelligent detection on construction project contract missing clauses based on deep learning and NLP", Engineering, Construction and Architectural Management, Vol. ahead-of-print No. ahead-of-print. https://doi.org/10.1108/ECAM-02-2023-0172

Publisher

:

Emerald Publishing Limited

Copyright © 2023, Emerald Publishing Limited

Related articles