CODE-ACCORD: A corpus of building regulatory data for rule generation towards automatic compliance checking
Visualitza/Obre
Autor/a
Altres autors/es
Data de publicació
2025-01-29ISSN
2052-4463
Resum
Automatic Compliance Checking (ACC) within the Architecture, Engineering, and Construction (AEC) sector necessitates automating the interpretation of building regulations to achieve its full potential. Converting textual rules into machine-readable formats is challenging due to the complexities of natural language and the scarcity of resources for advanced Machine Learning (ML). Addressing these challenges, we introduce CODE-ACCORD, a dataset of 862 sentences from the building regulations of England and Finland. Only the self-contained sentences, which express complete rules without needing additional context, were considered as they are essential for ACC. Each sentence was manually annotated with entities and relations by a team of 12 annotators to facilitate machine-readable rule generation, followed by careful curation to ensure accuracy. The final dataset comprises 4,297 entities and 4,329 relations across various categories, serving as a robust ground truth. CODE-ACCORD supports a range of ML and Natural Language Processing (NLP) tasks, including text classification, entity recognition, and relation extraction. It enables applying recent trends, such as deep neural networks and large language models, to ACC.
Tipus de document
Article
Versió del document
Versió publicada
Llengua
Anglès
Matèries (CDU)
62 - Enginyeria. Tecnologia
620 - Assaig de materials. Materials comercials. Economia de l'energia
69 - Materials de construcció. Pràctiques i procediments de construcció
72 - Arquitectura
Paraules clau
Pàgines
14 p.
Publicat per
Springer Nature
Publicat a
Scientific Data, 12, 170 (2025)
Citació recomanada
Aquesta citació s'ha generat automàticament.
Aquest element apareix en la col·lecció o col·leccions següent(s)
Drets
© L'autor/a
Excepte que s'indiqui una altra cosa, la llicència de l'ítem es descriu com http://creativecommons.org/licenses/by/4.0/


