SODA: A Large-scale Open Site Object Detection Dataset for Deep Learning in Construction

Automation in Construction, 2022

引用方式: Duan, R., Deng, H., Tian, M., Deng, Y.C., Lin, J.R. (2022). SODA: A Large-scale Open Site Object Detection Dataset for Deep Learning in Construction. Automation in Construction, 142, 104499. doi: 10.1016/j.autcon.2022.104499 http://doi.org/10.1016/j.autcon.2022.104499 cited by count

摘要

大规模图像数据集对建筑及施工行业构建深度学习对象识别模型及算法测试具有重要意义。但有关数据集构建过程复杂，需要大量专业人员精力、知识投入。为此，本研究开发并公布了专门面向施工现场的大型对象识别图像数据集SODA（Site Object Detection dAtaset）。数据集包含施工现场工人、材料、机器和环境等15类实体对象，覆盖了不同拍摄视角、天气条件和施工阶段，共计20000余张照片。实验分析表明，该数据集在多样性和数据规模方面具有显著优势；基于YOLO深度学习目标识别算法的训练和测试进一步表明了该数据集的优越性，最大mAP可达81.47%。总之，本研究为建筑业及施工场景构建了一个大规模开放图像数据集，为相关算法研发、测试及应用奠定了重要基础。

Comprehensive image datasets can benefit the construction industry in terms of serving as the basis for generating deep-learning-based object detection models and testing the performance of object detection algorithms, but building such datasets is complex and requires vast professional knowledge. This paper develops and publicly releases a new large-scale image dataset specifically collected and annotated for the construction site, called Site Object Detection dAtaset (SODA), which contains 15 object classes categorized by the worker, material, machine, and layout. More than 20,000 images were collected from multiple construction sites in different situations, weather conditions, and construction phases, covering different angles and perspectives. Statistical analysis shows that the dataset is well developed in terms of diversity and volume. Further evaluation with two widely adopted deep learning-based object detection algorithms also illustrates the feasibility of the dataset, achieving a maximum mAP of 81.47%. This research contributes a large-scale open image dataset for the construction industry and sets up a performance benchmark for further evaluation of relevant algorithms.

Graph Abstract

论文下载链接

预印本下载链接

数据集下载链接

The authors would like to acknowledge the support by Guangdong Science Foundation (Grant No. 2022A1515010174); the support by the State Key Lab of Subtropical Building Science, South China University of Technology (No. 2022ZB19); the support by the Guangzhou Science and Technology Program (No. 202201010338); and the National Natural Science Foundation of China (Grant No. 72091512, 51908323).

2020.1-2022.12：融合知识推理与性能仿真的性能消防设计审查方法

2021.1-2025.12：应对风险的城市基础设施韧性评估与管理

Twitter Facebook Google+ LinkedIn

林佳瑞

SODA: A Large-scale Open Site Object Detection Dataset for Deep Learning in Construction

摘要

相关项目资助情况:

2020.1-2022.12：融合知识推理与性能仿真的性能消防设计审查方法

2021.1-2025.12：应对风险的城市基础设施韧性评估与管理

分享

发表评论

你可能喜欢

Deep Learning-based Instance Segmentation for Indoor Fire Load Recognition

Pretrained Domain-Specific Language Model for Natural Language Processing Tasks in the AEC Domain

基于 BERT 预训练模型的灾害推文分类方法

Integrating NLP and Context-Free Grammar for Complex Rule Interpretation towards Automated Compliance Checking