RailFOD23: A dataset for foreign object detection on railroad transmission lines.

Journal

Scientific data

ISSN: 2052-4463

Titre abrégé: Sci Data

Pays: England

ID NLM: 101640192

Informations de publication

Date de publication:
16 Jan 2024

Historique:

received: 25 09 2023

accepted: 04 01 2024

medline: 17 1 2024

pubmed: 17 1 2024

entrez: 16 1 2024

Statut: epublish

Résumé

Artificial intelligence models play a crucial role in monitoring and maintaining railroad infrastructure by analyzing image data of foreign objects on power transmission lines. However, the availability of publicly accessible datasets for railroad foreign objects is limited, and the rarity of anomalies in railroad image data, combined with restricted data sharing, poses challenges for training effective foreign object detection models. In this paper, the aim is to present a new dataset of foreign objects on railroad transmission lines, and evaluating the overall performance of mainstream detection models in this context. Taking a unique approach and leveraging large-scale models such as ChatGPT (Chat Generative Pre-trained Transformer) and text-to-image generation models, we synthesize a series of foreign object data. The dataset includes 14,615 images with 40,541 annotated objects, covering four common foreign objects on railroad power transmission lines. Through empirical research on this dataset, we validate the performance of various baseline models in foreign object detection, providing valuable insights for the monitoring and maintenance of railroad facilities.

Identifiants

DOI: 10.1038/s41597-024-02918-9 PMID: 38228610

pubmed: 38228610

doi: 10.1038/s41597-024-02918-9

pii: 10.1038/s41597-024-02918-9

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

Subventions

Organisme : National Natural Science Foundation of China (National Science Foundation of China)

ID : 62063009

Informations de copyright

Références

Shabbir, M. N. S. K., Wang, C., Liang, X. & Adajar, E. A novel toolbox for induced voltage prediction on rail tracks due to ac electromagnetic interference between railway and nearby power lines. IEEE Transactions on Industry Applications 59, 2772–2784 (2023).

doi: 10.1109/TIA.2023.3234935

Feng, Z., Yang, J., Chen, Z. & Kang, Z. Lrseg: An efficient railway region extraction method based on lightweight encoder and self-correcting decoder. Expert Systems with Applications 238, 122386 (2024).

doi: 10.1016/j.eswa.2023.122386

Addai, E. K., Tulashie, S. K., Annan, J.-S. & Yeboah, I. Trend of fire outbreaks in ghana and ways to prevent these incidents. Safety and Health at Work 7, 284–292 (2016).

doi: 10.1016/j.shaw.2016.02.004 pubmed: 27924230 pmcid: 5127973

Sahebi, M. T., Rahman, M. M. & Rahman, M. M. Fire risk situation analysis in the nimtoli area of old dhaka. Journal of the Asiatic Society of Bangladesh, Science 46, 91–102 (2020).

doi: 10.3329/jasbs.v46i1.54232

Su, J. et al. Epnet: Power lines foreign object detection with edge proposal network and data composition. Knowledge-Based Systems 249, 108857 (2022).

doi: 10.1016/j.knosys.2022.108857

Qiu, Z., Zhu, X., Liao, C., Qu, W. & Yu, Y. A lightweight yolov4-edam model for accurate and real-time detection of foreign objects suspended on power lines. IEEE Transactions on Power Delivery 38, 1329–1340 (2023).

doi: 10.1109/TPWRD.2022.3213598

Li, H., Dong, Y., Liu, Y. & Ai, J. Design and implementation of uavs for bird’s nest inspection on transmission lines based on deep learning. Drones 6, 252 (2022).

doi: 10.3390/drones6090252

Rorabaugh, J. et al. Resonant grounded isolation transformers to prevent ignitions from powerline faults. IEEE Transactions on Power Delivery 36, 2287–2297 (2021).

doi: 10.1109/TPWRD.2020.3030220

Jahn, W., Urban, J. L. & Rein, G. Powerlines and wildfires: Overview, perspectives, and climate change: Could there be more electricity blackouts in the future? IEEE Power and Energy Magazine 20, 16–27 (2022).

doi: 10.1109/MPE.2021.3122755

Chen, Z., Yang, J., Chen, L., Feng, Z. & Jia, L. Efficient railway track region segmentation algorithm based on lightweight neural network and cross-fusion decoder. Automation in Construction 155, 105069 (2023).

doi: 10.1016/j.autcon.2023.105069

Chen, Z. et al. Fast vehicle detection algorithm in traffic scene based on improved ssd. Measurement 201, 111655 (2022).

doi: 10.1016/j.measurement.2022.111655

Wu, Y. et al. Automatic railroad track components inspection using hybrid deep learning framework. IEEE Transactions on Instrumentation and Measurement 72, 1–15 (2023).

Keshun, Y. & Huizhong, L. Feature detection of mineral zoning in spiral slope flow under complex conditions based on improved yolov5 algorithm. Physica Scripta 99, 016001 (2023).

doi: 10.1088/1402-4896/ad0f7d

Keshun, Y., Guangqi, Q. & Yingkui, G. Optimizing prior distribution parameters for probabilistic prediction of remaining useful life using deep learning. Reliability Engineering & System Safety 242, 109793 (2024).

doi: 10.1016/j.ress.2023.109793

Keshun, Y., Guangqi, Q. & Yingkui, G. Remaining useful life prediction of lithium-ion batteries using em-pf-ssa-svr with gamma stochastic process. Measurement Science and Technology 35, 015015 (2023).

doi: 10.1088/1361-6501/acfbef

Chen, Z., Yang, J., Chen, L. & Jiao, H. Garbage classification system based on improved shufflenet v2. Resources, Conservation and Recycling 178, 106090 (2022).

doi: 10.1016/j.resconrec.2021.106090

Creswell, A. et al. Generative adversarial networks: An overview. IEEE Signal Processing Magazine 35, 53–65 (2018).

doi: 10.1109/MSP.2017.2765202

Cooper, P. S. et al. Standardised images of novel objects created with generative adversarial networks. Scientific Data 10, 575 (2023).

doi: 10.1038/s41597-023-02483-7 pubmed: 37660073 pmcid: 10475029

Jin, T., Ye, X. & Li, Z. Establishment and evaluation of conditional gan-based image dataset for semantic segmentation of structural cracks. Engineering Structures 285, 116058 (2023).

doi: 10.1016/j.engstruct.2023.116058

Ali, R. & Cha, Y.-J. Attention-based generative adversarial network with internal damage segmentation using thermography. Automation in Construction 141, 104412 (2022).

doi: 10.1016/j.autcon.2022.104412

Kang, D. H. & Cha, Y.-J. Efficient attention-based deep encoder and decoder for automatic crack segmentation. Structural Health Monitoring 21, 2190–2205 (2022).

doi: 10.1177/14759217211053776 pubmed: 36039173

Chen, Z., Yang, J., Feng, Z., Chen, L. & Li, L. Bishufflenext: A lightweight bi-path network for remote sensing scene classification. Measurement 209, 112537 (2023).

doi: 10.1016/j.measurement.2023.112537

Chen, Z., Yang, J., Feng, Z. & Chen, L. Rscnet: An efficient remote sensing scene classification model based on lightweight convolution neural networks. Electronics 11 (2022).

Chen, Z., Yang, J. & Yang, C. Brightsightnet: A lightweight progressive low-light image enhancement network and its application in “rainbow” maglev train. Journal of King Saud University - Computer and Information Sciences 35, 101814 (2023).

doi: 10.1016/j.jksuci.2023.101814

Yang, L. Conditional generative adversarial networks (cgan) for abnormal vibration of aero engine analysis. In 2020 IEEE 2nd International Conference on Civil Aviation Safety and Information Technology (ICCASIT, 724–728 (2020).

Sadeghi, M., Leglaive, S., Alameda-Pineda, X., Girin, L. & Horaud, R. Audio-visual speech enhancement using conditional variational auto-encoders. IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1788–1800 (2020).

doi: 10.1109/TASLP.2020.3000593

Rombach, R., Blattmann, A., Lorenz, D., Esser, P. & Ommer, B. High-resolution image synthesis with latent diffusion models. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 10674–10685 (2022).

Zendel, O. et al. Railsem19: A dataset for semantic rail scene understanding. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 1221–1229 (2019).

Cong, W. et al. High-resolution image harmonization via collaborative dual transformations. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 18449–18458 (2022).

Wang, X. et al. Esrgan: Enhanced super-resolution generative adversarial networks. In Proceedings of the European conference on computer vision (ECCV) workshops, 63–79 (2018).

Russell, B. C., Torralba, A., Murphy, K. P. & Freeman, W. T. Labelme: a database and web-based tool for image annotation. International journal of computer vision 77, 157–173 (2008).

doi: 10.1007/s11263-007-0090-8

Chen, Z. Railfod23.zip. figshare https://doi.org/10.6084/m9.figshare.24180738.v3 (2023).

He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778 (2016).

Jiang, P., Ergu, D., Liu, F., Cai, Y. & Ma, B. A review of yolo algorithm developments. Procedia Computer Science 199, 1066–1073 (2022).

doi: 10.1016/j.procs.2022.01.135

Lin, T.-Y., Goyal, P., Girshick, R., He, K. & DollÃ¡r, P. Focal loss for dense object detection. In 2017 IEEE International Conference on Computer Vision (ICCV), 2999–3007 (2017).

Carion, N. et al. End-to-end object detection with transformers. In European conference on computer vision, 213–229 (Springer, 2020).

Ren, S., He, K., Girshick, R. & Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 1137–1149 (2017).

doi: 10.1109/TPAMI.2016.2577031 pubmed: 27295650

Pang, J. et al. Libra r-cnn: Towards balanced learning for object detection. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 821–830 (2019).

Sun, P. et al. Sparse r-cnn: End-to-end object detection with learnable proposals. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14449–14458 (2021).

Chen, K. et al. MMDetection: Open mmlab detection toolbox and benchmark. arXiv preprint arXiv:1906.07155 https://doi.org/10.48550/arXiv.1906.07155 (2019).

doi: 10.48550/arXiv.1906.07155

Xing, Z., Chen, X. & Pang, F. Dd-yolo: An object detection method combining knowledge distillation and differentiable architecture search. IET Computer Vision 16, 418–430 (2022).

doi: 10.1049/cvi2.12097

RailFOD23: A dataset for foreign object detection on railroad transmission lines.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Subventions

Informations de copyright

Références

Auteurs

Zhichao Chen (Z)

Jie Yang (J)

Zhicheng Feng (Z)

Hao Zhu (H)

Classifications MeSH