RailFOD23: A dataset for foreign object detection on railroad transmission lines.


Journal

Scientific data
ISSN: 2052-4463
Titre abrégé: Sci Data
Pays: England
ID NLM: 101640192

Informations de publication

Date de publication:
16 Jan 2024
Historique:
received: 25 09 2023
accepted: 04 01 2024
medline: 17 1 2024
pubmed: 17 1 2024
entrez: 16 1 2024
Statut: epublish

Résumé

Artificial intelligence models play a crucial role in monitoring and maintaining railroad infrastructure by analyzing image data of foreign objects on power transmission lines. However, the availability of publicly accessible datasets for railroad foreign objects is limited, and the rarity of anomalies in railroad image data, combined with restricted data sharing, poses challenges for training effective foreign object detection models. In this paper, the aim is to present a new dataset of foreign objects on railroad transmission lines, and evaluating the overall performance of mainstream detection models in this context. Taking a unique approach and leveraging large-scale models such as ChatGPT (Chat Generative Pre-trained Transformer) and text-to-image generation models, we synthesize a series of foreign object data. The dataset includes 14,615 images with 40,541 annotated objects, covering four common foreign objects on railroad power transmission lines. Through empirical research on this dataset, we validate the performance of various baseline models in foreign object detection, providing valuable insights for the monitoring and maintenance of railroad facilities.

Identifiants

pubmed: 38228610
doi: 10.1038/s41597-024-02918-9
pii: 10.1038/s41597-024-02918-9
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

72

Subventions

Organisme : National Natural Science Foundation of China (National Science Foundation of China)
ID : 62063009

Informations de copyright

© 2024. The Author(s).

Références

Shabbir, M. N. S. K., Wang, C., Liang, X. & Adajar, E. A novel toolbox for induced voltage prediction on rail tracks due to ac electromagnetic interference between railway and nearby power lines. IEEE Transactions on Industry Applications 59, 2772–2784 (2023).
doi: 10.1109/TIA.2023.3234935
Feng, Z., Yang, J., Chen, Z. & Kang, Z. Lrseg: An efficient railway region extraction method based on lightweight encoder and self-correcting decoder. Expert Systems with Applications 238, 122386 (2024).
doi: 10.1016/j.eswa.2023.122386
Addai, E. K., Tulashie, S. K., Annan, J.-S. & Yeboah, I. Trend of fire outbreaks in ghana and ways to prevent these incidents. Safety and Health at Work 7, 284–292 (2016).
doi: 10.1016/j.shaw.2016.02.004 pubmed: 27924230 pmcid: 5127973
Sahebi, M. T., Rahman, M. M. & Rahman, M. M. Fire risk situation analysis in the nimtoli area of old dhaka. Journal of the Asiatic Society of Bangladesh, Science 46, 91–102 (2020).
doi: 10.3329/jasbs.v46i1.54232
Su, J. et al. Epnet: Power lines foreign object detection with edge proposal network and data composition. Knowledge-Based Systems 249, 108857 (2022).
doi: 10.1016/j.knosys.2022.108857
Qiu, Z., Zhu, X., Liao, C., Qu, W. & Yu, Y. A lightweight yolov4-edam model for accurate and real-time detection of foreign objects suspended on power lines. IEEE Transactions on Power Delivery 38, 1329–1340 (2023).
doi: 10.1109/TPWRD.2022.3213598
Li, H., Dong, Y., Liu, Y. & Ai, J. Design and implementation of uavs for bird’s nest inspection on transmission lines based on deep learning. Drones 6, 252 (2022).
doi: 10.3390/drones6090252
Rorabaugh, J. et al. Resonant grounded isolation transformers to prevent ignitions from powerline faults. IEEE Transactions on Power Delivery 36, 2287–2297 (2021).
doi: 10.1109/TPWRD.2020.3030220
Jahn, W., Urban, J. L. & Rein, G. Powerlines and wildfires: Overview, perspectives, and climate change: Could there be more electricity blackouts in the future? IEEE Power and Energy Magazine 20, 16–27 (2022).
doi: 10.1109/MPE.2021.3122755
Chen, Z., Yang, J., Chen, L., Feng, Z. & Jia, L. Efficient railway track region segmentation algorithm based on lightweight neural network and cross-fusion decoder. Automation in Construction 155, 105069 (2023).
doi: 10.1016/j.autcon.2023.105069
Chen, Z. et al. Fast vehicle detection algorithm in traffic scene based on improved ssd. Measurement 201, 111655 (2022).
doi: 10.1016/j.measurement.2022.111655
Wu, Y. et al. Automatic railroad track components inspection using hybrid deep learning framework. IEEE Transactions on Instrumentation and Measurement 72, 1–15 (2023).
Keshun, Y. & Huizhong, L. Feature detection of mineral zoning in spiral slope flow under complex conditions based on improved yolov5 algorithm. Physica Scripta 99, 016001 (2023).
doi: 10.1088/1402-4896/ad0f7d
Keshun, Y., Guangqi, Q. & Yingkui, G. Optimizing prior distribution parameters for probabilistic prediction of remaining useful life using deep learning. Reliability Engineering & System Safety 242, 109793 (2024).
doi: 10.1016/j.ress.2023.109793
Keshun, Y., Guangqi, Q. & Yingkui, G. Remaining useful life prediction of lithium-ion batteries using em-pf-ssa-svr with gamma stochastic process. Measurement Science and Technology 35, 015015 (2023).
doi: 10.1088/1361-6501/acfbef
Chen, Z., Yang, J., Chen, L. & Jiao, H. Garbage classification system based on improved shufflenet v2. Resources, Conservation and Recycling 178, 106090 (2022).
doi: 10.1016/j.resconrec.2021.106090
Creswell, A. et al. Generative adversarial networks: An overview. IEEE Signal Processing Magazine 35, 53–65 (2018).
doi: 10.1109/MSP.2017.2765202
Cooper, P. S. et al. Standardised images of novel objects created with generative adversarial networks. Scientific Data 10, 575 (2023).
doi: 10.1038/s41597-023-02483-7 pubmed: 37660073 pmcid: 10475029
Jin, T., Ye, X. & Li, Z. Establishment and evaluation of conditional gan-based image dataset for semantic segmentation of structural cracks. Engineering Structures 285, 116058 (2023).
doi: 10.1016/j.engstruct.2023.116058
Ali, R. & Cha, Y.-J. Attention-based generative adversarial network with internal damage segmentation using thermography. Automation in Construction 141, 104412 (2022).
doi: 10.1016/j.autcon.2022.104412
Kang, D. H. & Cha, Y.-J. Efficient attention-based deep encoder and decoder for automatic crack segmentation. Structural Health Monitoring 21, 2190–2205 (2022).
doi: 10.1177/14759217211053776 pubmed: 36039173
Chen, Z., Yang, J., Feng, Z., Chen, L. & Li, L. Bishufflenext: A lightweight bi-path network for remote sensing scene classification. Measurement 209, 112537 (2023).
doi: 10.1016/j.measurement.2023.112537
Chen, Z., Yang, J., Feng, Z. & Chen, L. Rscnet: An efficient remote sensing scene classification model based on lightweight convolution neural networks. Electronics 11 (2022).
Chen, Z., Yang, J. & Yang, C. Brightsightnet: A lightweight progressive low-light image enhancement network and its application in “rainbow” maglev train. Journal of King Saud University - Computer and Information Sciences 35, 101814 (2023).
doi: 10.1016/j.jksuci.2023.101814
Yang, L. Conditional generative adversarial networks (cgan) for abnormal vibration of aero engine analysis. In 2020 IEEE 2nd International Conference on Civil Aviation Safety and Information Technology (ICCASIT, 724–728 (2020).
Sadeghi, M., Leglaive, S., Alameda-Pineda, X., Girin, L. & Horaud, R. Audio-visual speech enhancement using conditional variational auto-encoders. IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 1788–1800 (2020).
doi: 10.1109/TASLP.2020.3000593
Rombach, R., Blattmann, A., Lorenz, D., Esser, P. & Ommer, B. High-resolution image synthesis with latent diffusion models. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 10674–10685 (2022).
Zendel, O. et al. Railsem19: A dataset for semantic rail scene understanding. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 1221–1229 (2019).
Cong, W. et al. High-resolution image harmonization via collaborative dual transformations. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 18449–18458 (2022).
Wang, X. et al. Esrgan: Enhanced super-resolution generative adversarial networks. In Proceedings of the European conference on computer vision (ECCV) workshops, 63–79 (2018).
Russell, B. C., Torralba, A., Murphy, K. P. & Freeman, W. T. Labelme: a database and web-based tool for image annotation. International journal of computer vision 77, 157–173 (2008).
doi: 10.1007/s11263-007-0090-8
Chen, Z. Railfod23.zip. figshare https://doi.org/10.6084/m9.figshare.24180738.v3 (2023).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 770–778 (2016).
Jiang, P., Ergu, D., Liu, F., Cai, Y. & Ma, B. A review of yolo algorithm developments. Procedia Computer Science 199, 1066–1073 (2022).
doi: 10.1016/j.procs.2022.01.135
Lin, T.-Y., Goyal, P., Girshick, R., He, K. & Dollár, P. Focal loss for dense object detection. In 2017 IEEE International Conference on Computer Vision (ICCV), 2999–3007 (2017).
Carion, N. et al. End-to-end object detection with transformers. In European conference on computer vision, 213–229 (Springer, 2020).
Ren, S., He, K., Girshick, R. & Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 39, 1137–1149 (2017).
doi: 10.1109/TPAMI.2016.2577031 pubmed: 27295650
Pang, J. et al. Libra r-cnn: Towards balanced learning for object detection. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 821–830 (2019).
Sun, P. et al. Sparse r-cnn: End-to-end object detection with learnable proposals. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14449–14458 (2021).
Chen, K. et al. MMDetection: Open mmlab detection toolbox and benchmark. arXiv preprint arXiv:1906.07155 https://doi.org/10.48550/arXiv.1906.07155 (2019).
doi: 10.48550/arXiv.1906.07155
Xing, Z., Chen, X. & Pang, F. Dd-yolo: An object detection method combining knowledge distillation and differentiable architecture search. IET Computer Vision 16, 418–430 (2022).
doi: 10.1049/cvi2.12097

Auteurs

Zhichao Chen (Z)

Department of Electrical Engineering and Automation, Jiangxi University of Science and Technology, Ganzhou, Jiangxi Province, 341000, China.
Jiangxi Provincial Key Laboratory of Maglev Technology, Ganzhou, Jiangxi Province, 341000, China.

Jie Yang (J)

Department of Electrical Engineering and Automation, Jiangxi University of Science and Technology, Ganzhou, Jiangxi Province, 341000, China. yangjie@jxust.edu.cn.
Jiangxi Provincial Key Laboratory of Maglev Technology, Ganzhou, Jiangxi Province, 341000, China. yangjie@jxust.edu.cn.

Zhicheng Feng (Z)

Department of Electrical Engineering and Automation, Jiangxi University of Science and Technology, Ganzhou, Jiangxi Province, 341000, China.
Jiangxi Provincial Key Laboratory of Maglev Technology, Ganzhou, Jiangxi Province, 341000, China.

Hao Zhu (H)

Department of Electrical Engineering and Automation, Jiangxi University of Science and Technology, Ganzhou, Jiangxi Province, 341000, China.

Classifications MeSH