Publicación:
Enhancing consistency in piping and instrumentation diagrams using DistilBERT and smart PID systems

dc.contributor.authorGómez-Vega, F.S.
dc.contributor.authorAcuña, O.
dc.contributor.authorCamargo, Andrea C.
dc.contributor.authorJimenez, Jeison D.
dc.contributor.authorGaleano, Sara M.
dc.contributor.authorFranco, Isabella E.
dc.contributor.authorLozano, Laura L.
dc.contributor.authorVásquez Aguilar, Jenifer Yoris
dc.contributor.authorPuertas Del Castillo, Edwin Alexander
dc.contributor.researchgroupGrupo de Investigación Tecnologías Aplicadas y Sistemas de Información (GRITAS)
dc.contributor.seedbedsSemillero de Investigación en Inteligencia Artificial
dc.date.accessioned2025-08-28T13:40:47Z
dc.date.issued2025-08-09
dc.descriptionContiene gráficos
dc.description.abstractThis study presents a novel approach utilizing DistilBERT, a lightweight variant of BERT, to identify inconsistencies in piping and instrumentation diagrams (P&IDs) within SmartPID systems. A structured dataset was constructed by extracting engineering design data from a SQL-based SmartPID database, monitoring all modifications and updates made throughout the design phase. The DistilBERT model was fine-tuned on this dataset to recognize inconsistencies in real-time, achieving an impressive F1 score of 99% and a loss of 0.04%. The model’s performance was validated by domain experts, who confirmed the detected inconsistencies as highly accurate. Our approach significantly reduces the manual effort required for P&ID review and improves design consistency, demonstrating the potential for enhanced safety and efficiency in complex industrial projects. Future work will focus on refining the model’s parameters and expanding its application across different industries.eng
dc.description.researchareaInteligencia artificial
dc.description.researchareaAnalítica de datos y Big Data
dc.format.extent9 páginas
dc.format.mimetypeapplication/pdf
dc.identifier.citationGómez-Vega, F. S., Acuña, O., Camargo, A. C., Jimenez, J. D., Galeano, S. M., Franco, I. E., ... & Puertas, E. (2025). Enhancing consistency in piping and instrumentation diagrams using DistilBERT and smart PID systems. Systems and Soft Computing, 200373.
dc.identifier.doihttps://doi.org/10.1016/j.sasc.2025.200373
dc.identifier.urihttps://hdl.handle.net/20.500.12585/14179
dc.language.isoeng
dc.publisherSystems and Soft Computing
dc.relation.referencesDevlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019, June). Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics: human language technologies, volume 1 (long and short papers) (pp. 4171-4186).
dc.relation.referencesSanh, V., Debut, L., Chaumond, J., & Wolf, T. (2019). DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108.
dc.relation.referencesVieira, E. L., da Costa, S. E. G., de Lima, E. P., & Ferreira, C. C. (2019). Application of the Proknow-C methodology in the search of literature on performance indicators for energy management in manufacturing and industry 4.0. Procedia Manufacturing, 39, 1259-1269.
dc.relation.referencesKrüger, S., & Borsato, M. (2019). Developing knowledge on digital manufacturing to digital twin: A bibliometric and systemic analysis. Procedia Manufacturing, 38, 1174-1180.
dc.relation.referencesGajbhiye, S. M., Bhamre, S. R., Tadepalli, L. T., Pillai, M. R., & Uplaonkar, D. (2023, November). Advancing P&ID digitization with YOLOv5. In 2023 International Conference on Integrated Intelligence and Communication Systems (ICIICS) (pp. 1-6). IEEE.
dc.relation.referencesDzhusupova, R., Banotra, R., Bosch, J., & Olsson, H. H. (2023). Using artificial intelligence to find design errors in the engineering drawings. Journal of Software: Evolution and Process, 35(12), e2543.
dc.relation.referencesSu, G., Zhao, S., Li, T., Liu, S., Li, Y., Zhao, G., & Li, Z. (2024). Image format pipeline and instrument diagram recognition method based on deep learning. Biomimetic Intelligence and Robotics, 4(1), 100142.
dc.relation.referencesIbrahim, Y. M., Lukins, T. C., Zhang, X., Trucco, E., & Kaka, A. P. (2009). Towards automated progress assessment of workpackage components in construction projects using computer vision. Advanced Engineering Informatics, 23(1), 93-103.
dc.relation.referencesYu, E. S., Cha, J. M., Lee, T., Kim, J., & Mun, D. (2019). Features recognition from piping and instrumentation diagrams in image format using a deep learning network. Energies, 12(23), 4425.
dc.relation.referencesGellaboina, M. K., & Venkoparao, V. G. (2009, February). Graphic symbol recognition using auto associative neural network model. In 2009 Seventh International Conference on Advances in Pattern Recognition (pp. 297-301). IEEE.
dc.relation.referencesRahul, R., Paliwal, S., Sharma, M., & Vig, L. (2019). Automatic information extraction from piping and instrumentation diagrams. arXiv preprint arXiv:1901.11383.
dc.relation.referencesOeing, J., Welscher, W., Krink, N., Jansen, L., Henke, F., & Kockmann, N. (2022). Using artificial intelligence to support the drawing of piping and instrumentation diagrams using DEXPI standard. Digital Chemical Engineering, 4, 100038.
dc.relation.referencesKim, H., Lee, W., Kim, M., Moon, Y., Lee, T., Cho, M., & Mun, D. (2021). Deep-learning-based recognition of symbols and texts at an industrially applicable level from images of high-density piping and instrumentation diagrams. Expert Systems with Applications, 183, 115337.
dc.relation.referencesGuermazi, Y., Sellami, S., & Boucelma, O. (2022, August). A roberta based approach for address validation. In European Conference on Advances in Databases and Information Systems (pp. 157-166). Cham: Springer International Publishing.
dc.relation.referencesVerpoort, P. C., MacDonald, P., & Conduit, G. J. (2018). Materials data validation and imputation with an artificial neural network. Computational Materials Science, 147, 176-185.
dc.relation.referencesGhosh, S., Zaboli, A., Hong, J., & Kwon, J. (2024, June). Object-focused Risk Evaluation of AI-driven Perception Systems in Autonomous Vehicles. In 2024 IEEE Transportation Electrification Conference and Expo (ITEC) (pp. 1-5). IEEE.
dc.relation.referencesKubola, K., Kongon, B., Boonmee, P., & Jitngernmadan, P. (2023, November). Optimal AI Model for Industrial AR Application based on Object Detection. In 2023 7th International Conference on Information Technology (InCIT) (pp. 38-42). IEEE.
dc.relation.referencesWozniak, A. L., Duong, N. Q., Benderitter, I., Leroy, S., Segura, S., & Mazo, R. (2023, July). Robustness testing of an industrial road object detection system. In 2023 IEEE International Conference On Artificial Intelligence Testing (AITest) (pp. 82-89). IEEE.
dc.relation.referencesSingla, S., & Ramachandra, N. (2020). Comparative analysis of transformer based pre-trained NLP Models. Int. J. Comput. Sci. Eng, 8(11), 40-44.
dc.relation.referencesQasim, R., Bangyal, W. H., Alqarni, M. A., & Ali Almazroi, A. (2022). A fine‐tuned BERT‐based transfer learning approach for text classification. Journal of healthcare engineering, 2022(1), 3498123.
dc.relation.referencesNagarajan, A., Sen, S., Stevens, J. R., & Raghunathan, A. (2020). Optimizing transformers with approximate computing for faster, smaller and more accurate NLP models.
dc.relation.referencesZhang, W., Joseph, J., Yin, Y., Xie, L., Furuhata, T., Yamakawa, S., ... & Kara, L. B. (2023). Component segmentation of engineering drawings using Graph Convolutional Networks. Computers in Industry, 147, 103885.
dc.relation.referencesPaliwal, S., Sharma, M., & Vig, L. (2021, July). OSSR-PID: one-shot symbol recognition in P&ID sheets using path sampling and GCN. In 2021 International joint conference on neural networks (IJCNN) (pp. 1-8). IEEE.
dc.relation.referencesSarkar, S., Pandey, P., & Kar, S. (2022). Automatic detection and classification of symbols in engineering drawings. arXiv preprint arXiv:2204.13277.
dc.relation.referencesPaliwal, S., Jain, A., Sharma, M., & Vig, L. (2021, May). Digitize-PID: Automatic digitization of piping and instrumentation diagrams. In Pacific-Asia Conference on Knowledge Discovery and Data Mining (pp. 168-180). Cham: Springer International Publishing.
dc.relation.referencesWu, S., Wang, Y., Yang, H., & Wang, P. (2022). Improved faster R-CNN for the detection method of industrial control logic graph recognition. Frontiers in Bioengineering and Biotechnology, 10, 944944.
dc.relation.referencesChandra, H. (2018, August). Analysis of change data capture method in heterogeneous data sources to support RTDW. In 2018 4th International Conference on Computer and Information Sciences (ICCOINS) (pp. 1-6). IEEE.
dc.relation.referencesHao, L., Jiang, T., Lin, Y., & Lu, Y. (2022, July). Methods for solving the change data capture problem. In The International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (pp. 781-788). Cham: Springer International Publishing.
dc.relation.referencesSalau, A. O., & Beyene, M. M. (2024). Software defined networking based network traffic classification using machine learning techniques. Scientific Reports, 14(1), 20060.
dc.relation.referencesSingh, P. (2023). Systematic review of data-centric approaches in artificial intelligence and machine learning. Data Science and Management, 6(3), 144-157.
dc.relation.referencesFoalem, P. L., Khomh, F., & Li, H. (2024). Studying logging practice in machine learning-based applications. Information and Software Technology, 170, 107450.
dc.relation.referencesChen, A., Chow, A., Davidson, A., DCunha, A., Ghodsi, A., Hong, S. A., ... & Zumar, C. (2020, June). Developments in mlflow: A system to accelerate the machine learning lifecycle. In Proceedings of the fourth international workshop on data management for end-to-end machine learning (pp. 1-4).
dc.rights.licenseAtribución-NoComercial-SinDerivadas 4.0 Internacional (CC BY-NC-ND 4.0)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.proposalDatabases
dc.subject.proposalP&ID
dc.subject.proposalConsistency
dc.subject.proposalDistilBERT
dc.subject.proposalMachine learning
dc.subject.proposalDesign software
dc.titleEnhancing consistency in piping and instrumentation diagrams using DistilBERT and smart PID systemseng
dc.typeArtículo de revista
dc.type.coarhttp://purl.org/coar/resource_type/c_18cf
dc.type.coarversionhttp://purl.org/coar/version/c_970fb48d4fbd8a85
dc.type.contentText
dc.type.driverinfo:eu-repo/semantics/article
dc.type.redcolhttp://purl.org/redcol/resource_type/ART
dc.type.versioninfo:eu-repo/semantics/publishedVersion
dcterms.audienceResearcheng
dspace.entity.typePublication
relation.isAuthorOfPublicationf9c6bfe7-a984-49c0-ae7f-5ec856364a76
relation.isAuthorOfPublication84e86005-e232-4d13-ab38-68f0f2b4aeb0
relation.isAuthorOfPublication.latestForDiscovery84e86005-e232-4d13-ab38-68f0f2b4aeb0

Archivos

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
aet.pdf
Tamaño:
1.71 MB
Formato:
Adobe Portable Document Format

Bloque de licencias

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
license.txt
Tamaño:
14.49 KB
Formato:
Item-specific license agreed upon to submission
Descripción: