TY - JOUR
T1 - Digitization of chemical process flow diagrams using deep convolutional neural networks
AU - Theisen, Maximilian F.
AU - Flores, Kenji Nishizaki
AU - Schulze Balhorn, Lukas
AU - Schweidtmann, Artur M.
PY - 2023
Y1 - 2023
N2 - Advances in deep convolutional neural networks led to breakthroughs in many computer vision applications. In chemical engineering, a number of tools have been developed for the digitization of Process and Instrumentation Diagrams. However, there is no framework for the digitization of process flow diagrams (PFDs). PFDs are difficult to digitize because of the large variability in the data, e.g., there are multiple ways to depict unit operations in PFDs. We propose a two-step framework for digitizing PFDs: (i) unit operations are detected using a deep learning powered object detection model, (ii) the connectivities between unit operations are detected using a pixel-based search algorithm. To ensure robustness, we collect and label over 1000 PFDs from diversified sources including various scientific journals and books. To cope with the high intra-class variability in the data, we define 47 distinct classes that account for different drawing styles of unit operations. Our algorithm delivers accurate and robust results on an independent test set. We report promising results for line and unit operation detection with an Average Precision at 50 percent (AP50) of 88% and an Average Precision (AP) of 68% for the detection of unit operations.
AB - Advances in deep convolutional neural networks led to breakthroughs in many computer vision applications. In chemical engineering, a number of tools have been developed for the digitization of Process and Instrumentation Diagrams. However, there is no framework for the digitization of process flow diagrams (PFDs). PFDs are difficult to digitize because of the large variability in the data, e.g., there are multiple ways to depict unit operations in PFDs. We propose a two-step framework for digitizing PFDs: (i) unit operations are detected using a deep learning powered object detection model, (ii) the connectivities between unit operations are detected using a pixel-based search algorithm. To ensure robustness, we collect and label over 1000 PFDs from diversified sources including various scientific journals and books. To cope with the high intra-class variability in the data, we define 47 distinct classes that account for different drawing styles of unit operations. Our algorithm delivers accurate and robust results on an independent test set. We report promising results for line and unit operation detection with an Average Precision at 50 percent (AP50) of 88% and an Average Precision (AP) of 68% for the detection of unit operations.
KW - Deep convolutional neural network
KW - Digitalization
KW - Flowsheet digitization
KW - Machine learning
KW - Object detection
KW - Process flow diagrams (PFD)
UR - http://www.scopus.com/inward/record.url?scp=85148429110&partnerID=8YFLogxK
U2 - 10.1016/j.dche.2022.100072
DO - 10.1016/j.dche.2022.100072
M3 - Article
AN - SCOPUS:85148429110
SN - 2772-5081
VL - 6
SP - 11
JO - Digital Chemical Engineering
JF - Digital Chemical Engineering
M1 - 100072
ER -