TY - JOUR
T1 - Density-based clustering methods for unsupervised separation of partial discharge sources
AU - Castro Heredia, Luis Carlos
AU - Rodrigo Mor, Armando
PY - 2019
Y1 - 2019
N2 - The recognition of partial discharge (PD) sources is an important task of the monitoring and diagnostics of high-voltage components. Nowadays, digital PD measuring systems have the capability of extracting features and form scatter plots with such data sets. Part of an unsupervised PD analysis system is to discover clusters within the data sets and link them to particular PD sources. Due to the nature of PD data sets, clusters may appear very close to each other or even merged hindering the separation of sources. Clustering methods based on spatial density such as the density peak clustering (DPC) method and DBSCAN are suitable approaches to discover clusters within PD data sets. However, their accuracy can be reduced due to the proximity among clusters. In this paper, a new method is presented to improve the accuracy of the DPC method. Our method proposes to partition the data set and later pass the resulting subsets to the DPC method. The partitioning is based on the spatial density of data computed by a smoothed density method (SD). SD has the advantage of being fast and not requiring high computational power. As a final step, a routine is applied to group the sub clusters as per the DPC method having a threshold for the data contour distance as a criterion. This method proved higher accuracy to discover clusters in actual PD data sets. However, the threshold for the data contour distance still needs further research.
AB - The recognition of partial discharge (PD) sources is an important task of the monitoring and diagnostics of high-voltage components. Nowadays, digital PD measuring systems have the capability of extracting features and form scatter plots with such data sets. Part of an unsupervised PD analysis system is to discover clusters within the data sets and link them to particular PD sources. Due to the nature of PD data sets, clusters may appear very close to each other or even merged hindering the separation of sources. Clustering methods based on spatial density such as the density peak clustering (DPC) method and DBSCAN are suitable approaches to discover clusters within PD data sets. However, their accuracy can be reduced due to the proximity among clusters. In this paper, a new method is presented to improve the accuracy of the DPC method. Our method proposes to partition the data set and later pass the resulting subsets to the DPC method. The partitioning is based on the spatial density of data computed by a smoothed density method (SD). SD has the advantage of being fast and not requiring high computational power. As a final step, a routine is applied to group the sub clusters as per the DPC method having a threshold for the data contour distance as a criterion. This method proved higher accuracy to discover clusters in actual PD data sets. However, the threshold for the data contour distance still needs further research.
KW - Clustering
KW - DPC
KW - Partial discharge
KW - Spatial density
UR - http://www.scopus.com/inward/record.url?scp=85057788881&partnerID=8YFLogxK
U2 - 10.1016/j.ijepes.2018.11.015
DO - 10.1016/j.ijepes.2018.11.015
M3 - Article
AN - SCOPUS:85057788881
VL - 107
SP - 224
EP - 230
JO - International Journal of Electrical Power & Energy Systems
JF - International Journal of Electrical Power & Energy Systems
SN - 0142-0615
ER -