A Defect Classication Methodology for Sewer Image Sets with Convolutional Neural Networks

Dirk Meijer; Lisa Scholten; Francois Clemens; Arno Knobbe

doi:10.1016/j.autcon.2019.04.013

A Defect Classication Methodology for Sewer Image Sets with Convolutional Neural Networks

Dirk Meijer^*, Lisa Scholten, Francois Clemens, Arno Knobbe

^*Corresponding author for this work

Sanitary Engineering

Research output: Contribution to journal › Article › Scientific › peer-review

73 Citations (Scopus)

143 Downloads (Pure)

Abstract

Sewer pipes are commonly inspected in situ with CCTV equipment. The CCTV footage is then reviewed by human operators in order to classify defects in the pipes and make a recommendation on possible interventions. This process is both labor-intensive and error-prone. Other researchers have suggested machine learning techniques to (partially) automate the human review of this footage, but the automated classifiers are often validated in artificial testing setups, leading to biased results that do not translate directly to operational impact. In this work, we discuss suitable evaluation metrics for this specific classification task — most notably ‘specificity at sensitivity’ and ‘precision at recall’ — and the importance of using a validation setup that includes a realistic ratio of images with defects to images without defects, and a sufficiently large dataset. We also introduce ‘leave-two-inspections-out’ cross validation, designed to eliminate a data leakage bias that would otherwise cause an overestimation of classifier performance. We designed a convolutional neural network (CNN) and applied this validation methodology to automatically detect the twelve most common defect types in a dataset of over 2 million CCTV images. With this dataset and our validation methodology, our CNN outperforms the state-of-the-art. Classification performance was highest for intruding and defective connections and lowest for porous pipes. While the CNN is not capable of fully automated classification at sufficient performance levels, we determined that if we augment the human operator with the CNN, this may reduce the required human labor by up to 60.5%.

Original language	English
Pages (from-to)	281-298
Number of pages	18
Journal	Automation in Construction
Volume	104
DOIs	https://doi.org/10.1016/j.autcon.2019.04.013
Publication status	Published - 2019

Bibliographical note

Accepted Author Manuscript

Keywords

Automated classification
CCTV inspection
Classifier validation
Convolutional neural networks
Image processing
Sewer asset management

Access to Document

10.1016/j.autcon.2019.04.013

AUTCON_2818_meijer_defect_classification_methodologyAccepted author manuscript, 3.42 MBLicence: CC BY-NC-ND

Cite this

@article{084b065d495b441aa806abfc2ea8f27c,

title = "A Defect Classication Methodology for Sewer Image Sets with Convolutional Neural Networks",

abstract = "Sewer pipes are commonly inspected in situ with CCTV equipment. The CCTV footage is then reviewed by human operators in order to classify defects in the pipes and make a recommendation on possible interventions. This process is both labor-intensive and error-prone. Other researchers have suggested machine learning techniques to (partially) automate the human review of this footage, but the automated classifiers are often validated in artificial testing setups, leading to biased results that do not translate directly to operational impact. In this work, we discuss suitable evaluation metrics for this specific classification task — most notably {\textquoteleft}specificity at sensitivity{\textquoteright} and {\textquoteleft}precision at recall{\textquoteright} — and the importance of using a validation setup that includes a realistic ratio of images with defects to images without defects, and a sufficiently large dataset. We also introduce {\textquoteleft}leave-two-inspections-out{\textquoteright} cross validation, designed to eliminate a data leakage bias that would otherwise cause an overestimation of classifier performance. We designed a convolutional neural network (CNN) and applied this validation methodology to automatically detect the twelve most common defect types in a dataset of over 2 million CCTV images. With this dataset and our validation methodology, our CNN outperforms the state-of-the-art. Classification performance was highest for intruding and defective connections and lowest for porous pipes. While the CNN is not capable of fully automated classification at sufficient performance levels, we determined that if we augment the human operator with the CNN, this may reduce the required human labor by up to 60.5%.",

keywords = "Automated classification, CCTV inspection, Classifier validation, Convolutional neural networks, Image processing, Sewer asset management",

author = "Dirk Meijer and Lisa Scholten and Francois Clemens and Arno Knobbe",

note = "Accepted Author Manuscript",

year = "2019",

doi = "10.1016/j.autcon.2019.04.013",

language = "English",

volume = "104",

pages = "281--298",

journal = "Automation in Construction",

issn = "0926-5805",

publisher = "Elsevier",

}

TY - JOUR

T1 - A Defect Classication Methodology for Sewer Image Sets with Convolutional Neural Networks

AU - Meijer, Dirk

AU - Scholten, Lisa

AU - Clemens, Francois

AU - Knobbe, Arno

N1 - Accepted Author Manuscript

PY - 2019

Y1 - 2019

N2 - Sewer pipes are commonly inspected in situ with CCTV equipment. The CCTV footage is then reviewed by human operators in order to classify defects in the pipes and make a recommendation on possible interventions. This process is both labor-intensive and error-prone. Other researchers have suggested machine learning techniques to (partially) automate the human review of this footage, but the automated classifiers are often validated in artificial testing setups, leading to biased results that do not translate directly to operational impact. In this work, we discuss suitable evaluation metrics for this specific classification task — most notably ‘specificity at sensitivity’ and ‘precision at recall’ — and the importance of using a validation setup that includes a realistic ratio of images with defects to images without defects, and a sufficiently large dataset. We also introduce ‘leave-two-inspections-out’ cross validation, designed to eliminate a data leakage bias that would otherwise cause an overestimation of classifier performance. We designed a convolutional neural network (CNN) and applied this validation methodology to automatically detect the twelve most common defect types in a dataset of over 2 million CCTV images. With this dataset and our validation methodology, our CNN outperforms the state-of-the-art. Classification performance was highest for intruding and defective connections and lowest for porous pipes. While the CNN is not capable of fully automated classification at sufficient performance levels, we determined that if we augment the human operator with the CNN, this may reduce the required human labor by up to 60.5%.

AB - Sewer pipes are commonly inspected in situ with CCTV equipment. The CCTV footage is then reviewed by human operators in order to classify defects in the pipes and make a recommendation on possible interventions. This process is both labor-intensive and error-prone. Other researchers have suggested machine learning techniques to (partially) automate the human review of this footage, but the automated classifiers are often validated in artificial testing setups, leading to biased results that do not translate directly to operational impact. In this work, we discuss suitable evaluation metrics for this specific classification task — most notably ‘specificity at sensitivity’ and ‘precision at recall’ — and the importance of using a validation setup that includes a realistic ratio of images with defects to images without defects, and a sufficiently large dataset. We also introduce ‘leave-two-inspections-out’ cross validation, designed to eliminate a data leakage bias that would otherwise cause an overestimation of classifier performance. We designed a convolutional neural network (CNN) and applied this validation methodology to automatically detect the twelve most common defect types in a dataset of over 2 million CCTV images. With this dataset and our validation methodology, our CNN outperforms the state-of-the-art. Classification performance was highest for intruding and defective connections and lowest for porous pipes. While the CNN is not capable of fully automated classification at sufficient performance levels, we determined that if we augment the human operator with the CNN, this may reduce the required human labor by up to 60.5%.

KW - Automated classification

KW - CCTV inspection

KW - Classifier validation

KW - Convolutional neural networks

KW - Image processing

KW - Sewer asset management

UR - http://www.scopus.com/inward/record.url?scp=85064923640&partnerID=8YFLogxK

U2 - 10.1016/j.autcon.2019.04.013

DO - 10.1016/j.autcon.2019.04.013

M3 - Article

AN - SCOPUS:85064923640

SN - 0926-5805

VL - 104

SP - 281

EP - 298

JO - Automation in Construction

JF - Automation in Construction

ER -

A Defect Classication Methodology for Sewer Image Sets with Convolutional Neural Networks

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this