Towards automated video-based assessment of dystonia in dyskinetic cerebral palsy: A novel approach using markerless motion tracking and machine learning

Helga Haberfehlner; Shankara S. van de Ven; Sven A. van der Burg; Florian Huber; Sonja Georgievska; Ignazio Aleo; Jaap Harlaar; Laura A. Bonouvrié; Marjolein M. van der Krogt; Annemieke I. Buizer

doi:10.3389/frobt.2023.1108114

Towards automated video-based assessment of dystonia in dyskinetic cerebral palsy: A novel approach using markerless motion tracking and machine learning

Helga Haberfehlner^*, Shankara S. van de Ven, Sven A. van der Burg, Florian Huber, Sonja Georgievska, Ignazio Aleo, Jaap Harlaar, Laura A. Bonouvrié, Marjolein M. van der Krogt, Annemieke I. Buizer

^*Corresponding author for this work

Biomechatronics & Human-Machine Control

Research output: Contribution to journal › Article › Scientific › peer-review

39 Downloads (Pure)

Abstract

Introduction: Video-based clinical rating plays an important role in assessing dystonia and monitoring the effect of treatment in dyskinetic cerebral palsy (CP). However, evaluation by clinicians is time-consuming, and the quality of rating is dependent on experience. The aim of the current study is to provide a proof-of-concept for a machine learning approach to automatically assess scoring of dystonia using 2D stick figures extracted from videos. Model performance was compared to human performance. Methods: A total of 187 video sequences of 34 individuals with dyskinetic CP (8–23 years, all non-ambulatory) were filmed at rest during lying and supported sitting. Videos were scored by three raters according to the Dyskinesia Impairment Scale (DIS) for arm and leg dystonia (normalized scores ranging from 0–1). Coordinates in pixels of the left and right wrist, elbow, shoulder, hip, knee and ankle were extracted using DeepLabCut, an open source toolbox that builds on a pose estimation algorithm. Within a subset, tracking accuracy was assessed for a pretrained human model and for models trained with an increasing number of manually labeled frames. The mean absolute error (MAE) between DeepLabCut’s prediction of the position of body points and manual labels was calculated. Subsequently, movement and position features were calculated from extracted body point coordinates. These features were fed into a Random Forest Regressor to train a model to predict the clinical scores. The model performance trained with data from one rater evaluated by MAEs (model-rater) was compared to inter-rater accuracy. Results: A tracking accuracy of 4.5 pixels (approximately 1.5 cm) could be achieved by adding 15–20 manually labeled frames per video. The MAEs for the trained models ranged from 0.21 ± 0.15 for arm dystonia to 0.14 ± 0.10 for leg dystonia (normalized DIS scores). The inter-rater MAEs were 0.21 ± 0.22 and 0.16 ± 0.20, respectively. Conclusion: This proof-of-concept study shows the potential of using stick figures extracted from common videos in a machine learning approach to automatically assess dystonia. Sufficient tracking accuracy can be reached by manually adding labels within 15–20 frames per video. With a relatively small data set, it is possible to train a model that can automatically assess dystonia with a performance comparable to human scoring.

Original language	English
Article number	1108114
Number of pages	9
Journal	Frontiers In Robotics and AI
Volume	10
DOIs	https://doi.org/10.3389/frobt.2023.1108114
Publication status	Published - 2023

Keywords

cerebral palsy
human pose estimation
machine learning
markerless skeleton tracking
motion capture
movement disorders

Access to Document

10.3389/frobt.2023.1108114

frobt-10-1108114Final published version, 1.22 MBLicence: CC BY

Cite this

Haberfehlner, H., van de Ven, S. S., van der Burg, S. A., Huber, F., Georgievska, S., Aleo, I., Harlaar, J., Bonouvrié, L. A., van der Krogt, M. M., & Buizer, A. I. (2023). Towards automated video-based assessment of dystonia in dyskinetic cerebral palsy: A novel approach using markerless motion tracking and machine learning. Frontiers In Robotics and AI, 10, Article 1108114. https://doi.org/10.3389/frobt.2023.1108114

@article{e493116563d441379b381d19339b3de2,

title = "Towards automated video-based assessment of dystonia in dyskinetic cerebral palsy: A novel approach using markerless motion tracking and machine learning",

abstract = "Introduction: Video-based clinical rating plays an important role in assessing dystonia and monitoring the effect of treatment in dyskinetic cerebral palsy (CP). However, evaluation by clinicians is time-consuming, and the quality of rating is dependent on experience. The aim of the current study is to provide a proof-of-concept for a machine learning approach to automatically assess scoring of dystonia using 2D stick figures extracted from videos. Model performance was compared to human performance. Methods: A total of 187 video sequences of 34 individuals with dyskinetic CP (8–23 years, all non-ambulatory) were filmed at rest during lying and supported sitting. Videos were scored by three raters according to the Dyskinesia Impairment Scale (DIS) for arm and leg dystonia (normalized scores ranging from 0–1). Coordinates in pixels of the left and right wrist, elbow, shoulder, hip, knee and ankle were extracted using DeepLabCut, an open source toolbox that builds on a pose estimation algorithm. Within a subset, tracking accuracy was assessed for a pretrained human model and for models trained with an increasing number of manually labeled frames. The mean absolute error (MAE) between DeepLabCut{\textquoteright}s prediction of the position of body points and manual labels was calculated. Subsequently, movement and position features were calculated from extracted body point coordinates. These features were fed into a Random Forest Regressor to train a model to predict the clinical scores. The model performance trained with data from one rater evaluated by MAEs (model-rater) was compared to inter-rater accuracy. Results: A tracking accuracy of 4.5 pixels (approximately 1.5 cm) could be achieved by adding 15–20 manually labeled frames per video. The MAEs for the trained models ranged from 0.21 ± 0.15 for arm dystonia to 0.14 ± 0.10 for leg dystonia (normalized DIS scores). The inter-rater MAEs were 0.21 ± 0.22 and 0.16 ± 0.20, respectively. Conclusion: This proof-of-concept study shows the potential of using stick figures extracted from common videos in a machine learning approach to automatically assess dystonia. Sufficient tracking accuracy can be reached by manually adding labels within 15–20 frames per video. With a relatively small data set, it is possible to train a model that can automatically assess dystonia with a performance comparable to human scoring.",

keywords = "cerebral palsy, human pose estimation, machine learning, markerless skeleton tracking, motion capture, movement disorders",

author = "Helga Haberfehlner and {van de Ven}, {Shankara S.} and {van der Burg}, {Sven A.} and Florian Huber and Sonja Georgievska and Ignazio Aleo and Jaap Harlaar and Bonouvri{\'e}, {Laura A.} and {van der Krogt}, {Marjolein M.} and Buizer, {Annemieke I.}",

year = "2023",

doi = "10.3389/frobt.2023.1108114",

language = "English",

volume = "10",

journal = "Frontiers In Robotics and AI",

issn = "2296-9144",

publisher = "Frontiers Media",

}

Haberfehlner, H, van de Ven, SS, van der Burg, SA, Huber, F, Georgievska, S, Aleo, I, Harlaar, J, Bonouvrié, LA, van der Krogt, MM & Buizer, AI 2023, 'Towards automated video-based assessment of dystonia in dyskinetic cerebral palsy: A novel approach using markerless motion tracking and machine learning', Frontiers In Robotics and AI, vol. 10, 1108114. https://doi.org/10.3389/frobt.2023.1108114

Towards automated video-based assessment of dystonia in dyskinetic cerebral palsy: A novel approach using markerless motion tracking and machine learning. / Haberfehlner, Helga; van de Ven, Shankara S.; van der Burg, Sven A. et al.
In: Frontiers In Robotics and AI, Vol. 10, 1108114, 2023.

Research output: Contribution to journal › Article › Scientific › peer-review

TY - JOUR

T1 - Towards automated video-based assessment of dystonia in dyskinetic cerebral palsy

T2 - A novel approach using markerless motion tracking and machine learning

AU - Haberfehlner, Helga

AU - van de Ven, Shankara S.

AU - van der Burg, Sven A.

AU - Huber, Florian

AU - Georgievska, Sonja

AU - Aleo, Ignazio

AU - Harlaar, Jaap

AU - Bonouvrié, Laura A.

AU - van der Krogt, Marjolein M.

AU - Buizer, Annemieke I.

PY - 2023

Y1 - 2023

N2 - Introduction: Video-based clinical rating plays an important role in assessing dystonia and monitoring the effect of treatment in dyskinetic cerebral palsy (CP). However, evaluation by clinicians is time-consuming, and the quality of rating is dependent on experience. The aim of the current study is to provide a proof-of-concept for a machine learning approach to automatically assess scoring of dystonia using 2D stick figures extracted from videos. Model performance was compared to human performance. Methods: A total of 187 video sequences of 34 individuals with dyskinetic CP (8–23 years, all non-ambulatory) were filmed at rest during lying and supported sitting. Videos were scored by three raters according to the Dyskinesia Impairment Scale (DIS) for arm and leg dystonia (normalized scores ranging from 0–1). Coordinates in pixels of the left and right wrist, elbow, shoulder, hip, knee and ankle were extracted using DeepLabCut, an open source toolbox that builds on a pose estimation algorithm. Within a subset, tracking accuracy was assessed for a pretrained human model and for models trained with an increasing number of manually labeled frames. The mean absolute error (MAE) between DeepLabCut’s prediction of the position of body points and manual labels was calculated. Subsequently, movement and position features were calculated from extracted body point coordinates. These features were fed into a Random Forest Regressor to train a model to predict the clinical scores. The model performance trained with data from one rater evaluated by MAEs (model-rater) was compared to inter-rater accuracy. Results: A tracking accuracy of 4.5 pixels (approximately 1.5 cm) could be achieved by adding 15–20 manually labeled frames per video. The MAEs for the trained models ranged from 0.21 ± 0.15 for arm dystonia to 0.14 ± 0.10 for leg dystonia (normalized DIS scores). The inter-rater MAEs were 0.21 ± 0.22 and 0.16 ± 0.20, respectively. Conclusion: This proof-of-concept study shows the potential of using stick figures extracted from common videos in a machine learning approach to automatically assess dystonia. Sufficient tracking accuracy can be reached by manually adding labels within 15–20 frames per video. With a relatively small data set, it is possible to train a model that can automatically assess dystonia with a performance comparable to human scoring.

AB - Introduction: Video-based clinical rating plays an important role in assessing dystonia and monitoring the effect of treatment in dyskinetic cerebral palsy (CP). However, evaluation by clinicians is time-consuming, and the quality of rating is dependent on experience. The aim of the current study is to provide a proof-of-concept for a machine learning approach to automatically assess scoring of dystonia using 2D stick figures extracted from videos. Model performance was compared to human performance. Methods: A total of 187 video sequences of 34 individuals with dyskinetic CP (8–23 years, all non-ambulatory) were filmed at rest during lying and supported sitting. Videos were scored by three raters according to the Dyskinesia Impairment Scale (DIS) for arm and leg dystonia (normalized scores ranging from 0–1). Coordinates in pixels of the left and right wrist, elbow, shoulder, hip, knee and ankle were extracted using DeepLabCut, an open source toolbox that builds on a pose estimation algorithm. Within a subset, tracking accuracy was assessed for a pretrained human model and for models trained with an increasing number of manually labeled frames. The mean absolute error (MAE) between DeepLabCut’s prediction of the position of body points and manual labels was calculated. Subsequently, movement and position features were calculated from extracted body point coordinates. These features were fed into a Random Forest Regressor to train a model to predict the clinical scores. The model performance trained with data from one rater evaluated by MAEs (model-rater) was compared to inter-rater accuracy. Results: A tracking accuracy of 4.5 pixels (approximately 1.5 cm) could be achieved by adding 15–20 manually labeled frames per video. The MAEs for the trained models ranged from 0.21 ± 0.15 for arm dystonia to 0.14 ± 0.10 for leg dystonia (normalized DIS scores). The inter-rater MAEs were 0.21 ± 0.22 and 0.16 ± 0.20, respectively. Conclusion: This proof-of-concept study shows the potential of using stick figures extracted from common videos in a machine learning approach to automatically assess dystonia. Sufficient tracking accuracy can be reached by manually adding labels within 15–20 frames per video. With a relatively small data set, it is possible to train a model that can automatically assess dystonia with a performance comparable to human scoring.

KW - cerebral palsy

KW - human pose estimation

KW - machine learning

KW - markerless skeleton tracking

KW - motion capture

KW - movement disorders

UR - http://www.scopus.com/inward/record.url?scp=85150366436&partnerID=8YFLogxK

U2 - 10.3389/frobt.2023.1108114

DO - 10.3389/frobt.2023.1108114

M3 - Article

AN - SCOPUS:85150366436

SN - 2296-9144

VL - 10

JO - Frontiers In Robotics and AI

JF - Frontiers In Robotics and AI

M1 - 1108114

ER -

Towards automated video-based assessment of dystonia in dyskinetic cerebral palsy: A novel approach using markerless motion tracking and machine learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this