Assistant Professor Shirley Ryan AbilityLab / Northwestern Chicago, Illinois, United States
Research Objectives: While computer vision algorithms for human pose estimation (HPE) are advancing rapidly, both in terms of accuracy and the types of understanding they enable, using these cutting edge approaches for clinical research presents many challenges including the range of algorithms, algorithmic interdependencies and data management. In order to advance the use of HPE in clinical research, we have developed an open-source pipeline based that reduces these challenges.
Design: retrospective observation study on the use of HPE pipeline on videos of people involved in rehabilitation.
Setting: videos acquired in both inpatient and outpatient contexts.
Participants: convenience sample of videos including from a clinical gait analysis laboratory and newly collected data including gait from people with a range of diagnoses
Interventions: None
Main Outcome Measures: ability to efficiently analyze large volumes of videos acquired in clinic and qualitative review of the outputs.
Results: We have used the pipeline to analyze tens of thousands of videos acquired in clinical settings. If includes wrappers for numerous specific implementations of classes of algorithms, making exploratory analysis and comparison feasible. Among the classes of algorithms are tracking algorithms, 2D keypoint detection, 3D lifting layers, and body model parameter estimation. The movement of specific subjects of interest is isolated using tracking algorithms, which perform well even with other visible people. However, it occasionally swaps identities between patients and therapists and detection failures are more common for people using assistive devices. Identifying 2D keypoints is fairly reliable, but struggles with some prosthetic limbs. Lifting methods perform well when lifting these keypoints into 3D joint locations. Approaches that estimate the parameters of a body mesh are promising, but often brittle and produce implausible estimates. A custom neural network allows accurately detecting temporal gait parameters of rehabilitation participants.
Conclusions: The performance of HPE is advancing rapidly, but more tools will be developed to widely disseminate this to the clinical and research communities. Additionally, these algorithms need validation on clinical populations, who can be different than the population the algorithms are trained upon.
Author(s) Disclosures: None
Learning Objectives:
Enumerate common classes of human pose estimation algorithms
Describe how these algorithms interact and depend on each other
Be able to describe situations where different algorithms fail to generalize to rehabilitation populations