PlumX Metrics
Embed PlumX Metrics

Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISSN: 1990-9772, Page: 1405-1409
2024
  • 1
    Citations
  • 0
    Usage
  • 20
    Captures
  • 0
    Mentions
  • 0
    Social Media
Metric Options:   Counts1 Year3 Year

Metrics Details

Conference Paper Description

This work is concerned with devising a robust Parkinson's (PD) disease detector from speech in real-world operating conditions using (i) foundational models, and (ii) speech enhancement (SE) methods. To this end, we first fine-tune several foundational-based models on the standard PC-GITA (s-PC-GITA) clean data. Our results demonstrate superior performance to previously proposed models. Second, we assess the generalization capability of the PD models on the extended PC-GITA (e-PC-GITA) recordings, collected in real-world operative conditions, and observe a severe drop in performance moving from ideal to real-world conditions. Third, we align training and testing conditions applaying off-the-shelf SE techniques on e-PC-GITA, and a significant boost in performance is observed only for the foundational-based models. Finally, combining the two best foundational-based models trained on s-PC-GITA, namely WavLM Base and Hubert Base, yielded top performance on the enhanced e-PC-GITA.

Bibliographic Details

Moreno La Quatra; Maria Francesca Turco; Torbjørn Svendsen; Giampiero Salvi; Sabato Marco Siniscalchi; Juan Rafael Orozco-Arroyave

International Speech Communication Association

Arts and Humanities; Computer Science; Mathematics

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know