(Open Access) Characterisation of voice quality of Parkinson’s disease using differential phonological posterior features (2017) | Milos Cernak

Q1. What are the contributions in "Characterisation of voice quality of parkinson’s disease using differential phonological posterior features" ?

A goal of this paper is to characterize a VQ spectrum – the composition of non-modal phonations – of voice in PD. The paper relates non-modal healthy phonations: breathy, creaky, tense, falsetto and harsh, with disordered phonation in PD. In addition, the proposed features were applied for prediction ∗Corresponding author Email address: milos.

Q2. What future works have the authors mentioned in the paper "Characterisation of voice quality of parkinson’s disease using differential phonological posterior features" ?

In future, the authors plan to obtain PD data with labeled VQ, and validate the VQ characterisation on individual patients, looking for example for regression of the perceptual scores.

Q3. What type of phonation type was used for the read-VQ database?

The read-VQ database contains 4 speakers (2 males and 2 females) who were asked to read 17 sentences in six different phonation types: modal, breathy, tense, harsh, creaky, and falsetto.

Q4. What equipment was used to record the audio of the read-VQ database?

Audio of the read-VQ database was recorded at 44.1 kHz using high quality recording equipment: a B&K 4191 free-field microphone and a B&K 7749 pre-amplifier.

Q5. What is the common clinical assessment method for disordered vocal quality?

Auditory-perceptual evaluation of disordered VQ is the most commonly used clinical assessment method, and is considered by clinicians as the “gold standard” for documenting voice impairment severity (Kreiman et al., 1993).

Q6. What is the definition of hypokinetic dysarthria?

Speech of hypokinetic dysarthria in Parkinson’s disease (PD) is characterised by hypokinesia (rigid, less motion describing decreased range and frequency of movement) of the vocal folds and articulators.

Q7. How many frames were used for the training?

The temporal context from 7 to 11 successive frames was tested with no particular performance increase, so the temporal context of 9 frames was used for the training.

Q8. Who produced the prototype voice quality examples?

Participants were given prototype voice quality examples, produced by John Laver and John Kane, and were asked to practise producing them before coming to the recording session.

Q9. What is the biggest open-source database for PD?

To the authors’ knowledge, the used PD database is the biggest open-source database available, containing both isolated and connected speech, and was selected primarily for its size.

Q10. What is the phonological analysis and synthesis?

the platform is based on cascaded speech analysis and synthesis that works internally with the phonological speech representation.

Q11. How many sentences were chosen to obtain a wide phonetic coverage?

451 sentences were chosen to obtain a wide phonetic coverage, as it is likely that it can be very difficult for speakers to maintain a constant type of phonation over a long utterance.

Q12. What are the improvements for the monologue and reading speech tasks?

Improvements are obtained for the monologue and reading speech tasks, of 3% and 16% respectively, whereas no improvement is obtained with the pataka speech task.

Q13. How are the results of the proposed characterisation of speech quality?

The results obtained by DPP features have been validated by matching the obtained most significant, non-modal phonation types with evaluating parameters of the perceptual assessments.

Characterisation of voice quality of Parkinson’s disease using differential phonological posterior features

Figures

Citations

Detecting Parkinson's Disease with Sustained Phonation and Speech Signals using Machine Learning Techniques

Multimodal Assessment of Parkinson's Disease: A Deep Learning Approach

Parkinson's disease: Cause factors, measurable indicators, and early diagnosis.

Use of radiomic features and support vector machine to distinguish Parkinson's disease cases from normal controls.

A novel octopus based Parkinson’s disease and gender recognition method using vowels

References

A fast learning algorithm for deep belief nets

[Parkinson's disease].

The Sound Pattern of English

The Kaldi Speech Recognition Toolkit

A course in phonetics

Related Papers (5)

Novel Speech Signal Processing Algorithms for High-Accuracy Classification of Parkinson's Disease

Movement Disorder Society-sponsored revision of the Unified Parkinson's Disease Rating Scale (MDS-UPDRS): scale presentation and clinimetric testing results.

Speech impairment in a large sample of patients with Parkinson's disease

Suitability of Dysphonia Measurements for Telemonitoring of Parkinson's Disease

Frequency and Cooccurrence of Vocal Tract Dysfunctions in the Speech of a Large Sample of Parkinson Patients

Frequently Asked Questions (13)