Q2. What future works have the authors mentioned in the paper "Characterisation of voice quality of parkinson’s disease using differential phonological posterior features" ?
In future, the authors plan to obtain PD data with labeled VQ, and validate the VQ characterisation on individual patients, looking for example for regression of the perceptual scores.
Q3. What type of phonation type was used for the read-VQ database?
The read-VQ database contains 4 speakers (2 males and 2 females) who were asked to read 17 sentences in six different phonation types: modal, breathy, tense, harsh, creaky, and falsetto.
Q4. What equipment was used to record the audio of the read-VQ database?
Audio of the read-VQ database was recorded at 44.1 kHz using high quality recording equipment: a B&K 4191 free-field microphone and a B&K 7749 pre-amplifier.
Q5. What is the common clinical assessment method for disordered vocal quality?
Auditory-perceptual evaluation of disordered VQ is the most commonly used clinical assessment method, and is considered by clinicians as the “gold standard” for documenting voice impairment severity (Kreiman et al., 1993).
Q6. What is the definition of hypokinetic dysarthria?
Speech of hypokinetic dysarthria in Parkinson’s disease (PD) is characterised by hypokinesia (rigid, less motion describing decreased range and frequency of movement) of the vocal folds and articulators.
Q7. How many frames were used for the training?
The temporal context from 7 to 11 successive frames was tested with no particular performance increase, so the temporal context of 9 frames was used for the training.
Q8. Who produced the prototype voice quality examples?
Participants were given prototype voice quality examples, produced by John Laver and John Kane, and were asked to practise producing them before coming to the recording session.
Q9. What is the biggest open-source database for PD?
To the authors’ knowledge, the used PD database is the biggest open-source database available, containing both isolated and connected speech, and was selected primarily for its size.
Q10. What is the phonological analysis and synthesis?
the platform is based on cascaded speech analysis and synthesis that works internally with the phonological speech representation.
Q11. How many sentences were chosen to obtain a wide phonetic coverage?
451 sentences were chosen to obtain a wide phonetic coverage, as it is likely that it can be very difficult for speakers to maintain a constant type of phonation over a long utterance.
Q12. What are the improvements for the monologue and reading speech tasks?
Improvements are obtained for the monologue and reading speech tasks, of 3% and 16% respectively, whereas no improvement is obtained with the pataka speech task.
Q13. How are the results of the proposed characterisation of speech quality?
The results obtained by DPP features have been validated by matching the obtained most significant, non-modal phonation types with evaluating parameters of the perceptual assessments.