Abstract
Detecting signs of disorder from listening to spoken words is a core method in psychiatry. Traditionally the interpretation of speech depends on inherently subjective processes. By contrast, digital technology can be leveraged to detect and analyze what words are spoken, timestamp when they are uttered and quantify the manner in which they are expressed. With the use of mobile communication technology, digital speech processing tools are possible to use outside of traditional laboratory settings. This thesis argues that the necessary infrastructure to move speech processing into clinical practice is currently available. To examine this claim, a mobile application for remote mental state assessments was developed that implemented speech-based neuropsychological testing in 353 participants in two countries. It was possible to collect speech data in ecologically valid settings, but future larger scale implementations must solve technical, legal and cultural challenges by interdisciplinary teamwork. The findings of spoken responses on the classic Stroop color-word test from 57 patients with substance use disorders and 86 healthy participants showed that the production of single-word speech utterances could be measured with a high level of temporal precision. The classic Stroop task response latency interference was replicated and the scope of measurements was extended with novel speech characteristics. The audio files from 59 participants naming words in a category fluency task could be analyzed for both the temporal dynamics of response-word sequences and the semantic relatedness between words. Finally, the story recall ability in 25 patients with serious mental illness and 79 healthy participants was examined, and automated measurements of their ability to retell a story was computed using both simple word-count procedures and more advanced estimates of distances in a semantic vector space. In conclusion, it is technologically feasible to develop instruments for measuring multiple aspects of how patients with psychiatric disorders speak, and traditional speech-based neuropsychological tests can be employed outside of a laboratory setting provided the digital infrastructure is able to ensure the privacy of the users.
Has part(s)
Paper I: Holmlund, T.B., Foltz, P.W., Cohen, A.S., Johansen, H.D., Sigurdsen, R., Fugelli, P. … Elvevåg, B. (2019). Moving psychological assessment out of the controlled laboratory setting: Practical challenges. Psychological Assessment, 31(3), 292-303. Available in the file “thesis_entire.pdf”. Also available at https://doi.org/10.1037/pas0000647. Accepted manuscript available in Munin at https://hdl.handle.net/10037/16823.
Paper II: Holmlund, T.B., Cheng, J., Foltz, P.W., Cohen, A.S., Bernstein, J., Rosenfeld, E. … Elvevåg, B. Using automated speech processing for repeated measurements of attentional bias and control. (Manuscript). Available in the file “thesis.pdf”.
Paper III: Holmlund, T.B., Cheng, J., Foltz, P.W., Cohen, A.S. & Elvevåg, B. (2019). Updating verbal fluency analysis for the 21st century: Applications for psychiatry. Psychiatry Research, 273, 767-769. Also available in Munin at https://hdl.handle.net/10037/17113.
Paper IV: Holmlund, T.B., Chandler, C., Foltz, P.W., Cohen, A.S., Cheng, J., Bernstein, J. … Elvevåg, B. Applying speech technologies to assess verbal memory in patients with serious mental illness. (Manuscript). Available in the file “thesis.pdf”. Final version published in npj Digital Medicine available in Munin at https://hdl.handle.net/10037/17993.