Multimodal Deep Learning for the Diagnosis and Assessment of Alzheimer's Disease
Study Details
Study Description
Brief Summary
Alzheimer's disease (AD) is the most common dementia and has been one of the most expensive diseases with the highest lethality. With the rapid increase of the aging population, more and more burdens will be posed on society and economics. The manifestations of AD are the progressive loss of memory, language and visuospatial function, executive and daily living abilities, and so forth. The Pathophysiological changes of AD occur 10-20 years before the clinical symptoms, while there is still a lack of effective strategy for early diagnosis. Mild cognitive impairment (MCI) is considered to be a transitional state between healthy aging and the clinical diagnosis of dementia and has received increasing attention as a separate diagnostic entity.
To make the diagnosis, doctors ought to compressively consider the multimodal medical information including clinical symptoms, neuroimages, neuropsychological tests, laboratory examinations, etc. Multimodal deep learning has risen to this challenge, which could integrate the various modalities of biological information and capture the relationships among them contributing to higher accuracy and efficiency. It has been widely applied in imaging, tumor pathology, genomics, etc. Recently, the studies on AD based on deep learning still mainly focused on multimodal neuroimaging, while multimodal medical information requires comprehensive integration and intellectual analysis. Moreover, studies reveal that some imperceptible symptoms in MCI and the early stage of AD may also play an effective role in diagnosis and assessment, such as gait disorder, facial expression identification dysfunction, and speech and language impairment. However, doctors could hardly detect the slight and complex changes, which could rely on the full mining of the video and audio information by multimodal deep learning.
In conclusion, we aim to explore the features of gait disorder, facial expression identification dysfunction, and speech and language impairment in MCI and AD, and analyze their diagnostic efficiency. We would identify the different degrees of dependency on multimodal medical information in diagnosis and finally build an optimal multimodal diagnostic method utilizing the most convenient and economical information. Besides, based on follow-up observations on the changes in multimodal medical information with the progress of AD and MCI, we expect to establish an effective and convenient diagnostic strategy.
Condition or Disease | Intervention/Treatment | Phase |
---|---|---|
|
Detailed Description
Our objective is to make the early diagnosis and assessment of AD and MCI based on multimodal deep learning. Initially, Gait disorder, facial expression identification dysfunction, and speech and language impairment are of great significance in the occurrence and development of AD and MCI. However, due to the high complexity and stealthiness of these clinical symptoms, no uniform conclusions have been made. Hence, we attempt to apply machine learning methods to recognize the video and audio information. In this way, we will explore the changing characteristics of gait, expression, and language in AD and MCI, and analyze their diagnostic effectiveness as diagnostic markers, finally providing new ideas and experimental data for the diagnosis of AD and MCI. Secondly, multimodal medical information needs to be integrated and comprehensively analyzed. We aim to propose an optimal diagnostic strategy referring to the different degrees of dependency on multimodal medical information in diagnosis. Moreover, observing the changes in multimodal medical information with the progress of AD and MCI, we expect to build a predicting model of AD diagnosis and prognosis.
The methods are as follows:
-
Collecting multimodal medical information A variety of multimodal medical information would be carefully collected including the baseline demographic data, chief complaint and medical history, peripheral organ function assessment, laboratory examination, imaging examination, neuroelectrophysiological examination, neurocognitive and psychological examination, information on gait, expression, and language, and biological samples, etc.
-
Revealing the changes of gait, expression, and language in patients with AD and MCI, and verifying their diagnostic efficacy.
For multimodal medical information on gait, OpenPose model was used to extract human key points and construct a human skeleton structure diagram. Based on graph neural networks and convolutional neural networks, instantaneous action analysis of single-frame images is carried out. And then utilizing the Transformer model, gait sequence analysis is carried out by integrating multi-frame video.For multimodal medical information on facial expression, the Dlib algorithm will be used to extract facial key points, combined with facial expression images, and the spatiotemporal Transformer model will be used for facial expression analysis. For multimodal medical information on language, ASRT model will be used for speech recognition and text content extraction. Simultaneously, the frequency domain Fourier transform and wavelet transform will be applied to extract frequency domain information and analyze the speech features by integrating language content, voice intonation, speech speed, and other information. Based on the attention model, the gait, expression, and language analysis results of AD and MCI will be compared with those of the control group to reveal the features of AD and MCI and provide evidence for disease diagnosis.
-
Analyzing the different degrees of dependency on multimodal information in the diagnosis of AD and MCI diseases, and establishing an optimal diagnosis strategy In the supervised learning process, the attention mechanism-based method will be used to analyze the influence of multimodal information on the final results. At the same time, based on the knowledge map, the patient's blood biochemical indicators, genomic information and other fields of knowledge would be added to the model. Based on Bayesian probability inference and causal inference theory, the causal programming method will be used to model the causal analysis of information and diagnosis results of different modes. Based on AutoML method, multimodal information will be combined and optimized, and a reliable optimal diagnosis strategy will be established according to experimental results.
-
Exploring the changes of multimodal medical information with the progression of the disease, and build a predicting model for early diagnosis and disease progression of AD.
Viewing multimodal medical information as the control condition, the Transformer model will be used to model time sequence information, and the conditional diffusion model will be used to generate patients' MRI image changes and other disease progression-related information, providing the basis for disease progression prediction. Based on the large multimodal model technology, the output of the model will be interfered with and adjusted referring to the judgment and description of professional doctors, to generate the prediction in line with the judgment of professional doctors, and finally construct the interpretable early diagnosis and disease progression prediction model.
Study Design
Arms and Interventions
Arm | Intervention/Treatment |
---|---|
Alzheimer's disease the diagnosis of AD is according to the recommendations from the National Institute on Aging-Alzheimer's Association workgroups on diagnostic guidelines for AD. |
Diagnostic Test: gait video; speech video; facial expression video;
The videos of participants' gait, facial expression, and speech will be recorded and analyzed further. Other routine diagnostic tests will also be performed such as imaging of MRI, cognitive scales, etc.
Other Names:
|
Mild cognitive impairment the diagnosis of MCI refers to the criteria defined by Peterson in 2004. |
Diagnostic Test: gait video; speech video; facial expression video;
The videos of participants' gait, facial expression, and speech will be recorded and analyzed further. Other routine diagnostic tests will also be performed such as imaging of MRI, cognitive scales, etc.
Other Names:
|
Control participants who are age-matched with AD and MCI participants, without cognitive impairment. |
Diagnostic Test: gait video; speech video; facial expression video;
The videos of participants' gait, facial expression, and speech will be recorded and analyzed further. Other routine diagnostic tests will also be performed such as imaging of MRI, cognitive scales, etc.
Other Names:
|
Outcome Measures
Primary Outcome Measures
- The diagnostic efficiency of multimodal deep learning diagnostic strategy [The outcome will be measured and analyzed once all the baseline multimodal medical information has been collected.]
The diagnostic efficiency will be measured by the area under curve(AUC)of receiver operating characteristic(ROC)curve.
Secondary Outcome Measures
- The prognostic efficiency of multimodal deep learning prognostic strategy [The outcome will be measured and analyzed once all two-year follow-up multimodal medical information has been collected.]
The prognostic efficiency will be measured by the area under curve(AUC)of receiver operating characteristic(ROC)curve.
Eligibility Criteria
Criteria
Inclusion Criteria:
-
. Participants' age is between 50 and 85 years old, male or female;
-
. Participants graduated from primary school or above, with normal hearing, vision, and pronunciation, using Chinese as their mother tongue and Mandarin as their daily language;
-
. The diagnosis of AD and MCI participants conform to the corresponding diagnostic criteria mentioned above;
-
. The scores of MMSE are between 10 and 28, and the scores of CDR are no more than 2.
-
. Patients or family members agree to sign informed consent.
Exclusion Criteria:
-
. Participants suffer from neurological disorders that could cause dysfunction of the brain, such as depression, tumors, Parkinson's disease, metabolic encephalopathy, encephalitis, multiple sclerosis, epilepsy, brain trauma, normal cranial pressure hydrocephalus, and so forth;
-
. Participants suffer from systematic diseases that could cause cognitive impairment, such as liver insufficiency, renal insufficiency, thyroid dysfunction, severe anemia, folic acid or vitamin B12 deficiency, syphilis, HIV infection, alcohol and drug abuse, and so forth;
-
. Participants suffer from diseases that are unable to cooperate with the examinations;
-
. Participants cannot take magnetic resonance imaging;
-
. Participants suffer from mental and neurodevelopmental retardation;
-
. Participants refuse to sign informed consent.
Contacts and Locations
Locations
No locations specified.Sponsors and Collaborators
- First Hospital of China Medical University
Investigators
- Study Chair: Huayan Liu, the first affiliated hospital of China medical university, neurology department
Study Documents (Full-Text)
None provided.More Information
Publications
- 2023 Alzheimer's disease facts and figures. Alzheimers Dement. 2023 Apr;19(4):1598-1695. doi: 10.1002/alz.13016. Epub 2023 Mar 14.
- Adolphs R, Tranel D. Impaired judgments of sadness but not happiness following bilateral amygdala damage. J Cogn Neurosci. 2004 Apr;16(3):453-62. doi: 10.1162/089892904322926782.
- Ahmed S, Haigh AM, de Jager CA, Garrard P. Connected speech as a marker of disease progression in autopsy-proven Alzheimer's disease. Brain. 2013 Dec;136(Pt 12):3727-37. doi: 10.1093/brain/awt269. Epub 2013 Oct 18.
- Beauchet O, Launay CP, Annweiler C, Allali G. Hippocampal volume, early cognitive decline and gait variability: which association? Exp Gerontol. 2015 Jan;61:98-104. doi: 10.1016/j.exger.2014.11.002. Epub 2014 Nov 6.
- Ceresetti R, Rouch I, Laurent B, Getenet JC, Pommier M, de Chalvron S, Chainay H, Borg C. Processing of Facial Expressions of Emotions and Pain in Alzheimer's Disease. J Alzheimers Dis. 2022;89(1):389-398. doi: 10.3233/JAD-220236.
- Chen X, Wang X, Zhang K, Fung KM, Thai TC, Moore K, Mannel RS, Liu H, Zheng B, Qiu Y. Recent advances and clinical applications of deep learning in medical image analysis. Med Image Anal. 2022 Jul;79:102444. doi: 10.1016/j.media.2022.102444. Epub 2022 Apr 4.
- Eyigoz E, Mathur S, Santamaria M, Cecchi G, Naylor M. Linguistic markers predict onset of Alzheimer's disease. EClinicalMedicine. 2020 Oct 22;28:100583. doi: 10.1016/j.eclinm.2020.100583. eCollection 2020 Nov.
- Green S, Reivonen S, Rutter LM, Nouzova E, Duncan N, Clarke C, MacLullich AMJ, Tieges Z. Investigating speech and language impairments in delirium: A preliminary case-control study. PLoS One. 2018 Nov 26;13(11):e0207527. doi: 10.1371/journal.pone.0207527. eCollection 2018.
- Hu X, Fernie AR, Yan J. Deep learning in regulatory genomics: from identification to design. Curr Opin Biotechnol. 2023 Feb;79:102887. doi: 10.1016/j.copbio.2022.102887. Epub 2023 Jan 12.
- Jia J, Wei C, Chen S, Li F, Tang Y, Qin W, Zhao L, Jin H, Xu H, Wang F, Zhou A, Zuo X, Wu L, Han Y, Han Y, Huang L, Wang Q, Li D, Chu C, Shi L, Gong M, Du Y, Zhang J, Zhang J, Zhou C, Lv J, Lv Y, Xie H, Ji Y, Li F, Yu E, Luo B, Wang Y, Yang S, Qu Q, Guo Q, Liang F, Zhang J, Tan L, Shen L, Zhang K, Zhang J, Peng D, Tang M, Lv P, Fang B, Chu L, Jia L, Gauthier S. The cost of Alzheimer's disease in China and re-estimation of costs worldwide. Alzheimers Dement. 2018 Apr;14(4):483-491. doi: 10.1016/j.jalz.2017.12.006. Epub 2018 Feb 9.
- Jiang Y, Yang M, Wang S, Li X, Sun Y. Emerging role of deep learning-based artificial intelligence in tumor pathology. Cancer Commun (Lond). 2020 Apr;40(4):154-166. doi: 10.1002/cac2.12012. Epub 2020 Apr 11.
- Klein-Koerkamp Y, Beaudoin M, Baciu M, Hot P. Emotional decoding abilities in Alzheimer's disease: a meta-analysis. J Alzheimers Dis. 2012;32(1):109-25. doi: 10.3233/JAD-2012-120553.
- Knopman DS, Amieva H, Petersen RC, Chetelat G, Holtzman DM, Hyman BT, Nixon RA, Jones DT. Alzheimer disease. Nat Rev Dis Primers. 2021 May 13;7(1):33. doi: 10.1038/s41572-021-00269-y.
- Pakhomov SV, Hemmy LS. A computational linguistic measure of clustering behavior on semantic verbal fluency task predicts risk of future dementia in the nun study. Cortex. 2014 Jun;55:97-106. doi: 10.1016/j.cortex.2013.05.009. Epub 2013 Jun 14.
- Rosen HJ, Wilson MR, Schauer GF, Allison S, Gorno-Tempini ML, Pace-Savitsky C, Kramer JH, Levenson RW, Weiner M, Miller BL. Neuroanatomical correlates of impaired recognition of emotion in dementia. Neuropsychologia. 2006;44(3):365-73. doi: 10.1016/j.neuropsychologia.2005.06.012. Epub 2005 Sep 9.
- Scheltens P, De Strooper B, Kivipelto M, Holstege H, Chetelat G, Teunissen CE, Cummings J, van der Flier WM. Alzheimer's disease. Lancet. 2021 Apr 24;397(10284):1577-1590. doi: 10.1016/S0140-6736(20)32205-4. Epub 2021 Mar 2.
- Wilson J, Allcock L, Mc Ardle R, Taylor JP, Rochester L. The neural correlates of discrete gait characteristics in ageing: A structured review. Neurosci Biobehav Rev. 2019 May;100:344-369. doi: 10.1016/j.neubiorev.2018.12.017. Epub 2018 Dec 13.
- Wilson SM, Eriksson DK, Schneck SM, Lucanie JM. A quick aphasia battery for efficient, reliable, and multidimensional assessment of language function. PLoS One. 2018 Feb 9;13(2):e0192773. doi: 10.1371/journal.pone.0192773. eCollection 2018. Erratum In: PLoS One. 2018 Jun 15;13(6):e0199469.
- Yamaguchi T, Maki Y, Yamaguchi H. Yamaguchi Facial Expression-Making Task in Alzheimer's Disease: A Novel and Enjoyable Make-a-Face Game. Dement Geriatr Cogn Dis Extra. 2012 Jan;2(1):248-57. doi: 10.1159/000339425. Epub 2012 Jun 20.
- LHuayan