Konstantina Skolariki, Graciella Muniz Terrera, Samuel O. Danso
Alzheimer’s disease (AD) is an irreversible and progressive neurodegenerative disease as well as the most common form of dementia. It usually affects the older population, but early onset AD is still possible(Ritchie et al., 2015). Recent studies propose that AD is a middle-life disease (Ritchie et al.,2015). Regardless the onset of the disease,it is important to note that it takes years for the symptoms to manifest. Specifically, it is believed that AD begins 20 years before the onset of symptoms. AD broadly includes three stages: preclinical AD, mild cognitive impairment (MCI) and dementia (Grassi et al., 2019). Researchers find it challenging to classify the MCI stage. This is partly because although MCI patients appear to have neurological deficits, their symptoms are not advanced enough to meet the AD criteria.MCI is also known as the stage between normal cognitive ageing and dementia and is often thought of as the prodromal stage of AD (Grassi et al., 2019). MCI patients can either remain stable at this stage of the disease or convert to AD. Approximatively 20-40% of MCI patients convert to AD (MCI converters-MCIc; Grassi et al., 2019). Like any other disease, early diagnosis is important.Therefore, identifying subtle brain changes that occur during the MCI-to-AD conversion as early as possible could be the key to the development of more effective treatment plans.
The majority of current approaches aim to help patients manage behavioral symptoms and impede others such as memory loss and cognitive decline. Because of the complexity of the disease, one specific drug or treatment intervention appears unlikely to successfully treat the disease. Predicting the exact point when patients convert from the prodromal stage of the disease (MCI) to AD would be extremely beneficial in identifying novel mechanisms of disease prevention.
In the era of big data, analysis of large volumes of data requires progressive approaches. Advances in neuroimaging techniques such as magnetic resonance imaging (MRI) and positron emission tomography (PET) scans enable scientists to search for AD patterns in the entire brain.Because of AD complexity, image analysis in combination with pre-existingin vivobiomarkers (amyloid-β (Aβ), tau, etc.) is a more reliable diagnostic tool (Westman et al., 2011). Machine learning methodologies are often utilized for the examination of said high dimensional data (Cuingnet et al., 2011). Models based on machine learning provide a promising opportunity for developing tools that can detect disease progression. A variety of methodologies have been suggested for patient classification(AD and/or MCI; Cuingnet et al., 2011). The majority of methods work by reducing the dimensionality of the feature space. This approach relies on the extraction of different features types, clustering and/or selection methods (Cuingnet et al., 2011).
Machine learning techniques have been used for classification of MCI patients who convert to AD (MCIc) and MCI patients who remain stable at this stage (MCInc;Additional Table 1).Our previous work was set to create predictive models in order to identify any emerging patterns of conversion from MCI to AD (Skolariki et al., 2020). For the establishment of such models, we relied on machine learning for analysis of multivariate data. The supervised machine learning algorithms that were utilized for classification tasks include: i) support vector machine (SVM) for which a linear C-SVM algorithm was applied, ii) decision trees, for which a Java open source implementation of the C4.5 algorithm (the J48 algorithm) was used and iii) the Naive Bayes (NB) classifier,which even though is considered a fairly simple approach oftentimes outperforms more complex classification methods. The aforementioned filters are available in the WEKA 3.9.2 software.
Features based on cortical thickness (CTH)and hippocampal volumes (HCV) extracted from brain scans were used to train the learning algorithms. CTH was utilized as a classification feature seeing as literature shows that cortical thickness characterizes atrophy manifestation, making it a potential AD diagnostic biomarker (Thompson et al.,2001). HCV was chosen as a feature because the hippocampus is a region of the brain found to be associated with early stages of AD making it an early AD marker (Schuff et al., 2008).
All data used to develop the models was obtained from Alzheimer’s Disease Neuroimaging Initiative (ADNI) (http://www.adni-info.org). Participants consisted of AD patients, subjects in several stages of MCI,and healthy controls (55-90 years old). For this study, subjects were divided into healthy controls, MCI converters (MCIc), MCI nonconverters (MCInc) and AD.
Using the WEKA filter for random data division, a “train” file was produced that included data from the three diagnostic groups (AD, healthy control and MCIc) and a “test” file that included data from MCIc(Figure 1). Utilization of the “train” file resulted in the six predictive models for MCIc identification (Additional Table 2).The test set was used for the training of the algorithms and model development.
As a result, six models were created CTHSVM, CTH-J48, CTH-NB, HCV-SVM, HCV-J48 and HCV-NB (Additional Table 2). The best model, which is SVM trained by CTH-based features (CTH-SVM), accurately identified 99% of MCI patients that converted to AD(Additional Table 2). Additionally, the CTHbased models consistently outperformed the HCV-based models (Additional Table 2).Based on the study, evidence suggests that multivariate methods (SVM, J48 and NB) are highly promising for group differentiation(MCIvs. AD) that take into consideration the synchronized involvement of the input features.
The aforementioned predictive models could prove effective in the identification of MCIto-AD inhibitory mechanisms, leading, thus,to a prolonged MCI stage allowing patients to live an increasingly moderate life at the prodromal stage of the disease.
Our current work includes validating the previous predictive models using a larger data set to establish their accuracy and performance as precisely as possible. This research will also allow us to determine whether our predictions are more dependent on the feature, the sample size or the models themselves. Next steps involve the inclusion of additional classification features in order to create an all-inclusive predictive model.These features should contain but not be limited to: apolipoprotein-E, cerebrospinal fluid protein levels (tau, Aβ), neurofilament light chain (NFL), plasma protein levels (tau,Aβ, NFL), electroencephalograph markers and volumetric differences in mapped hippocampal regions, MRI (used to analyze certain regions of interest), structural MRI(used to classify brain regions affected by AD at a voxel scale) and PET scans (Gupta et al., 2019). These features are established AD markers. Therefore, inclusion of a wider combination of AD indicators would increase model accuracy. Future research will utilize more sophisticated machine learning approaches, such as ensemble selection that would allow for the combination of several different classifiers in order to accomplish higher decision boundaries (Aguilar et al,2013).
Figure 1|Flow chart of subject inclusion.AD: Alzheimer’s disease; CTH: cortical thickness; HC: healthy control; HCV: hippocampal volume; MCI:mild cognitive impairment; MCIc: MCI converters; MCInc: MCI non-converters. Reprinted from Skolariki et al. (2020).
Healthy control data were included in this study and the sample size for all three groups (healthy control, MCI and AD) was proportionate in order to represent the broader population. Future work should focus on exploring a similar research aim with the inclusion of additional features that would be obtained from the same subjects.The utilization of ADNI data encompasses a key restriction regarding sample subjects,seeing as it is still difficult to acquire data for all of the required features from the same participants. Therefore, the respective discriminative power of the different approaches could not be evaluated.
Based on the complexity and heterogeneity of AD, we conclude that an advanced machine learning predictive model that includes a panel of features would offer far greater insights into precise diagnostic and prognostic approaches in an unbiased manner. Thus, the overall aim of scientists is to develop therapeutic methods that incorporate all types of biological mechanisms (genetic, molecular, cellular,etc.) in order for AD to be prevented.
Konstantina Skolariki*,Graciella Muniz Terrera,Samuel O. Danso
Bioinformatics and Human Electrophysiology Laboratory, Ionian University, Corfu, Greece(Skolariki K)
Centre for Dementia Prevention, University of Edinburgh, UK (Terrera GM)
Division of Neuroimaging, Centre for Clinical Brain Sciences and Edinburgh Imaging, University of Edinburgh, Scotland, UK (Terrera GM, Danso SO)
*Correspondence to:Konstantina Skolariki, MScR,kskolariki@hotmail.com.
https://orcid.org/0000-0002-7704-5368(Konstantina Skolariki)
Date of submission:September 10, 2020
Date of decision:October 26, 2020
Date of acceptance:December 9, 2020
Date of web publication:January 25, 2021
https://doi.org/10.4103/1673-5374.306071
How to cite this article:Skolariki K, Terrera GM,Danso SO (2021) Predictive models for mild cognitive imрairment to Alzheimer’s disease conversion. Neural Regen Res 16(9):1766-1767.
Copyright license agreement:The Coрyright License Agreement has been signed by all authors before рublication.
Plagiarism check:Checked twice by iThenticate.
Peer review:Externally рeer reviewed.
Open access statement:This is an oрen access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build uрon the work non-commercially, as long as aррroрriate credit is given and the new creations are licensed under the identical terms.
Additional files:
Additional Table 1:Machine learning techniques used for MCIc and MCInc classification.
Additional Table 2:Predictions acquired using six different models.