Reliability of P3 Event-Related Potential during Working Memory Across the Spectrum of Cognitive Aging ====================================================================================================== * Hannes Devos * Jeffrey Burns * Ke Liao * Pedram Ahmadnezhad * Jonathan D. Mahnken * William M. Brooks * Kathleen Gustafson ## Abstract Event-related potentials (ERPs) offer unparalleled temporal resolution in tracing distinct electrophysiological processes related to normal and pathological cognitive aging. The stability of ERPs in older individuals, who inherently show more intraindividual variability in cognitive functions, has not been established. In this test-retest reliability study, 39 older individuals (age 74.10 (5.4) years; 23 (59%) women; 15 non β-amyloid elevated, 16 β-amyloid elevated, 8 cognitively impaired) with scores on the Montreal Cognitive Assessment (MOCA) ranging between 3 and 30 completed a working memory (n-back) test with three levels of difficulty at baseline and two-week follow-up. Stability of the ERP was evaluated on grand averaged task effects for both visits. P3 peak amplitude and latency were measured in frontal channels. P3 peak amplitude at Fz, our main outcome variable, showed excellent reliability in 0-back (intraclass correlation coefficient (ICC), 95% confidence interval = 0.82 (0.67 – 0.90) and 1-back (ICC = 0.87 (0.76 – 0.93), however, only fair reliability in 2-back (ICC = 0.53 (0.09 – 0.75). Reliability of P3 peak latencies was substantially lower, with ICCs ranging between 0.17 for 2-back and 0.54 for 0-back. Generalized linear mixed models showed no confounding effect of age, group, or task difficulty on stability of P3 amplitude and latency of Fz. By contrast, MOCA scores tended to negatively correlate with P3 amplitude of Fz (p=0.07). We conclude that P3 peak amplitude and latency provide a stable measure of electrophysiological processes in older individuals. However, impaired cognition may affect the stability of the ERP response. Keywords * event-related potentials * electro-encephalography * reliability * working memory * older adults * mild cognitive impairment * Alzheimer disease * preclinical ## 1 Introduction The aging process is characterized by gradual decline in physical, neurobiological and cognitive functions that may impact instrumental activities of daily living (iADL) such as driving, doing household chores, managing finances, medication adherence, or grocery shopping.(1, 2) Deterioration in these iADL becomes more apparent with age-related neurodegeneration such as mild cognitive impairment (MCI) and Alzheimer’s disease (AD).(3) Executive functions in particular are paramount in carrying out numerous iADL, but are also vulnerable to the effects of normal and pathological cognitive aging.(4, 5) Working memory is one core executive function that relates to the ability to temporarily store, process, and manipulate the information necessary for higher order cognitive tasks such as decision making, learning, and reasoning.(6) Working memory stems from the interaction between attention, short-term retention and manipulation of information, carried out by the coordinated activation of many brain regions.(7) The prefrontal cortex has particularly been associated with working memory.(8) Consequently, the prefrontal cortex is highly susceptible to the effects of aging and early neurodegeneration.(9, 10) A recent meta-analysis pooling functional magnetic resonance imaging studies suggested a gradual and linear decline in prefrontal cortex engagement in older individuals.(11) Similarly, electrophysiological processes also decline with age. The P3, a positive peak that appears with a latency between 250 to 500ms in the event-related potential (ERP), has been implicated in attention and working memory processes across the lifespan.(12) A previous study showed reduced positivity in P3 central-frontal and parietal ERPs in older adults,(13) whereas others demonstrated frontal hyperactivity in P3 coupled with parietal or posterior hypoactivity.(14, 15). Despite the ambiguity in ERP findings, most studies conclude that the abnormal ERP response in older individuals reflects inefficient or compensatory use of neural resources due to frontal cortex dysfunction.(13, 14) Therefore, electrophysiological responses to working memory tasks are convenient measures to test hypotheses related to frontal cortex function, normal cognitive aging, and early neurodegeneration. The ability to distinguish natural variability and measurement error from biologically relevant cognitive changes due to aging or early neurodegeneration is valuable to provide informed decisions on diagnosis, monitoring, and treatment of cognitive impairments.(16) However, older adults show more intraindividual variability in performance measures of working memory compared to younger adults. The age-related changes in intraindividual variability of performance measures become even more apparent with increasing cognitive demand.(9) This increased intraindividual may also stem from the heterogeneity of cognitive profiles in older individuals, especially when patients with MCI and AD are included.(17) The intraindividual variability observed in performance measures is believed to be linked to frontal cortex dysfunction,(9) which may therefore also affect intraindividual variability of the ERP response in older adults.(18) To date, few studies have investigated test-retest reliability of P3 ERP in healthy older adults(19-21) The test-retest reliability of the P3 ERP in older individuals with a heterogeneous cognitive profile has yet to be established. The aims were to (1) characterize test-retest reliability of P3 amplitude and latency, particularly in the frontal-midline channels; and (2) investigate the impact of age, disease groups, cognitive status, and task difficulty P3 amplitude and latency. ## 2 Materials and Methods ### 2.1 Participants This test-retest reliability study included 39 right-handed participants recruited from the KU Alzheimer’s Disease Center between 05/03/2018 and 03/10/2020. Inclusion criteria were (1) age older than 65; (2) having undergone a previous amyloid PET scan of the brain; (3) ability to understand the instructions in English; and (4) informed consent. Exclusion criteria were: (1) currently taking steroids, benzodiazepines, or neuroleptics; (2) history of any substance abuse; and (3) history of a neurological disorder other than MCI or AD. Sixteen were cognitively normal older adults with no elevated amyloid PET scans (Aβ-), 15 were cognitive normal with elevated amyloid PET scans (Aβ+), and eight had a clinical diagnosis of mild cognitive impairment with positive amyloid PET scans. Participants completed their two-week follow-up session 16 ± 8 days after the first session. Each session lasted about 60 minutes including rest breaks. ### 2.2 Procedure #### 2.2.1 Demographic and Clinical Information Age, sex, and education were recorded. General cognitive functions were evaluated with the Montreal Cognitive Assessment (MOCA).(22) Scores on the MOCA range between 0 and 30. #### 2.2.2 N-back Test In the n-back test, participants are shown a series of letters and are instructed to press a button when the current stimulus is the same as the item presented n-positions back. The cognitive demand of the n-back task increases with each number, while the perceptual and motor demands remain constant. Whereas in the 0-back and 1-back the stimulus on screen is held in the focus of attention, the more difficult versions of the n-back tests require constant switching from the focus of attention to short- term memory.(23) Higher levels of difficulty require continuous mental effort to update information of new stimuli and maintain representations of recently presented stimuli.(24) In this study, the 0-back, 1-back, and 2-back tests were administered. Participants sat in a comfortable chair at 26 inches in front of the computer screen with the center of the screen at eye level. White letters appeared on a black screen. Prior to each test, participants were given a practice trial consisting of 7 non-targets and 3 targets. The practice trials were repeated until the participant felt comfortable with the instructions. Each test comprised 180 trials, including 60 trials that needed a response (target, 33.3%) and 120 trials for which a response was not required (nontarget, 66.7%). Each letter was presented for 500 ms on the computer screen followed by a blank interstimulus interval for 1700 ms, with a random jitter of +/-50 ms. The maximum time to accept the response was 2150 ms. The total task time was ∼7 minutes. In the 0-back test, participants were instructed to press the left mouse button as soon as the letter “X” (target) appeared on the screen while ignoring the other letters (nontarget). In the 1-back test, participants were instructed to press the button if the current letter on the screen was the same as the letter previously shown (target). In the 2-back test, participants were instructed to press the button when the current letter was the same as the one presented two places before (target). The number of hits (accuracy) and response times to the hits were the main behavioral performance outcome measures. #### 2.2.3 P3 ERP Continuous electro-encephalogram (EEG) was acquired using a Philips EGI high-density system from 256 scalp electrodes, digitized at 1,000 Hz. Data were filtered from 0.50 to 30 Hz using EGI software. Data were online referenced to Cz and offline rereferenced to the left and right mastoid. All other EEG processing was done in EEGLab(25) and in ERPLab.(26) Various artifacts unrelated to cognitive functions, including ocular and muscular movement, cardiovascular signal were identified and removed using independent component analysis (ICA). Signal from bad electrodes were interpolated using surrounding electrode data. Stimulus-locked ERPs were extracted from the n-back tests and segmented into epochs of 100 ms before to 1000ms after stimulus onset, and baseline corrected using the prestimulus interval. Scalp locations and measurement windows for the P3 component were based on their spatial extent and latency after inspection of grand average waveforms (collapsed across the two sessions). P3 peak amplitude and peak latency of the task effect were considered the main electrophysiological outcome measures. The difference waveform was calculated by subtracting the average ERP elicited from the targets from the average ERP elicited by non-targets for each participant. The P3 component time window was established between 200 ms and 400 ms for all three tests. Because of the prefrontal cortex involvement in working memory, we identified a priori the Fz channel as the main channel, but also calculated reliability of other pre- identified electrode locations, i.e., Cz, Pz, F3, and F4. Cz was interpolated using the surrounding five channels. No participants were removed from the analyses because of artifacts. However, one participant disengaged during the 2-back test and was therefore excluded from the analyses. ### 2.3 Data Analysis Descriptive analysis including mean (standard deviation) and frequency count of participants’ general, performance measures, and ERP data were performed as appropriate. Intra-class correlation coefficients (ICC) were used to calculate test-retest reliability of performance measures and P3 amplitude and latency. ICCs reflect the consistency of a measure taking into account variance related to the time of testing.(27) ICC values less than 0.40 were considered poor; values between 0.40 and 0.59 fair, values between 0.60 and 0.74 good, and values between 0.75 and 1.00 excellent.(28). Bland-Altman plots were used to visualize the measurement precision of amplitude and latency across the test moments.(29) Intersubject stability according to subject rankings was calculated using the Pearson r correlation coefficient. Generalized linear mixed models were employed to evaluate the effect of age, diagnosis (Aβ-; Aβ+; MCI/AD), MOCA scores, and task difficulty on stability of the P3 amplitude and latency. Stability of P3 amplitude (latency) was calculated as the squared difference of P3 amplitude (latency) at follow-up and baseline. The Kolmogorov–Smirnov test was employed to test the normality of our data distribution in addition to visualization of Q-Q plots. All analyses were done using SAS 9.4 software. The threshold of significance was set at p = 0.05. ## 3 Results ### 3.1 Participant Characteristics Participants (n = 39) were on average 74.05 (5.37) years old and scored 26.44 (4.76) on the MOCA scale. However, MOCA scores ranged from 3 to 30, indicating a wide range of severe to normal cognitive functions. About 60% (n = 23) were women. ### 3.2 Test-Retest Reliability of Performance Measures All ICC values of hits (accuracy) and response times of each n-back test demonstrated excellent reliability (Supplementary Table 1). ICCs of hits ranged between 0.92 (1- and 2-back) and 0.99 (0- back) and were slightly higher than the ICCs of response times, ranging between 0.76 (2-back) and 0.89 (1-back). Pearson r correlations ranged from 0.65 (0-back response time) to 0.99 (0-back hits). ### 3.3 Test-Retest Reliability of ERP Measures Grand average waveforms of the task effect from all channels at baseline and follow-up are displayed in Figure 1. The 3D scalp map is embedded in the figure to demonstrate the task effect at P3. Considerable overlap in ERP response within the P3 time window (200 – 400 ms post-stimulus) was observed at baseline and two-week follow-up. ![Figure 1.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/06/22/2020.05.27.20109157/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2020/06/22/2020.05.27.20109157/F1) Figure 1. Grand Average Event-related Potential Waveform of (a) 0-back, (b) 1-back and (c) 2-back. The ICC values of P3 peak amplitude and peak latency of the key electrode locations are displayed in Table 1. Overall, P3 amplitude showed greater reliability compared to P3 latency across channels and task difficulty levels. Also, ICCs of the 0-back and 1-back were consistently higher than those calculated for the 2-back. View this table: [Table 1.](http://medrxiv.org/content/early/2020/06/22/2020.05.27.20109157/T1) Table 1. Comparison of task effect (target – nontarget) P3 response at baseline and two-week follow-up. For the main channel location Fz, excellent reliability was found in P3 amplitude for 0-back (ICC = 0.82) and 1-back (ICC = 0.87). P3 amplitude of Fz for 2-back only showed fair reliability. Reliability scores of P3 latency at Fz were fair for 0-back (ICC = 0.54) and 1-back (ICC = 0.47), but poor for 2- back (ICC = 0.17). Figure 2 shows the Bland-Altman plots for P3 peak amplitude and peak latency at the Fz channel. All plots demonstrated equal distribution of the data around zero, indicating no bias in the results and no heteroscedasticity within the data. ![Figure 2.](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2020/06/22/2020.05.27.20109157/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2020/06/22/2020.05.27.20109157/F2) Figure 2. Bland Altman Plotsof (a) 0-back Fz Peak Amplitude (b) 1-back Fz Peak Amplitude; (c) 2- back Fz Peak Amplitude; (d) 0-back Fz Peak Latency; (e) 1-back Peak Latency; (f) 2-back Peak Latency. Finally, generalized linear mixed models were employed to evaluate the effect of age, disease diagnosis (Aβ-; Aβ+; MCI/AD), cognitive status, and task difficulty on stability of squared P3 peak amplitude and latency at the Fz channel. Age (p=0.74), disease diagnosis (p = 0.67), and task difficulty (p = 0.70) did not affect the stability of the P3 amplitude response, although individuals with lower MOCA cognitive scores tended to show more variability in P3 amplitude (p = 0.07). Age (p = 0.60), disease diagnosis (p = 0.55), MOCA (p = 0.52), or task difficulty (p = 0.95) did not affect the stability of the P3 latency response. We recalculated ICC for 0-back, 1-back, and 2-back in participants who scored 26 or higher on MOCA (n = 32) and those scoring lower than 26 (n = 7). ICC values showed more variance in 0-back and in 2-back in the group with lower MOCA scores, but ICC values were not worse across the n- back tests in this group (Supplementary Tables 2 and 3). Whereas ICCs were similar in the Aβ- and Aβ+ groups, lower ICCs were found for the MCI/AD group (Supplementary Tables 2 and 3). ## 4 Discussion This test-retest reliability study provides critical information on the stability of electrophysiological measures related to working memory in healthy older adults, older adults with increased risk of dementia, and those with MCI or AD. Our results showed that most P3 ERPs in the frontal channels provide fair to excellent reliability to measure electrophysiological processes of cognitive aging in older adults with and without cognitive impairments. Similar to previous studies, the reliability is superior in measures of amplitude compared to latency.(20, 21, 30, 31) The robustness of P3 stability is not affected by age, disease diagnosis, or task difficulty, however, there is a trend that lower MOCA scores may affect the stability of the P3 amplitude response. The body of evidence related to reliability of P3 ERPs is sparse, and typically restricted to healthy young,(30-34) middle-aged,(31) and older individuals.(19-21). Few studies have reported reliability measures in neurological conditions.(35) The reliability analyses in our study produced fair to excellent ICC values across the n-back tests. Whereas ICC values provide a single measure of the magnitude of agreement, Bland-Altman plots depict a graphical display of bias across the two test moments.(36) Visual inspection of the Bland-Altman plots showed average difference in ERP responses between first and second testing close to 0, with equal spread of data points around the average difference line. These findings suggest that 2 weeks follow-up is sufficient to wash out any potential adaptation, test, or practice effect of the n-back on ERPs in older individuals. Comparison of our results with other test-retest studies of ERPs in older adults is complicated by lack of consistency in terms of the ERP components that are investigated, the tests of working memory, the choice of channel locations, the extracted P3 metric, the P3 window measurement, and the test- retest reliability intervals.(19-21). Our research design most closely aligns with a study that compared ERPs to novel stimuli collected at baseline and 7-week follow-up in healthy older individuals.(21) Similar to our study, this study also found excellent reliability for P3 mean amplitude (ICC =0.86, 95% CI, 0.78 – 0.92), and poorer reliability for P3 mean latency (ICC = 0.56, 0.30 – 0.73). Our study demonstrated larger confidence intervals in some of the amplitude and latency measures, which might have been due to the greater cognitive heterogeneity of our sample. Another reliability study also reported considerably lower reliability in P3 amplitude (ICC = -0.02) and latency (ICC = -0.17) in seven individuals experiencing cognitive difficulties following traumatic brain injury compared to healthy peers (ICC = 0.84 for amplitude and 0.64 for latency).(35) Combined, these findings point towards a potential confounding effect of cognitive impairment on stability of ERPs in neurological conditions. No effect of age, task difficulty, or disease diagnosis was found on stability of the P3 ERP in the n- back task. Most participants in our study were cognitively normal, either without (n = 15) or with (n = 16) elevated beta amyloid. Accumulation of beta amyloid deposits in the brain is known to increase the risk of developing AD.(37) The fair to excellent reliability of P3 amplitude and latency in our group of older adults provides opportunities for studying the effect of amyloid deposits on neural transmission in preclinical AD using ERP. P3 amplitudes are smaller in AD compared to controls.(38) ERPs also show useful in predicting conversion to AD, with accuracy rates ranging between 70% and 94%.(39) Patients with AD exhibit prolonged latency in P3 ERP compared to age- matched controls(40). These prolonged latencies observed in patients with AD become particularly apparent in the cognitive domains of executive function, memory, and language.(41) The ability of P3 ERP to discriminate between mild cognitive impairment (MCI) and AD(42) opens avenues for investigation of ERP in detecting preclinical AD.(43) Although this study included participants with a wide range of cognitive impairment, most were cognitively normal. Future studies should include a larger sample of participants with MCI and AD to confirm the confounding effect of impaired cognition on the stability of the P3 response. Although the n-back is arguably the most ubiquitous working memory test used in ERP studies across the age spectrum,(23) we did not establish reliability of ERP in other cognitive domains known to deteriorate in older age, such as memory and language, and this remains an opportunity for further investigation. Our results are therefore only generalizable to the specific subdomain of working memory. Future research should also include multiple testing sessions over extended periods of time to evaluate the sensitivity of ERP to detect subtle neurobiological changes due to normal and pathological aging. ## Data Availability The data are available upon request. ## 5 Conclusion We set out to assess the test-retest reliability of ERP response in older adults with a heterogeneous cognitive profile. Consistent with other studies, P3 amplitude and latency show fair to excellent reliability across different levels of task difficulty. However, impaired cognition may potentially affect the stability of the P3 ERP response. ## Ethics Statement The study was carried out in accordance with the Declaration of Helsinki. The study was approved by the University of Kansas Medical Center Internal Review Board. Informed consent was obtained from each participant. Each participant received $100 for participating. ## Conflict of Interest The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. ## Author Contributions HD, JB, JM, WMB, and KG conceptualized the study. HD, KL, and KG worked out the EEG data processing steps. HD, PA, and KL administered the tests. HD and JM analysed the data. HD wrote the initial manuscript. JB, KL, PA, JM, WMB, and KG reviewed the manuscript and provided valuable comments. ## Funding Research reported in this publication was supported by the National Institute on Aging of the National Institutes of Health under Award Number K01 AG058785. This study was supported in part by a pilot grant of the KU Alzheimer Disease Center (P30 AG035982). The Hoglund Biomedical Imaging Center is supported in part by S10 RR29577 and generous gifts from Forrest and Sally Hoglund. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. ## Acknowledgments The authors thank the volunteers for their time and willingness to participate in this research. We are also grateful for the staff at the KU Alzheimer Disease Center. * Received May 27, 2020. * Revision received June 21, 2020. * Accepted June 22, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.Carmona-Torres JM, Rodríguez-Borrego Ma, Laredo-Aguilera JA, López-Soto Pj, Santacruz-Salas E, Cobo-Cuenca AI. Disability for basic and instrumental activities of daily living in older individuals. PLOS ONE (2019) 14(7):e0220157. doi: 10.1371/journal.pone.0220157. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0220157&link_type=DOI) 2. 2.Moon S, Ranchet M, Abiodun, Tant M, David, Mukaila, et al. The Impact of Advanced Age on Driving Safety in Adults with Medical Conditions. Gerontology (2018) 64(3):291–9. doi: 10.1159/000486511. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1159/000486511&link_type=DOI) 3. 3.Jekel K, Damian M, Wattmo C, Hausner L, Bullock R, Connelly PJ, et al. Mild cognitive impairment and deficits in instrumental activities of daily living: a systematic review. Alzheimer’s Research & Therapy (2015) 7(1). doi: 10.1186/s13195-015-0099-0. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13195-015-0099-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25815063&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 4. 4.Overdorp EJ, Kessels RPC, Claassen JA, Oosterman JM. The Combined Effect of Neuropsychological and Neuropathological Deficits on Instrumental Activities of Daily Living in Older Adults: a Systematic Review. Neuropsychology Review (2016) 26(1):92–106. doi: 10.1007/s11065-015-9312-y. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11065-015-9312-y&link_type=DOI) 5. 5.Tabira T, Hotta M, Murata M, Yoshiura K, Han G, Ishikawa T, et al. Age-Related Changes in Instrumental and Basic Activities of Daily Living Impairment in Older Adults with Very Mild Alzheimer’s Disease. Dementia and Geriatric Cognitive Disorders Extra (2020):27–37. doi: 10.1159/000506281. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1159/000506281&link_type=DOI) 6. 6.Baddeley A. Working memory. Science (1992) 255(5044):556–9. doi: 10.1126/science.1736359. [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6Mzoic2NpIjtzOjU6InJlc2lkIjtzOjEyOiIyNTUvNTA0NC81NTYiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMC8wNi8yMi8yMDIwLjA1LjI3LjIwMTA5MTU3LmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 7. 7.Eriksson J, Edward Lansner A, Bergström F, Nyberg L. Neurocognitive Architecture of Working Memory. Neuron (2015) 88(1):33–46. doi: 10.1016/j.neuron.2015.09.020. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuron.2015.09.020&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26447571&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 8. 8.Bahmani Z, Clark K, Merrikhi Y, Mueller A, Pettine W, Isabel Vanegas M, et al. Prefrontal Contributions to Attention and Working Memory. Curr Top Behav Neurosci (2019) 41:129–53. Epub 2019/02/11. doi: 10.1007/7854\_2018\_74. PubMed PMID: 30739308; PubMed Central PMCID: PMCPMC6689265. [CrossRef](http://medrxiv.org/lookup/external-ref?access\_num=10.1007/7854_2018_74&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30739308&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 9. 9.West R, Murphy KJ, Armilio ML, Craik FIM, Stuss DT. Lapses of Intention and Performance Variability Reveal Age-Related Increases in Fluctuations of Executive Control. (2002) 49(3):402–19. doi: 10.1006/brcg.2001.1507. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1006/brcg.2001.1507&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12139961&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000177165500009&link_type=ISI) 10. 10.Ranchet M, Morgan JC, Akinwuntan AE, Devos H. Cognitive workload across the spectrum of cognitive impairments: A systematic review of physiological measures. Neuroscience & Biobehavioral Reviews (2017) 80:516–37. doi: 10.1016/j.neubiorev.2017.07.001. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neubiorev.2017.07.001&link_type=DOI) 11. 11.Yaple ZA, Stevens WD, Arsalidou M. Meta-analyses of the n-back working memory task: fMRI evidence of age-related changes in prefrontal cortex involvement across the adult lifespan. NeuroImage (2019) 196:16–31. doi: 10.1016/j.neuroimage.2019.03.074. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuroimage.2019.03.074&link_type=DOI) 12. 12.Van Dinteren R, Arns M, Jongsma MLA, Kessels RPC. P300 Development across the Lifespan: A Systematic Review and Meta-Analysis. PLoS ONE (2014) 9(2):e87347. doi: 10.1371/journal.pone.0087347. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0087347&link_type=DOI) 13. 13.Lubitz AF, Niedeggen M, Feser M. Aging and working memory performance: Electrophysiological correlates of high and low performing elderly. Neuropsychologia (2017) 106:42–51. Epub 2017/09/12. doi: 10.1016/j.neuropsychologia.2017.09.002. PubMed PMID: 28889995. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuropsychologia.2017.09.002&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28889995&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 14. 14.Saliasi E, Geerligs L, Lorist MM, Maurits NM. The Relationship between P3 Amplitude and Working Memory Performance Differs in Young and Older Adults. (2013) 8(5):e63701. doi: 10.1371/journal.pone.0063701. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0063701&link_type=DOI) 15. 15.Fjell AM, Walhovd KB. P300 and neuropsychological tests as measures of aging: Scalp topography and cognitive changes. Brain Topography (2001) 14(1):25–40. doi: 10.1023/a:1012563605837. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/A:1012563605837&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11599530&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000171705600003&link_type=ISI) 16. 16.Feinkohl I, Borchers F, Burkhardt S, Krampe H, Kraft A, Speidel S, et al. Stability of neuropsychological test performance in older adults serving as normative controls for a study on postoperative cognitive dysfunction. BMC Research Notes (2020) 13(1). doi: 10.1186/s13104-020-4919-3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s13104-020-4919-3&link_type=DOI) 17. 17.Troyer AK, Vandermorris S, Murphy KJ. Intraindividual variability in performance on associative memory tasks is elevated in amnestic mild cognitive impairment. Neuropsychologia (2016) 90:110–6. Epub 2016/06/15. doi: 10.1016/j.neuropsychologia.2016.06.011. PubMed PMID: 27297728. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuropsychologia.2016.06.011&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27297728&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 18. 18.Robertson S, Myerson J, Hale S. Are There Age Differences in Intraindividual Variability in Working Memory Performance? The Journals of Gerontology Series B: Psychological Sciences and Social Sciences (2006) 61(1):P18–P24. doi: 10.1093/geronb/61.1.p18. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/geronb/61.1.p18&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16399937&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 19. 19.Sandman CA, Patterson JV. The auditory event-related potential is a stable and reliable measure in elderly subjects over a 3 year period. Clinical Neurophysiology (2000) 111(8):1427–37. doi: 10.1016/s1388-2457(00)00320-5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/s1388-2457(00)00320-5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10904225&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 20. 20.Walhovd KB, Fjell AM. One-year test-retest reliability of auditory ERPs in young and old adults. Int J Psychophysiol (2002) 46(1):29–40. Epub 2002/10/11. doi: 10.1016/s0167-8760(02)00039-9. PubMed PMID: 12374644. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0167-8760(02)00039-9&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=12374644&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000178472000003&link_type=ISI) 21. 21.Behforuzi H, Feng NC, Billig AR, Ryan E, Tusch ES, Holcomb PJ, et al. Markers of Novelty Processing in Older Adults Are Stable and Reliable. Frontiers in Aging Neuroscience (2019) 11. doi: 10.3389/fnagi.2019.00165. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fnagi.2019.00165&link_type=DOI) 22. 22.Nasreddine ZS, Phillips NA, Bedirian V, Charbonneau S, Whitehead V, Collin I, et al. The Montreal Cognitive Assessment, MoCA: A brief screening tool for mild cognitive impairment. J Am Geriatr Soc (2005) 53(4):695–9. doi: 10.1111/j.1532-5415.2005.53221.x. PubMed PMID: WOS:000227899200021. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1532-5415.2005.53221.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15817019&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000227899200021&link_type=ISI) 23. 23.Bopp KL, Verhaeghen P. Aging and n-Back Performance: A Meta-Analysis. The Journals of Gerontology: Series B (2018). doi: 10.1093/geronb/gby024. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/geronb/gby024&link_type=DOI) 24. 24.Gevins A, Smith ME, McEvoy LK, Ilan AB, Chan CS, Jiang A, et al. A cognitive and neurophysiological test of change from an individual’s baseline. Clinical Neurophysiology (2011) 122(1):114–20. doi: 10.1016/j.clinph.2010.06.010. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.clinph.2010.06.010&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20619727&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 25. 25.Delorme A, Makeig S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods (2004) 134(1):9–21. Epub 2004/04/23. doi: 10.1016/j.jneumeth.2003.10.009. PubMed PMID: 15102499. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jneumeth.2003.10.009&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15102499&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000189330500002&link_type=ISI) 26. 26.Lopez-Calderon J, Luck SJ. ERPLAB: an open-source toolbox for the analysis of event-related potentials. Frontiers in Human Neuroscience (2014) 8(213). doi: 10.3389/fnhum.2014.00213. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3389/fnhum.2014.00213&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24782741&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 27. 27.Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull (1979) 86(2):420–8. Epub 1979/03/01. doi: 10.1037//0033-2909.86.2.420. PubMed PMID: 18839484. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1037/0033-2909.86.2.420&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18839484&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1979HE52500019&link_type=ISI) 28. 28.Cicchetti DV. Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychological assessment (1994) 6(4):284. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1037//1040-3590.6.4.284&link_type=DOI) 29. 29.Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet (London, England) (1986) 1(8476):307–10. Epub 1986/02/08. PubMed PMID: 2868172. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0140-6736(86)90837-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=2868172&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1986AYW4000013&link_type=ISI) 30. 30.Cassidy SM, Robertson IH, O’Connell RG. Retest reliability of event-related potentials: Evidence from a variety of paradigms. Psychophysiology (2012) 49(5):659–64. doi: 10.1111/j.1469-8986.2011.01349.x. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1469-8986.2011.01349.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22335452&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000302620800008&link_type=ISI) 31. 31.Kinoshita S, Inoue M, Maeda H, Nakamura J, Morita K. Long-term patterns of change in ERPs across repeated measurements. Physiol Behav (1996) 60(4):1087–92. Epub 1996/10/01. doi: 10.1016/0031-9384(96)00130-8. PubMed PMID: 8884937. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/0031-9384(96)00130-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8884937&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 32. 32.Segalowitz SJ, Barnes KL. The reliability of ERP components in the auditory oddball paradigm. Psychophysiology (1993) 30(5):451–9. Epub 1993/09/01. doi: 10.1111/j.1469-8986.1993.tb02068.x. PubMed PMID: 8416071. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1469-8986.1993.tb02068.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8416071&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1993LU38200004&link_type=ISI) 33. 33.Brunner JF, Hansen TI, Olsen A, Skandsen T, Haberg A, Kropotov J. Long-term test-retest reliability of the P3 NoGo wave and two independent components decomposed from the P3 NoGo wave in a visual Go/NoGo task. Int J Psychophysiol (2013) 89(1):106–14. Epub 2013/06/19. doi: 10.1016/j.ijpsycho.2013.06.005. PubMed PMID: 23770084. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijpsycho.2013.06.005&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23770084&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 34. 34.Huffmeijer R, Bakermans-Kranenburg MJ, Alink LR, van Ijzendoorn MH. Reliability of event-related potentials: the influence of number of trials and electrodes. Physiol Behav (2014) 130:13–22. Epub 2014/03/20. doi: 10.1016/j.physbeh.2014.03.008. PubMed PMID: 24642000. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.physbeh.2014.03.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24642000&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 35. 35.Lew HL, Gray M, Poole JH. Temporal Stability of Auditory Event-Related Potentials in Healthy Individuals and Patients With Traumatic Brain Injury. Journal of Clinical Neurophysiology (2007) 24(5):392–7. doi: 10.1097/wnp.0b013e31814a56e3. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/WNP.0b013e31814a56e3&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17912063&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 36. 36.Ranganathan P, Pramesh CS, Aggarwal R. Common pitfalls in statistical analysis: Measures of agreement. Perspect Clin Res (2017) 8(4):187–91. doi: 10.4103/picr.PICR\_123\_17. PubMed PMID: 29109937. [CrossRef](http://medrxiv.org/lookup/external-ref?access\_num=10.4103/picr.PICR_123_17&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29109937&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 37. 37.Klunk WE, Engler H, Nordberg A, Wang Y, Blomqvist G, Holt DP, et al. Imaging brain amyloid in Alzheimer’s disease with Pittsburgh Compound-B. Ann Neurol (2004) 55(3):306–19. Epub 2004/03/03. doi: 10.1002/ana.20009. PubMed PMID: 14991808. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/ana.20009&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14991808&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000189275800002&link_type=ISI) 38. 38.Hedges D, Janis R, Mickelson S, Keith C, Bennett D, Brown BL. P300 Amplitude in Alzheimer’s Disease. Clinical EEG and Neuroscience (2016) 47(1):48–55. doi: 10.1177/1550059414550567. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/1550059414550567&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25253434&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) 39. 39.Chapman RM, McCrary JW, Gardner MN, Sandoval TC, Guillily MD, Reilly LA, et al. Brain ERP components predict which individuals progress to Alzheimer’s disease and which do not. (2011) 32(10):1742–55. doi: 10.1016/j.neurobiolaging.2009.11.010. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neurobiolaging.2009.11.010&link_type=DOI) 40. 40.Pedroso RV, Fraga FJ, Corazza DI, Andreatto CAA, Coelho Fgdm, Costa JLR, et al. Latência e amplitude do P300 auditivo na doença de Alzheimer: uma revisão sistemática. Brazilian Journal of Otorhinolaryngology (2012) 78(4):126–32. doi: 10.1590/s1808-86942012000400023. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1590/s1808-86942012000400023&link_type=DOI) 41. 41.Lee M-S, Lee S-H, Moon E-O, Moon Y-J, Kim S, Kim S-H, et al. Neuropsychological correlates of the P300 in patients with Alzheimer’s disease. Progress in Neuro-Psychopharmacology and Biological Psychiatry (2013) 40:62–9. doi: 10.1016/j.pnpbp.2012.08.009. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.pnpbp.2012.08.009&link_type=DOI) 42. 42.Bennys K, Portet F, Touchon J, Rondouin G. Diagnostic value of event-related evoked potentials N200 and P300 subcomponents in early diagnosis of Alzheimer’s disease and mild cognitive impairment. J Clin Neurophysiol (2007) 24(5):405–12. Epub 2007/10/04. doi: 10.1097/WNP.0b013e31815068d5. PubMed PMID: 17912065. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/WNP.0b013e31815068d5&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17912065&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F06%2F22%2F2020.05.27.20109157.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000250213400006&link_type=ISI) 43. 43.Rossini PM, Di Iorio R, Vecchio F, Anfossi M, Babiloni C, Bozzali M, et al. Early diagnosis of Alzheimer’s disease: the role of biomarkers including advanced EEG signal analysis. Report from the IFCN-sponsored panel of experts. Clinical Neurophysiology (2020) 131(6):1287–310. doi: 10.1016/j.clinph.2020.03.003. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.clinph.2020.03.003&link_type=DOI)