Cognitive subtypes in youth at clinical high risk for psychosis =============================================================== * Walid Yassin * James Green * Matcheri Keshavan * Elisabetta C. del Re * Jean Addington * Carrie E Bearden * Kristin S Cadenhead * Tyrone D Cannon * Barbara A Cornblatt * Daniel H Mathalon * Diana O Perkins * Elaine F Walker * Scott W Woods * William S. Stone ## Abstract **Introduction** Schizophrenia is a mental health condition that severely impacts well-being. Cognitive impairment is among its core features, often presenting well before the onset of overt psychosis, underscoring a critical need to study it in the psychosis proneness (clinical high risk; CHR) stage, to maximize the benefits of interventions and to improve clinical outcomes. However, given the heterogeneity of cognitive impairment in this population, a one-size-fits-all approach to therapeutic interventions would likely be insufficient. Thus, identifying cognitive subtypes in this population is crucial for tailored and successful therapeutic interventions. Here we identify, validate, and characterize cognitive subtypes in large CHR samples and delineate their baseline and longitudinal cognitive and functional trajectories. **Methods** Using machine learning, we performed cluster analysis on cognitive measures in a large sample of CHR youth (n = 764), and demographically comparable controls (HC; n = 280) from the North American Prodrome Longitudinal Study (NAPLS) 2, and independently validated our findings with an equally large sample (NAPLS 3; n = 628 CHR, 84 HC). By utilizing several statistical approaches, we compared the clusters on cognition and functioning at baseline, and over 24 months of followup. We further delineate the conversion status within those clusters. **Results** Two main cognitive clusters were identified, “impaired” and “intact” across all cognitive domains in CHR compared to HC. Baseline differences between the cognitively intact cluster and HC were found in the verbal abilities and attention and working memory domains. Longitudinally, those in the cognitively impaired cluster group demonstrated an overall floor effect and did not deteriorate further over time. However, a “catch up” trajectory was observed in the attention and working memory domain. This group had higher instances of conversion overall, with these converters having significantly more non-affective psychotic disorder diagnosis versus bipolar disorder, than those with intact cognition. In the cognitively intact group, we observed differences in trajectory based on conversion status, where those who start with intact cognition and later convert demonstrate a sharp decline in attention and functioning. Functioning was significantly better in the cognitively intact than in the impaired group at baseline. Most of the cognitive trajectories demonstrate a positive relationship with functional ones. **Conclusion** Our findings provide evidence for intact and impaired cognitive subtypes in youth at CHR, independent of conversion status. They further indicate that attention and working memory are important to distinguish between the CHR with intact cognition and controls. The cognitively intact CHR group becomes less attentive after conversion, while the cognitively impaired one demonstrates a catch up trajectory on both attention and working memory. Overall, early evaluation, covering several cognitive domains, is crucial for identifying trajectories of improvement and deterioration for the purpose of tailoring intervention for improving outcomes in individuals at CHR for psychosis. ## Introduction Schizophrenia is a debilitating mental health condition that affects overall well-being and is ranked among the top causes of disability-adjusted life years (1–4). The disability weight for an acute psychotic state is estimated to be the highest of any psychiatric condition (5). Schizophrenia is characterized by positive, negative, and other symptoms in addition to impaired cognition (6–8). Cognitive impairment in this population includes challenges with attention, verbal and working memory, executive functioning, and processing speed, among others (9–11). This has been demonstrated to significantly impact daily functioning (9–12). Given the chronicity of schizophrenia, focus is now shifting towards those in the early phase of the condition, clinical high risk for psychosis (CHR), during teenage years and early adulthood, when intervention is key (13,14). Individuals at CHR also experience varying levels of disruptions in perceptual and cognitive processes, as well as functioning, leading to worse overall outcomes (15–17). Cognitive variability also characterizes individuals with first-episode psychosis (FEP) (18), and other psychotic disorders (19–21). Individuals who develop bipolar disorder, for example, typically show better overall cognition than those who develop schizophrenia (22–24). Several studies report cognitive deficits in the CHR population similar to those observed in other psychotic conditions, albeit milder, and these deficits are also quite heterogeneous (22,25–27). This heterogeneity hinders efforts for tailored therapeutic interventions (19). It is hoped that individuals at CHR will benefit from early intervention that would ameliorate symptoms and even prevent conversion to frank psychosis (28,29). This presents a valuable public health target and an opportunity to substantially improve the wellbeing of those impacted. The challenges addressed above, among others, prompted a search for biomarkers to enhance the accuracy of diagnosis, prognosis, and treatment using machine learning (ML) and advanced statistical approaches (19,30–33). Cognition is among these biomarkers (17,22,27,34–37). Given the functional importance of cognitive deficits in this population and the heterogeneity of their presentation, the identification of cognitive clusters may reveal subtypes of individuals who are most likely to benefit from tailored therapeutic interventions to achieve better outcomes (27,29,38,39). Several studies attempted to produce cognitive subtypes in the CHR population, however, these studies either included other populations, such as genetic high-risk, FEP, recent onset psychosis or depression, controls (22,25), or other input variables such as clinical measures in addition to cognition (18,19,35). Moreover, the samples from the majority of these studies are relatively small with no external validation sample. The cognitive baseline and longitudinal trajectories in the CHR groups within different conversion statuses as well as their associated clinical and functional outcomes have not yet been reported. The goal of this study is to identify, validate, and characterize cognitive subtypes in a large CHR population and to evaluate how these subtypes differ in clinical and functional outcomes. In addition, we explore these subtypes within converters and non-converters. To do this, we utilized the North American Prodrome Longitudinal Study (NAPLS) 2 neurocognitive data, clustered them using machine learning, and validated our findings independently using NAPLS 3. We then compared the clusters on different cognitive and functional measures. ## Methods ### Participants The participants’ data in this study were obtained from NAPLS 2 (CHR (n = 764) HC (n = 280)) and 3 (CHR (n = 628) HC (n = 84)) consortia (13,14,40). Participants were help-seeking individuals who were referred from multiple community-based sources. Some were also self-referred. All CHR participants met the Criteria of Prodromal Syndrome (COPS) which is an assessment based on the Structured Interview for Psychosis-Risk Syndromes (SIPS) (41). After a thorough assessment which included administering the Structured Clinical Interview for DSM (SCID) IV (42) for NAPLS 2 and SCID 5 for NAPLS 3 (43), and the SIPS, vignettes were established for each CHR participant to obtain a consensus diagnosis. The attenuated psychotic symptoms rated on the Scale of Psychosis-Risk Symptoms (SOPS) are comprehensively described and include both recent and long-standing symptoms. The cross-site reliability in the SOPS rating was conducted in the same manner, with NAPLS 2 having an intraclass correlation range for the total SOPS of 0.82 to 0.93 and NAPLS 3 of 0.83 to 0.91, while the attenuated psychotic symptom score from the SOPS ranged from 0.92 to 0.96 and 0.83 to 0.94 for NAPLS 2 and 3, respectively. The differences across the individual sites were minimal and had excellent interclass correlations range. The Institutional Review Board approval was abstained from all sites and all the participants signed an informed consent. ### Inclusion and exclusion criteria Participants in NAPLS 2 and 3 were between 12 and 35 years old and met the diagnostic criteria for the psychosis risk syndrome as part of the COPS criteria (41). Participants were excluded for i) criteria for current or lifetime Axis I psychotic disorder (Including affective psychosis), ii) IQ < 70, iii) history of disorders of the central nervous system, iv) diagnostic psychosis risk symptoms better explained by another Axis I disorder. Unless the disorder did not account for the individual’s prodromal symptoms, other non-psychotic DSM-IV and 5 disorders were not exclusionary, e.g., substance use disorder (13,14,40). The use of antipsychotics was not exclusionary if there was clear evidence that psychosis risk, not psychotic symptoms, were present when the medication was started. The healthy controls (HC) did not meet the criteria for any psychosis risk syndrome, a current or past psychotic disorder, or a Cluster A personality disorder diagnosis. In addition, we excluded those who are currently using psychotropic medication, as well as those with first-degree relatives having a history of a psychotic disorder or other disorders involving psychotic symptoms. Detailed information about the recruitment, diagnostic measures, and inclusion and exclusion criteria can be found elsewhere (13,14,40). ### Assessments #### Clinical Clinical measures included in this study were the SOPS, Global Assessment of Functioning (GAF) (44), and the Global Functioning Social (GF:SS) and Role Scales (GF:RS) (45). #### Neurocognition Neurocognitive measures that are common between NAPLS 2 and 3 were used. For example, newer versions of tests used in NAPLS2 were used in NAPLS 3, such as the Wechsler Abbreviated Scale of Intelligence (WASI) in NAPLS2 and the WASI-II in NAPLS 3 (46). In both instances, IQ was estimated from two subtests that included Vocabulary, but NAPLS2 used Block Design as the second test and NAPLS3 used Matrix Reasoning. Thus, block design and matrix reasoning were not included, only Vocabulary was. Similarly, portions of Seidman’s Auditory Continuous Performance (A-CPT) that were administered in NAPLS2 on cassette tapes were repeated in NAPLS3 but in a digitized format and administered by computer. Portions of the A-CPT that were used in both NAPLS2 and NAPLS3 were included in the study and comprised the vigilance (QA) and high working memory load/no interference (Q3A-MEM) conditions (47), as was The Wide Range Achievement Test, Fourth Edition (WRAT-4) (48), and several MATRICS Consensus Cognitive Battery (MCCB) tests, including Brief Assessment of Cognition in Schizophrenia Symbol Coding (BACS SC) (49), Hopkins Verbal Learning Test-Revised (HVLT-R) (50), Letter-Number Span (LNS) (51). #### Magnetic Resonance Imaging The Magnetic Resonance Imaging scans were acquired using 3 Telsa Siemens (12-channel coil used in 5 sites) or a General Electric (8-channel coil used in 3 sites) scanners. The parameters of the sequences were optimized for the scanner manufacturer, version of the software, and coil configuration ([http://adni.loni.ucla.edu/research/protocols/mri-protocols/](http://adni.loni.ucla.edu/research/protocols/mri-protocols/)) (52). Segmentation was performed using the Freesurfer software suite version 5.3. (53,54). Further details can be found elsewhere (55,56). #### Conversion to psychosis Conversion to psychosis was established by meeting the Presence of Psychotic Symptoms (POPS) criteria (57). At least one of the 5 SOPS positive symptoms reached a psychotic level of intensity, rated 6, for a frequency of more than one hour/day for 4 days/week during the past month, or symptoms seriously impacted functioning were needed to determine conversion. ### Machine learning #### Preprocessing NAPLS 2 and 3 were processed in the same way. Instances with more than 50% missingness of cognitive data were not included in the study. NAPLS 2 functioned as a discovery dataset and NAPLS 3 as a validation dataset. Only neurocognitive variables that were common to both consortia were included as input variables for clustering (See *Neurocognition* section). Clustering was applied to CHR only (i.e., not healthy controls), since our aim was to identify cognitive subtypes within the CHR population. First, NAPLS 2 data were imputed using KKNImputer, part of scikit learn v1.4.2 ([https://scikit-learn.org/](https://scikit-learn.org/)), then the imputation of NAPLS 2 was applied to NAPLS 3 using the imputer fit/transofrm functions. Site correction was done using ComBat Family of Harmonization Methods ([https://github.com/andy1764/ComBatFamily](https://github.com/andy1764/ComBatFamily)) in R Studios v4.3.2. The site effect removal was also applied to NAPLS 3 data using the function provided in ComBat. After, the age and sex effects were removed from both NAPLS 2 and 3 using the control data with Python v3.11.5 via scikit learn’s linear regression model. This was accomplished using regression to model the beta coefficients in the control group, to preserve illness effects, and to use them to residualize the cognitive variables in NAPLS 2 and 3 (13,14,40). Then, the data were standardized using the StandardScalar preprocessing function of scikit learn. Again, the standardization was done on NAPLS 2 and applied to NAPLS 3 using the fit-transform function. Lastly, principal component analysis (PCA) was used to reduce the multicollinearity of the data in NAPLS 2 using the decomposition package in scikit learn and applied to NAPLS 3. #### Clustering algorithms The clustering was done using the R package clValid v0.7 (58). The resulting principal components with 86.8% explained variance from the PCA entered the clustering. An in-house script was written to iterate over several algorithms (including k means, Partition Around Medoids (PAM), AGglomerative NESting (Agnes), and DIvisive ANAlysis Clustering (DIANA)), and available parameters (including metrics; euclidian; correlation; manhattan, methods; average; single; complete; ward, cluster number range [2:9], was based on the literature with some leeway for potentially higher number of clusters) and produce results that were compared based on the Silhouette score, to determine the best algorithm, with the optimal parameters, and the number of clusters. After identifying the algorithm and optimal parameters, as well as the number of clusters, the cluster membership was extracted from that algorithm, and used for further analysis. #### Validation NAPLS 2 clusters were validated in three ways using NAPLS 3 as indicated in Ullmann et al. (59). i) using external validators. We used total brain volume as an external validator (19) since a reduction in brain volume has been observed in those with impaired cognition (60,61). ii) Methods based validation. The PCs from NAPLS 3 were run using the same algorithm, parameters, and cluster number that were deemed best for the NAPLS 2 data and selected to produce the NAPLS 3 clusters. Then, the two were compared based on the Silhouette score. A similar silhouette score is indicative of good validation (59). iii) Results based validation. The centroids of the clusters in NAPLS 2 were calculated and the instances in NAPLS 3 were mapped on those clusters, i.e., the proximity of each instance in NAPLS 3 to the centroid produced from the clusters of NAPLS 2 would determine their cluster membership (59). After, a machine learning classification algorithm (random forest classifier using sklearn) was run using the cluster membership as labels and the PCs of the instances as input variables from NAPLS 2 for training, and then the trained algorithm from NAPLS 2 was used for classifying NAPLS 3 data. The results were reported including precision, recall, and f1-score. In addition, a confusion matrix is also provided. #### Clusters vs converter status After the clusters were identified and described, we were interested in delineating the characteristics of the converters and non-converters belonging to each of the cognitive clusters. ### Statistical Analysis Descriptive statistics, including chi-square and t-tests, were implemented to assess the differences across demographic variables. Then, for NAPLS2 (discovery sample), demographics were described for the clusters produced. The clusters within the converter and non-converter groups are also presented. To describe baseline differences in cognitive, clinical, and functional outcomes between the clusters and controls, Kruskal-Wallis test with post-hoc pairwise comparisons was conducted. Dunn’s test was used for post-hoc pairwise comparisons where needed. Longitudinal analyses were conducted using a linear mixed effects model in R, where the HC are used as reference. False Discovery Rate was used to correct for multiple comparisons. The comparisons of the clusters between the cognitive variables at baseline were the only analyses that were not corrected for multiple comparisons as they were produced synthetically and were presented for descriptive purposes. Kaplan-Meyer survival curve was used to identify differences in conversion between the clusters. This was conducted with the entire sample, however, another analysis was run with participants who remained in the study < 2.5 years to reduce outlier effects (62). The number of converters vs non -converters in the clusters was evaluated using the Chi square test. We assessed the longitudinal association between cognition and SOPS to further evaluate their longitudinal relationship post hoc, using a Generalized Estimating Equation (GEE). We also examine the relationship between cognition and functioning while accounting for the repeated measures within individual overtime, we used a GEE on the entire sample in R (63,64). Functioning was specified as outcome variables and cognition as predictors, and vice versa, both were scaled. The “waves” were the time points, and the participant ID was used to identify repeated measures for each participant. The correlation structures were chosen based on the quasi information criterion (QIC) (65). The “independence” correlation structure was not entered into the QIC model evaluation, as it is not suitable for this analysis. The percentage of psychotic vs bipolar diagnoses were calculated by using the total of the converters in each cluster as the total number of converters (e.g., of all the converters in C1, how many have a psychotic versus bipolar diagnosis?). ## Results ### Demographic data The demographic data of NAPLS 2 and NAPLS 3 are presented in table S1. The demographic data of NAPLS 2 and 3 did not differ within each study between the controls and CHR for sex. Control participants in NAPLS 2 were about a year older (x̄ : 19.73 vs. 18.5) than CHR (p < 0.001). In both NAPLS2 and NAPLS3, CHR participants were more likely to have lower levels of education than the controls (p < 0.001). The resulting clusters did not differ in age and sex, but differ in income (p < 0.001), and education (p < 0.001). More details regarding other demographic factors can be found in table 1. View this table: [Table 1:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/T1) Table 1: Demographic comparisons for cluster membership ### Machine learning #### Clustering The clustering analysis using NAPLS 2 (discovery) data demonstrated that DIANA was the best algorithm resulting in two clusters (Cluster 1, impaired (C1), Cluster 2, intact (C2)), with good separability indicated by the internal validation measures including the Silhouette score of 0.5045 (Table S2). Hierarchical clustering is a valuable overall clustering algorithm and has proven to be suitable for clustering cognition in CHR (22,25). The full details of the clustering results including parameters can be seen in the supplementary material (Table S2, S3). The clusters in 3D and dendrograms are also presented (Figures S1). #### Validation Total brain volume (mm3) was significantly different between the clusters (p=0.024), where C2 (Mean (M)=606,085, standard deviation (SD)=614,89) had a larger volume than C1 (M=592,433, SD=66,835), which is what we would expect based on evidence from the literature (19,60,61). NAPLS 3 clustering demonstrated a similar silhouette score (0.5408) to the NAPLS 2 (0.5045). The second validation method demonstrated an accuracy of 0.83 (Precision =0.88, recall=0.86, f1-score=.87) (See Table S3 for confusion matrix). #### Cognition (Baseline) As expected, there are significant differences in the cognitive domains that entered the clustering algorithm, where C1 is characterized by “impaired” cognition and C2 by “intact” cognition (WRAT (p <0.001), WASI Vocab (p <0.001), CPT QA (p < 0.001) and Q3A (p < 0.001), BACS (p < 0.001), HVLT-R (p < 0.001), LNS (p < 0.001)). Interestingly, only CPT Q3A (p <0.01) and WRAT (p < 0.05) show differences between C2 and controls (Table 2, Fig 1.) ![Figure 1:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F1.medium.gif) [Figure 1:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F1) Figure 1: Showing the baseline cognitive differences between the clusters and controls View this table: [Table 2:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/T2) Table 2: Pairwise Comparisons between Cognitive Clusters, Controls, and Cognitive Outcomes #### Cognition (Longitudinal) C1 demonstrated a notable catch up trajectory in CPT QA (*b* = 2.17, *p* < 0.001), and Q3A (*b* = 3.03, *p* < 0.01), compared to controls. (Table 3, Fig 2). No other notable differences were observed. ![](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F2/graphic-6.medium.gif) [](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F2/graphic-6) ![](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F2/graphic-7.medium.gif) [](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F2/graphic-7) Figure 2: Longitudinal cognitive outcomes across clusters and controls. View this table: [Table 3:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/T3) Table 3: Cognitive Clusters and Longitudinal Cognition Outcomes ### Cognitive clusters vs. conversion status #### Cognition (Baseline) Pairwise comparisons demonstrated that controls and C2NC had higher scores than C1 independent of their conversion (p <0.001). C2C also had higher scores than C1 on all cognitive domains independent of conversion status (CPT Q3A: p<0.05, otherwise p<0.001). The only differences between C2NC and controls were found in CPT Q3A and WRAT, where CPT Q3A was lower (p<0.05), and WRAT was higher (p<0.05) (Table 4, Fig 3). ![Figure 3:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F3.medium.gif) [Figure 3:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F3) Figure 3: Pairwise comparisons between clusters, converters, and healthy controls. View this table: [Table 4:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/T4) Table 4: Conversion and Cluster Post-hoc Comparisons. #### Cognition (Longitudinal) Several cognitive variables demonstrated a “deficit” trajectory with improvement over time similar to controls. Other variables show more improvement comparatively (C1NC in WRAT (*b* = 0.57, *p* < .05), even catch up QA (*b* =1.89, *p* < 0.001) and Q3A (*b* =2.31, *p* < 0.05), and C1C in QA (*b* =3.55, *p* < 0.001), and Q3A (*b* =6.77, *p* < 0.001)). A “deterioration” trajectory was observed in C2C QA compared to the controls (*b* =-2.89, *p* < 0.05). No other cognitive variables demonstrated differences (Table 5, Fig 4). ![](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F4/graphic-15.medium.gif) [](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F4/graphic-15) ![](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F4/graphic-16.medium.gif) [](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F4/graphic-16) Figure 4: Longitudinal outcomes across cognitive variables for clusters and healthy controls. View this table: [Table 5:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/T5) Table 5: Conversion and Cluster Cognitive Longitudinal Outcomes #### Functioning (Baseline) A significant difference in functioning was observed, where C1 showed lower GF: RS than C2 (p < 0.001), and C1 (p < 0.001), and C2 (p < 0.001), lower than controls (p < 0.001). The same trend was observed in GF: SS (C1 vs C2: p < 0.01, C1 vs HC: p < 0.001, C2 vs. HC: p < 0.001). In GAF, C1 (p < 0.001) and C2 (p < 0.001) were lower than controls. (Table 6, Fig 5). ![Figure 5:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F5.medium.gif) [Figure 5:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F5) Figure 5: Baseline comparing in functioning between the clusters and control View this table: [Table 6:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/T6) Table 6: Functioning #### Functioning (Longitudinal) GF: SS demonstrated a trajectory with mild improvement in both clusters (C1: (*b* = .19, *p* < 0.001), C2 (*b* = .16, *p* < 0.001)) compared to controls. This was also observed in GAF (C1: (*b* =1.74, *p* < 0.001), C2 (*b* = 1.67, *p* < 0.001)). No change was observed in GF: RS (Table 7, Fig 6). ![](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F6/graphic-22.medium.gif) [](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F6/graphic-22) ![](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F6/graphic-23.medium.gif) [](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F6/graphic-23) Figure 6: Longitudinal functioning outcomes between cluster groups and health controls. View this table: [Table 7:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/T7) Table 7: Longitudinal Outcomes for Functioning, Cluster membership, and Cluster Membership and Conversion Status #### Functioning (Baseline) Pairwise comparisons demonstrated that C1C had the lowest functioning compared to C2C (p < 0.05), C2NC (p < 0.001), and controls (p < 0.001) for the GF: RS. Additionally in GF:RS, C1NC had lower functioning than C2NC (p < 0.001). In GAF, C2NC (p < 0.05) and controls (p < 0.001) had higher functioning than C1C, and C1NC (p < 0.001), C2C (p < 0.001), and C2NC (p < 0.001), lower than controls. The differences observed in GF: SS were between the controls and the cluster versus conversion groups (C1C vs. HC: p< 0.001, C1NC vs. HC: p< 0.001, C2C vs. HC: p< 0.001, C2NC vs. HC: p< 0.001) (Table 8, Fig 7). ![Figure 7:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F7.medium.gif) [Figure 7:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F7) Figure 7: Pairwise comparisons between clusters, converter status, and healthy controls View this table: [Table 8:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/T8) Table 8: Conversion and Cluster Post-hoc Functioning Comparisons. #### Functioning (Longitudinal) In GF:SS, only C1NC (*b* = 0.22, *p* < 0.001) and C2NC (*b* = 0.17, *p* < 0.001) demonstrated a significant, but slight, improved trajectory in both clusters compared to controls. In GF:RS, C2C demonstrated significant decline overtime compared to controls (*b* = -0.57, *p* < 0.001). In GAF, C1NC (*b* = 1.97, *p* < 0.001), and C2NC (*b* = 2.09, *p* < 0.001), both demonstrated trajectories with slight improvement, while C2C demonstrated a significant decline (*b* = -3.95, *p* < 0.001). (Table 9, Figure 8). ![Figure 8:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F8.medium.gif) [Figure 8:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F8) Figure 8: Longitudinal differences between clusters, converters, and healthy controls. View this table: [Table 9:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/T9) Table 9: Cluster and Converter Status with Functional Outcomes Longitudinal #### Clinical (Baseline) The clusters did not demonstrate baseline differences in SOPS positive, negative, disorganization, or general domains (Table 10, Fig. 9). ![Figure 9:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2024/08/08/2024.08.07.24311240/F9.medium.gif) [Figure 9:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/F9) Figure 9: Differences in the Scale of Prodromal Symptoms between the clusters and control View this table: [Table 10:](http://medrxiv.org/content/early/2024/08/08/2024.08.07.24311240/T10) Table 10: Clinical and Cognitive Cluster pairwise comparisons #### Clinical (Longitudinal) For Positive symptoms, converter groups demonstrated worsening symptom trajectories (C1C: *b* = 3.53, *p* < 0.001; C2C: *b* = 2.27, *p* <0.001) while non-converter groups showed improvement trajectories (C1NC: *b* = -2.26, *p* <0.001; C2NC: *b* = -2.04, *p* <0.001) compared to controls. In Negative, Disorganization, and General symptoms non-converter groups demonstrated significant improvement compared to controls (Negative (C1NC: *b* = -1.89, *p* <0.001; C2NC: *b* = -1.95, *p* <0.001); Disorganization (C1NC: *b* = -0.81, *p* <0.0001; C2NC: *b* = -0.92, *p* <0.0001); General(C1NC: *b* = -1.43, *p* <0.0001; C2NC: *b* = -1.50, *p* <0.0001), while converter groups didn’t change significantly over time. (Table S4-S5, Fig. S2-S3). #### Time to conversion The Kaplan-Meyer survival curve and log-rank test demonstrated that individuals in C1 have a lower survival probability than C2 that is consistent over time *(X*(1) *=* 8.4, *p* < 0.01) (Fig. S4.a). For the sample having participants who remained in the study for < 2.5 years, C1 continued to have significantly lower survival than C2 consistently (*X*(1) = 8.1, *p* < 0.01) (Fig. S4.b). #### Converter vs. non-convert proportion The converters constitute 18% of C1 and 11% of C2 (*X*2=6, p=0.02). #### Relationship between cognition and SOPS The GEE showed that only negative symptoms and disorganization are associated with cognition in the non-converters, while positive symptoms are not associated with any of the cognitive measures in both converters and non converters (Table S6). #### Relationship between cognition and functioning The GEE showed that all the cognitive variables have a significant and positive relationship with functioning (Table S7, S8). Variables related to attention and working memory, verbal abilities, and declarative memory, had the largest estimates. #### Converter diagnosis In C1C, 81.9%, and in C2C 62%, had a diagnosis code that falls within the psychotic disorder diagnoses (*X*2=5, p=0.02). In contrast, C1C had 16.8% and C2C had 44.7% bipolar diagnosis (*X*2=7, p=0.006). ## Discussion To our knowledge, this is the first and largest study to use machine learning clustering on cognitive variables in a CHR sample, independently validate the clusters with an equally large CHR sample having common cognitive variables, and investigate the clusters within the conversion status. Our findings reveal two CHR cognitive subtypes with “impaired” and “intact” cognition compared to controls. Previous work by Haining et al. and Allott et al. reported on two cognitive clusters with “intact” and “impaired” cognitive profiles as well (25,27). This is also consistent more broadly with results from the psychosis spectrum (23,66–69). While other studies report varying numbers of clusters, these differences could be attributed to the inclusion of different conditions or a mix of variable types beyond cognition in the clustering (70). Notably, only attention and working memory as well as verbal abilities domains demonstrated differences between the cognitively intact CHR group, specifically non-converters, and controls at baseline. Despite that group having comparable cognitive abilities to controls in most domains, independent of the conversion status, it is still unable to perform as well on CPT Q3A. This task can be difficult, even for the controls, and thus assessments challenging attention and working memory abilities could potentially be an important cognitive marker to distinguish between the cognitively intact CHR group and controls early on. Our previous work does demonstrate worse performance by the CHR group, compared to controls, on attention and working memory abilities than any of the other domains, which is consistent with the current findings (35). Verbal abilities at baseline, on the other hand, were higher in the CHR group. DeRosse et al. also reported significantly higher WRAT scores in the psychotic disorder group, which they have attributed to a compensation mechanism or “resilient factor against clinically significant psychosis” indicated by negative symptoms (71). We do not observe differences in negative symptoms, or any other SOPS subscores, between the cognitively intact and impaired CHR groups at baseline, nor do we observe a longitudinal association between WRAT and negative symptoms in the CHR group as a whole (71). While the converters had worsened positive symptoms overtime, which is expected after conversion, the non converters improved across all SOPS subscores. The association we found between negative symptoms and most of the cognitive assessments, especially CPT QA, is in alignment with previous studies. Nonetheless, others, e.g., Allott et al., did not observe an association between cognition and symptomatology, even longitudinally (27,72). This could still be a compensation mechanism, but not necessarily specifically attributed to differences in psychosis related clinical findings in the converter group, at least those measured by SOPS, in our study. Rather our results are more in alignment with Walter et al. demonstrating that, independent of symptomatology, verbal abilities are more resistant to the impact of psychiatric conditions, specifically schizophrenia, than the rest of the cognitive domains that can be quite vulnerable (73). So it is possible that the cognitive domains, other than verbal abilities, that are comparable between the cognitively intact CHR group and the controls at the time of assessment could have even been higher if it weren’t for their CHR status. The attention and working memory domains follow a “catch up” trajectory in the cognitively impaired CHR group, independent of the conversion status, with QA reaching closer to the controls than Q3A, probably due to the task’s difficulty. In the theoretical framework by Allott et al., they describe the “deficit” trajectory in which there is no change in the trajectory itself, but the deficit is consistent throughout the 12 month follow-up from baseline. All cognitive trajectories in the cognitively impaired CHR group in our study follow this pattern except for attention and working memory domains (CPT QA and Q3A) and less so in verbal abilities (WRAT). Thus, our results support those of Allott et al. and extend their findings up to 2 years (27). The impaired cognition groups would benefit from early intervention by boosting their catch up trajectory to meet the performance of the controls. The cognitively intact CHR group on the other hand did not demonstrate any significant trajectory differences from controls, probably because of a ceiling effect. Interestingly, however, the baseline differences in CPT Q3A distinguishing between the cognitively intact CHR group and controls fades away after 24 months with a gradual, non-significant, “catch up” trajectory. It is possible that the increased vigilance we see in the CHR post conversion, where they are more wary of their surroundings, could have contributed to this sort of compensation mechanism. This can be more clearly captured in the cognitively impaired CHR group, due to the lack of a ceiling effect, with improvement beyond the practice effects seen in the controls. Further examination of these trajectories showed that the attention and working memory domain in the cognitively intact converter group demonstrates a sharp decline. The deterioration in CPT QA, after conversion, without an associated decline in CPT Q3A might indicate that attention, or vigilance, is more impacted than working memory. Given the substantial decline in this group, we believe that they might require close observation. Since this group starts seemingly similar to controls, clinicians might assume that they are not in need of therapeutic intervention, which might be a missed opportunity for improvement of outcome or at least sustaining their pre conversion capacity. Therapeutic interventions targeting vigilance might be most useful for this group (74). Unlike GAF, the role and social scales were designed to assess functioning specifically in the CHR population (75), and thus it was unsurprising to observe differences in the global functioning role and social scales between the cognitively intact and impaired CHR groups, but not in GAF (75). After 24 months, however, the social scale becomes indistinguishable between the two CHR groups. More detailed examination of functioning demonstrated that the cognitively intact converters drop drastically in both GAF and role scale after conversion. This resembles the deterioration observed in CPT QA in that group as well. Our results do indicate a positive relationship between CPT QA and functioning, however, the point change is minimal compared to the association with other cognitive domains when the response variable is role scale and the predictor is CPT QA. On the other hand, when these are reversed, role scale is predictor and CPT QA is response, the estimate increases substantially. Meaning that the change in functioning has more impact on the change in cognition here than cognition has on functioning. Nonetheless, we do acknowledge the trajectory heterogeneity within CPT QA in the cognitively intact group, and thus an approach similar to Allott et al. (27), investigating CPT QA, is warranted. It is also possible that there are mediators informing the association between CPT QA and functioning that would be worth exploring. The CHR group with impaired cognition has a significantly higher number of converters and the members convert earlier overall than the cognitively intact group. This is supported by previous research in CHR and the schizophrenia spectrum, demonstrating that neurocognitive performance at baseline is generally lower in those who later convert (76). The converters with impaired cognition also had a significantly higher percentage of a psychotic disorder diagnosis than the intact groups and significantly lower bipolar disorder diagnosis (23,24). It is possible that those who convert and have impaired cognition will end up converting to schizophrenia, which makes cognition be an important predictor of future diagnosis. This is consistent with prior work indicating that cognitive impairment of those with schizophrenia exists early in life preceding onset and is more pronounced, while those with bipolar develop such impairment later in adulthood but not as severe (23,24,77). There is a high potential for tailored and preventative cognitive therapeutic interventions here. For example, It is intuitive that the cognitively impaired group would be recommended for cognitive intervention given their cognitive abilities early on. Therapeutic approaches here might improve their catch-up trajectory to reach a level similar to controls. Those with intact cognition on the other hand, even though they are similar to controls at baseline and follow up in most of the measures, those of them who convert, change drastically in both vigilance, and functioning, especially role scale. This group should be under close observation, as they initially seem that they would not benefit from cognitive intervention, but they actually need it to prevent their deterioration later on (38,39). Unlike what has been reported previously– that those with good cognition at baseline might not benefit from cognitive therapeutic intervention (27), we recommend that this is taken with more consideration and scrutiny to better understand those with intact cognition who end up converting. ## Conclusion This study identified impaired and intact cognitive subtypes, independent of their conversion status. The results further indicate that attention and working memory are important to distinguish between those who are CHR with intact cognition and controls. The cognitively intact CHR group becomes less vigilant generally post conversion after 24 month followup, while the cognitively impaired one demonstrates a catch up trajectory on both attention and working memory. Overall, early assessment, covering several cognitive domains, especially attention and working memory, is crucial for identifying trajectories of improvement and deterioration for the purpose of tailoring intervention for improving outcomes in individuals at CHR. ### Limitations There are some limitations to consider: 1) Like other longitudinal studies, the drop out rate of the participants is a limitation. 2) This is a secondary data analysis study, meaning that we were limited by the variables available to us. Additionally, the cognitive measures were limited by what is common between the discovery and validation dataset. Nonetheless, the variables included in this study do explore a wide range of cognitive domains. 3) The causality, and its direction, between the observed decline in cognition and functioning could not be established in this current work. ## Supporting information Supplementary materials [[supplements/311240_file03.pdf]](pending:yes) ## Data Availability All data are available online at the National Institute of Mental Health Data Archive. [https://nda.nih.gov/](https://nda.nih.gov/) ## Acknowledgments This work was supported by the National Institute of Mental Health (R21MH133001 to Dr Yassin; U01MH081984 to Dr Addington; R01MH60720, U01MH081944 and K24MH76191 to Dr Cadenhead; P50MH066286 (Prodromal Core) to Dr Bearden; U01MH082004 to Dr Perkins; U01MH081988 to Dr Walker; U01MH082022 to Drs Cadenhead and Woods; U01MH081928 to Dr Stone; U01MH081902 to Drs Bearden andCannon; U01MH076989 to Dr Mathalon; and U01MH081857 to Dr Cornblatt). * Received August 7, 2024. * Revision received August 7, 2024. * Accepted August 8, 2024. * © 2024, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.Solmi M, Seitidis G, Mavridis D, Correll CU, Dragioti E, Guimond S, et al. Incidence, prevalence, and global burden of schizophrenia - data, with critical appraisal, from the Global Burden of Disease (GBD) 2019. Mol Psychiatry [Internet]. 2023 Jul 27 [cited 2024 Jan 24]; Available from: [https://www.nature.com/articles/s41380-023-02138-4](https://www.nature.com/articles/s41380-023-02138-4) 2. 2.McGrath J, Saha S, Chant D, Welham J. Schizophrenia: a concise overview of incidence, prevalence, and mortality. Epidemiol Rev. 2008;30:67–76. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/epirev/mxn001&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18480098&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000260555900004&link_type=ISI) 3. 3.an der Heiden W, Häfner H. The epidemiology of onset and course of schizophrenia. Eur Arch Psychiatry Clin Neurosci. 2000;250(6):292–303. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s004060070004&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11153964&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000166150900004&link_type=ISI) 4. 4.Charlson FJ, Ferrari AJ, Santomauro DF, Diminic S, Stockings E, Scott JG, et al. Global Epidemiology and Burden of Schizophrenia: Findings From the Global Burden of Disease Study 2016. Schizophr Bull. 2018 Oct 17;44(6):1195–203. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/schbul/sby058&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=29762765&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 5. 5.Global, regional, and national burden of 12 mental disorders in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Psychiatry. 2022 Feb;9(2):137–50. 6. 6.Heinrichs RW. The primacy of cognition in schizophrenia. Am Psychol. 2005 Apr;60(3):229–42. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1037/0003-066X.60.3.229&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15796677&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000228112300002&link_type=ISI) 7. 7.Kahn RS, Keefe RSE. Schizophrenia is a cognitive illness: time for a change in focus. JAMA Psychiatry. 2013 Oct;70(10):1107–12. 8. 8.Nuechterlein KH, Subotnik KL, Ventura J, Green MF, Gretchen-Doorly D, Asarnow RF. The puzzle of schizophrenia: tracking the core role of cognitive deficits. Dev Psychopathol. 2012 May;24(2):529–36. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S0954579412000132&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22559128&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 9. 9.Bowie CR, Harvey PD. Cognitive deficits and functional outcome in schizophrenia. Neuropsychiatr Dis Treat. 2006 Dec;2(4):531–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2147/nedt.2006.2.4.531&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19412501&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 10. 10.McCutcheon RA, Keefe RSE, McGuire PK. Cognitive impairment in schizophrenia: aetiology, pathophysiology, and treatment. Mol Psychiatry. 2023 May;28(5):1902–18. 11. 11.Fujiwara H, Yassin W, Murai T. Neuroimaging studies of social cognition in schizophrenia. Psychiatry Clin Neurosci. 2015 May;69(5):259–67. 12. 12.Green MF. What are the functional consequences of neurocognitive deficits in schizophrenia? Am J Psychiatry. 1996 Mar;153(3):321–30. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1176/ajp.153.3.321&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8610818&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1996TY35600004&link_type=ISI) 13. 13.Addington J, Liu L, Buchy L, Cadenhead KS, Cannon TD, Cornblatt BA, et al. North American Prodrome Longitudinal Study (NAPLS 2): The Prodromal Symptoms. J Nerv Ment Dis. 2015 May;203(5):328–35. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/NMD.0000000000000290&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25919383&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 14. 14.Addington J, Liu L, Brummitt K, Bearden CE, Cadenhead KS, Cornblatt BA, et al. North American Prodrome Longitudinal Study (NAPLS 3): Methods and baseline description. Schizophr Res. 2022 May;243:262–7. 15. 15.Bora E, Murray RM. Meta-analysis of cognitive deficits in ultra-high risk to psychosis and first-episode psychosis: do the cognitive deficits progress over, or after, the onset of psychosis? Schizophr Bull. 2014 Jul;40(4):744–55. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/schbul/sbt085&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23770934&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000338130300007&link_type=ISI) 16. 16.Fusar-Poli P, Deste G, Smieskova R, Barlati S, Yung AR, Howes O, et al. Cognitive functioning in prodromal psychosis: a meta-analysis. Arch Gen Psychiatry. 2012 Jun;69(6):562–71. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/archgenpsychiatry.2011.1592&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22664547&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000304771000003&link_type=ISI) 17. 17.Giuliano AJ, Li H, Mesholam-Gately RI, Sorenson SM, Woodberry KA, Seidman LJ. Neurocognition in the psychosis risk syndrome: a quantitative and qualitative review. Curr Pharm Des. 2012;18(4):399–415. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2174/138161212799316019&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22239571&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 18. 18.Amoretti S, Rabelo-da-Ponte FD, Rosa AR, Mezquida G, Sánchez-Torres AM, Fraguas D, et al. Cognitive clusters in first-episode psychosis. Schizophr Res. 2021 Nov;237:31–9. 19. 19.Clementz BA, Sweeney JA, Hamm JP, Ivleva EI, Ethridge LE, Pearlson GD, et al. Identification of Distinct Psychosis Biotypes Using Brain-Based Biomarkers. Am J Psychiatry. 2016;173(4):373–84. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1176/appi.ajp.2015.14091200&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26651391&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 20. 20.Lewandowski KE, Sperry SH, Cohen BM, Öngür D. Cognitive variability in psychotic disorders: a cross-diagnostic cluster analysis. Psychol Med. 2014 Nov;44(15):3239–48. 21. 21.Lewandowski KE, Baker JT, McCarthy JM, Norris LA, Öngür D. Reproducibility of Cognitive Profiles in Psychosis Using Cluster Analysis. J Int Neuropsychol Soc. 2018 Apr;24(4):382–90. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S1355617717001047&link_type=DOI) 22. 22.Velthorst E, Meyer EC, Giuliano AJ, Addington J, Cadenhead KS, Cannon TD, et al. Neurocognitive profiles in the prodrome to psychosis in NAPLS-1. Schizophr Res. 2019 Feb;204:311–9. 23. 23.Lee J, Rizzo S, Altshuler L, Glahn DC, Miklowitz DJ, Sugar CA, et al. Deconstructing Bipolar Disorder and Schizophrenia: A cross-diagnostic cluster analysis of cognitive phenotypes. J Affect Disord. 2017 Feb;209:71–9. 24. 24.Krabbendam L, Arts B, Van Os J, Aleman A. Cognitive functioning in patients with schizophrenia and bipolar disorder: A quantitative review. Schizophr Res. 2005 Dec;80(2–3):137–49. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.schres.2005.08.004&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16183257&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000234757600001&link_type=ISI) 25. 25.Haining K, Gajwani R, Gross J, Gumley AI, Ince RAA, Lawrie SM, et al. Characterising cognitive heterogeneity in individuals at clinical high-risk for psychosis: a cluster analysis with clinical and functional outcome prediction. Eur Arch Psychiatry Clin Neurosci. 2022 Apr;272(3):437–48. 26. 26.Haddad NM, Hortêncio L, Andrade JC, Serpa MH, Alves TM, Van De Bilt MT, et al. Cognitive Patterns and Conversion in a Representative Sample of Individuals at Risk for Psychosis. J Nerv Ment Dis. 2022 May;210(5):335–41. 27. 27.Allott K, Schmidt SJ, Yuen HP, Wood SJ, Nelson B, Markulev C, et al. Twelve-Month Cognitive Trajectories in Individuals at Ultra-High Risk for Psychosis: A Latent Class Analysis. Schizophr Bull Open. 2022 Jan 1;3(1):sgac008. 28. 28.Glenthøj LB, Hjorthøj C, Kristensen TD, Davidson CA, Nordentoft M. The effect of cognitive remediation in individuals at ultra-high risk for psychosis: a systematic review. Npj Schizophr. 2017 May 8;3(1):20. 29. 29.Zhang T, Wang J, Xu L, Wei Y, Tang X, Hu Y, et al. Subtypes of Clinical High Risk for Psychosis that Predict Antipsychotic Effectiveness in Long-Term Remission. Pharmacopsychiatry. 2021 Jan;54(01):23–30. 30. 30.Yassin W, Nakatani H, Zhu Y, Kojima M, Owada K, Kuwabara H, et al. Machine-learning classification using neuroimaging data in schizophrenia, autism, ultra-high risk and first-episode psychosis. Transl Psychiatry. 2020 Aug 17;10(1):278. 31. 31.Zhu Y, Nakatani H, Yassin W, Maikusa N, Okada N, Kunimatsu A, et al. Application of a Machine Learning Algorithm for Structural Brain Images in Chronic Schizophrenia to Earlier Clinical Stages of Psychosis and Autism Spectrum Disorder: A Multiprotocol Imaging Dataset Study. Schizophr Bull. 2022 Mar;48(3):563–74. 32. 32.Yassin W, Hoftman GD, Bergen SE, Del Re EC. Editorial: Diagnostic and prognostic brain-based biomarkers in psychosis spectrum. Front Psychiatry. 2023 Nov 22;14:1332447. 33. 33.Mirzakhanian H, Singh F, Cadenhead KS. Biomarkers in psychosis: an approach to early identification and individualized treatment. Biomark Med. 2014;8(1):51–7. 34. 34.Bolt LK, Amminger GP, Farhall J, McGorry PD, Nelson B, Markulev C, et al. Neurocognition as a predictor of transition to psychotic disorder and functional outcomes in ultra-high risk participants: Findings from the NEURAPRO randomized clinical trial. Schizophr Res. 2019 Apr;206:67–74. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.schres.2018.12.013&link_type=DOI) 35. 35.Seidman LJ, Shapiro DI, Stone WS, Woodberry KA, Ronzio A, Cornblatt BA, et al. Association of Neurocognition With Transition to Psychosis: Baseline Functioning in the Second Phase of the North American Prodrome Longitudinal Study. JAMA Psychiatry. 2016 Dec 1;73(12):1239. 36. 36.Hauser M, Zhang JP, Sheridan EM, Burdick KE, Mogil R, Kane JM, et al. Neuropsychological Test Performance to Enhance Identification of Subjects at Clinical High Risk for Psychosis and Be Most Promising for Predictive Algorithms for Conversion to Psychosis: A Meta-Analysis. J Clin Psychiatry. 2017 Jan 25;78(01):e28–40. 37. 37.Catalan A, Salazar De Pablo G, Aymerich C, Damiani S, Sordi V, Radua J, et al. Neurocognitive Functioning in Individuals at Clinical High Risk for Psychosis: A Systematic Review and Meta-analysis. JAMA Psychiatry. 2021 Aug 1;78(8):859. 38. 38.Piskulic D, Barbato M, Liu L, Addington J. Pilot study of cognitive remediation therapy on cognition in young people at clinical high risk of psychosis. Psychiatry Res. 2015 Jan 30;225(1–2):93–8. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.psychres.2014.10.021&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25467705&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 39. 39.Hooker CI, Carol EE, Eisenstein TJ, Yin H, Lincoln SH, Tully LM, et al. A pilot study of cognitive training in clinical high risk for psychosis: initial evidence of cognitive benefit. Schizophr Res. 2014 Aug;157(1–3):314–6. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.schres.2014.05.034&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24954429&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 40. 40.Addington J, Cadenhead KS, Cornblatt BA, Mathalon DH, McGlashan TH, Perkins DO, et al. North American Prodrome Longitudinal Study (NAPLS 2): Overview and recruitment. Schizophr Res. 2012 Dec;142(1–3):77–82. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.schres.2012.09.012&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23043872&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 41. 41.McGlashan TH, Walsh B, Woods S. The psychosis-risk syndrome: handbook for diagnosis and follow-up. New York: Oxford University Press; 2010. 243 p. 42. 42.First MB, Spitzer RL, Gibbon M, Williams JBW, Davies M, Borus J, et al. The Structured Clinical Interview for DSM-III-R Personality Disorders (SCID-II). Part II: Multi-Site Test-Retest Reliability Study. J Personal Disord. 1995 Jun;9(2):92–104. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1521/pedi.1995.9.2.92&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1995RL33100002&link_type=ISI) 43. 43.First MB, Williams JBW, Karg RS, Spitzer RL. SCID-5-CV: Structured Clinical Interview for DSM-5 Disorders: clinician version. Arlington, VA: American Psychiatric Association Publishing; 2016. 95 p. 44. 44.Aas IM. Guidelines for rating Global Assessment of Functioning (GAF). Ann Gen Psychiatry. 2011;10(1):2. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21251305&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 45. 45.Cornblatt BA, Auther AM, Niendam T, Smith CW, Zinberg J, Bearden CE, et al. Preliminary Findings for Two New Measures of Social and Role Functioning in the Prodromal Phase of Schizophrenia. Schizophr Bull. 2007 Mar 19;33(3):688–702. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/schbul/sbm029&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17440198&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000247241700012&link_type=ISI) 46. 46.Wechsler D. Wechsler Abbreviated Scale of Intelligence [Internet]. 2012 [cited 2023 Sep 14]. Available from: [http://doi.apa.org/getdoi.cfm?doi=10.1037/t15170-000](http://doi.apa.org/getdoi.cfm?doi=10.1037/t15170-000) 47. 47.Seidman LJ, Meyer EC, Giuliano AJ, Breiter HC, Goldstein JM, Kremen WS, et al. Auditory working memory impairments in individuals at familial high risk for schizophrenia. Neuropsychology. 2012 May;26(3):288–303. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1037/a0027970&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22563872&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 48. 48.Wide Range Achievement Test - 4. In: SpringerReference [Internet]. Springer-Verlag; Available from: doi:10.1007/springerreference_184831 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/springerreference_184831&link_type=DOI) 49. 49.Keefe R. The Brief Assessment of Cognition in Schizophrenia: reliability, sensitivity, and comparison with a standard neurocognitive battery. Schizophr Res. 2004 Jun 1;68(2–3):283–97. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.schres.2003.09.011&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15099610&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000221172900016&link_type=ISI) 50. 50.Belkonen S. Hopkins Verbal Learning Test. In: Kreutzer JS, DeLuca J, Caplan B, editors. Encyclopedia of Clinical Neuropsychology [Internet]. New York, NY: Springer New York; 2011 [cited 2024 Jul 1]. p. 1264–5. Available from: [http://link.springer.com/10.1007/978-0-387-79948-3\_1127](http://link.springer.com/10.1007/978-0-387-79948-3_1127) 51. 51.Atkins SM, Sprenger AM, Colflesh GJH, Briner TL, Buchanan JB, Chavis SE, et al. Measuring Working Memory Is All Fun and Games: A Four-Dimensional Spatial Game Predicts Cognitive Task Performance. Exp Psychol. 2014 Jan 1;61(6):417–38. 52. 52.Mueller SG, Weiner MW, Thal LJ, Petersen RC, Jack CR, Jagust W, et al. Ways toward an early diagnosis in Alzheimer’s disease: The Alzheimer’s Disease Neuroimaging Initiative (ADNI). Alzheimers Dement. 2005 Jul;1(1):55–66. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jalz.2005.06.003&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17476317&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 53. 53.Dale AM, Fischl B, Sereno MI. Cortical Surface-Based Analysis. NeuroImage. 1999 Feb;9(2):179–94. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1006/nimg.1998.0395&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=9931268&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000078608900001&link_type=ISI) 54. 54.Fischl B. Automatically Parcellating the Human Cerebral Cortex. Cereb Cortex. 2004 Jan 1;14(1):11–22. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/cercor/bhg087&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14654453&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000187219900002&link_type=ISI) 55. 55.Cannon TD, Sun F, McEwen SJ, Papademetris X, He G, Van Erp TGM, et al. Reliability of neuroanatomical measurements in a multisite longitudinal study of youth at risk for psychosis. Hum Brain Mapp. 2014 May;35(5):2424–34. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/hbm.22338&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23982962&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 56. 56.Chung Y, Allswede D, Addington J, Bearden CE, Cadenhead K, Cornblatt B, et al. Cortical abnormalities in youth at clinical high-risk for psychosis: Findings from the NAPLS2 cohort. NeuroImage Clin. 2019;23:101862. 57. 57.Miller TJ, McGlashan TH, Rosen JL, Cadenhead K, Ventura J, McFarlane W, et al. Prodromal Assessment With the Structured Interview for Prodromal Syndromes and the Scale of Prodromal Symptoms: Predictive Validity, Interrater Reliability, and Training to Reliability. Schizophr Bull. 2003 Jan 1;29(4):703–15. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/oxfordjournals.schbul.a007040&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=14989408&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000188873800007&link_type=ISI) 58. 58.Brock G, Pihur V, Datta S, Datta S. **clValid** : An *R* Package for Cluster Validation. J Stat Softw [Internet]. 2008 [cited 2024 Jul 5];25(4). Available from: [http://www.jstatsoft.org/v25/i04/](http://www.jstatsoft.org/v25/i04/) 59. 59.Ullmann T, Hennig C, Boulesteix A. Validation of cluster analysis results on validation data: A systematic framework. WIREs Data Min Knowl Discov. 2022 May;12(3):e1444. 60. 60.Van Rheenen TE, Cropley V, Zalesky A, Bousman C, Wells R, Bruggemann J, et al. Widespread Volumetric Reductions in Schizophrenia and Schizoaffective Patients Displaying Compromised Cognitive Abilities. Schizophr Bull. 2018 Apr 6;44(3):560–74. 61. 61.Weinberg D, Lenroot R, Jacomb I, Allen K, Bruggemann J, Wells R, et al. Cognitive Subtypes of Schizophrenia Characterized by Differential Brain Volumetric Reductions and Cognitive Decline. JAMA Psychiatry. 2016 Dec 1;73(12):1251. 62. 62.Ryan AT, Addington J, Bearden CE, Cadenhead KS, Cornblatt BA, Mathalon DH, et al. Latent class cluster analysis of symptom ratings identifies distinct subgroups within the clinical high risk for psychosis syndrome. Schizophr Res. 2018 Jul;197:522–30. 63. 63.Hardin JW, Hilbe JM. Generalized estimating equations. Boca Raton, Fla.: Chapman & Hall/CRC; 2003. 222 p. 64. 64.Liang KY, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika. 1986;73(1):13–22. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/biomet/73.1.13&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1986A734100002&link_type=ISI) 65. 65.Pan W. Akaike’s Information Criterion in Generalized Estimating Equations. Biometrics. 2001 Mar;57(1):120–5. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.0006-341X.2001.00120.x&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11252586&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000167376900015&link_type=ISI) 66. 66.Green MJ, Cairns MJ, Wu J, Dragovic M, Jablensky A, Tooney PA, et al. Genome-wide supported variant MIR137 and severe negative symptoms predict membership of an impaired cognitive subtype of schizophrenia. Mol Psychiatry. 2013 Jul;18(7):774–80. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/mp.2012.84&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22733126&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 67. 67.Wenzel J, Haas SS, Dwyer DB, Ruef A, Oeztuerk OF, Antonucci LA, et al. Cognitive subtypes in recent onset psychosis: distinct neurobiological fingerprints? Neuropsychopharmacology. 2021 Jul;46(8):1475–83. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41386-021-00963-1&link_type=DOI) 68. 68.Cobia DJ, Csernansky JG, Wang L. Cortical thickness in neuropsychologically near-normal schizophrenia. Schizophr Res. 2011 Dec;133(1–3):68–76. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.schres.2011.08.017&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21981933&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 69. 69.Benassi M, Garofalo S, Ambrosini F, Sant’Angelo RP, Raggini R, De Paoli G, et al. Using Two-Step Cluster Analysis and Latent Class Cluster Analysis to Classify the Cognitive Heterogeneity of Cross-Diagnostic Psychiatric Inpatients. Front Psychol. 2020 Jun 10;11:1085. 70. 70.Dwyer DB, Buciuman MO, Ruef A, Kambeitz J, Sen Dong M, Stinson C, et al. Clinical, Brain, and Multilevel Clustering in Early Psychosis and Affective Stages. JAMA Psychiatry. 2022 Jul 1;79(7):677. 71. 71.DeRosse P, Karlsgodt KH. Examining the Psychosis Continuum. Curr Behav Neurosci Rep. 2015 Jun;2(2):80–9. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s40473-015-0040-7&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26052479&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 72. 72.Allott K, Wood SJ, Yuen HP, Yung AR, Nelson B, Brewer WJ, et al. Longitudinal Cognitive Performance in Individuals at Ultrahigh Risk for Psychosis: A 10-year Follow-up. Schizophr Bull. 2019 Sep 11;45(5):1101–11. 73. 73.Heinrichs RW, Miles AA, Smith D, Zargarian T, Vaz SM, Goldberg JO, et al. Cognitive, clinical, and functional characteristics of verbally superior schizophrenia patients. Neuropsychology. 2008 May;22(3):321–8. [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18444710&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) 74. 74.Al-Shargie F, Tariq U, Mir H, Alawar H, Babiloni F, Al-Nashash H. Vigilance Decrement and Enhancement Techniques: A Review. Brain Sci. 2019 Jul 26;9(8):178. 75. 75.Cornblatt BA, Carrión RE, Addington J, Seidman L, Walker EF, Cannon TD, et al. Risk factors for psychosis: impaired social and role functioning. Schizophr Bull. 2012 Nov;38(6):1247–57. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/schbul/sbr136&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22080497&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000310944500020&link_type=ISI) 76. 76.Fusar-Poli P, Borgwardt S, Bechdolf A, Addington J, Riecher-Rössler A, Schultze-Lutter F, et al. The Psychosis High-Risk State: A Comprehensive State-of-the-Art Review. JAMA Psychiatry. 2013 Jan 1;70(1):107. 77. 77.Lewandowski KE, Cohen BM, Öngur D. Evolution of neuropsychological dysfunction during the course of schizophrenia and bipolar disorder. Psychol Med. 2011 Feb;41(2):225–41. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1017/S0033291710001042&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=20836900&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2024%2F08%2F08%2F2024.08.07.24311240.atom)