Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Predicting one-year outcome in first episode psychosis using machine learning

  • Samuel P. Leighton,

    Roles Formal analysis, Methodology, Writing – original draft

    Affiliation Institute of Health and Wellbeing, University of Glasgow, Glasgow, United Kingdom

  • Rajeev Krishnadas ,

    Roles Conceptualization, Formal analysis, Methodology, Supervision, Validation, Writing – original draft

    Rajeev.Krishnadas@glasgow.ac.uk

    Affiliations Institute of Neuroscience and Psychology, University of Glasgow, Glasgow, United Kingdom, ESTEEM First Episode Psychosis Service, NHS Greater Glasgow and Clyde, Glasgow, United Kingdom

  • Kelly Chung,

    Roles Data curation, Project administration

    Affiliation Institute of Health and Wellbeing, University of Glasgow, Glasgow, United Kingdom

  • Alison Blair,

    Roles Data curation, Funding acquisition

    Affiliation ESTEEM First Episode Psychosis Service, NHS Greater Glasgow and Clyde, Glasgow, United Kingdom

  • Susie Brown,

    Roles Data curation

    Affiliation ESTEEM First Episode Psychosis Service, NHS Greater Glasgow and Clyde, Glasgow, United Kingdom

  • Suzy Clark,

    Roles Data curation

    Affiliation ESTEEM First Episode Psychosis Service, NHS Greater Glasgow and Clyde, Glasgow, United Kingdom

  • Kathryn Sowerbutts,

    Roles Data curation

    Affiliation ESTEEM First Episode Psychosis Service, NHS Greater Glasgow and Clyde, Glasgow, United Kingdom

  • Matthias Schwannauer,

    Roles Funding acquisition, Methodology, Writing – review & editing

    Affiliation Department of Clinical & Health Psychology, University of Edinburgh, Edinburgh, United Kingdom

  • Jonathan Cavanagh ,

    Roles Writing – review & editing

    ‡ These authors are joint senior authors on this work.

    Affiliation Institute of Health and Wellbeing, University of Glasgow, Glasgow, United Kingdom

  • Andrew I. Gumley

    Roles Conceptualization, Data curation, Funding acquisition, Writing – review & editing

    ‡ These authors are joint senior authors on this work.

    Affiliation Institute of Health and Wellbeing, University of Glasgow, Glasgow, United Kingdom

Abstract

Background

Early illness course correlates with long-term outcome in psychosis. Accurate prediction could allow more focused intervention. Earlier intervention corresponds to significantly better symptomatic and functional outcomes. Our study objective is to use routinely collected baseline demographic and clinical characteristics to predict employment, education or training (EET) status, and symptom remission in patients with first episode psychosis (FEP) at one-year.

Methods and findings

83 FEP patients were recruited from National Health Service (NHS) Glasgow between 2011 and 2014 to a 24-month prospective cohort study with regular assessment of demographic and psychometric measures. An external independent cohort of 79 FEP patients were recruited from NHS Glasgow and Edinburgh during a 12-month study between 2006 and 2009. Elastic net regularised logistic regression models were built to predict binary EET status, period and point remission outcomes at one-year on 83 Glasgow patients (training dataset). Models were externally validated on an independent dataset of 79 patients from Glasgow and Edinburgh (validation dataset). Only baseline predictors shared across both cohorts were made available for model training and validation. After excluding participants with missing outcomes, models were built on the training dataset for EET status, period and point remission outcomes and externally validated on the validation dataset. Models predicted EET status, period and point remission with receiver operating curve (ROC) area under the curve (AUC) performances of 0.876 (95%CI: 0.864, 0.887), 0.630 (95%CI: 0.612, 0.647) and 0.652 (95%CI: 0.635, 0.670) respectively. Positive predictors of EET included baseline EET and living with spouse/children. Negative predictors included higher PANSS suspiciousness, hostility and delusions scores. Positive predictors for symptom remission included living with spouse/children, and affective symptoms on the Positive and Negative Syndrome Scale (PANSS). Negative predictors of remission included passive social withdrawal symptoms on PANSS. A key limitation of this study is the small sample size (n) relative to the number of predictors (p), whereby p approaches n. The use of elastic net regularised regression rather than ordinary least squares regression helped circumvent this difficulty. Further, we did not have information for biological and additional social variables, such as nicotine dependence, which observational studies have linked to outcomes in psychosis.

Conclusions and relevance

Using advanced statistical machine learning techniques, we provide the first externally validated evidence, in a temporally and geographically independent cohort, for the ability to predict one-year EET status and symptom remission in individual FEP patients.

Introduction

Initial clinical presentation and early illness course correlates with long term outcome in first episode psychosis (FEP).[1] The ability to accurately predict outcome at an individual level would allow more focussed intervention. For patients, a meaningful outcome is often more than simple symptom remission but optimising developmental pathways in early psychosis including vocational and educational outcomes.[24]

FEP most often presents at the critical stage in a young person’s life when community, societal roles, educational and vocational achievement are being forged. Consequently, its onset triggers a precipitous decline in education and employment. Missing out on such vocational opportunities, as enshrined in Article 23 of the Universal Declaration of Human Rights[5], impairs not only financial independence but also societal inclusion, the forming of relationships and self-actualisation.[6]

Those with FEP want to work. Evidence suggests that finding employment is more important than any specific mental health intervention.[7] Yet less than a quarter of those with severe mental illness such as schizophrenia receive vocational rehabilitation.[8,9] Clinicians’ attitudes towards their patients’ ability to return to work and their associated estimation of risks are often ambivalent; this is reinforced by a continued decline in employment rates in the years following contact with mental health services.[10]

Attitudes are changing with a new focus on employment, education and training (EET] recognised by the Meaningful Lives international consensus statement for FEP.[11] Successful intervention strategies exist, such as the Individual Placement and Support approach in FEP, which have been evidenced to show employment and education rates of 69% as compared with 35% for controls.[10] If we can correctly identify those with poor EET outcomes at their initial presentation, we could apply such vocational interventions at an earlier stage. Existing evidence suggests this would have a much greater chance of success.[12,13]

At present, our knowledge of factors, which predict outcome in FEP is incomplete. A recent meta-analysis by Lally et al provides the first robust evidence of remission and recovery outcomes in FEP but was unable to establish the key clinical or demographic factors, which discriminated between patients. Specifically, it did not replicate an earlier meta-analysis showing an association between longer duration of untreated psychosis (DUP) and worse outcome in FEP.[14,15] A systematic review identified poor premorbid adjustment, history of developmental disorder, greater symptom severity at baseline and longer DUP to be the most replicated predictors of poor clinical, functional, cognitive, and biological outcomes in early onset psychosis.[16] Within the Scottish population specifically, socioeconomic deprivation and ethnicity have been shown to be risk factors for developing psychosis, while substance misuse, longer first admission, younger age of onset and male gender increased the risk of poor long-term outcome.[1719] However, such group level differences cannot be extrapolated to individuals–the ‘ecological fallacy’[20]–nor can observational results be readily equated with causation or accurate predictions.

Advanced machine learning techniques have potential to revolutionise medicine by looking at causation and the prediction of individual patient outcome.[21] Within psychiatry, machine learning has been already applied to symptomatic and neuroimaging data to classify individuals into diagnostic categories and to predict response to ECT, with high levels of accuracy.[2224]. Koutsouleris et al employed machine learning to predict 4 and 52-week outcome (Global Assessment of Function ≥65) in FEP to a 75% and 73.8% test-fold balanced accuracy on repeated nested internal cross-validation. The authors suggest that before employing a machine learning model “into real-world care, replication is needed in external first episode samples”.[25]

Because of the practicalities surrounding the requirement to recruit an additional and independent cohort of patients, few studies assess predictive model generalisability via temporal and/or geographical validation. External validation in a plausibly related population is a considerably stronger test of predictive models. It assesses model transportability to an untested situation rather than simply reproducibility alone.[26] As outlined by Koutsouleris et al, this is considered essential before applying the predictive model to clinical practice.[25,27]

Our study is based on the hypothesis that it is possible to predict outcome in terms of employment education or training (EET) status or symptom remission at one-year in FEP using baseline (pertaining to the time around study entrance) demographic and clinical psychometric biomarkers. We are not aware of any study to date, which has employed predictive modelling of EET status in a FEP cohort. We then assess the generalisability of our predictive model on an external cohort of patients.

Additionally, we seek to find the relative importance of the individual variables in contributing to prediction performance. These could have clinical relevance and may inform future research. Focussing on patients with FEP, who have not had extensive psychotherapeutic or pharmacological interventions, helps mitigate against potential confounders.

Methods

The study was approved by the West of Scotland Research Ethics Committee. Referece number: 11/AL/0247 and R and D approval (GN11CP130). Written informed consent was obtained from all participants in the study.

Participants and study design

The Compassionate Recovery: Individualised Support in early Psychosis (CR:ISP) study was a 24-month non-randomised prospective study of individuals with FEP. An Integrated Care Pathway for FEP was implemented which facilitated regular routine assessment of demographic and psychometric measures at time 0, month 3, month 6 and month 12.

Recruitment took place in mental health services in the NHS Greater Glasgow & Clyde (NHSGGC) health board between 2011 and 2014. To be considered for inclusion participants had to be: (a) in-patients or out-patients with (b) first presentation to mental health services for psychosis, (c) ICD-10 diagnosis of non-organic psychosis. 83 participants were entered into the study. The CR:ISP patients formed the training dataset.

The validation dataset was formed of 79 FEP patients recruited to an earlier study (1 September 2006 to 31 August 2009), which took place in mental health services in the NHS in Glasgow and in Edinburgh. Demographic and psychometric measures were assessed at time 0, month 6 and month 12. Participants and study design have been described previously (see S1 File).[28]

Predictor and outcome measures

Baseline predictors shared across both cohorts were made available for model training and external validation. Demographic predictors included admission to hospital, age, citizenship, educational attainment, ethnicity, gender, household composition, EET at baseline, parental status, relationship status, accommodation, alcohol use, and recreational drug use status. Psychometric clinical predictors included individual PANSS items[29] and ordinalised depression rating (training cohort used Hospital Anxiety and Depression Scale (HADS), validation cohort used Beck’s Depression Inventory II (BDI-II)–scores were categorised as none, mild, moderate or severe according to published cut offs.[30,31]

The one-year binary outcome measures included EET status at one-year, PANSS point remission (meeting Andreasen PANSS criteria at month 12), and period remission (meeting Andreasen PANSS criteria at both month 6 and month 12). Andreasen et al defined remission as scores of less than or equal to 3 in PANSS items P1 Delusions, P2 Conceptual Disorganisation, P3 Hallucinatory Behaviour, N1 Blunted Affect, N4 Apathetic Social Withdrawal, N6 Lack of Spontaneity and G9 Unusual Thought Content, present for a period of at least 6 months.[32]

Statistical analysis

All statistical analyses were carried out within the R programming environment.[33]

Between-group (training versus validation) differences were tested using Welch’s independent t-test for continuous variables and Pearson’s chi-squared or Fisher’s exact test for categorical variables. Bonferroni correction was employed for multiple comparisons.

Machine learning analysis was carried out using the ‘Caret’ package.[34] R code is available in S2 File. The anonymised training and validation datasets are available in S3 File.

During pre-processing, data were centred and scaled, variables with zero variance and near-zero variance removed and missing data imputed using k (5) nearest neighbour imputation, prior to model construction. In psychiatric research, data are commonly seen as ‘missing not at random’. For example, with drop-out more frequent in those who have relapsed. Ignoring this leads to systematic attrition bias in any inference drawn, in addition to a loss of power. Imputation increases power and reduces bias.[35,36]

A logistic regression model was fit by elastic net regularisation with variable selection in ‘Caret’ with the ‘glmnet’ package.[37] This was following initial scoping utilising alternative classifiers including linear and radial support vector machines, random forest, naïve Bayes and linear discriminant analysis. Glmnet fits a generalized linear model via penalised maximum likelihood.[38] The objective function for the penalised logistic regression uses the negative binomial log-likelihood:

The alpha (α) hyperparameter determines the balance between ridge (α = 0) and lasso (α = 1) penalty and the lambda (λ) hyperparameter determines the amount of penalty. Elastic net regression improves upon ordinary least squares via regularisation (shrinkage) of the estimated beta coefficients (β). This results in superior performance when number of predictors (p) approaches the number of subjects (N) (in such circumstances there is no unique least squares estimate), or in the presence multicollinearity by offsetting a small amount of bias with large reductions in variance. Further, a consequence of regularisation is feature selection, which improves model interpretability.[39]

We used n (5) fold repeated (100 times) cross validation to train and tune our model over a grid of alpha and lambda hyperparameters on our training dataset (see S1 File). All splits were balanced by outcome class. The model was refit on the whole training set (with the best performing hyperparameters) to calculate its standardised beta coefficients. The exponentials of which, the odds ratios, are presented.

We estimated the discriminative performance of the models using receiver operating curve (ROC) area under the curve (AUC). ROCAUC is independent of class distribution and selects for discriminative models with false and true positive rates significantly better than random chance.

External validity was established by using the model built on the training dataset to predict the probability of the outcome class and comparing it to the actual outcome class in the validation dataset, then, calculating the ROC AUC performance metric using the ‘pROC’ package.[40,41] The 95%CI of the ROCAUC was established based on U-statistic theory using the ‘clinfun’ package. [41,42] In addition, permutation testing was used to confirm significance, whereby, the actual ROCAUC was compared to its null ROCAUC distribution derived by testing the model on randomly permutated class outcomes, repeated 10000 times. The p value is the proportion of permutated values greater than or equal to the actual value.[43] The model accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) are presented based on the point closest to the top-left of the ROC plot. The formulae for these quantities have been described previously.[44]

The above procedure was repeated for each of the three dependent outcome variables.

Results

Table 1 summarises the predictor and outcome data for the training and validation cohorts. The only statistically significantly different predictor variable between the cohorts was accommodation. More people lived in rented accommodation in the training cohort, and more people lived with their family in the validation cohort (p<0.001). There were no statistically significant differences in each of the three outcomes between the cohorts.

thumbnail
Table 1. Characteristics of training cohort and validation cohort.

https://doi.org/10.1371/journal.pone.0212846.t001

Table 2 summarises the EET status, point remission, and period remission models’ externally validated performance. Fig 1 show each model’s respective ROC curve–EET status (A), point remission (B) and period remission (C). All three models’ ROCAUC and 95%CI were significantly better than chance. The model predicting EET status had particularly high performance with a ROCAUC of 0.876 (95%CI: 0.864, 0.887; p = <0.001).

thumbnail
Fig 1.

ROC curves for model of one-year EET status (A), point remission (B), and period remission (C). ROC = Receiver Operating Characteristic; EET = Employment, Education or Training.

https://doi.org/10.1371/journal.pone.0212846.g001

thumbnail
Table 2. One-year EET status, point remission, and period remission models’ externally validated performance metrics.

https://doi.org/10.1371/journal.pone.0212846.t002

Figs 24 summarise the feature selected predictors and their odds ratios for the EET, point remission and period remission models.

thumbnail
Fig 2. One-year EET status model’s predictors and respective odds ratios.

Features selected by elastic net regularisation and respective odds ratios for one-year EET status model. EET = Employment, Education or Training; PANSS = Positive And Negative Symptoms Scale.

https://doi.org/10.1371/journal.pone.0212846.g002

thumbnail
Fig 3. One-year point remission model’s predictors and respective odds ratios.

Features selected by elastic net regularisation and respective odds ratios for one-year point remission model. PANSS = Positive And Negative Symptoms Scale.

https://doi.org/10.1371/journal.pone.0212846.g003

thumbnail
Fig 4. One-year period remission model’s predictors and respective odds ratios.

Features selected by elastic net regularisation and respective odds ratios for one-year period remission model. PANSS = Positive And Negative Symptoms Scale.

https://doi.org/10.1371/journal.pone.0212846.g004

Discussion

We believe our study to be the first externally validated evidence, in a temporally and geographically independent cohort, for predictive modelling in FEP at an individual patient level. This builds on existing studies of group-level differences and the elegant work of Koutsouleris et al, which outlined the first internally validated evidence of the ability to predict functional outcome in individual patients with FEP.[25]

Our results demonstrate the ability to predict both symptom remission and more importantly, functioning (in employment, education or training). The performance of our EET model was particularly robust, with an ability to accurately predict the one-year EET outcome in more than 85% of patients. This is clinically relevant, especially in the context of the 2010 international first episode psychosis “Meaningful Lives” consensus statement on unemployment as a source of significant social disability.[11]

Predictive statistics focus on the predictive performance of models on new ‘unseen’ data. This helps avoid the inherent problems with conventional statistical techniques, which only explore relationships within the study dataset. Few such relationships are externally validated and, of those that are, many lose significance.[45,46]

Because of the inherent feature selection of regularised logistic regression classifiers, which avoids overfitting in ill-conditioned regression problems where the number of variables is close to the sample size, our models are sparse and uniquely interpretable. Further, without a priori selection of variables but instead allowing the classifier to select features, we avoid the introduction of additional observer bias.[37,39]

The period remission model shares all the same predictors that are selected for the point remission model. However, the period remission model selects for additional predictors but with a reduced performance. This suggests possible overfitting on the training dataset, whereby the signal to noise variable ratio captured by the model is reduced, hence poorer generalisability to the independent validation dataset.[47]

Focussing on the shared predictors for both point and period remission, higher scores on the individual PANSS items P4 Excitement and G6 Depression correspond to a positive outcome. Over a century later, this continues to lend credence to a central tenant of the Kraepelian dichotomy–that psychosis in the context of phenomenologically prominent affective presentations tends to have a better chance of recovery.[48] Recent research in this area is conflicting and hampered by the few prospective studies looking at the influence of affective symptoms on outcome.[49]

PANSS item N4 passive or apathetic social withdrawal predicts poor one-year remission outcomes, a so called negative symptom frequently found in the classical schizophrenia syndrome.[50] We have previously identified social and interpersonal factors as influencing outcome in FEP.[28,51] Such initial negative symptoms persist longer and are more difficult to treat, with limited evidence for benefit with conventional pharmacotherapy.[52] In contrast, there is better evidence in support of psychosocial interventions. For example, cognitive behavioural therapy has been shown to be effective up to 24-months.[53] Recent meta-analysis supports the use of interventions which enhance social cognition and interpersonal skills in the treatment of negative symptoms in psychosis.[54] Our finding highlights the requirement for a multimodal approach to treatment of FEP earlier in the illness course. Further, our model selects variables representative of higher social support including living with spouse and children and not renting as predictive of good symptomatic outcome which are consistent with established literature.[55]

Similarly, living with spouse and children is predictive of a good one-year EET outcome. Again, consistent with established literature, our model selects for good baseline EET and higher educational attainment as positive predictors of positive one-year EET outcome.[55] Interestingly, different individual PANSS items–P1 Delusions, P6 Suspiciousness and P7 Hostility–emerged as negative predictors of one-year EET. Such features may hinder service engagement and attachment which is known to influence recovery and adaption in FEP.[28] That distinct variables predict EET compared to those predicting remission, suggests the outcomes are capturing different yet complementary areas of recovery.

Our focus on EET status is timely and reflects the growing recognition within the literature that this functional outcome is at least as important as symptom remission. Young people not being in employment, education or training [NEET] has a considerable economic cost, accounting for a loss of €153bn or 1.2% of the European Union’s gross domestic product. Prolonged economic inactivity leads to serious mental health sequelae with higher rates of depression, alcohol or substance misuse and increased suicide attempts. Reducing youth unemployment remains a policy priority in high-income countries including the USA and Europe but the quality of evidence available to legislators surrounding NEET status is low.[56] We suggest that our robust finding related to the prediction of NEET outcome in first episode psychosis is the first high quality externally validated evidence.

The use of an externally (temporally and geographically) validated dataset with the same predictor and outcome variables is a major strength. Further, our models are interpretable, employ easily obtained variables and, especially in relation to prediction of EET status, are robust. However, our study has limitations. Small sample size relative to the number of predictors is a potential concern. Plus, given its post-hoc nature, our analysis may have been underpowered. Using elastic net regularised regression, which simulation studies have shown outperforms conventional ordinary least squares or lasso while still enjoying the interpretability of feature selection missing with ridge, helps circumvent such concerns. Another concern may be that our cohorts have a statistically significant difference in accommodation status with more participants renting and less living in private accommodation with family in the training cohort as compared with the validation cohort. Similarly, changes in the wider macroeconomic environment, such as those following the 2008 financial crisis which would have been felt by the training cohort only, are known to influence mental health and functional outcomes.[57] However, we would contend that such differences are in fact a strength of our study and reinforces the importance of external validation. Despite these issues, and that our model includes the accommodation status ‘renting’ as a negative predictor of EET outcome, our models still had a statistically significant and robust predictive performance in a geographically and temporally distinct cohort. However, we are not aware of published studies on how expert clinicians perform at predicting these outcomes which would be a required benchmark against which to compare new machine learning models. A further limitation is the lack of additional social variables, such as nicotine dependence[58], which observational studies have linked to outcomes in psychosis, or any cognitive and of biological markers of illness, including neuroimaging and blood markers, which previous studies have highlighted as important and may have improved predictive performance.[59]

These limitations should be addressed when designing future studies. A required next step prior to implementation into routine clinical practice would be to establish whether, by the accurate identification of individuals who will have poor outcomes, we can meaningful intervene to improve their prognosis. Further, few machine learning studies prospectively apply and update a prediction model to new patients in a clinical setting. Clinical problems are not stationary and intelligent algorithms should adapt to changing circumstance and response to treatment. Current research, including this present study, focuses on a classifier’s performance rather than its clinical usefulness. In a clinical context, often false negatives (under treatment) are more harmful than false positives (over treatment). Statistical performance should be weighted by the net benefit of potential harm resulting from either over or undertreatment versus the benefit of correct diagnosis, for example, by employing decision curve analysis.[27]

Conclusions

We have demonstrated that it is possible to accurately predict one-year symptomatic and functional employment education or training status in first episode psychosis. This has not been reported previously in an externally validated cohort. We propose that our results represent an important and exciting next step in unlocking the potential of predictive modelling in psychiatric illness.

Supporting information

S3 File. Anonymised training and validation datasets.

https://doi.org/10.1371/journal.pone.0212846.s003

(CSV)

References

  1. 1. Petersen L, Nordentoft M, Jeppesen P, Ohlenschaeger J, Thorup A, Christensen TØ, et al. Improving 1-year outcome in first-episode psychosis: OPUS trial. Br J Psychiatry Suppl [Internet]. 2005 Aug 1 [cited 2018 Jan 18];48(48):s98–103. Available from: http://www.ncbi.nlm.nih.gov/pubmed/16055817
  2. 2. Lambert M, Schimmelmann BG, Naber D, Schacht A, Karow A, Wagner T, et al. The Journal of clinical psychiatry. [Internet]. Vol. 67, The Journal of Clinical Psychiatry. [Physicians Postgraduate Press]; 2006 [cited 2018 Jan 18]. 1690–1697 p. Available from: http://www.psychiatrist.com/jcp/article/Pages/2006/v67n11/v67n1104.aspx
  3. 3. Bond GR, Drake RE, Luciano A. Employment and educational outcomes in early intervention programmes for early psychosis: a systematic review. Epidemiol Psychiatr Sci [Internet]. 2015 Oct 14 [cited 2018 Feb 13];24(05):446–57. Available from: http://www.journals.cambridge.org/abstract_S2045796014000419
  4. 4. Tapfumaneyi A, Johnson S, Joyce J, Major B, Lawrence J, Mann F, et al. Predictors of vocational activity over the first year in inner-city early intervention in psychosis services. Early Interv Psychiatry [Internet]. 2015 Dec 1 [cited 2018 Feb 13];9(6):447–58. Available from: http://doi.wiley.com/10.1111/eip.12125 pmid:26104585
  5. 5. United Nations. Universal Declaration of Human Rights. 1948 [cited 2018 Jan 18]; Available from: http://www.ohchr.org/EN/UDHR/Documents/UDHR_Translations/eng.pdf
  6. 6. Maslow AH. A theory of human motivation. Psychol Rev [Internet]. 1943 [cited 2018 Jan 18];50(4):370–96. Available from: http://content.apa.org/journals/rev/50/4/370
  7. 7. Shepherd G. Perspectives on schizophrenia: A survey of user, family carer and professional views regarding effective care. J Ment Heal [Internet]. 1995 Jan 6 [cited 2018 Jan 18];4(4):403–22. Available from: http://www.tandfonline.com/doi/full/10.1080/09638239550037451
  8. 8. Lehman AF, Steinwachs DM. Patterns of usual care for schizophrenia: initial results from the Schizophrenia Patient Outcomes Research Team (PORT) Client Survey. Schizophr Bull [Internet]. 1998 [cited 2018 Feb 13];24(1):11–20; discussion 20–32. Available from: http://www.ncbi.nlm.nih.gov/pubmed/9502543 pmid:9502543
  9. 9. Bertram M, Howard L. Employment status and occupational care planning for people using mental health services. Psychiatr Bull [Internet]. 2006 Feb 2 [cited 2018 Feb 13];30(02):48–51. Available from: https://www.cambridge.org/core/product/identifier/S0955603600030257/type/journal_article
  10. 10. Rinaldi M, Killackey E, Smith J, Shepherd G, Singh SP, Craig T. First episode psychosis and employment: A review. Int Rev Psychiatry [Internet]. 2010 Jan 26 [cited 2018 Jan 18];22(2):148–62. Available from: http://www.tandfonline.com/doi/full/10.3109/09540261003661825 pmid:20504055
  11. 11. International First Episode Vocational Recovery (iFEVR) Group. Meaningful lives: Supporting young people with psychosis in education, training and employment: an international consensus statement. Early Interv Psychiatry [Internet]. 2010 Nov [cited 2018 Jan 18];4(4):323–6. Available from: http://www.ncbi.nlm.nih.gov/pubmed/21043208 pmid:21043208
  12. 12. Killackey EJ, Jackson HJ, Gleeson J, Hickie IB, Mcgorry PD. Exciting Career Opportunity Beckons! Early Intervention and Vocational Rehabilitation in First-Episode Psychosis: Employing Cautious Optimism. Aust New Zeal J Psychiatry [Internet]. 2006 Nov 17 [cited 2018 Jan 18];40(11–12):951–62. Available from: http://journals.sagepub.com/doi/10.1080/j.1440-1614.2006.01918.x
  13. 13. Craig T, Shepherd G, Rinaldi M, Smith J, Carr S, Preston F, et al. Vocational rehabilitation in early psychosis: cluster randomised trial. Br J Psychiatry [Internet]. 2014 Aug 10 [cited 2018 Feb 13];205(2):145–50. Available from: http://www.ncbi.nlm.nih.gov/pubmed/24855129 pmid:24855129
  14. 14. Lally J, Ajnakina O, Stubbs B, Cullinane M, Murphy KC, Gaughran F, et al. Remission and recovery from first-episode psychosis in adults: systematic review and meta-analysis of long-term outcome studies. Br J Psychiatry [Internet]. 2017 Dec 1 [cited 2018 Jan 18];211(6):350–8. Available from: http://www.ncbi.nlm.nih.gov/pubmed/28982659 pmid:28982659
  15. 15. Marshall M, Lewis S, Lockwood A, Drake R, Jones P, Croudace T. Association Between Duration of Untreated Psychosis and Outcome in Cohorts of First-Episode Patients. Arch Gen Psychiatry [Internet]. 2005 Sep 1 [cited 2018 Jan 18];62(9):975. Available from: http://www.ncbi.nlm.nih.gov/pubmed/16143729 pmid:16143729
  16. 16. Díaz-Caneja CM, Pina-Camacho L, Rodríguez-Quiroga A, Fraguas D, Parellada M, Arango C. Predictors of outcome in early-onset psychosis: a systematic review. npj Schizophr [Internet]. 2015 Dec 1 [cited 2018 Jan 18];1(1):14005. Available from: http://www.nature.com/articles/npjschz20145
  17. 17. Srireddy P, Agnihotri A, Park J, Taylor J, Connolly M, Krishnadas R. Ethnicity, deprivation and psychosis: the Glasgow experience. Epidemiol Psychiatr Sci [Internet]. 2012 Sep 13 [cited 2018 Feb 15];21(03):311–6. Available from: http://www.journals.cambridge.org/abstract_S2045796012000352
  18. 18. Queirazza F, Semple DM, Lawrie SM. Transition to schizophrenia in acute and transient psychotic disorders. Br J Psychiatry [Internet]. 2014 Apr 1 [cited 2018 Feb 15];204(4):299–305. Available from: http://www.ncbi.nlm.nih.gov/pubmed/24357576
  19. 19. Alderson HL, Semple DM, Blayney C, Queirazza F, Chekuri V, Lawrie SM. Risk of transition to schizophrenia following first admission with substance-induced psychotic disorder: a population-based longitudinal cohort study. Psychol Med [Internet]. 2017 Oct 3 [cited 2018 Feb 15];47(14):2548–55. Available from: https://www.cambridge.org/core/product/identifier/S0033291717001118/type/journal_article pmid:28464965
  20. 20. Sedgwick P. Understanding the ecological fallacy. BMJ [Internet]. 2015 Sep 21 [cited 2018 Jan 18];351:h4773. Available from: http://www.ncbi.nlm.nih.gov/pubmed/26391012 pmid:26391012
  21. 21. Darcy AM, Louie AK, Roberts LW. Machine Learning and the Profession of Medicine. JAMA [Internet]. 2016 Feb 9 [cited 2018 Jan 18];315(6):551. Available from: http://jama.jamanetwork.com/article.aspx?doi=10.1001/jama.2015.18421 pmid:26864406
  22. 22. Redlich R, Opel N, Grotegerd D, Dohm K, Zaremba D, Bürger C, et al. Prediction of Individual Response to Electroconvulsive Therapy via Machine Learning on Structural Magnetic Resonance Imaging Data. JAMA Psychiatry [Internet]. 2016 Jun 1 [cited 2018 Jan 18];73(6):557. Available from: http://archpsyc.jamanetwork.com/article.aspx?doi=10.1001/jamapsychiatry.2016.0316 pmid:27145449
  23. 23. Jauhar S, Krishnadas R, Nour MM, Cunningham-Owens D, Johnstone EC, Lawrie SM. Is there a symptomatic distinction between the affective psychoses and schizophrenia? A machine learning approach. Schizophr Res [Internet]. 2018 Jul 24 [cited 2018 Nov 22]; Available from: http://www.ncbi.nlm.nih.gov/pubmed/30054176
  24. 24. Supekar K, Cai W, Krishnadas R, Palaniyappan L, Menon V. Dysregulated Brain Dynamics in a Triple-Network Saliency Model of Schizophrenia and Its Relation to Psychosis. Biol Psychiatry [Internet]. 2018 Aug 1 [cited 2018 Nov 22]; Available from: https://linkinghub.elsevier.com/retrieve/pii/S0006322318317153
  25. 25. Koutsouleris N, Kahn RS, Chekroud AM, Leucht S, Falkai P, Wobrock T, et al. Multisite prediction of 4-week and 52-week treatment outcomes in patients with first-episode psychosis: a machine learning approach. The Lancet Psychiatry [Internet]. 2016 Oct [cited 2018 Jan 18];3(10):935–46. Available from: http://www.ncbi.nlm.nih.gov/pubmed/27569526 pmid:27569526
  26. 26. Justice AC, Covinsky KE, Berlin JA. Assessing the generalizability of prognostic information. Ann Intern Med [Internet]. 1999 Mar 16 [cited 2018 Jan 18];130(6):515–24. Available from: http://www.ncbi.nlm.nih.gov/pubmed/10075620 pmid:10075620
  27. 27. Steyerberg EW, Vergouwe Y. Towards better clinical prediction models: seven steps for development and an ABCD for validation. Eur Heart J [Internet]. 2014 Aug 1 [cited 2018 Jan 18];35(29):1925–31. Available from: https://academic.oup.com/eurheartj/article-lookup/doi/10.1093/eurheartj/ehu207 pmid:24898551
  28. 28. Gumley AI, Schwannauer M, Macbeth A, Fisher R, Clark S, Rattrie L, et al. Insight, duration of untreated psychosis and attachment in first-episode psychosis: prospective study of psychiatric recovery over 12-month follow-up. Br J Psychiatry [Internet]. 2014 Jul 1 [cited 2018 Jan 18];205(1):60–7. Available from: http://www.ncbi.nlm.nih.gov/pubmed/24723630 pmid:24723630
  29. 29. Kay SR, Fiszbein A, Opler LA. The Positive and Negative Syndrome Scale (PANSS) for Schizophrenia. Schizophr Bull [Internet]. 1987 Jan 1 [cited 2018 Jan 18];13(2):261–76. Available from: https://academic.oup.com/schizophreniabulletin/article-lookup/doi/10.1093/schbul/13.2.261 pmid:3616518
  30. 30. Zigmond AS, Snaith RP. The hospital anxiety and depression scale. Acta Psychiatr Scand [Internet]. 1983 Jun [cited 2018 Jan 18];67(6):361–70. Available from: http://www.ncbi.nlm.nih.gov/pubmed/6880820 pmid:6880820
  31. 31. Beck AT, Steer RA, Brown GK. BDI-II, Beck depression inventory: manual [Internet]. 2nd ed. San Antonio Tex.; Boston: Psychological Corp.; 1996 [cited 2018 Jan 18]. 38 p. Available from: http://www.worldcat.org/title/bdi-ii-beck-depression-inventory-manual/oclc/36075838
  32. 32. Andreasen NC, Carpenter WT, Kane JM, Lasser RA, Marder SR, Weinberger DR. Remission in Schizophrenia: Proposed Criteria and Rationale for Consensus. Am J Psychiatry [Internet]. 2005 Mar [cited 2018 Jan 18];162(3):441–9. Available from: http://www.ncbi.nlm.nih.gov/pubmed/15741458 pmid:15741458
  33. 33. R Core Team. R: A language and environment for statistical computing. [Internet]. Vienna, Austria.: R Foundation for Statistical Computing; 2017. Available from: https://www.r-project.org/
  34. 34. Kuhn M. caret: Classification and Regression Training [Internet]. Available from: https://github.com/topepo/caret/
  35. 35. von Hippel P, Lynch J. Efficiency Gains from Using Auxiliary Variables in Imputation. 2013 Nov 20 [cited 2018 Jan 18]; Available from: http://arxiv.org/abs/1311.5249
  36. 36. Liu Y, Gopalakrishnan V. An Overview and Evaluation of Recent Machine Learning Imputation Methods Using Cardiac Imaging Data. Data. 2017 Mar;2(1).
  37. 37. Friedman J, Hastie T, Tibshirani R. Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw [Internet]. 2010 Feb 2 [cited 2018 Jan 18];33(1):1–22. Available from: http://www.jstatsoft.org/v33/i01/ pmid:20808728
  38. 38. Iniesta R, Stahl D, McGuffin P. Machine learning, statistical learning and the future of biological research in psychiatry. Psychol Med [Internet]. 2016 [cited 2018 Nov 18];46(12):2455–65. Available from: http://www.ncbi.nlm.nih.gov/pubmed/27406289 pmid:27406289
  39. 39. Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc Ser B (Statistical Methodol [Internet]. 2005 Apr 1 [cited 2018 Jan 18];67(2):301–20. Available from: http://doi.wiley.com/10.1111/j.1467-9868.2005.00503.x
  40. 40. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics [Internet]. 2011 Mar 17 [cited 2018 Jan 18];12(1):77. Available from: http://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-12-77
  41. 41. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the Areas under Two or More Correlated Receiver Operating Characteristic Curves: A Nonparametric Approach. Biometrics [Internet]. 1988 Sep [cited 2018 Jan 18];44(3):837. Available from: http://www.jstor.org/stable/2531595?origin=crossref pmid:3203132
  42. 42. Seshan VE. clinfun: Clinical Trial Design and Data Analysis Functions [Internet]. 2017. Available from: https://cran.r-project.org/package=clinfun
  43. 43. Pérez-Guaita D, Kuligowski J, Garrigues S, Quintás G, Wood BR. Assessment of the statistical significance of classifications in infrared spectroscopy based diagnostic models. Analyst [Internet]. 2015 Mar 16 [cited 2018 Jan 18];140(7):2422–7. Available from: http://xlink.rsc.org/?DOI=C4AN01783H pmid:25382314
  44. 44. Leighton SP, Nerurkar L, Krishnadas R, Johnman C, Graham GJ, Cavanagh J. Chemokines in depression in health and in inflammatory illness: a systematic review and meta-analysis. Mol Psychiatry [Internet]. 2018 Jan 14 [cited 2018 Jan 18];23(1):48–58. Available from: http://www.nature.com/doifinder/10.1038/mp.2017.205 pmid:29133955
  45. 45. Schooler JW. Metascience could rescue the ‘replication crisis.’ Nature [Internet]. 2014 Nov 4 [cited 2018 Jan 18];515(7525):9–9. Available from: http://www.ncbi.nlm.nih.gov/pubmed/25373639 pmid:25373639
  46. 46. Open Science Collaboration OS. Estimating the reproducibility of psychological science. Science [Internet]. 2015 Aug 28 [cited 2018 Jan 18];349(6251):aac4716. Available from: http://www.ncbi.nlm.nih.gov/pubmed/26315443 pmid:26315443
  47. 47. Waldron L, Pintilie M, Tsao M-S, Shepherd FA, Huttenhower C, Jurisica I. Optimized application of penalized regression methods to diverse genomic data. Bioinformatics [Internet]. 2011 Dec 15 [cited 2018 Jan 18];27(24):3399–406. Available from: https://academic.oup.com/bioinformatics/article-lookup/doi/10.1093/bioinformatics/btr591 pmid:22156367
  48. 48. Kraepelin E. Psychiatrie: Ein Lehrbuch für Studirende und Aerzte I [Internet]. Leipzig: J. A. Barth; 1899 [cited 2018 Jan 18]. 366 p. Available from: https://archive.org/details/psychiatrieeinle02krae
  49. 49. Jäger M, Haack S, Becker T, Frasch K. Schizoaffective disorder—an ongoing challenge for psychiatric nosology. Eur Psychiatry [Internet]. 2011 Apr 1 [cited 2018 Jan 18];26(3):159–65. Available from: http://www.ncbi.nlm.nih.gov/pubmed/20646917 pmid:20646917
  50. 50. Boonstra N, Klaassen R, Sytema S, Marshall M, De Haan L, Wunderink L, et al. Duration of untreated psychosis and negative symptoms—A systematic review and meta-analysis of individual patient data. Schizophr Res [Internet]. 2012 Dec 1 [cited 2018 Jan 18];142(1–3):12–9. Available from: http://www.ncbi.nlm.nih.gov/pubmed/23025994 pmid:23025994
  51. 51. Mcleod H, Gumley A, Macbeth A, Schwannauer M, Lysaker. Metacognitive functioning predicts positive and negative symptoms over 12 months in first episode psychosis. J Psychiatr Res [Internet]. 2014 [cited 2018 Feb 14];54(10):109–15. Available from: http://www.research.ed.ac.uk/portal/files/19352163/Macbeth_Metacognitive_Functioning.pdf
  52. 52. Leucht S, Heres S, Kissling W, Davis JM. Evidence-based pharmacotherapy of schizophrenia. Int J Neuropsychopharmacol [Internet]. 2011 Mar 6 [cited 2018 Jan 18];14(02):269–84. Available from: https://academic.oup.com/ijnp/article-lookup/doi/10.1017/S1461145710001380
  53. 53. National Collaborating Centre for Mental Health (UK). Psychosis and Schizophrenia in Adults; NICE Clinical Guidelines, No. 178 [Internet]. Psychosis and Schizophrenia in Adults: Treatment and Management: Updated Edition 2014. London: National Institute for Health and Care Excellence (UK); 2014 [cited 2018 Jan 18]. Available from: http://www.ncbi.nlm.nih.gov/pubmed/25340235
  54. 54. Turner DT, McGlanaghy E, Cuijpers P, van der Gaag M, Karyotaki E, MacBeth A. A Meta-Analysis of Social Skills Training and Related Interventions for Psychosis. Schizophr Bull [Internet]. 2017 Nov 11 [cited 2018 Feb 14]; Available from: http://academic.oup.com/schizophreniabulletin/article/doi/10.1093/schbul/sbx146/4618008
  55. 55. Rosen K, Garety P. Predicting Recovery From Schizophrenia: A Retrospective Comparison of Characteristics at Onset of People With Single and Multiple Episodes. Schizophr Bull [Internet]. 2005 Jul 27 [cited 2018 Jan 18];31(3):735–50. Available from: https://academic.oup.com/schizophreniabulletin/article-lookup/doi/10.1093/schbul/sbi017 pmid:16123527
  56. 56. Scott J, Fowler D, McGorry P, Birchwood M, Killackey E, Christensen H, et al. Adolescents and young adults who are not in employment, education, or training. BMJ [Internet]. 2013 Sep 18 [cited 2018 Jan 18];347:f5270. Available from: http://www.ncbi.nlm.nih.gov/pubmed/24048295 pmid:24048295
  57. 57. Ramanathan S, Balasubramanian N, Krishnadas R. Macroeconomic Environment During Infancy as a Possible Risk Factor for Adolescent Behavioral Problems. JAMA Psychiatry [Internet]. 2013 Feb 1 [cited 2018 Feb 14];70(2):218. Available from: http://archpsyc.jamanetwork.com/article.aspx?doi=10.1001/jamapsychiatry.2013.280 pmid:23277351
  58. 58. Krishnadas R, Jauhar S, Telfer S, Shivashankar S, McCreadie RG. Nicotine dependence and illness severity in schizophrenia. Br J Psychiatry [Internet]. 2012 Oct 1 [cited 2018 Feb 14];201(4):306–12. Available from: http://www.ncbi.nlm.nih.gov/pubmed/22878134 pmid:22878134
  59. 59. Kapur S, Phillips AG, Insel TR. Why has it taken so long for biological psychiatry to develop clinical tests and what to do about it? Mol Psychiatry. 2012 Dec;17(12):1174–9. pmid:22869033