Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Genome-wide discovery for diabetes-dependent triglycerides-associated loci

  • Margaret Sunitha Selvaraj,

    Roles Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, United States of America, Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, United States of America, Department of Medicine, Harvard Medical School, Boston, MA, United States of America

  • Kaavya Paruchuri,

    Roles Data curation

    Affiliations Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, United States of America, Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, United States of America

  • Sara Haidermota,

    Roles Data curation

    Affiliations Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, United States of America, Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, United States of America

  • Rachel Bernardo,

    Roles Data curation

    Affiliations Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, United States of America, Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, United States of America

  • Stephen S. Rich,

    Roles Conceptualization, Supervision, Writing – original draft, Writing – review & editing

    Affiliation Center for Public Health Genomics, University of Virginia, Charlottesville, VA, United States of America

  • Gina M. Peloso ,

    Roles Conceptualization, Project administration, Supervision, Writing – original draft, Writing – review & editing

    pnatarajan@mgh.harvard.edu (PN); gpeloso@bu.edu (GMP)

    ‡ GMP and PN are jointly supervised the work.

    Affiliation Department of Biostatistics, Boston University School of Public Health, Boston, MA, United States of America

  • Pradeep Natarajan

    Roles Conceptualization, Formal analysis, Funding acquisition, Supervision, Writing – original draft, Writing – review & editing

    pnatarajan@mgh.harvard.edu (PN); gpeloso@bu.edu (GMP)

    ‡ GMP and PN are jointly supervised the work.

    Affiliations Cardiovascular Research Center, Massachusetts General Hospital, Boston, MA, United States of America, Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, United States of America, Department of Medicine, Harvard Medical School, Boston, MA, United States of America

Abstract

Purpose

We aimed to discover loci associated with triglyceride (TG) levels in the context of type 2 diabetes (T2D). We conducted a genome-wide association study (GWAS) in 424,120 genotyped participants of the UK Biobank (UKB) with T2D status and TG levels.

Methods

We stratified the cohort based on T2D status and conducted association analyses of TG levels for genetic variants with minor allele count (MAC) at least 20 in each stratum. Effect differences of genetic variants by T2D status were determined by Cochran’s Q-test and we validated the significantly associated variants in the Mass General Brigham Biobank (MGBB).

Results

Among 21,176 T2D and 402,944 non-T2D samples from UKB, stratified GWAS identified 19 and 315 genomic risk loci significantly associated with TG levels, respectively. Only chr6p21.32 exhibited genome-wide significant heterogeneity (I2 = 98.4%; pheterogeneity = 2.1x10-15), with log(TG) effect estimates of -0.066 (95%CI: -0.082, -0.050) and 0.002 (95%CI: -0.002, 0.006) for T2D and non-T2D, respectively. The lead variant rs9274619:A (allele frequency 0.095) is located 2Kb upstream of the HLA-DQB1 gene, between HLA-DQB1 and HLA-DQA2 genes. We replicated this finding among 25,137 participants (6,951 T2D cases) of MGBB (pheterogeneity = 9.5x10-3). Phenome-wide interaction association analyses showed that the lead variant was strongly associated with a concomitant diagnosis of type 1 diabetes (T1D) as well as diabetes-associated complications.

Conclusion

In conclusion, we identified an intergenic variant near HLA-DQB1/DQA2 significantly associates with decreased triglycerides only among those with T2D and highlights an immune overlap with T1D.

Introduction

Diabetes, largely due to type 2 diabetes (T2D), was estimated to afflict 9.3% of population in 2019 and projected to increase to 10.9% by 2045 [1]. Despite ongoing scientific advances [2], T2D remains a leading cause of morbidity and mortality in the US and increasingly worldwide [3, 4]. Novel approaches to discover the factors influencing T2D-related metabolic alterations may yield new insights toward the prevention of T2D-related complications.

Plasma lipid, particularly triglycerides (TG), alterations represent early metabolic changes linked to insulin resistance. Hypertriglyceridemia is often observed among individuals at risk for T2D and is more severe among individuals with poorly controlled T2D [5]. Enhanced hepatic secretion of TG rich lipoproteins due to insulin resistance and delayed clearance involving lipoprotein lipase-mediated lipolysis may further exacerbate hypertriglyceridemia [6]. Hypertriglyceridemia is an independent predictor of cardiovascular disease in T2D [7, 8], as well as a predictor of T2D itself [9]. Characterizing the genetic determinants of TG concentrations specific to those with T2D may yield new insights into diabetes pathogenesis and complications.

Here, we tested the hypothesis that there are genetic variants associated with TG levels specific to T2D using GWAS and heterogeneity analysis in 424,120 participants of the UKB. Further, we assessed the role of the identified lead variant for multiple diabetes-related phenotypes.

Results

Baseline characteristics

The overall study schematic is depicted in S1 Fig Among 424,120 and 25,137 included samples, 21,176 (5.0%) and 6,951 (27.7%) of samples had T2D in UKB and MGBB, respectively. Overall UKB was composed of participants with a mean age (standard deviation [SD]) of 56.6 (8.1) years, 195,966 (46.2%) male, and 356,023 (83.9%) White British self-reported race. MGBB participants were mean 62.1 (16.2) years, 11,579 (46.1%) male, and 21,172 (84.2%) White British self-reported race. As expected, individuals with T2D versus non-T2D had greater median TG concentrations in both cohorts (Table 1).

thumbnail
Table 1. Baseline characteristics for discovery and replication cohorts.

Distribution of samples across the T2D strata in discovery (UKB) and replication (MGBB) cohorts are provided. Number of samples by gender, ancestry and lipid lowering medications are documented. Lipid measurements for the four main lipid class is tabulated based on T2D strata.

https://doi.org/10.1371/journal.pone.0275934.t001

T2D-stratified GWAS of TG identified an associated locus on chromosome 6

We performed GWAS on normalized natural log TG stratified by T2D status in the discovery cohort. Among the 402,944 non-T2D samples, 315 significant loci were identified. Among the 21,176 T2D samples, 19 significant loci were identified (S2 Fig and S1 Table). We then assessed for differential TG effects for 67M variants by T2D status using Cochran’s Q-test for heterogeneity. We identified 478 variants which were genome-wide significant, all at chr6p21.32 (lead variant: rs9274619:G>A; I2 = 98.4%; pheterogeneity = 2x10-15) (Fig 1). The most significantly heterogenous variant was an intergenic variant near the HLA-DQB1/DQA2 genes (S3 Fig), where the minor allele (frequency 0.095) decreases natural log TG among those with T2D but yields no difference among those with non-T2D (T2D group: beta = -0.066, p-value = 3.9x10-15; non-T2D group: beta = 0.002, p-value = 0.21; pinteraction = 1.9x10-11). We observed that the difference test (ZDiff) identified this top significant lead variant (rs9274619:A; Zdiff = -7.935) in the HLA locus as well. The mean raw TG measurements were significantly different between the reference and alternative genotypes in T2D samples (rs9274619(G/A or A/G): meandiff = 5.45 mg/dl; p-value = 1.26x10-02; rs9274619(A/A): meandiff = 25.6 mg/dl; p-value = 3.78x10-03). To evaluate the independence of the lead variant, we clumped the 478 genome-wide significant variants and performed conditional analyses. After clumping, we retained 8 individual signals, and rs3957148 was in strong LD with the lead variant. However, conditional analyses with all these eight clumped variants for rs9274619:A interaction with T2D status did not abrogate the lead variant’s signal (S2 Table). Since BMI is associated with both TG and T2D, we used scaled BMI as an additional covariate in testing the association of the identified lead variant with TG. We observed evidence for persistent albeit attenuated interaction between rs9274619:A and T2D after adjusting for BMI (pinteraction = 2.9x10-7). We observed a slight reduction in effects in the interaction model with T2D (betainteraction = -0.046), which shows that BMI has a confounding effect.

thumbnail
Fig 1. Genome-wide heterogeneity between T2D strata.

Manhattan plot for heterogeneity p-values comparing T2D and non-T2D groups. Only one locus achieved genome-wide significance and the corresponding variants are colored in green. Lambda GC values from GWAS stratified by T2D status is shown in the figure. Red line: Genome-wide significance (p-value = 5x10-8), Blue line: Suggestive significance (p-value = 1x10-5). GWAS–Genome wide association studies; T2D –Type 2 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.g001

Individuals with T2D with lower TG concentrations were enriched for rs9274619:A (Fig 2A). Among individuals with normal TG (i.e., <150 mg/dL), rs9274619:A was associated with T2D by 1.23-fold (95% CI 1.15,1.29; p-value 2.8x10-11). However, among individuals with TG > 450 mg/dl, rs9274619:A was not associated with T2D (OR 0.96, 95% CI 0.76–1.18; p-value 0.67). We replicated the findings in an independent cohort of 25,137 participants (6,951 T2D cases) of MGBB (pheterogeneity = 9.5x10-3) (Fig 2B). Additionally, we evaluated the association of the lead variant interacting with T2D status with other lipids in discovery cohort (S3 Table). We observed a significant interaction between rs9274619:A and T2D on HDL-C (pinteraction = 2.9x10-8) with higher concentrations among those with T2D, and nominally greater reductions in LDL-C among those with T2D (pinteraction = 6.0x10-4).

thumbnail
Fig 2. HLA-DQB1/DQA2 rs9274619:A significantly interacts with T2D on triglycerides.

A) Allele frequency of rs9274619:A in T2D and non-T2D samples grouped by raw TG values. Significance of samples proportions between the groups was assessed using Fisher’s exact test for the lower and higher TG bins. B) GWAS and heterogeneity statistics of the lead variant rs9274619:A at the HLA-DQB1/DQA2 locus from discovery (UKB) and replication (MGBB) cohorts based on T2D stratification. CI–Confidence Intervals; GWAS–Genome wide association studies; HLA–Human Leukocyte Antigens; MGBB–Mass General Brigham Biobank; NS–Non Significant; T2D –Type 2 Diabetes; TG–Triglycerides; UKB–UK Biobank.

https://doi.org/10.1371/journal.pone.0275934.g002

We next bioinformatically prioritized the putative causal gene responsible for the T2D-dependent TG genetic association observed. Using T2D GWAS summary statistics PoPS prioritized the HLA-DQB1 gene to be one of the top 20 genes along with other known TG genes such as APOE, LPL and APOB. However, HLA-DQB1 was not prioritized in the non-T2D GWAS (S4 Table). Intersecting rs9274619:A with GTEx eQTL data for five different tissues (Methods) shows that the variant is an eQTL for multiple HLA genes including HLA-DQB1 but more significantly for HLA-DQA2 and HLA-DRB6 (S5 Table). We curated all the eQTLs of the three HLA genes from GTEx database, T2D GWAS and correlated the Z-scores. eQTLs of all three HLA genes had a similar degree of correlations, but with opposite directions (S4 Fig). We further interrogated pQTL and mQTL data. rs9274619:A is a pQTL for HLA-DQA2 (beta = 0.31; p-value = 6.2x10-14) but it is an mQTL for multiple CpG regions at genome-wide significance. We identified 133 cis-associations and mapped the CpGs to Illumina HumanMethylation450 BeadChip (Illumina Inc., San Diego, USA) identification numbers (GEO data: GPL13534) to obtain the corresponding genes. Multiple HLA genes and other genes in chromosome 6 were identified (S6 Table). Gene prioritization using PoPS and QTL curation identified multiple HLA-genes (S5 Fig).

rs9274619:A tags HLA-DQB1*0302

Since the significant locus was at the HLA region, we correlated the rs9274619:A with 362 imputed HLA genotypes from 11 classes in the UKB. DQB1 and DQA1 were the most strongly correlated with rs9274619:A. Furthermore, DQB1_302 and DQA1_301 were most strongly correlated with rs9274619:A (DQB1_302: r = 0.95, pcorrelation<3.83x10-313; DQA1_301: r = 0.62, pcorrelation<3.83x10-313) (S6A Fig). We subsequently tested the interaction of all 362 HLA genotypes with T2D status on log(TG) as outcome. From this focused assessment of 362 HLA genotypes (S7 Table), 7 passed Bonferroni corrected significance (0.05/362 = 1x10-4) (S6B Fig). Consistent with our discovery and correlation analyses, only DQB1_302 had a genome-wide significant interaction (pinteraction = 1.05x10-9). Although rs9274619:A is associated with increased expression of HLA-DQA2 gene in eQTL and pQTL analysis, alleles from these HLA types were not previously imputed in UKB. Since rs9274619:A located between both HLA-DQB1 and HLA-DQA2, the variant could be a potential quantitative trait loci to both the genes.

Since the HLA locus is a strong predictor of type 1 diabetes (T1D), we used polygenic risk scores (PRS) from 67 variants previously associated with T1D as reported by Oram et al. [10], to exclude potential T1D cases in sensitivity analysis. First, we used the 67 SNPs from both HLA and non-HLA loci to create a genome-wide PRS for all the UKB samples. The top one percentile of the samples based on PRS were classified as T1D (N = 4242). Next, from the whole UKB samples, we removed any sample which identified as T1D by either ICD codes or PRS scores. With the remaining 18460 T2D cases, we observe a nominal interaction of the lead SNP with T2D (betainteraction = -0.026; pinteraction = 1.16x10-02).

Phenome-wide interaction analyses implicates multiple diabetes-related complications

We assessed the interactions between the rs9274619:A and T2D with 1567 disease conditions as outcomes (combination of incidence and prevalence) adjusted for all the covariates (age, age2, sex, race, PC1-10). Using a Bonferroni correction (0.05/1567 = 3.19x10-5), 45 disease phenotypes exhibited significant interactions. The strongest interaction was for the concomitant diagnosis of type 1 diabetes (T1D) among those with T2D (pinteraction<1.72x10-274) (S8 Table). We applied logistic regression models stratified by T2D status on the 45 significant phenotypes, while adjusting for all covariates as mentioned above (S9 Table). Multiple diabetes-related microvascular and macrovascular complications including hypoglycemia, retinopathy, polyneuropathy, angiopathy, atherosclerosis and osteomyelitis were significantly associated, with the T2D-specific TG-lowering rs9274619:A allele leading to increased risks (Fig 3). Interestingly, a recent investigation on Anti-neutrophil cytoplasmic antibody (ANCA)-associated vasculitides (AAV) by Dahlqvist et al. have identified rs9274619 as a lead variant for myeloperoxidase (MPO)-ANCA association [11], showing its importance in immune related vascular diseases. However, this allele was associated with reduced odds for obesity and related phenotypes.

thumbnail
Fig 3. Phenome wide association (PheWAS) of disease conditions stratified by T2D status.

Interaction between the rs9274619:A and T2D with multiple disease conditions as outcomes (combination of incidence and prevalence) was modeled while adjusting for all covariates (sex, age, age2, race, PC1-10). Bonferroni corrected 45 significant phenotypes were stratified by T2D status and analyzed using logistic regression. The T2D effect estimates for rs9274619:A and the interaction p-value are documented, the disease conditions are ordered based on T2D beta. From the 45 disease conditions tested, highly correlated Type1 diabetic conditions were removed while plotting the figure. PC–Principal Components; PheWAS–Phenome Wide Association Studies; T2D –Type 2 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.g003

Since HLA-DQB1 is a GWAS locus for an overlap between T1D and T2D previously referred to as latent autoimmune diabetes in adults (LADA) [12], as well as T1D itself [13], we further explored the relationships between the rs9274619:A, T1D, T2D, and their respective interactions on TGs (S10 Table). The TG-lowering rs9274619:A was strongly associated with T1D after adjusting for T2D (p-value = 8.6x10-113), but not significantly associated with T2D status after adjusting for T1D (p-value = 0.29). However, when assessing for interactions on TGs, there was still a significant interaction with T2D independent of T1D (betainteraction = -0.055; pinteraction = 5.28x10-9) and more strongly with T1D independently of T2D (betainteraction = -0.252; pinteraction = 5.53x10-49). Furthermore, we removed T1D samples and the interaction of rs9274619:A with T2D on TG was nominally significant (betainteraction = -0.022; pinteraction = 2.9x10-2). We additionally checked the main effects of rs9274619:A for presumed LADA samples (N = 2580) in our dataset and observed a significant association (p-value = 8.37x10-41), showing that overlapping T1D and T2D samples could potentially contribute to the observed association of this locus.

Cousminer et al. reported four loci to be significantly associated with LADA [14], therefore we assessed T2D/TG interactions for these lead variants (S11 Table). None of the variants tested had a genome-wide significant interaction. The variant rs9273368 from HLA-DQB1 (pinteraction = 2.58x10-5) was genome-wide significant in our heterogeneity analysis (pheterogeneity = 1.8x10-8) and was in moderate LD with rs9274619:A (R2 = 0.3). The four LADA loci were examined for interactions for additional diabetes-related phenotypes as noted in S12 Table.

Metabolic characterizations of interactions with T2D

We secondarily explored the relationship between the rs9274619:A, interacting with T2D, and relationships with other metabolic features in UKB. Both the main effects and interaction models were adjusted for all the covariates (age, age2, sex, race, PC1-10). Outcomes assessed included waist/hip ratio (WHR), body mass index (BMI), macronutrients from 24-hour dietary recall surveys, and 60 plasma biomarkers (S13 and S14 Tables). Several features were associated with the rs9274619:A itself, including increased eosinophil and neutrophil counts as well as hemoglobin A1c and C-reactive protein concentrations. We identified several outcomes that demonstrated differential association by T2D status in interaction testing (alpha 0.05/72 = 6.94x10-4). The TG-lowering rs9274619:A allele was associated with greater concentrations for T2D vs non-T2D for hemoglobin A1c, sex hormone binding globulin, HDL-C, glucose, and apolipoprotein-A1 concentrations. However, the TG-lowering rs9274619:A allele was associated with reduced concentrations for T2D vs non-T2D for urate, BMI, WHR, and reticulocyte count (S13 and S14 Tables). Since HbA1c is a strong predictor of T2D, HbA1c is the most significant biomarker interacting with the lead SNP (S14 Table). When also using HbA1c > = 48 mmol/mol to define T2D samples as described by Young et al. [15], 15,180 samples were identified, with an additional 5212 samples to the previous T2D group. The interaction model of HbA1c-based only T2D with the lead SNP showed higher significance with TG (betainteraction = -0.159; pinteraction = 1.26x10-54). Next, we added the 5212 samples to our existing ICD10-based T2D groups and conducted a sensitivity analysis for the interaction. The addition of samples showed consistent significant interaction (betainteraction = -0.102; pinteraction = 1.03x10-35).

We further assessed lipid metabolomic data comprising 102,575 UKB samples (5200 T2D cases) and 249 metabolites for interactions. For each normalized metabolomic phenotype we analyzed the main effect of rs9274619:A and the interaction of rs9274619:A with T2D, with both models adjusted for all aforementioned covariates. Using a Bonferroni correction (0.05/249 = 2.01x10-4), we identified 6 metabolomic features associated with the rs9274619:A and only 1 interaction with T2D (S15 and S16 Tables). With respect to the interaction detected, we observed that the average diameter for LDL particles among rs9274619:A carriers was greater for T2D versus non-T2D.

Discussion

Independent GWAS studies have identified multiple loci strongly associated with TG and T2D separately [16, 17], and we now observe a variant tagging HLA-DQB1/DQA2 associated with TGs only among those with T2D and not among those without T2D. Despite being associated with reduced TGs specifically in T2D, the lead variant is associated with greater diabetes-related complications and an overlap with T1D. Previous studies have shown TG as an independent risk factor for cardiovascular diseases in T2D patients, and a predictor of T2D itself. Through a genome-wide stratified analysis, we now show that the HLA locus may triangulate some of these relationships by specifically influencing T2D-associated triglycerides.

Our study permits several conclusions regarding T2D pathogenesis. First, our study highlights heterogeneous metabolism among individuals with adult-onset diabetes. Hyperinsulinemia contributes to reduced hydrolysis and clearance of TG rich lipoproteins and thus persistence of these atherogenic lipoproteins toward heightened macrovascular risk [18]. TGs have more moderate associations with microvascular complications among diabetics [19]. Indeed, glycemic control is a more potent risk factor for microvascular complications. Here, we find that an immune-related locus (i.e., HLA-DQB1/DQA2 genotype of major histocompatibility class II) linked to reduced TGs interestingly associates with greater microvascular versus macrovascular risk among individuals with T2D. Lipid homeostasis plays a key role in immune cells, where lipids are key constituents of major histocompatibility complex molecules and other cell membrane microdomains [20]. These observations highlight complementary roles of immune dysfunction and hyperinsulinemia in adult-onset diabetes pathogenesis.

Second, the distinct lipid pattern observed by HLA-DQB1*0302 genotype may reflect etiologically distinct subgroups of adult-onset diabetes. The HLA-DQB1*0302 has long been recognized as a very strong risk factor for T1D and tags the potent T1D DR4 risk haplotype [2124]. Approximately 2–12% of adults diagnosed with type 2 diabetes have glutamic acid decarboxylase autoantibodies (GADA), thereby leading to the proposed term of latent autoimmune diabetes in adults (LADA) [2527]. Such individuals are often classified as T2D because they typically do not initially require insulin. Indeed, a recent GWAS of LADA showed that HLA-DQB1 was the most significant locus [14]. Recent data-driven approaches to cluster diabetes have grouped diabetes with GADA, traditionally classified as T1D or LADA, as severe autoimmune diabetes (SAID) [28]. Consistent with separate T1D and LADA analyses, the HLA-DQB1 locus is significantly associated with SAID unlike with other diabetes subgroups [29]. Thus, the relatively reduced TG concentrations among adults classified as having T2D and the HLA-DQB1 risk allele may reflect the lack of hypertriglyceridemia typically observed with more typical hyperinsulinemic T2D.

Third, TG concentrations among individuals diagnosed with T2D may help identify individuals with features more consistent with T1D. GADA testing only in adult-onset diabetics with normal TGs would optimize diagnostic yield. Furthermore, with increasingly available genotyping through expanding research testing and widely used direct-to-consumer approaches, HLA genotypes may further improve efficiency of testing. While large-scale randomized controlled trials for LADA are lacking, expert consensus recommend personalized management approaches deviating from conventional T2D and surveillance management [30]. Such approaches include biomarker-based surveillance of residual beta cell function and to determine insulin initiation.

A few limitations of our study deserve mention. First, causal gene prioritization through multiple methods did not converge on a single gene. Whether our observations reflect coordinated regulation merits further study. Based on the current results we were not able to elucidate the exact mechanism of TG lowering by HLA rs9274619:A in the context of T2D. Second, GADA or other islet autoantibodies and C-peptide levels are not present, despite using ICD codes and T1D polygenic risk score in sensitivity analyses, we cannot rule out the possibility of T1D-like features in some of the classified T2D cases. Third, UKB and MGBB are predominantly White and this may limit generalizability to other genetic backgrounds. Finally, Cochran’s Q test is under powered to detect differences in heterogeneity of effect sizes and we may have missed other loci with differences in effects. However, we also performed a difference test and found similar results to using the Cochran’s Q test.

In conclusion, we observed that rs9274619:A linked to HLA-DQB1/DQA2 is associated with reduced TGs only among adults with diabetes. Presence of this allele reflects an autoinflammatory subgroup of adult-onset diabetes most consistent with T1D, without characteristic hyperinsulinemia and thus relatively reduced TG concentrations. Among individuals classified as T2D, these individuals have greater risks for diabetes-associated complications.

Materials and methods

Study participants

We used the UK Biobank (UKB), which is a prospective population-based cohort composed of approximately 500,000 samples with rich phenotypic and genotypic information, as the discovery cohort [31]. UKB includes volunteer residents of the UK aged 40 to 69 years recruited during 2006–2010. The phenotypic information includes details on lifestyle, medical history, food habits, weight, height, body measurements, scans, blood routines, and electronic medical record (EMR) coded data. Out of 488,377 total individuals, we removed unconsented individuals and samples with >10% genotypic missingness thereby retaining 424,120 individuals for whom the TGs were also available.

We used data from the Mass General Brigham Biobank (MGBB), which comprises volunteer patients of the large Mass General Brigham Healthcare system in Massachusetts with greater than 105,000 participants, as replication [32]. In total 36,424 randomly selected individuals were genotyped using three versions of the Multi-Ethnic Genotyping Array (MEGA) Single-Nucleotide Polymorphism (SNP) array (Multiethnic Exome Global (Meg), Human multi-ethnic array (Mega), Expanded multi-ethnic genotyping array (Megex)). Out of 36,424 individuals, we retained 25,137 samples for whom T2D status and TG measurements were available for the current study.

Ethics.

All UK Biobank participants gave written, informed consent per the UKB primary protocol. Secondary use of these data was approved by the Massachusetts General Hospital Institutional Review Board (protocol 2021P002228) and was facilitated through UKB application 7089. All MGB Biobank participants provided written, informed consent per the MGBB primary protocol. Secondary use of these data was approved by the Massachusetts General Hospital Institutional Review Board (protocol 2020P000904). All participants consent to broad use of their samples and data for research, no minor participants were included in this study.

Phenotypes

In the UKB, we defined T2D based on self-reported status (data field 20002) and ICD10 codes E11:0–9 (data fields 41202, 41204, 40001, and 40002). The first instance of TG measurement (data field 30870) was defined as the primary lipid phenotype of interests. We also included other lipid levels as secondary outcomes: total cholesterol (TC) (data field 30690), low-density lipoprotein cholesterol (LDL-C) (data field 30780), and high-density lipoprotein cholesterol (HDL-C) (data field 30760). TG measurements were converted to mg/dL by multiplying mmol/L values by 88.57 and natural log transformed. TC, LDL-C and HDL-C values in mmol/L were converted to mg/dL by multiplying 38.67. When lipid-lowering medications were prescribed, TC measurements were divided by 0.8 and LDL-C by 0.7, as previously done [17]. All four lipid measurements were further inverse rank normalized to the residuals scaled by the standard deviation, where the model was adjusted for covariates (sex, age, age2, PC1-10).

We curated multiple diseases for UKB samples into phecodes for PheWAS analysis. The PheWAS R package (version PheWAS_0.99.5–4) was used to map ICD codes to phecodes based on the phecode map 1.2 and 1.2b1 from https://phewascatalog.org/ [33]. Codes that failed to map were excluded, which were relatively few and often procedural. Mapped codes were defined as multiple disease conditions and specified as incident or prevalent based on the time of sample collection. Next, we obtained the secondary phenotypes, which included waist circumference (data field 48), hip circumference (data field 49), body mass index (BMI) (data field 23104) and 24-hour diet recall (data field 110001) for downstream analysis. Additionally, we included blood biochemistry (category id 18518) and count (category id 9081) measures. These phenotypes were normalized to a mean 0 and standard deviation 1 for analysis. We obtained the NMR metabolic biomarkers generated by Nightingale Health (Helsinki, Finland) from the first tranche of 249 metabolic biomarkers in 118,032 UKB participants [34]. We included 102,528 samples that intersected with the discovery cohort and each of the metabolites were inverse rank normalized and regressed against the covariates (age, age2, sex, race, PC1-10). The residuals were used as our phenotypes in analyses.

In MGBB, electronic health record (EHR) data were used to define incident and prevalent cases based on enrollment date and ICD-9/ICD-10 codes on clinical phenotype definitions from phecode groups [35, 36], where samples with phecode 250.2X were defined as T2D in our study. Similarly, lipid test results, medication information, demographic status of genotyped samples were curated from EHR records. LDL-C was measured directly or calculated using Friedewald equation when TG were <400 mg/dL, all lipid measurements were in mg/dL units. The lipid measurements closest to sequencing date was curated. A sample was defined as on statin medication, if statin treatment was prescribed within the last one year of the sequencing date. We performed phenotype harmonization and normalization for the validation data as described above.

Genotypes

Genetic data from 488,377 UKB samples were assayed using two similar genotyping arrays from Affymetrix (Santa Clara, CA): i) Applied Biosystems UK BiLEVE Axiom Array ii) Applied Biosystems UK Biobank Axiom Array. 49,950 participants with 807,411 markers were genotyped at using the Applied Biosystems UK BiLEVE Axiom Array and 438,427 participants with 825,927 markers were genotyped using the closely related Applied Biosystems UK Biobank Axiom Array. Both arrays shared 95% of marker content and the UK Biobank Axiom array was chosen to capture genome-wide genetic variation (single nucleotide polymorphism (SNPs) and short insertions and deletions (indels)) [31]. The imputation from the UKB array-derived genotypes was performed using merged UK10K and 1000 Genomes phase 3 reference panels [37] and was combined to the Haplotype Reference Consortium (HRC) [38] panel using IMPUTE4 program (https://jmarchini.org/software/) as implemented in IMPUTE2 [39]. We obtained the UKB imputed human leukocyte antigen (HLA) genotypes (data field 22182) composed of classical allelic variation of 11 HLA types (A, B, C, DRB5, DRB4, DRB3, DRB1, DQB1, DQA1, DPB1, DPA1). HLA imputation from allele pairs was performed using HLA*IMP:02 in the UKB, as previously described [31]. Genotypic data in MGBB cohort was generated using three different arrays (Multiethnic Exome Global [MEG], Human multi-ethnic array [MEGA], Expanded multi-ethnic genotyping array [MEGEX]) from Illumina (San Diego, CA).

Statistical analysis

We performed genome wide association analysis (GWAS) stratified based on T2D status. We first performed quality control (QC) of the full UKB dataset regardless of T2D status by applying additional filters, including minor allele frequency (MAF) < 1%, Hardy-Weinberg equilibrium p-value not exceeding 1x10-15 and genotype missingness > 10% to filter variants, and sample-level genotype missingness > 10%. The QC-passed dataset was used to create NULL model with sex, age, age2, genotype array, race and PC1-10 as covariates. We employed REGENIE with leave-one-out-cross-validation (LOOCV) [40] approach adjusted for covariates stated above to perform GWAS on UKB imputed data with minor allele count (MAC) 20 in both T2D and non-T2D samples, independently. We annotated the genomic risk-loci from GWAS summary statistics using FUMA [41].

We tested for differences in effect estimates per genotype by T2D status using Cochran’s Q-test for heterogeneity in the METAL package [42]. For the genome-wide heterogeneity assessment, we used the conventional alpha threshold of 5x10-8 to assign statistical significance accounting for multiple-hypothesis testing. We validated the outcomes from the Cochran’s Q-test using the Difference Test [43]. We clumped the genome significant variants using plink [44] (p1: 5e-08; r2:0.5; kb:250). Significant lead variant in the discovery dataset were replicated at an alpha threshold of 0.05 accounting for the single SNP assessed.

The significant and replicated variant (lead variant) was pursued for further downstream analysis. Given its genomic location, we correlated the UKB imputed classical HLA genotypes with the significant lead variant using corplot R-package (method-pearson; version-0.90). We applied sample and variant quality measures to the initial data (—geno 0.05;—mind 0.05;—hwe 1E-06) and calculated PRS using plink [44]. We performed regression-based interaction analyses using the model where adiposity-related, diet-related and other blood biomarker phenotypes were separately analyzed with the lead variant along with T2D status. The regression analysis (main and interaction model) was carried out in R, adjusting for sex, age, age2, genotype array, race, and PC1-10 as covariates. Bonferroni corrected alpha threshold of 0.05/number of tests was considered statistically significant for these analyses. Fisher’s exact test was performed to test the significance of sample proportions among group of samples.

We implemented the Polygenic Priority Score (PoPS) enrichment method [45] for gene prioritization with the GWAS summary statistics. PoPS integrates multiple public bulk and single-cell expression datasets, protein-protein interaction and pathway databases to implement enrichment analysis using MAGMA [46] based gene association scores to identify top list of genes functionally linked to the phenotype of interest. We complementarily performed quantitative trait locus (QTLs) interrogations using multiple publicly available datasets. We downloaded expression quantitative trait locus (eQTLs) data from GTEx (v8_eQTL_all_associations) database (https://gtexportal.org/home/datasets) and curated significant hits (p-value < 5x10-8) for the lead SNP from 5 different tissues relevant to diabetes, lipids, and inflammation (i.e., Liver, Adipose Subcutaneous, Adipose Visceral Omentum, Whole Blood, and Pancreas). We utilized protein quantitative trait loci (pQTL) data in blood from the INTERVAL study [47] and GoDMC database [48] for methylation quantitative trait loci (mQTL) in blood to curate pQTLs and mQTLs related to the lead variant (p-value < 5x10-8).

Supporting information

S1 Fig. Overall study schematic.

We carried out stratified GWAS on UKB discovery cohort based on T2 status, using Regenie LOOCV models adjusting for age, age2, sex, race, and PC1-10. We implemented heterogeneity analysis to identify loci that was differentially associated between the two strata. Out of the 67M variants analyzed, only one locus achieved genome-wide significance. We replicated the significant locus using MGBB, an independent cohort, and further analyzed the lead variant using various secondary analysis in the discovery cohort. GWAS–Genome wide association; LOOCV—leave-one-out-cross-validation; MGBB–Mass General Brigham Biobank; T2D –Type 2 Diabetes; UKB–UK Biobank.

https://doi.org/10.1371/journal.pone.0275934.s001

(TIF)

S2 Fig. Manhattan (MH) plots for T2D stratified GWAS.

A) MH plots for T2D GWAS B) MH plots for non-T2D GWAS. Genes near to the most significant lead variant in each loci are documented, full list of lead SNPs are tabulated in S1 Table. Red line: Genome significance (p-value = 5x10-8), Blue line: Suggestive significance (p-value = 1x10-5). GWAS–Genome wide association studies; MH–Manhattan; T2D –Type 2 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.s002

(ZIP)

S3 Fig. HLA-DQB1/DQA2 lead variant locus in T2D and non-T2D strata.

A) Locus zoom plot for HLA-DQB1/DQA2 loci in T2D strata. B) Locus zoom plot for HLA-DQB1/DQA2 loci in non-T2D strata. X-axis defines the genomic position where variants +/-500 kb on either side of rs9274619—chr6:32635954:G:A (grc37) is mapped on the genome. The variants are colored based on the r2 with the lead variant and the genes are mapped based on their genomic position. Y-axis is the -log10(p-values) from the respective strata and the scale of y-axis is different between the two plots. HLA–Human Leukocyte Antigen; T2D –Type 2 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.s003

(ZIP)

S4 Fig. Correlation between GTEx and GWAS Z-scores of eQTLs from HLA-DQB1, HLA-DQA2 and HLA-DRB6.

Z-scores were calculated from T2D GWAS and GTEx (version 8) summary statistics for all the eQTLs for the three genes. Pearson correlation coefficient was calculated, and scatter plots were generated for eQTL data from five different tissues. Most of the TG lowering variants increases the expression of HLA-DQA2/HLA-DRB6, whereas decreases the expression of HLA-DQB1.

https://doi.org/10.1371/journal.pone.0275934.s004

(ZIP)

S5 Fig. Gene prioritization using PoPS and QTL mining.

PoPS method was used to prioritize genes using GWAS summary statistics from both T2D and non-T2D stratum. Multiple HLA genes were prioritized, where HLA-DQB1 topped the list. eQTL, pQTL and mQTL curation of rs9274619:A from various public repositories mapped the lead variant to multiple HLA-genes, where HLA-DQA2 was identified by all three QTL searches.

https://doi.org/10.1371/journal.pone.0275934.s005

(TIF)

S6 Fig. Imputed HLA alleles and its correlation with variant of interest—rs9274619:A.

A) Correlation between rs9274619:A and HLA alleles in DQB1 and DQA1 class. DQB1-302 is the most strongly correlated allele. B) Forest plot showing the different alleles that passed the Bonferroni correction on interacting with T2D with log(TG) as outcome, the model was adjusted age, age2, sex, race, PC1-10 and rs9274619:A. DQB1-302 allele is the only allele with significant interaction with T2D and highly correlated to rs9274619:A (mapped in red). HLA–Human Leukocyte Antigen; T2D –Type 2 Diabetes; TG–Triglycerides; VOI–Variant of interest.

https://doi.org/10.1371/journal.pone.0275934.s006

(ZIP)

S1 Table. Genomic risk loci identified from GWAS stratified by T2D status in UKB.

19 and 315 risk loci were identified by FUMA on T2D and non-T2D GWAS respectively. Significant loci and lead variants identified from FUMA analysis for each risk loci are documented. GWAS–Genome wide association studies; UKB–UK Biobank; T2D –Type 2 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.s007

(XLSX)

S2 Table. List of variants identified after clumping the 478 SNPs in the chromosome 6 loci.

The r2 of the clumped variant with the lead SNP and their respective summary data are documented. Each clumped SNP was additionally used to carry out conational analysis with the lead SNP interaction model and the summary statistics are documented.

https://doi.org/10.1371/journal.pone.0275934.s008

(XLSX)

S3 Table. Summary statistics of lead SNP with lipids.

Summary statistics from linear regression model where rs9274619:A interacting with T2D is documented for log(TG) as outcome for the 3 lipids including LDL-C, HDL-C and TC. HLA-DQB1/DQA2 is enriched only in T2D and not in non-T2D. HLD-C–High-Density Lipoprotein Cholesterol; LDL-C–Low-Density Lipoprotein Cholesterol; T2D –Type 2 Diabetes; TG–Triglycerides; TC–Total Cholesterol.

https://doi.org/10.1371/journal.pone.0275934.s009

(XLSX)

S4 Table. Gene prioritization scores.

PoPS enrichment scores for the top100 genes from T2D and nonT2D GWAS summary statistics. The Genes are ordered based on PoPS scores within each strata. T2D –Type 2 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.s010

(XLSX)

S5 Table. Summary statistics for curated eQTLs.

eQTL–gene pairs for rs9274619:A was curated from GTEx data and genome significant hits were obtained (p-value = 5x10-8) from 5 different tissues. Each associated gene is tabulated and ordered based on GTEx p-value. eQTL–Expression Quantitative Trait Locus.

https://doi.org/10.1371/journal.pone.0275934.s011

(XLSX)

S6 Table. Summary statistics for curated mQTLs.

mQTL–CpG pairs for rs9274619:A and the corresponding genes that regulate the CpG regions was curated from GoDMC database where the genome significant hits were obtained (p-value = 5x10-8) from the cis-associations. The data is ordered based on p-value and the corresponding CpG island regions were curated from Illumina HumanMethylation450 BeadChip resource. mQTL–Methylation Quantitative Trait Locus.

https://doi.org/10.1371/journal.pone.0275934.s012

(XLSX)

S7 Table. Summary statistics for curated HLA-alleles.

Summary statistics for linear regression HLA interaction model with T2D status adjusted for age, age^2, sex, race, PC1-10 and rs9274619:A, where the outcome was normalized log(TG). The correlation coefficient of each HLA allele to rs9274619:A is documented. The HLA alleles are ordered based on their significance, out of the total 362 HLA alleles 331 alleles had interaction summary statistics. PC–Principal Components; T2D –Type 2 Diabetes; TG–Triglycerides.

https://doi.org/10.1371/journal.pone.0275934.s013

(XLSX)

S8 Table. Summary statistics from PheWAS.

Summary statistics for logistic regression model for PheWAS where disease conditions used as outcomes, adjusted for age, age^2, sex, race and PC1-10. The effects from the rs9274619:A interacting with T2D status is documented and the disease conditions are ordered based on significance. PC–Principal Components; PheWAS–Phenome Wide Association Studies; T2D –Type 2 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.s014

(XLSX)

S9 Table. Summary statistics from significant disease associations.

The 45 significant disease conditions identified using interaction model were analyzed using logistic regression, adjusted for age, age^2, sex, race and PC1-10 and stratified by T2D status. The summary statistics from T2D and non-T2D models are tabulated, where the phenotypes are ordered based on T2D beta. PC–Principal Components; T2D –Type 2 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.s015

(XLSX)

S10 Table. Summary statistics from T1D and LADA linear models.

Multiple linear main and interaction models used to validate the influence of T1D and LADA samples. LADA–Latent Autoimmune Diabetes in Adults; T1D –Type 1 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.s016

(XLSX)

S11 Table. Summary statistics from LADA interaction models.

Interaction model between the four LADA loci and T2D with TG as outcome. LADA–Latent Autoimmune Diabetes in Adults; T2D –Type 2 Diabetes; TG–Triglycerides.

https://doi.org/10.1371/journal.pone.0275934.s017

(XLSX)

S12 Table. Significant disease phenotypes with LADA loci.

Interaction p-values between the four LADA loci and T2D with T1D related and hyperlipidemia related phenotypes as outcomes. The top 5 and bottom 5 phenotypes significantly associated with rs9274619:A were selected (Fig 3). All four loci have significant interaction with diabetes related diseases, but not with obesity related phenotypes. LADA–Latent Autoimmune Diabetes in Adults; T1D –Type 1 Diabetes; T2D –Type 2 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.s018

(XLSX)

S13 Table. Summary statistics from linear models with biomarkers.

Summary statistics for linear regression main model with fat, diet and biomarkers phenotypes as outcomes, adjusted for age, age^2, sex, race and PC1-10. The phenotypes are orders based on p-value. PC–Principal Components.

https://doi.org/10.1371/journal.pone.0275934.s019

(XLSX)

S14 Table. Summary statistics from interaction models with biomarkers.

Summary statistics for linear interaction model with fat, diet and biomarkers phenotypes as outcomes, adjusted for age, age^2, sex, race and PC1-10. The phenotypes are orders based on p-value. PC–Principal Components.

https://doi.org/10.1371/journal.pone.0275934.s020

(XLSX)

S15 Table. Summary statistics from linear models with metabolites.

Summary statistics for linear regression main model with metabolomic phenotype as outcomes, adjusted for age, age^2, sex, race and PC1-10. The effects from rs9274619:A are documented and the metabolomes are ordered based on significance. PC–Principal Components.

https://doi.org/10.1371/journal.pone.0275934.s021

(XLSX)

S16 Table. Summary statistics from interaction models with metabolites.

Summary statistics for linear regression interaction model with metabolomic phenotype as outcomes, adjusted for age, age^2, sex, race and PC1-10. The effects from rs9274619:A interacting with T2D status is documented and the metabolomes are ordered based on significance. PC–Principal Components; T2D –Type 2 Diabetes.

https://doi.org/10.1371/journal.pone.0275934.s022

(XLSX)

Acknowledgments

We thank all the participants from UKB and MGBB.

References

  1. 1. Saeedi P. et al., “Global and regional diabetes prevalence estimates for 2019 and projections for 2030 and 2045: Results from the International Diabetes Federation Diabetes Atlas, 9th edition,” Diabetes Res. Clin. Pract., vol. 157, p. 107843, Nov. 2019, pmid:31518657
  2. 2. Gillies C. L. et al., “Pharmacological and lifestyle interventions to prevent or delay type 2 diabetes in people with impaired glucose tolerance: systematic review and meta-analysis,” BMJ, vol. 334, no. 7588, p. 299, Feb. 2007, pmid:17237299
  3. 3. Tancredi M. et al., “Excess Mortality among Persons with Type 2 Diabetes,” N. Engl. J. Med., vol. 373, no. 18, pp. 1720–1732, Oct. 2015, pmid:26510021
  4. 4. Wright A. K. et al., “Life Expectancy and Cause-Specific Mortality in Type 2 Diabetes: A Population-Based Cohort Study Quantifying Relationships in Ethnic Subgroups,” Diabetes Care, vol. 40, no. 3, pp. 338–345, Mar. 2017, pmid:27998911
  5. 5. Low S. et al., “The role of triglyceride glucose index in development of Type 2 diabetes mellitus,” Diabetes Res. Clin. Pract., vol. 143, pp. 43–49, Sep. 2018, pmid:29936253
  6. 6. Pang J., Chan D. C., and Watts G. F., “Origin and therapy for hypertriglyceridaemia in type 2 diabetes,” World J. Diabetes, vol. 5, no. 2, pp. 165–175, Apr. 2014, pmid:24748930
  7. 7. Feher M., Greener M., and Munro N., “Persistent hypertriglyceridemia in statin-treated patients with type 2 diabetes mellitus,” Diabetes Metab. Syndr. Obes. Targets Ther., vol. 6, pp. 11–15, 2013, pmid:23341741
  8. 8. Leiter L. A. et al., “Persistent lipid abnormalities in statin-treated patients with diabetes mellitus in Europe and Canada: results of the Dyslipidaemia International Study,” Diabet. Med. J. Br. Diabet. Assoc., vol. 28, no. 11, pp. 1343–1351, Nov. 2011, pmid:21679231
  9. 9. Zhao J. et al., “Triglyceride is an independent predictor of type 2 diabetes among middle-aged and older adults: a prospective study with 8-year follow-ups in two cohorts,” J. Transl. Med., vol. 17, no. 1, p. 403, Dec. 2019, pmid:31801571
  10. 10. Oram R. A. et al., “Utility of Diabetes Type–Specific Genetic Risk Scores for the Classification of Diabetes Type Among Multiethnic Youth,” Diabetes Care, vol. 45, no. 5, pp. 1124–1131, May 2022, pmid:35312757
  11. 11. Dahlqvist J. et al., “Identification and functional characterization of a novel susceptibility locus for small vessel vasculitis with MPO-ANCA,” Rheumatology, vol. 61, no. 8, pp. 3461–3470, Aug. 2022, pmid:34888651
  12. 12. Basile K. J., Guy V. C., Schwartz S., and Grant S. F. A., “Overlap of genetic susceptibility to type 1 diabetes, type 2 diabetes, and latent autoimmune diabetes in adults,” Curr. Diab. Rep., vol. 14, no. 11, p. 550, 2014, pmid:25189437
  13. 13. Noble J. A. and Valdes A. M., “Genetics of the HLA region in the prediction of type 1 diabetes,” Curr. Diab. Rep., vol. 11, no. 6, pp. 533–542, Dec. 2011, pmid:21912932
  14. 14. Cousminer D. L. et al., “First Genome-Wide Association Study of Latent Autoimmune Diabetes in Adults Reveals Novel Insights Linking Immune and Metabolic Diabetes,” Diabetes Care, vol. 41, no. 11, pp. 2396–2403, Nov. 2018, pmid:30254083
  15. 15. Young K. G., McDonald T. J., and Shields B. M., “Glycated haemoglobin measurements from UK Biobank are different to those in linked primary care records: implications for combining biochemistry data from research studies and routine clinical care,” Int. J. Epidemiol., vol. 51, no. 3, pp. 1022–1024, Jun. 2022, pmid:35026015
  16. 16. Scott R. A. et al., “An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans,” Diabetes, vol. 66, no. 11, pp. 2888–2902, Nov. 2017, pmid:28566273
  17. 17. NHLBI TOPMed Lipids Working Group et al., “Deep-coverage whole genome sequences and blood lipids among 16,324 individuals,” Nat. Commun., vol. 9, no. 1, p. 3391, Dec. 2018, pmid:30140000
  18. 18. Miller M. et al., “Triglycerides and Cardiovascular Disease: A Scientific Statement From the American Heart Association,” Circulation, vol. 123, no. 20, pp. 2292–2333, May 2011, pmid:21502576
  19. 19. Sacks F. M. et al., “Association Between Plasma Triglycerides and High-Density Lipoprotein Cholesterol and Microvascular Kidney Disease and Retinopathy in Type 2 Diabetes Mellitus: A Global Case–Control Study in 13 Countries,” Circulation, vol. 129, no. 9, pp. 999–1008, Mar. 2014, pmid:24352521
  20. 20. Hiltbold E. M., Poloso N. J., and Roche P. A., “MHC Class II-Peptide Complexes and APC Lipid Rafts Accumulate at the Immunological Synapse,” J. Immunol., vol. 170, no. 3, pp. 1329–1338, Feb. 2003, pmid:12538693
  21. 21. Erlich H. et al., “HLA DR-DQ Haplotypes and Genotypes and Type 1 Diabetes Risk: Analysis of the Type 1 Diabetes Genetics Consortium Families,” Diabetes, vol. 57, no. 4, pp. 1084–1092, Apr. 2008, pmid:18252895
  22. 22. Noble J. A. et al., “HLA Class I and Genetic Susceptibility to Type 1 Diabetes: Results From the Type 1 Diabetes Genetics Consortium,” Diabetes, vol. 59, no. 11, pp. 2972–2979, Nov. 2010, pmid:20798335
  23. 23. Noble J. A. and Erlich H. A., “Genetics of Type 1 Diabetes,” Cold Spring Harb. Perspect. Med., vol. 2, no. 1, pp. a007732–a007732, Jan. 2012, pmid:22315720
  24. 24. Jacobi T. et al., “HLA Class II Allele Analyses Implicate Common Genetic Components in Type 1 and Non–Insulin-Treated Type 2 Diabetes,” J. Clin. Endocrinol. Metab., vol. 105, no. 3, pp. e245–e254, Mar. 2020, pmid:31974565
  25. 25. Buzzetti R. et al., “High Titer of Autoantibodies to GAD Identifies a Specific Phenotype of Adult-Onset Autoimmune Diabetes,” Diabetes Care, vol. 30, no. 4, pp. 932–938, Apr. 2007, pmid:17392553
  26. 26. Radtke M. A., Midthjell K., Nilsen T. I. L., and Grill V., “Heterogeneity of Patients With Latent Autoimmune Diabetes in Adults: Linkage to Autoimmunity Is Apparent Only in Those With Perceived Need for Insulin Treatment: Results from the Nord-Trondelag Health (HUNT) study,” Diabetes Care, vol. 32, no. 2, pp. 245–250, Feb. 2009, pmid:19001190
  27. 27. Tuomi T., Santoro N., Caprio S., Cai M., Weng J., and Groop L., “The many faces of diabetes: a disease with increasing heterogeneity,” The Lancet, vol. 383, no. 9922, pp. 1084–1094, Mar. 2014, pmid:24315621
  28. 28. Ahlqvist E. et al., “Novel subgroups of adult-onset diabetes and their association with outcomes: a data-driven cluster analysis of six variables,” Lancet Diabetes Endocrinol., vol. 6, no. 5, pp. 361–369, May 2018, pmid:29503172
  29. 29. Mansour Aly D. et al., “Genome-wide association analyses highlight etiological differences underlying newly defined subtypes of diabetes,” Nat. Genet., vol. 53, no. 11, pp. 1534–1542, Nov. 2021, pmid:34737425
  30. 30. Buzzetti R. et al., “Management of Latent Autoimmune Diabetes in Adults: A Consensus Statement From an International Expert Panel,” Diabetes, vol. 69, no. 10, pp. 2037–2047, Oct. 2020, pmid:32847960
  31. 31. Bycroft C. et al., “The UK Biobank resource with deep phenotyping and genomic data,” Nature, vol. 562, no. 7726, pp. 203–209, Oct. 2018, pmid:30305743
  32. 32. Smoller J. et al., “An eMERGE Clinical Center at Partners Personalized Medicine,” J. Pers. Med., vol. 6, no. 1, p. 5, Jan. 2016, pmid:26805891
  33. 33. Denny J. C. et al., “PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations,” Bioinformatics, vol. 26, no. 9, pp. 1205–1210, May 2010, pmid:20335276
  34. 34. Julkunen H., Cichońska A., Slagboom P. E., Würtz P., and Nightingale Health UK Biobank Initiative, “Metabolic biomarker profiling for identification of susceptibility to severe pneumonia and COVID-19 in the general population,” eLife, vol. 10, p. e63033, May 2021, pmid:33942721
  35. 35. Wei W.-Q. et al., “Evaluating phecodes, clinical classification software, and ICD-9-CM codes for phenome-wide association studies in the electronic health record,” PloS One, vol. 12, no. 7, p. e0175508, 2017, pmid:28686612
  36. 36. Wu P. et al., “Mapping ICD-10 and ICD-10-CM Codes to Phecodes: Workflow Development and Initial Evaluation,” JMIR Med. Inform., vol. 7, no. 4, p. e14325, Nov. 2019, pmid:31553307
  37. 37. UK10K Consortium et al., “Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel,” Nat. Commun., vol. 6, no. 1, p. 8111, Nov. 2015, pmid:26368830
  38. 38. the Haplotype Reference Consortium, “A reference panel of 64,976 haplotypes for genotype imputation,” Nat. Genet., vol. 48, no. 10, pp. 1279–1283, Oct. 2016, pmid:27548312
  39. 39. Howie B., Fuchsberger C., Stephens M., Marchini J., and Abecasis G. R., “Fast and accurate genotype imputation in genome-wide association studies through pre-phasing,” Nat. Genet., vol. 44, no. 8, pp. 955–959, Aug. 2012, pmid:22820512
  40. 40. Mbatchou J. et al., “Computationally efficient whole-genome regression for quantitative and binary traits,” Nat. Genet., vol. 53, no. 7, pp. 1097–1103, Jul. 2021, pmid:34017140
  41. 41. Watanabe K., Taskesen E., van Bochoven A., and Posthuma D., “Functional mapping and annotation of genetic associations with FUMA,” Nat. Commun., vol. 8, no. 1, p. 1826, Dec. 2017, pmid:29184056
  42. 42. Willer C. J., Li Y., and Abecasis G. R., “METAL: fast and efficient meta-analysis of genomewide association scans,” Bioinformatics, vol. 26, no. 17, pp. 2190–2191, Sep. 2010, pmid:20616382
  43. 43. Winkler T. W. et al., “Approaches to detect genetic effects that differ between two strata in genome-wide meta-analyses: Recommendations based on a systematic evaluation,” PLOS ONE, vol. 12, no. 7, p. e0181038, Jul. 2017, pmid:28749953
  44. 44. Purcell S. et al., “PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses,” Am. J. Hum. Genet., vol. 81, no. 3, pp. 559–575, Sep. 2007, pmid:17701901
  45. 45. Weeks E. M. et al., “Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases,” Genetic and Genomic Medicine, preprint, Sep. 2020.
  46. 46. de Leeuw C. A., Mooij J. M., Heskes T., and Posthuma D., “MAGMA: Generalized Gene-Set Analysis of GWAS Data,” PLOS Comput. Biol., vol. 11, no. 4, p. e1004219, Apr. 2015, pmid:25885710
  47. 47. Sun B. B. et al., “Genomic atlas of the human plasma proteome,” Nature, vol. 558, no. 7708, pp. 73–79, Jun. 2018, pmid:29875488
  48. 48. Min J. L. et al., “Genomic and phenotypic insights from an atlas of genetic effects on DNA methylation,” Nat. Genet., vol. 53, no. 9, pp. 1311–1321, Sep. 2021, pmid:34493871