Publications

2024

Sordillo, Joanne E, Frédérique White, Sana Majid, François Aguet, Kristin G Ardlie, Ananth Karumanchi, Jose C Florez, et al. (2024) 2024. “Higher Maternal Body Mass Index Is Associated With Lower Placental Expression of EPYC: A Genome-Wide Transcriptomic Study.”. The Journal of Clinical Endocrinology and Metabolism 109 (3): e1159-e1166. https://doi.org/10.1210/clinem/dgad619.

CONTEXT: Elevated body mass index (BMI) in pregnancy is associated with adverse maternal and fetal outcomes. The placental transcriptome may elucidate molecular mechanisms underlying these associations.

OBJECTIVE: We examined the association of first-trimester maternal BMI with the placental transcriptome in the Gen3G prospective cohort.

METHODS: We enrolled participants at 5 to 16 weeks of gestation and measured height and weight. We collected placenta samples at delivery. We performed whole-genome RNA sequencing using Illumina HiSeq 4000 and aligned RNA sequences based on the GTEx v8 pipeline. We conducted differential gene expression analysis of over 15 000 genes from 450 placental samples and reported the change in normalized gene expression per 1-unit increase in log2 BMI (kg/m2) as a continuous variable using Limma Voom. We adjusted models for maternal age, fetal sex, gestational age at delivery, gravidity, and surrogate variables accounting for technical variability. We compared participants with BMI of 18.5 to 24.9 mg/kg2 (N = 257) vs those with obesity (BMI ≥30 kg/m2, N = 82) in secondary analyses.

RESULTS: Participants' mean ± SD age was 28.2 ± 4.4 years and BMI was 25.4 ± 5.5 kg/m2 in early pregnancy. Higher maternal BMI was associated with lower placental expression of EPYC (slope = -1.94, false discovery rate [FDR]-adjusted P = 7.3 × 10-6 for continuous BMI; log2 fold change = -1.35, FDR-adjusted P = 3.4 × 10-3 for BMI ≥30 vs BMI 18.5-24.9 kg/m2) and with higher placental expression of IGFBP6, CHRDL1, and CXCL13 after adjustment for covariates and accounting for multiple testing (FDR < 0.05).

CONCLUSION: Our genome-wide transcriptomic study revealed novel genes potentially implicated in placental biologic response to higher maternal BMI in early pregnancy.

Szczerbinski, Lukasz, Ravi Mandla, Philip Schroeder, Bianca C Porneala, Josephine H Li, Jose C Florez, Josep M Mercader, Miriam S Udler, and Alisa K Manning. (2024) 2024. “Algorithms for the Identification of Prevalent Diabetes in the All of Us Research Program Validated Using Polygenic Scores.”. Scientific Reports 14 (1): 26895. https://doi.org/10.1038/s41598-024-74730-9.

The All of Us Research Program (AoU) is an initiative designed to gather a comprehensive and diverse dataset from at least one million individuals across the USA. This longitudinal cohort study aims to advance research by providing a rich resource of genetic and phenotypic information, enabling powerful studies on the epidemiology and genetics of human diseases. One critical challenge to maximizing its use is the development of accurate algorithms that can efficiently and accurately identify well-defined disease and disease-free participants for case-control studies. This study aimed to develop and validate type 1 (T1D) and type 2 diabetes (T2D) algorithms in the AoU cohort, using electronic health record (EHR) and survey data. Building on existing algorithms and using diagnosis codes, medications, laboratory results, and survey data, we developed and implemented algorithms for identifying prevalent cases of type 1 and type 2 diabetes. The first set of algorithms used only EHR data (EHR-only), and the second set used a combination of EHR and survey data (EHR+). A universal algorithm was also developed to identify individuals without diabetes. The performance of each algorithm was evaluated by testing its association with polygenic scores (PSs) for type 1 and type 2 diabetes. We demonstrated the feasibility and utility of using AoU EHR and survey data to employ diabetes algorithms. For T1D, the EHR-only algorithm showed a stronger association with T1D-PS compared to the EHR + algorithm (DeLong p-value = 3 × 10-5). For T2D, the EHR + algorithm outperformed both the EHR-only and the existing T2D definition provided in the AoU Phenotyping Library (DeLong p-values = 0.03 and 1 × 10-4, respectively), identifying 25.79% and 22.57% more cases, respectively, and providing an improved association with T2D PS. We provide a new validated type 1 diabetes definition and an improved type 2 diabetes definition in AoU, which are freely available for diabetes research in the AoU. These algorithms ensure consistency of diabetes definitions in the cohort, facilitating high-quality diabetes research.

Syreeni, Anna, Emma H Dahlström, Laura J Smyth, Claire Hill, Stefan Mutter, Yogesh Gupta, Valma Harjutsalo, et al. (2024) 2024. “Blood Methylation Biomarkers Are Associated With Diabetic Kidney Disease Progression in Type 1 Diabetes.”. MedRxiv : The Preprint Server for Health Sciences. https://doi.org/10.1101/2024.11.28.24318055.

BACKGROUND: DNA methylation differences are associated with kidney function and diabetic kidney disease (DKD), but prospective studies are scarce. Therefore, we aimed to study DNA methylation in a prospective setting in the Finnish Diabetic Nephropathy Study type 1 diabetes (T1D) cohort.

METHODS: We analysed baseline blood sample-derived DNA methylation (Illumina's EPIC array) of 403 individuals with normal albumin excretion rate (early progression group) and 373 individuals with severe albuminuria (late progression group) and followed-up their DKD progression defined as decrease in eGFR to <60 mL/min/1.73m2 (early DKD progression group; median follow-up 13.1 years) or end-stage kidney disease (ESKD) (late DKD progression group; median follow-up 8.4 years). We conducted two epigenome-wide association studies (EWASs) on DKD progression and sought methylation quantitative trait loci (meQTLs) for the lead CpGs to estimate genetic contribution.

RESULTS: Altogether, 14 methylation sites were associated with DKD progression (P<9.4×10-8). Methylation at cg01730944 near CDKN1C and at other CpGs associated with early DKD progression were not correlated with baseline eGFR, whereas late progression CpGs were strongly associated. Importantly, 13 of 14 CpGs could be linked to a gene showing differential expression in DKD or chronic kidney disease. Higher methylation at the lead CpG cg17944885, a frequent finding in eGFR EWASs, was associated with ESKD risk (HR [95% CI] = 2.15 [1.79, 2.58]). Additionally, we replicated meQTLs for cg17944885 and identified ten novel meQTL variants for other CpGs. Furthermore, survival models including the significant CpG sites showed increased predictive performance on top of clinical risk factors.

CONCLUSIONS: Our EWAS on early DKD progression identified a podocyte-specific CDKN1C locus. EWAS on late progression proposed novel CpGs for ESKD risk and confirmed previously known sites for kidney function. Since DNA methylation signals could improve disease course prediction, a combination of blood-derived methylation sites could serve as a potential prognostic biomarker.

Kennedy, Ciarán, Ross Doyle, Oisin Gough, Caitriona Mcevoy, Susan McAnallen, Maria Hughes, Xin Sheng, et al. (2024) 2024. “A Novel Role for FERM Domain-Containing Protein 3 in CKD.”. Kidney360 5 (12): 1799-1812. https://doi.org/10.34067/KID.0000000602.

KEY POINTS: We have identified a transcriptional signature of 93 genes associated with CKD severity and progression. Protein 4.1, ezrin, radixin, moesin domain-containing protein 3 gene expression is reduced in the context of more severe kidney disease and in individuals who go on to develop progressive disease. Protein 4.1, ezrin, radixin, moesin domain-containing protein 3 interacts with proteins of the cell cytoskeleton and cell-cell junctions in proximal tubule epithelial cells.

BACKGROUND: Currently, there are limited methods to link disease severity and risk of disease progression in CKD. To better understand this potential relationship, we interrogated the renal transcriptomic profile of individuals with CKD with measures of CKD severity and identified protein 4.1, ezrin, radixin, moesin-domain containing protein 3 (FRMD3) as a candidate gene for follow-up study.

METHODS: RNA-sequencing was used to profile the transcriptome of CKD biopsies from the North Dublin Renal BioBank, the results of which were correlated with clinical parameters. The potential function of FRMD3 was explored by interrogating the FRMD3 interactome and assessing the effect of lentiviral mediated FRMD3 knock down on human renal proximal tubule epithelial cells by assessing cell viability, metabolic activity, and structural markers.

RESULTS: We identified a subset of 93 genes which are significantly correlated with eGFR and percentage tubulointerstitial fibrosis at time of biopsy and with CKD progression 5 years postbiopsy. These results were validated against transcriptomic data from an external cohort of 432 nephrectomy samples. One of the top-ranking genes from this subset, FRMD3, has previously been associated with the risk of developing diabetic kidney disease. Interrogating the interactome of FRMD3 in tubule epithelial cells revealed interactions with cytoskeletal components of cell-cell junctions. Knockdown of FRMD3 expression in tubule epithelial cells resulted in increased proapoptotic activity within the cells, as well as dysregulation of E-Cadherin.

CONCLUSIONS: We have identified a panel of kidney-specific transcripts correlated with severity and progression of kidney disease, and from this, we have identified a possible role for FRMD3 in tubule cell structure and health.

Mandla, Ravi, Philip H Schroeder, Jose C Florez, Josep M Mercader, and Aaron Leong. (2024) 2024. “Hemoglobin A1c Genetics and Disparities in Risk of Diabetic Retinopathy in Individuals of Genetically Inferred African American/African British and European Ancestries.”. Diabetes Care 47 (10): 1731-39. https://doi.org/10.2337/dc23-1691.

OBJECTIVE: Individuals with diabetes who carry genetic variants that lower hemoglobin A1c (HbA1c) independently of glycemia may have higher real, but undetected, hyperglycemia compared with those without these variants despite achieving similar HbA1c targets, potentially placing them at greater risk for diabetes-related complications. We sought to determine whether these genetic variants, aggregated in a polygenic score, and the large-effect African ancestry-specific missense variant in G6PD (rs1050828) that lower HbA1c were associated with higher retinopathy risk.

RESEARCH DESIGN AND METHODS: Using data from 29,828 type 2 diabetes cases of genetically inferred African American/African British and European ancestries, we calculated ancestry-specific nonglycemic HbA1c polygenic scores (ngA1cPS) composed of 122 variants associated with HbA1c at genome-wide significance, but not with glucose. We tested the association of the ngA1cPS and the G6PD variant with retinopathy, adjusting for measured HbA1c and retinopathy risk factors.

RESULTS: Participants in the bottom quintile of the ngA1cPS showed between 20% and 50% higher retinopathy prevalence, compared with those above this quintile, despite similar levels of measured HbA1c. The adjusted meta-analytic odds ratio for the bottom quintile was 1.31 (95% CI 1.0, 1.73; P = 0.05) in African ancestry and 1.31 (95% CI 1.15, 1.50; P = 6.5 × 10-5) in European ancestry. Among individuals of African ancestry with HbA1c below 7%, retinopathy prevalence was higher in individuals below, compared with above, the 50th percentile of the ngA1cPS regardless of sex or G6PD carrier status.

CONCLUSIONS: Genetic effects need to be considered to personalize HbA1c targets and improve outcomes of people with diabetes from diverse ancestries.

Antonio-Villa, Neftali Eduardo, Omar Yaxmehen Bello-Chavolla, Carlos A Fermín-Martínez, Daniel Ramírez-García, Arsenio Vargas-Vázquez, Martín Roberto Basile-Alvarez, Alejandra Núñez-Luna, et al. (2024) 2024. “Diabetes Subgroups and Sociodemographic Inequalities in Mexico: A Cross-Sectional Analysis of Nationally Representative Surveys from 2016 to 2022.”. Lancet Regional Health. Americas 33: 100732. https://doi.org/10.1016/j.lana.2024.100732.

BACKGROUND: Differences in the prevalence of four diabetes subgroups have been reported in Mexico compared to other populations, but factors that may contribute to these differences are poorly understood. Here, we estimate the prevalence of diabetes subgroups in Mexico and evaluate their correlates with indicators of social disadvantage using data from national representative surveys.

METHODS: We analyzed serial, cross-sectional Mexican National Health and Nutrition Surveys spanning 2016, 2018, 2020, 2021, and 2022, including 23,354 adults (>20 years). Diabetes subgroups (obesity-related [MOD], severe insulin-deficient [SIDD], severe insulin-resistant [SIRD], and age-related [MARD]) were classified using self-normalizing neural networks based on a previously validated algorithm. We used the density-independent social lag index (DISLI) as a proxy of state-level social disadvantage.

FINDINGS: We identified 4204 adults (median age: 57, IQR: 47-66, women: 64%) living with diabetes, yielding a pooled prevalence of 16.04% [95% CI: 14.92-17.17]. When stratified by diabetes subgroup, prevalence was 6.62% (5.69-7.55) for SIDD, 5.25% (4.52-5.97) for MOD, 2.39% (1.95-2.83) for MARD, and 1.27% (1.00-1.54) for SIRD. SIDD and MOD clustered in Southern Mexico, whereas MARD and SIRD clustered in Northern Mexico and Mexico City. Each standard deviation increase in DISLI was associated with higher odds of SIDD (OR: 1.12, 95% CI: 1.06-1.12) and lower odds of MOD (OR: 0.93, 0.88-0.99). Speaking an indigenous language was associated with higher odds of SIDD (OR: 1.35, 1.16-1.57) and lower odds of MARD (OR 0.58, 0.45-0.74).

INTERPRETATION: Diabetes prevalence in Mexico is rising in the context of regional and sociodemographic inequalities across distinct diabetes subgroups. SIDD is a subgroup of concern that may be associated with inadequate diabetes management, mainly in marginalized states.

FUNDING: This research was supported by Instituto Nacional de Geriatría in Mexico.

Mandla, Ravi, Philip Schroeder, Bianca Porneala, Jose C Florez, James B Meigs, Josep M Mercader, and Aaron Leong. (2024) 2024. “Polygenic Scores for Longitudinal Prediction of Incident Type 2 Diabetes in an Ancestrally and Medically Diverse Primary Care Physician Network: A Patient Cohort Study.”. Genome Medicine 16 (1): 63. https://doi.org/10.1186/s13073-024-01337-0.

BACKGROUND: The clinical utility of genetic information for type 2 diabetes (T2D) prediction with polygenic scores (PGS) in ancestrally diverse, real-world US healthcare systems is unclear, especially for those at low clinical phenotypic risk for T2D.

METHODS: We tested the association of PGS with T2D incidence in patients followed within a primary care practice network over 16 years in four hypothetical scenarios that varied by clinical data availability (N = 14,712): (1) age and sex; (2) age, sex, body mass index (BMI), systolic blood pressure, and family history of T2D; (3) all variables in (2) and random glucose; and (4) all variables in (3), HDL, total cholesterol, and triglycerides, combined in a clinical risk score (CRS). To determine whether genetic effects differed by baseline clinical risk, we tested for interaction with the CRS.

RESULTS: PGS was associated with incident T2D in all models. Adjusting for age and sex only, the Hazard Ratio (HR) per PGS standard deviation (SD) was 1.76 (95% CI 1.68, 1.84) and the HR of top 5% of PGS vs interquartile range (IQR) was 2.80 (2.39, 3.28). Adjusting for the CRS, the HR per SD was 1.48 (1.40, 1.57) and HR of the top 5% of PGS vs IQR was 2.09 (1.72, 2.55). Genetic effects differed by baseline clinical risk ((PGS-CRS interaction p = 0.05; CRS below the median: HR 1.60 (1.43, 1.79); CRS above the median: HR 1.45 (1.35, 1.55)).

CONCLUSIONS: Genetic information can help identify high-risk patients even among those perceived to be low risk in a clinical evaluation.

Moore, Jill E, Henry E Pratt, Kaili Fan, Nishigandha Phalke, Jonathan Fisher, Shaimae I Elhajjajy, Gregory Andrews, et al. (2024) 2024. “An Expanded Registry of Candidate Cis-Regulatory Elements for Studying Transcriptional Regulation.”. BioRxiv : The Preprint Server for Biology. https://doi.org/10.1101/2024.12.26.629296.

Mammalian genomes contain millions of regulatory elements that control the complex patterns of gene expression. Previously, The ENCODE consortium mapped biochemical signals across many cell types and tissues and integrated these data to develop a Registry of 0.9 million human and 300 thousand mouse candidate cis-Regulatory Elements (cCREs) annotated with potential functions1. We have expanded the Registry to include 2.35 million human and 927 thousand mouse cCREs, leveraging new ENCODE datasets and enhanced computational methods. This expanded Registry covers hundreds of unique cell and tissue types, providing a comprehensive understanding of gene regulation. Functional characterization data from assays like STARR-seq, MPRA, CRISPR perturbation, and transgenic mouse assays now cover over 90% of human cCREs, revealing complex regulatory functions. We identified thousands of novel silencer cCREs and demonstrated their dual enhancer/silencer roles in different cellular contexts. Integrating the Registry with other ENCODE annotations facilitates genetic variation interpretation and trait-associated gene identification, exemplified by discovering KLF1 as a novel causal gene for red blood cell traits. This expanded Registry is a valuable resource for studying the regulatory genome and its impact on health and disease.

Lim, Siew S, Zhila Semnani-Azad, Mario L Morieri, Ashley H Ng, Abrar Ahmad, Hugo Fitipaldi, Jacqueline Boyle, et al. (2024) 2024. “Reporting Guidelines for Precision Medicine Research of Clinical Relevance: The BePRECISE Checklist.”. Nature Medicine 30 (7): 1874-81. https://doi.org/10.1038/s41591-024-03033-3.

Precision medicine should aspire to reduce error and improve accuracy in medical and health recommendations by comparison with contemporary practice, while maintaining safety and cost-effectiveness. The etiology, clinical manifestation and prognosis of diseases such as obesity, diabetes, cardiovascular disease, kidney disease and fatty liver disease are heterogeneous. Without standardized reporting, this heterogeneity, combined with the diversity of research tools used in precision medicine studies, makes comparisons across studies and implementation of the findings challenging. Specific recommendations for reporting precision medicine research do not currently exist. The BePRECISE (Better Precision-data Reporting of Evidence from Clinical Intervention Studies & Epidemiology) consortium, comprising 23 experts in precision medicine, cardiometabolic diseases, statistics, editorial and lived experience, conducted a scoping review and participated in a modified Delphi and nominal group technique process to develop guidelines for reporting precision medicine research. The BePRECISE checklist comprises 23 items organized into 5 sections that align with typical sections of a scientific publication. A specific section about health equity serves to encourage precision medicine research to be inclusive of individuals and communities that are traditionally under-represented in clinical research and/or underserved by health systems. Adoption of BePRECISE by investigators, reviewers and editors will facilitate and accelerate equitable clinical implementation of precision medicine.