Our current understanding of the determinants of plasma proteome variation during pediatric development remains incomplete. Here, we show that genetic variants, age, sex and body mass index significantly influence this variation. Using a streamlined and highly quantitative mass spectrometry-based proteomics workflow, we analyzed plasma from 2,147 children and adolescents, identifying 1,216 proteins after quality control. Notably, the levels of 70% of these were associated with at least one of the aforementioned factors, with protein levels also being predictive. Quantitative trait loci (QTLs) regulated at least one-third of the proteins; between a few percent and up to 30-fold. Together with excellent replication in an additional 1,000 children and 558 adults, this reveals substantial genetic effects on plasma protein levels, persisting from childhood into adulthood. Through Mendelian randomization and colocalization analyses, we identified 41 causal genes for 33 cardiometabolic traits, emphasizing the value of protein QTLs in drug target identification and disease understanding.
Publications
2025
Genetic sequencing technologies are powerful tools for identifying rare variants and genes associated with Mendelian and complex traits; indeed, whole-exome and whole-genome sequencing are increasingly popular methods for population-scale genetic studies. However, careful quality control steps should be taken to ensure study accuracy and reproducibility, and sequencing data require extensive quality filtering to delineate true variants from technical artifacts. Although processing standards are harmonized across pipelines to call variants from sequencing reads, there currently exists no standardized pipeline for conducting quality filtering on variant-level datasets for the purpose of population-scale association analysis. In this Tutorial, we discuss key quality control parameters, provide guidelines for conducting quality filtering of samples and variants, and compare commonly used software programs for quality control of samples, variants and genotypes from sequencing data. As sequencing data continue to gain popularity in genetic research, establishing standardized quality control practices is crucial to ensure consistent, reliable and reproducible results across studies.
A key challenge of the modern genomics era is developing empirical data-driven representations of gene function. Here we present the first unbiased morphology-based genome-wide perturbation atlas in human cells, containing three genome-wide genotype-phenotype maps comprising CRISPR-Cas9-based knockouts of >20,000 genes in >30 million cells. Our optical pooled cell profiling platform (PERISCOPE) combines a destainable high-dimensional phenotyping panel (based on Cell Painting) with optical sequencing of molecular barcodes and a scalable open-source analysis pipeline to facilitate massively parallel screening of pooled perturbation libraries. This perturbation atlas comprises high-dimensional phenotypic profiles of individual cells with sufficient resolution to cluster thousands of human genes, reconstruct known pathways and protein-protein interaction networks, interrogate subcellular processes and identify culture media-specific responses. Using this atlas, we identify the poorly characterized disease-associated TMEM251/LYSET as a Golgi-resident transmembrane protein essential for mannose-6-phosphate-dependent trafficking of lysosomal enzymes. In sum, this perturbation atlas and screening platform represents a rich and accessible resource for connecting genes to cellular functions at scale.
Precision nutrition is a vibrant and rapidly evolving field of scientific research and innovation with the potential to deliver health, societal and economic benefits by improving healthcare delivery and policies. Advances in deep phenotyping technologies, digital tools and artificial intelligence have made possible early proof-of-concept research that expands the understanding of within- and between-person variability in responses to diet. These studies illustrate the promise of precision nutrition to complement the traditional 'one size fits all' dietary guidelines, which, while considering broad life-stage and disease-specific nutritional requirements, often lack the granularity to account fully for individual variations in nutritional needs and dietary responses. Despite these developments, however, considerable challenges remain before precision nutrition can be implemented on a broader scale. This Review examines the current state of precision nutrition research, with a focus on its application to reducing the incidence and burden of cardiometabolic diseases. We critically examine the evidence base, explore the potential benefits and discuss the challenges and opportunities ahead.
Obesity is a major public health challenge. Glucagon-like peptide-1 receptor agonists (GLP1-RA) and bariatric surgery (BS) are effective weight loss interventions; however, the genetic factors influencing treatment response remain largely unexplored. Moreover, most previous studies have focused on race and ethnicity rather than genetic ancestry. Here we analyzed 10,960 individuals from 9 multiancestry biobank studies across 6 countries to assess the impact of known genetic factors on weight loss. Between 6 and 12 months, GLP1-RA users had an average weight change of -3.93% or -6.00%, depending on the outcome definition, with modest ancestry-based differences. BS patients experienced -21.17% weight change between 6 and 48 months. We found no significant associations between GLP1-RA-induced weight loss and polygenic scores for body mass index or type 2 diabetes, nor with missense variants in GLP1R. A higher body mass index polygenic score was modestly linked to lower weight loss after BS (+0.7% per s.d., P = 1.24 × 10-4), but the effect attenuated in sensitivity analyses. Our findings suggest known genetic factors have limited impact on GLP1-RA effectiveness with respect to weight change and confirm treatment efficacy across ancestry groups.
BACKGROUND & AIMS: A quarter of the world population is estimated to have metabolic dysfunction-associated steatotic liver disease. Here, we aim to understand the impact of liver trait-associated genetic variants on fat content and tissue volume across organs and body compartments and on a large set of biomarkers.
METHODS: Genome-wide association analyses were performed on liver fat and liver volume estimated with magnetic resonance imaging in up to 27,243 unrelated European participants from the UK Biobank. Identified variants were assessed for associations with fat fraction and tissue volume in >2 million 'Imiomics' image elements in 22,261 individuals and with circulating biomarkers in 310,224 individuals.
RESULTS: We confirmed four liver fat and nine liver volume previously reported genetic variants (p values <5 × 10-8). We further found evidence suggestive of a novel liver volume locus, ADH4, where each additional T allele increased liver volume by 0.05 SD (SE = 0.01, p value = 3.3 × 10-8). The Imiomics analyses showed that liver fat-increasing variants were specifically associated with fat fraction of the liver tissue (p values <2.8 × 10-3) and with higher inflammation, liver and renal injury biomarkers, and lower lipid levels. Associations of liver volume variants with fat content, tissue volume, and biomarkers were more heterogeneous, for example the liver volume-increasing alleles at CENPW and PPP1R3B were associated with higher skeletal muscle volumes and were more pronounced in men, whereas the GCKR variant was negatively associated with lower skeletal muscle volumes in women (p values <2.8 × 10-3).
CONCLUSIONS: Liver fat-increasing variants were mostly linked to fat fraction of the liver and were positively associated with some adverse metabolic biomarkers and negatively with lipids. In contrast, liver volume-associated variants showed a less consistent pattern across organs and biomarkers.
IMPACT AND IMPLICATIONS: Liver fat and liver volume are common metabolic traits with a strong genetic component, yet the extent to which they exert organ-specific vs. systemic effects remains poorly defined. By integrating genome-wide association analyses and high-resolution neck-to-knee magnetic resonance imaging data through the Imiomics framework, this study reveals distinct genetic architectures for liver fat and liver volume, including sex-specific effects. These findings provide new insights into the biological, organ-level, tissue-specific, and systemic characteristics of steatotic liver disease and its genetic determinants. The results may inform the development of precision imaging genetic approaches, biomarker discovery, and stratified risk assessment strategies, while reinforcing the importance of incorporating sex-specific analyses in future research and clinical applications.
Serum lipid levels, which are influenced by both genetic and environmental factors, are key determinants of cardiometabolic health and are influenced by both genetic and environmental factors. Improving our understanding of their underlying biological mechanisms can have important public health and therapeutic implications. Although psychosocial factors, including depression, anxiety, and perceived social support, are associated with serum lipid levels, it is unknown if they modify the effect of genetic loci that influence lipids. We conducted a genome-wide gene-by-psychosocial factor interaction (G×Psy) study in up to 133,157 individuals to evaluate if G×Psy influences serum lipid levels. We conducted a two-stage meta-analysis of G×Psy using both a one-degree of freedom (1df) interaction test and a joint 2df test of the main and interaction effects. In Stage 1, we performed G×Psy analyses on up to 77,413 individuals and promising associations (P < 10-5) were evaluated in up to 55,744 independent samples in Stage 2. Significant findings (P < 5 × 10-8) were identified based on meta-analyses of the two stages. There were 10,230 variants from 120 loci significantly associated with serum lipids. We identified novel associations for variants in four loci using the 1df test of interaction, and five additional loci using the 2df joint test that were independent of known lipid loci. Of these 9 loci, 7 could not have been detected without modeling the interaction as there was no evidence of association in a standard GWAS model. The genetic diversity of included samples was key in identifying these novel loci: four of the lead variants displayed very low frequency in European ancestry populations. Functional annotation highlighted promising loci for further experimental follow-up, particularly rs73597733 (MACROD2), rs59808825 (GRAMD1B), and rs11702544 (RRP1B). Notably, one of the genes in identified loci (RRP1B) was found to be a target of the approved drug Atenolol suggesting potential for drug repurposing. Overall, our findings suggest that taking interaction between genetic variants and psychosocial factors into account and including genetically diverse populations can lead to novel discoveries for serum lipids.
Genome-wide association studies (GWASs) have identified thousands of genetic loci associated with cardiometabolic disorders. However, the functional interpretation of these loci remains a daunting challenge. This is particularly true for adipose tissue, a critical organ in systemic metabolism and the pathogenesis of various cardiometabolic diseases. We discuss how variant-to-function (V2F) approaches are used to elucidate the mechanisms by which GWAS loci increase the risk of cardiometabolic disorders by directly influencing adipose tissue. We outline GWAS traits most likely to harbor adipose-related variants and summarize tools to pinpoint the putative causal variants, genes, and cell types for the associated loci. We explain how large-scale perturbation experiments, coupled with imaging and multi-omics, can be used to screen variants' effects on cellular phenotypes and how these phenotypes can be tied to physiological mechanisms. Lastly, we discuss the challenges and opportunities that lie ahead for V2F research and propose a roadmap for future studies.
Cohesin-mediated DNA loop extrusion enables gene regulation by distal enhancers through the establishment of chromosome structure and long-range enhancer-promoter interactions. The best characterized cohesin-related structures, such as topologically associating domains (TADs) anchored at convergent CTCF binding sites, represent static conformations. Consequently, loop extrusion dynamics remain poorly understood. To better characterize static and dynamically extruding chromatin loop structures, we use MNase-based 3D genome assays to simultaneously determine CTCF and cohesin localization as well as the 3D contacts they mediate. Here we present CTCF Analyzer (with) Multinomial Estimation (CAMEL), a tool that identifies CTCF footprints at near base-pair resolution in CTCF MNase HiChiP. We also use Region Capture Micro-C to identify a CTCF-adjacent footprint that is attributed to cohesin occupancy. We leverage this substantial advance in resolution to determine that the fully extruded (CTCF-CTCF loop) state is rare genome-wide with locus-specific variation from 1-10%. We further investigate the impact of chromatin state on loop extrusion dynamics and find that active regulatory elements impede cohesin extrusion. These findings support a model of topological regulation whereby the transient, partially extruded state facilitates enhancer-promoter contacts that can regulate transcription.