MRdb | Help

MRdb: a comprehensive database of Univariate & Multivariate mendelian randomization with large-scale GWAS summary data.

In recent years, the number of published Mendelian randomization (MR) analyses has increased exponentially due to the widespread availability of genetic data and the explosion of genome-wide association studies (GWAS). By using genetic variants as instrumental variables for exposure, MR analysis is an epidemiological design that can strengthen causal inference. Compared with observational studies, MR analysis can reduce bias due to confounding because the genetic alleles are randomly selected at conception and therefore have no correlations with environmental and self-administered factors. However, GWAS results for therapeutic targets, such as plasma proteins, inflammatory cytokines, immune cells, eQTL genes, etc. are often insufficiently curated, undermining efficient implementation of the MR approach. To meet this need, we developed MRdb, a comprehensive database of curated GWAS results and MR applications for inferring causal relationships between therapeutic targets and phenotypes.

The integration of data and web tools in MRdb allows the rigorous application of hypothesis-driven analysis, enabling the efficient evaluation of millions of potential associations across phenotypes. MRdb curates thousands of GWAS results for therapeutic targets as exposures, including 19,126 eQTL genes, 4,907 plasma proteins, 1,400 plasma metabolites, 731 immune cells, 486 blood metabolites, 597 skin microbiota, 412 gut microbiota, 179 plasma lipids, 91 circulating proteins, 74 blood cells, 41 inflammatory cytokines and 8 thyroid hormones. MRdb also curates a number of 48,507 GWAS datasets as outcomes.

MRdb provides a user-friendly searching and analyzing interface. The Univariable MR analysis tool allows users to infer causal relationships between phenotypes, using a strategy known as 2-sample Mendelian randomization (2SMR). The Multivariable MR analysis tool was developed to estimate the effect of different mediators on the risk of outcome phenotypes with adjustment for exposures (e.g. adjustment for BMI, smoking initiation and type 2 diabetes mellitus). To assess the sensitivity of different patterns to MR results, MRdb provides a range of diagnostics and sensitivity analyses, including Heterogeneity test, Horizontal pleiotropy test, Horizontal pleiotropy test and Leave-one-out Validation. These results can be visualized as MR Plots, Funnel Plots, Forest Plots and Density Plots. Based on MRdb, user-definable parameters are set for all analysis steps. All data and analyses are freely accessible and downloadable. The MRdb is regularly updated by combining the latest genetic data with the latest statistical methods. Collectively, MRdb will serve as an important resource for investigating the potential causal relationships between therapeutic targets and phenotypes.

MRdb provides a user-friendly searching interface.

1. Main functions of the MRdb are provided in menu bar form (boxed in green).

2. Click "Get Start" button to select an exposure dataset of interest for Mendelian randomization.

3. Click "Analysis" button to select a univariate/multivariate analysis for Mendelian randomization.

4. Click "Get Help" button to get helps of MRdb.

5. This panel displays basic information about the exposure datasets built into MRdb, and you can click on any exposure dataset to perform univariate/multivariate Mendelian randomization analysis.

Figure 1-1

MRdb provides the ability to perform univariate analysis on 12 types of exposure datasets.

1. In the "Exposure (Univariate)" section, input or select the exposure dataset for Mendelian randomization analysis.

2. Click on the pre-defined example datasets (e.g., "IL-10", "CTACK") to quickly fill in exposure data for demonstration purposes.

3. Navigate to the "Outcome" section and input or select the outcome dataset for analysis.

4. Utilize the example buttons (e.g., "Alzheimer disease", "Abscess of lung") to quickly load pre-defined outcome datasets for demonstration.

5. Adjust univariate analysis parameters under the "Params" section, including thresholds, methods, and additional settings to customize the analysis process.

6. Once the configurations are set, click the "Submit" button to execute the univariate analysis.

Figure 2-1

Figure 2-2

MRdb also provides the ability to perform multivariate analysis for complex datasets.

7. In the "Exposure (Multivariate)" section, input or select the exposure datasets for Mendelian randomization multivariate analysis.

8. Click on the pre-defined example datasets (e.g., "IL-10", "CTACK") to quickly populate multiple exposure variables for demonstration.

9. Navigate to the "Outcome" section and input or select the outcome dataset for multivariate analysis.

10. Use the example buttons (e.g., "Alzheimer disease", "Abscess of lung") to load pre-defined outcome datasets for multivariate analysis demonstration purposes.

11. Configure the advanced multivariate parameters under the "Params" section, including variable thresholds, model selection, and distribution types to suit the analytical needs.

12. After confirming all configurations, click the "Submit" button to run the multivariate analysis.

Figure 2-3

Figure 2-4

MRdb provides the ability to perform univariate analysis on 12 types of exposure datasets.

1. Click to select an exposure dataset.

2. Further filtering based on the currently selected dataset.

3. Click on one dataset in the table that interests you as the exposure data in Mendelian randomization univariate analysis.

4. Filter the outcome dataset you are interested in.

5. Click on one dataset in the table that interests you as the outcome data in Mendelian randomization univariate analysis.

6. To display your selected dataset (exposure data and outcome data), you can click on the table icon on the right to view the raw dataset.

7. Choose a variety of univariate Mendelian randomization parameters for analysis.

Figure 3-1

Figure 3-2

Figure 3-3

MRdb provides full process analysis results, including heterogeneity test, horizontal pleiotropy test, etc.

1. Exposure dataset basic information.

2. Outcome dataset basic information.

3. Harmonise Result.

4. MR Result.

5. Heterogeneity Test.

6. Pleiotropy Test.

7. MR Plot.

8. Leave-one-out Validation.

9. Funnel Plot.

10. Forest Plot.

11. Density Plot

Figure 4-1

Figure 4-2

MRdb provides the ability to perform multivariate analysis on 12 types of exposure datasets.

1. Click to select exposure datasets.

2. Further filtering based on the currently selected dataset.

3. Click on one dataset (greater than or equal to 2 and less than or equal to 5) in the table that interests you as the exposure data in Mendelian randomization multivariate analysis.

4. Filter the outcome dataset you are interested in.

5. Click on one dataset in the table that interests you as the outcome data in Mendelian randomization multivariate analysis.

6. To display your selected dataset (exposure datasets and outcome data), you can click "extra exposure" to add additional exposure datasets.

7. Choose a variety of multivariate Mendelian randomization parameters for analysis.

Figure 5-1

Figure 5-2

Figure 5-3

MRdb provides full process analysis results, including MV-IVW result, MV-EGGER result, etc.

1. MVMR Result.

2. Exposure datasets basic information.

3. Outcome dataset basic information.

4. MV-IVW Result.

5. MV-EGGER Result.

6. MRMV-IVW Result.

7. F statistic.

8. Q table.

Figure 6-1

A genome-wide association study meta-analysis of thyroid function in up to 271,040 individuals of European ancestry, including thyrotropin (TSH), free thyroxine (FT4), free T3 (FT3), total T3 (TT3), the FT3/FT4 ratio, the TT3/FT4 ratio, high TSH levels, and low TSH levels of a sex-combined dataset.

Please cite: Sterenborg RBTM, Steinbrenner I, Li Y, et al. Multi-trait analysis characterizes the genetics of thyroid function and identifies causal associations with clinical implications. Nat Commun. 2024;15(1):888. Published 2024 Jan 30. doi:10.1038/s41467-024-44701-9

Figure 7-1

The results reported here are the meta-analysis summary statistics for the 41 inflammatory cytokines without fitting BMI as a covariate in the model (as was done in the previously published GWAS). Results are filtered such that only SNPs with results for 2 or more of the 3 cohorts remain.

Please cite: Kalaoja M, Corbin LJ, Tan VY, et al. The Role of Inflammatory Cytokines as Intermediates in the Pathway from Increased Adiposity to Disease. Obesity (Silver Spring). 2021;29(2):428-437. doi:10.1002/oby.23060

Figure 8-1

The study is a trans-ethnic meta-analysis that evaluated 15 blood cell traits in 746,667 participants (including 184,535 non-European) from 5 global populations.

Please cite: Chen MH, Raffield LM, Mousas A, et al. Trans-ethnic and Ancestry-Specific Blood-Cell Genetics in 746,667 Individuals from 5 Global Populations. Cell. 2020;182(5):1198-1213.e14. doi:10.1016/j.cell.2020.06.045

Figure 9-1

A genome-wide protein quantitative trait locus (pQTL) study of 91 plasma proteins measured using the Olink Target platform in 14,824 participants.

Please cite: Zhao JH, Stacey D, Eriksson N, et al. Genetics of circulating inflammatory proteins identifies drivers of immune-mediated disease risk and therapeutic targets [published correction appears in Nat Immunol. 2023 Sep 7;:]. Nat Immunol. 2023;24(9):1540-1551. doi:10.1038/s41590-023-01588-w

Figure 10-1

An univariate and multivariate genome-wide analyses of 179 lipid species in 7,174 Finnish individuals.

Please cite: Ottensmann L, Tabassum R, Ruotsalainen SE, et al. Genome-wide association analysis of plasma lipidome identifies 495 genetic associations. Nat Commun. 2023;14(1):6934. Published 2023 Oct 31. doi:10.1038/s41467-023-42532-8

Figure 11-1

The study, a meta-analysis of genome-wide association studies (GWAS) of two population-based German cohorts, revealed the effect of genetics on skin microbiota across three different skin microenvironments.

Please cite: Moitinho-Silva L, Degenhardt F, Rodriguez E, et al. Host genetic factors related to innate immunity, environmental sensing and cellular functions are associated with human skin microbiota. Nat Commun. 2022;13(1):6204. Published 2022 Oct 19. doi:10.1038/s41467-022-33906-5

Figure 12-1

A genome-wide association study of 207 taxa and 205 pathways representing microbial composition and function in 7,738 participants of the Dutch Microbiome Project.

Please cite: Lopera-Maya EA, Kurilshikov A, van der Graaf A, et al. Effect of host genetics on the gut microbiome in 7,738 participants of the Dutch Microbiome Project [published correction appears in Nat Genet. 2022 Sep;54(9):1448]. Nat Genet. 2022;54(2):143-151. doi:10.1038/s41588-021-00992-y

Figure 13-1

The study involved 7,824 adult individuals from 2 European population studies. The study reported genome-wide significant associations at 145 metabolic loci and their biochemical connectivity with more than 400 metabolites in human blood.

Please cite: Shin SY, Fauman EB, Petersen AK, et al. An atlas of genetic influences on human blood metabolites. Nat Genet. 2014;46(6):543-550. doi:10.1038/ng.2982

Figure 14-1

This study report on the influence of ~22 million variants on 731 immune cell traits in a cohort of 3,757 Sardinians.

Please cite: Orrù V, Steri M, Sidore C, et al. Complex genetic signatures in immune cells underlie autoimmunity and inform therapy [published correction appears in Nat Genet. 2020 Sep 18;:]. Nat Genet. 2020;52(10):1036-1045. doi:10.1038/s41588-020-0684-4

Figure 15-1

The study was conducted through a genome-wide association study of 1,091 blood metabolites and 309 metabolite ratios.

Please cite: Chen Y, Lu T, Pettersson-Kymmer U, et al. Genomic atlas of the plasma metabolome prioritizes metabolites implicated in human diseases. Nat Genet. 2023;55(1):44-53. doi:10.1038/s41588-022-01270-1

Figure 16-1

A genome-wide association studies (GWASs) of plasma protein levels measured with 4,907 aptamers in 35,559 Icelanders.

Please cite: Ferkingstad E, Sulem P, Atlason BA, et al. Large-scale integration of the plasma proteome with genetics and disease. Nat Genet. 2021;53(12):1712-1721. doi:10.1038/s41588-021-00978-w

Figure 17-1

To investigate the genetics of gene expression, this study performed cis- and trans-expression quantitative trait locus (eQTL) analyses using blood-derived expression from 31,684 individuals through the eQTLGen Consortium. We only use cis-eQTL here.

Please cite: Võsa U, Claringbould A, Westra HJ, et al. Large-scale cis- and trans-eQTL analyses identify thousands of genetic loci and polygenic scores that regulate blood gene expression. Nat Genet. 2021;53(9):1300-1310. doi:10.1038/s41588-021-00913-z

Figure 18-1

MR db

MRdb introduction

Quick start in MRdb

MRdb - Custom analysis

MRdb - Univariate analysis

MRdb - Univariate analysis result

MRdb - Multivariate analysis

MRdb - Multivariate analysis result

MRdb - Analysing ThyroidOmics

MRdb - Analysing Inflammatory cytokines

MRdb - Blood cells

MRdb - Analysing Circulating proteins

MRdb - Analysing Plasma lipidome

MRdb - Analysing Skin microbiota

MRdb - Analysing Dutch gut microbiota

MRdb - Analysing Blood metabolites

MRdb - Analysing Immune cells

MRdb - Analysing Plasma metabolome

MRdb - Analysing Plasma proteome

MRdb - Analysing eQTLGen