Efficient multivariate linear mixed model algorithms for genome-wide association studies

May 22, 2017 genome wide association studies gwas have identified a large amount of singlenucleotide polymorphisms snps associated with complex traits. Examples of linear mixed models based methods include the multitrait mixed models proposed by korte et al. Efficient algorithms for multivariate linear mixed models in genomewide association studies. Efficient multivariate analysis algorithms for longitudinal. Xiang zhou, peter carbonetto and matthew stephens 20. An efficient algorithm for multivariate linear mixed model. Genomewide efficient mixedmodel analysis for association studies. The second step uses the correlation between residuals of the linear mixed model to estimate. Here, we aimed to identify and characterize genetic variants with pleiotropic effects on cytokines. A fast and powerful empirical bayes method for genomewide. An efficient algorithm for multivariate linear mixed model analysis based on genomic information. One class of approaches for this problem builds on classical variance component methodology, utilizing a multitrait version of a linear mixed model. Efficient multivariate analysis algorithms for longitudinal genomewide association studies. Supplementary note algorithms for genome wide association.

Unlabelled we have developed an algorithm for genetic analysis of complex traits using genome wide snps in a linear mixed model framework. Aug 17, 2018 current dynamic phenotyping system introduces time as an extra dimension to genome wide association studies gwas. May 01, 2015 multipletrait association mapping, in which multiple traits are used simultaneously in the identification of genetic variants affecting those traits, has recently attracted interest. The use of multivariate information could enhance the detection power of gwa. Multivariate linear mixed models for statistical genetics.

The r package multimeta provides an implementation of the inversevariancebased method for metaanalysis, generalized to an n dimensional setting. Efficient algorithms for multivariate linear mixed models in genomewide association studies article in nature methods 114 february 2014 with 204 reads how we measure reads. Linear mixed models lmm adopted to gwas kang, et al. Aug 24, 2017 a multivariate genome wide association test is proposed for analyzing data on multivariate quantitative phenotypes collected from related subjects. However, most of these studies were performed in a single phenotype framework without putting into consideration the clinical relatedness among traits. Imputing phenotypes for genomewide association studies. Coconet integrates tissuespecific gene coexpression networks constructed from either bulk or single cell rna sequencing studies with association summary statistics from genome wide association studies. The first contribution of this thesis is mtset, an efficient mixed model approach that enables genome wide association testing between sets of genetic variants and multiple traits while accounting for confounding factors. The emma method utilizes a spectral decomposition algorithm to build the likelihood. Polygenic modeling with bayesian sparse linear mixed models. Of these, the genome wide efficient mixed model association gemma approach should become more widely adopted because it efficiently refits the mixed model at every marker, resulting in exact estimates of marker effects rather than approximate estimates in emmax or p3d. Multivariate linear mixed models mvlmms have been widely used in many areas of genetics, and have attracted considerable recent interest in genome wide association studies gwass. There is an increasing interest in using linear mixed models lmms, also known as mixed linear models, or mlms to test for association in genomewide association studies gwas, because of their demonstrated effectiveness in accounting for relatedness among samples and in controlling for population stratification and other confounding factors 17.

Advancements of transcriptome imputation and related. In contrast to existing univariate linear mixed model analyses, the proposed new method has improved statistic power for association detection and computational speed. Coconet is a composite likelihoodbased covariance regression network model for identifying traitrelevant tissues or cell types. Stephens, efficient algorithms for multivariate linear mixed models in genomewide association studies, arxiv preprint arxiv. Genomeassisted prediction of quantitative traits using. Genome wide association mapping gwa has been widely applied to a variety of species to identify genomic regions responsible for quantitative traits. Linear mixed model lmm is an efficient method for gwas.

Compared to current standard reml software based on the mixed model equation, our method is substantially faster. In contrast with existing univariate mixed linear model analyses, our algorithms have improved. Contribute to genenetworkgemma development by creating an account on github. Apr 26, 2019 genome wide association studies gwas have found hundreds of novel loci associated with full blood count fbc phenotypes. Efficient algorithms for multivariate linear mixed models. Efficient algorithms for multivariate linear mixed models in. Efficient multivariate linear mixed model algorithms for genomewide association studies. Using three populationbased cohorts n 9,263, we performed multivariate genome wide association studies gwas for a correlation network of 11 circulating cytokines, then combined our results in metaanalysis. Current largescale standard univariate and multivariate gwas analyses. Recently, the linear mixed model has become a popular approach for. We present efficient algorithms in the genome wide efficient mixed model association gemma software.

Efficient multipletrait association and estimation of. A recently developed linear mixed model for estimating heritability by simultaneously fitting all snps suggests that common variants can explain a substantial fraction of heritability, which hints at the low power of single variant analysis typically. However, improving statistical power and computing efficiency have always been the research hotspots of the lmmbased gwas methods. Identifying pleiotropic genes in genomewide association. An algorithm for linear mixed models substantially reduces memory usage and run time for genomewide association studies. Multivariate linear mixed models implemented in the gemma software package add speed, power and the ability to test for genomewide associations between. Multivariate linear mixed models mvlmms are powerful tools for testing associations between singlenucleotide polymorphisms and multiple correlated phenotypes while controlling for population stratification in genome wide association studies. However, the power to detect these variants depends on the number of individuals whose phenotypes are collected, and for phenotypes that are difficult to collect, the sample size might be insufficient to achieve the desired statistical power.

Comparison of ftests for univariate and multivariate mixed. Integrative approaches for largescale transcriptomewide association studies. Genomewide association study identifies loci for traits. Genomeassisted prediction of quantitative traits using the r. If you use the bayesian sparse linear mixed model bslmm, please cite. A multivariate genomewide association study of wing shape in. Pdf efficient multivariate analysis algorithms for. Efficient multivariate linear mixed model algorithms for. Efficient algorithms for multivariate linear mixed models in genome.

However, existing methods for calculating the likelihood ratio test statistics in mvlmms are time consuming, and, without approximations, cannot be directly applied to analyze even two traits jointly in a typical. However, fitting mvlmms is computationally nontrivial, and no existing method is computationally practical for performing the likelihood ratio test lrt for mvlmms in gwas settings with moderate sample size n. Genome wide association studies gwass have been successful in detecting variants correlated with phenotypes of clinical interest. Genome wide efficient mixed model association gemma zhou and. Genome wide association apping with gemma alwaysdata. Recently, mvlmms have become increasingly important in genomewide association studies gwas, both because of their effectiveness in. Here, we proposed a fast empirical bayes method, which is based on linear mixed models. Efficient multivariate linear mixed model algorithms for genome wide assocication studies. Dec 11, 2018 efficient multivariate linear mixed model algorithms for genomewide association studies. Multivariate linear mixed models implemented in the gemma software package add speed, power and the ability to test for genome wide associations between genetic polymorphisms and multiple. Efficient multivariate analysis algorithms for longitudinal genome. We herein developed efficient genomewide multivariate association algorithms for longitudinal data. Complimentary methods for multivariate genomewide association. Oct 16, 2015 univariate, bivariate, and conditional genome wide association studies gwas were performed with gemma, a genome wide efficient mixed model association algorithm.

The improved algorithm scales linearly in cohort size, allowing the. Multivariate linear mixed models mvlmms are powerful tools for testing snp associations with multiple correlated phenotypes while controlling for population stratification in genomewide association studies. Multivariate genomewide association analysis of a cytokine. Current dynamic phenotyping system introduces time as an extra dimension to genomewide association studies gwas. A mixed model approach for genome wide association studies of correlated traits in structured populations. We present efficient algorithms in the genomewide efficient mixed model association gemma software. Multivariate linear mixed models mvlmms are powerful tools for testing snp associations with multiple correlated phenotypes while controlling for population stratification in genome wide association studies. Xiang zhou, phd faculty profiles um school of public health. Efficient and accurate multiplephenotype regression.

As new methods for multivariate analysis of genome wide association studies become available, it is important to be able to combine results from different cohorts in a metaanalysis. Fast linear mixed models for genomewide association studies. Zhou x, stephens m 2014 efficient multivariate linear mixed model algorithms for genomewide association studies. The method is particularly useful for multivariate analysis. In contrast to existing univariate linear mixed model analyses, the proposed method has improved statistic power for association detection and computational speed. These approaches both increase power and provide insights into. A fast algorithm for bayesian multilocus model in genome. We herein developed efficient genome wide multivariate association algorithms for longitudinal data. The advantage is largest when there is only a single genetic covariance structure. Genomewide efficient mixed model association gemma zhou and. An efficient algorithm for multivariate linear mixed model analysis. The first step models the association between the genotype and marginal phenotype using a linear mixed model. We develop efficient genomewide multivariate analysis algorithms gma to deal with longitudinal data. Multivariate linear mixed models mvlmms have been widely used in many areas of genetics, and have attracted considerable recent interest in genomewide association studies gwass.

May 12, 2019 results we herein developed efficient genome wide multivariate association algorithms gma for longitudinal data. In this work, in addition to the standard univariate gwas, we also use two different multivariate methods to perform the. Multivariate linear mixed models mvlmms are powerful tools for testing associations between singlenucleotide polymorphisms and multiple correlated phenotypes while controlling for population stratification in genomewide association studies. Multivariate linear mixed models implemented in the gemma software package add speed, power and the ability to test for genomewide associations between genetic polymorphisms and multiple. A central limitation of these methods is that they cannot address the second. Genomewide efficient mixed model association github. Matthew stephens committee on genetics, genomics and. The statistical significance threshold for association was inferred by the simplem method. Genomewide efficient mixed model analysis for association. Genomewide association studies for feed intake and. Efficient computation with a linear mixed model on largescale data sets with applications to genetic studies.

464 1309 875 1185 554 946 731 617 977 478 9 1455 162 1274 1310 296 1097 849 539 1288 1382 439 96 355 1288 242 613 1465 532 95 573 141 835 1090 644 457 461 284 340 1173 730 80 830 104 665 355 286