Omics-squared: human genomic, transcriptomic and phenotypic data for genetic analysis workshop 19
John Blangero 1, Tanya M Teslovich 2, Xueling Sim 2, Marcio A Almeida 1, Goo Jun 3, Thomas D Dyer 1, Matthew Johnson 1, Juan M Peralta 1, Alisa Manning 4, Andrew R Wood 5, Christian Fuchsberger 2, Jack W Kent Jr 6, David A Aguilar 7, Jennifer E Below 8, Vidya S Farook 1, Rector Arya 1, Sharon Fowler 9, Tom W Blackwell 2, Sobha Puppala 6, Satish Kumar 1, David C Glahn 10, Eric K Moses 11, Joanne E Curran 1, Farook Thameem 12, Christopher P Jenkinson 1, Ralph A DeFronzo 13, Donna M Lehman 9, Craig Hanis 8, Goncalo Abecasis 2, Michael Boehnke 2, Harald Göring 1, Ravindranath Duggirala 1, Laura Almasy 14; T2D-GENES Consortium
Affiliations
Affiliations
- South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley, Harlingen, TX 78550 USA.
- Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109 USA.
- Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, MI 48109 USA ; Department of Epidemiology, Human Genetics and Environmenal Sciences, University of Texas Health Science Center at Houston, Houston, TX 77030 USA.
- Department of Genetics, Massachusetts General Hospital, Boston, MA 02114 USA.
- Genetics of Complex Traits, Peninsula College of Medicine and Dentistry, University of Exeter, Exeter, UK.
- Department of Genetics, Texas Biomedical Research Institute, 7620 NW Loop 410, San Antonio, TX 78227 USA.
- Cardiovascular Division, Baylor College of Medicine, Houston, TX 77030 USA.
- Department of Epidemiology, Human Genetics and Environmenal Sciences, University of Texas Health Science Center at Houston, Houston, TX 77030 USA.
- Division of Clinical Epidemiology, Department of Medicine, University of San Antonio Health Science Center at San Antonio, San Antonio, TX 78229 USA.
- Department of Psychiatry, Yale University, New Haven, CT 06106 USA.
- Centre for Genetic Origins of Health and Disease, University of Western Australia, Crawley, Australia.
- Department of Biochemistry, Faculty of Medicine, Kuwait University, Safat, Kuwait City, 13110 Kuwait.
- Texas Diabetes Institute, University of San Antonio Health Science Center at San Antonio, San Antonio, TX 78229 USA.
- South Texas Diabetes and Obesity Institute, University of Texas Rio Grande Valley, Harlingen, TX 78550 USA ; Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104 USA.
Abstract
Background: The Genetic Analysis Workshops (GAW) are a forum for development, testing, and comparison of statistical genetic methods and software. Each contribution to the workshop includes an application to a specified data set. Here we describe the data distributed for GAW19, which focused on analysis of human genomic and transcriptomic data.
Methods: GAW19 data were donated by the T2D-GENES Consortium and the San Antonio Family Heart Study and included whole genome and exome sequences for odd-numbered autosomes, measures of gene expression, systolic and diastolic blood pressures, and related covariates in two Mexican American samples. These two samples were a collection of 20 large families with whole genome sequence and transcriptomic data and a set of 1943 unrelated individuals with exome sequence. For each sample, simulated phenotypes were constructed based on the real sequence data. 'Functional' genes and variants for the simulations were chosen based on observed correlations between gene expression and blood pressure. The simulations focused primarily on additive genetic models but also included a genotype-by-medication interaction. A total of 245 genes were designated as 'functional' in the simulations with a few genes of large effect and most genes explaining < 1 % of the trait variation. An additional phenotype, Q1, was simulated to be correlated among related individuals, based on theoretical or empirical kinship matrices, but was not associated with any sequence variants. Two hundred replicates of the phenotypes were simulated. The GAW19 data are an expansion of the data used at GAW18, which included the family-based whole genome sequence, blood pressure, and simulated phenotypes, but not the gene expression data or the set of 1943 unrelated individuals with exome sequence.
Similar articles
Almasy L, Dyer TD, Peralta JM, Jun G, Wood AR, Fuchsberger C, Almeida MA, Kent JW Jr, Fowler S, Blackwell TW, Puppala S, Kumar S, Curran JE, Lehman D, Abecasis G, Duggirala R, Blangero J; T2D-GENES Consortium.BMC Proc. 2014 Jun 17;8(Suppl 1):S2. doi: 10.1186/1753-6561-8-S1-S2. eCollection 2014.PMID: 25519314 Free PMC article.
Paterson AD.Genet Epidemiol. 2014 Sep;38 Suppl 1:S1-4. doi: 10.1002/gepi.21818.PMID: 25112182
Longitudinal analytical approaches to genetic data.
Chiu YF, Justice AE, Melton PE.BMC Genet. 2016 Feb 3;17 Suppl 2(Suppl 2):4. doi: 10.1186/s12863-015-0312-y.PMID: 26866891 Free PMC article.
Constrained multivariate association with longitudinal phenotypes.
Melton PE, Peralta JM, Almasy L.BMC Proc. 2016 Oct 18;10(Suppl 7):329-332. doi: 10.1186/s12919-016-0051-8. eCollection 2016.PMID: 27980657 Free PMC article.
Nelson RT, Shoemaker R.BMC Genomics. 2006 Aug 9;7:204. doi: 10.1186/1471-2164-7-204.PMID: 16899135 Free PMC article. Review.
Cited by
On the application, reporting, and sharing of in silico simulations for genetic studies.
Riggs K, Chen HS, Rotunno M, Li B, Simonds NI, Mechanic LE, Peng B.Genet Epidemiol. 2021 Mar;45(2):131-141. doi: 10.1002/gepi.22362. Epub 2020 Oct 16.PMID: 33063887 Free PMC article. Review.
Aslibekyan S, Almasy L, Province MA, Absher DM, Arnett DK.BMC Proc. 2018 Sep 17;12(Suppl 9):35. doi: 10.1186/s12919-018-0114-0. eCollection 2018.PMID: 30275886 Free PMC article.
Advances in the Genetics of Hypertension: The Effect of Rare Variants.
Russo A, Di Gaetano C, Cugliari G, Matullo G.Int J Mol Sci. 2018 Feb 28;19(3):688. doi: 10.3390/ijms19030688.PMID: 29495593 Free PMC article. Review.
Estimating and testing direct genetic effects in directed acyclic graphs using estimating equations.
Konigorski S, Wang Y, Cigsar C, Yilmaz YE.Genet Epidemiol. 2018 Mar;42(2):174-186. doi: 10.1002/gepi.22107. Epub 2017 Dec 18.PMID: 29265408 Free PMC article.
Espin-Garcia O, Craiu RV, Bull SB.Genet Epidemiol. 2018 Feb;42(1):104-116. doi: 10.1002/gepi.22099. Epub 2017 Dec 14.PMID: 29239496 Free PMC article.
KMEL References
References
-
- Mitchell BD, Kammerer CM, Blangero J, Mahaney MC, Rainwater DL, Dyke B, Hixson JE, Henkel RD, Sharp RM, Comuzzie AG, VandeBerg JL, et al. Genetic and environmental contributions to cardiovascular risk factors in Mexican Americans. The San Antonio Family Heart Study. Circulation. 1996;94:2159–2170. doi: 10.1161/01.CIR.94.9.2159. - DOI - PubMed
-
- Hunt KJ, Lehman DM, Arya R, Fowler S, Leach RJ, Göring HH, Almasy L, Blangero J, Dyer TD, Duggirala R, et al. Genome-wide linkage analyses of type 2 diabetes in Mexican Americans: the San Antonio Family Diabetes/Gallbladder Study. Diabetes. 2005;54:2655–2662. doi: 10.2337/diabetes.54.9.2655. - DOI - PubMed
-
- Coletta DK, Schneider J, Hu SL, Dyer TD, Puppala S, Farook VS, Arya R, Lehman DM, Blangero J, DeFronzo RA, et al. Genome-wide linkage scan for genes influencing plasma triglyceride levels in the Veterans Administration Genetic Epidemiology Study. Diabetes. 2009;58:279–284. doi: 10.2337/db08-0491. - DOI - PMC - PubMed
-
- Hanis CL, Ferrell RE, Barton SA, Aguilar L, Garza-Ibarra A, Tulloch BR, Garcia CA, Schull WJ. Diabetes among Mexican Americans in Starr County, Texas. Am J Epidemiol. 1983;118:659–672. - PubMed
-
- Below JE, Gamazon ER, Morrison JV, Konkashbaev A, Pluzhnikov A, McKeigue PM, Parra EJ, Elbein SC, Hallman DM, Nicolae DL, et al. Genome-wide association and meta-analysis in populations from Starr County, Texas, and Mexico City identify type 2 diabetes susceptibility loci and enrichment for expression quantitative trait loci in top signals. Diabetologia. 2011;54:2047–2055. doi: 10.1007/s00125-011-2188-3. - DOI - PMC - PubMed
-
- McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. - DOI - PMC - PubMed