Correlations in Genetic Risk Scores Produced by Direct-to-Consumer Genetic Testing Companies

Size: px
Start display at page:

Download "Correlations in Genetic Risk Scores Produced by Direct-to-Consumer Genetic Testing Companies"

Transcription

1

2 Correlations in Genetic Risk Scores Produced by Direct-to-Consumer Genetic Testing Companies A thesis submitted to the Graduate School of the University of Cincinnati in partial fulfillment of the requirements for the degree of Master of Science in the Department of Pediatrics of the College of Medicine April 2013 by Brian Douglas Reys B.S. The Ohio State University 2011 Committee Chair: Ge Zhang, MD, PhD Committee Members: Mehdi Keddache, PhD Melanie Myers, PhD, MS, CGC Daniel Prows, PhD

3 Abstract Background. Direct-to-consumer genetic testing companies provide consumers genetic risk scores for common diseases based on genotype. Single nucleotide polymorphism (SNP)- associated risk estimates published by genome wide association studies are the most common source of genotype-driven risk information for common diseases. However, the risk estimate of any given SNP varies depending on the source population and study design of the original publication. An important factor in establishing clinical validity for genetic testing of common disease is the consistency in genetic risk scoring between direct-to-consumer companies. Such an association however, has not been well described. While small-scale studies looking at individual sample results between direct-to-consumer companies have been performed, to our knowledge, no large-scale studies aiming to measure the consistency in risk scoring have been reported. Methods. A genotyped cohort of 834 individuals was used to calculate the equivalent genetic risk score that would be produced by the direct-to-consumer genetic testing companies 23andMe and DeCODE Genetics for two diseases, type 2 diabetes (T2D) and agerelated macular degeneration (AMD). These scores were compared to comprehensive academic SNP panels to look at the consistency between the risk scoring of different companies and academic literature. Results. Our results showed that although the genetic risk scores calculated based on different SNP risk panels (23andMe, DeCODE Genetics and academic) were significantly correlated, (r 2 = for T2D and r 2 = for AMD), the levels of correlation were far from appropriate to establish the clinical utility of these SNP-based genetic scores. In addition, the ranges of the estimated genetic scores varied substantially among these three different SNP risk panels with a greater number of SNPs utilized roughly correlating with II

4 increased range in risk score. Conclusion. Significant differences in the number of SNPs used to calculate risk score as well as selection of SNP risk estimates are the primary causes of inconsistency in risk scoring between direct-to-consumer companies. To improve consistency direct to consumer genetic testing companies need to incorporate into their calculations more recently published SNP associations, use consistent SNP effect sizes and use similar numbers of SNPs. III

5 (This page left blank intentionally) IV

6 Table of Contents Page # Abstract... II Table of Contents... V List of Tables... VI List of Figures... VI Introduction... 1 Methods... 4 i. DTC Genetic Testing Company Selection... 5 ii. Common Disease Selection... 5 iii. Cohort... 6 iv. Genotyping... 6 v. Genetic Risk Prediction... 7 Results... 8 vi. Results: Type 2 Diabetes... 8 vii. Results: Age-Related Macular Degeneration Discussion viii. Study Limitations ix. Conclusions References Tables Figures V

7 List of Tables I. Type 2 Diabetes SNP Reported Risks by SNP II. III. IV. Age-Related Macular Degeneration Reported Risks by SNP Mean and SDs of Genetic Risk Scores for Type 2 Diabetes Mean and SDs of Genetic Risk Scores for Age-Related Macular Degeneration List of Figures I. Venn Diagram of Number of Identical SNPs Shared between Panels II. III. IV. Distributions of Genetic Risk Score by Source for Type 2 Diabetes Correlation Plots between Sources for Type 2 Diabetes Distributions of Genetic Risk Score by Source for Age-Related Macular Degeneration V. Correlation Plots between Sources for Age-Related Macular Degeneration VI

8 Introduction Direct-to-consumer (DTC) genetic testing has introduced a new frontier to personalized medicine by providing consumers direct access to genetic testing for common diseases without the medium of a healthcare provider. A growing industry with limited regulation and consumer protections, DTC genetic testing has arrived to the field of genetics with both concerns and opportunity [1-3]. Today single nucleotide polymorphism (SNP)-based genetic testing runs the gambit from nutrition and ancestry testing to predicting the inherent genetic risk of developing common diseases [2, 4]. While the SNP-based genotyping technology behind DTC genetic testing for common diseases is well understood, the interpretation of genotypic information into predictive genetic risk remains complex and varied across DTC genetic testing companies [5, 6]. Advocates of DTC genetic testing note that benefits of testing include increased access to testing, improved health awareness and potential health benefits to consumers. In early adoption of personal genomics, individuals were optimistic about using genomic profiling in consultation with physicians to improve health [7]. Further studies gauging interest in personalized genomic testing have shown that the majority of consumers who would consider testing would ask their physician for help with the interpretation of the results [8, 9]. Additionally, DTC genetic testing companies may contribute their own research into the area of common disease; the DTC genetic testing company 23andMe has recently pursued patents for SNPs implicated in Parkinson s disease [10, 11]. 1

9 Despite notable benefits to DTC genetic testing, critics of DTC genetic testing for common diseases list concerns for the clinical utility, interpretation and consistency of DTC genetic testing results. Given the complex nature of genetic information, a clear understanding of testing results remains important in communicating disease risk. Furthermore consumers may be prone to overestimating the health benefit of testing as well as misinterpreting the results compared to healthcare providers [12]. While consumers who overestimate their risk might seek additional healthcare guidance, alternatively, consumers who underestimate their risk may not seek appropriate healthcare guidance [13]. Likewise it is unclear if DTC genetic testing results will change behavior in the cases of predictive health testing [14]. Possible misinterpretation of results demonstrates the need for healthcare providers or genetic counseling services in the facilitation of interpreting DTC genetic testing results to consumers. However, literature suggests that healthcare providers are not prepared to interpret DTC genetic testing results [15]. A study performed by the Government Accountability Office (GAO) noted inconsistency in disease risk results between DTC genetic testing companies that was concerning given that consumers may treat DTC genetic testing disease risk as clinically significant [16]. Inconsistencies which may provide different risks to an individual for the same disease, cause significant concern for the clinical utility and interpretation of the test and consistency in risk results between companies remains a concern in the field today. Understanding the consistency in disease risk results between major DTC genetic testing companies has been the focus of several recent studies, however the definition and measurement of consistency varies [17-19]. For this study, consistency represents the uniformity in genetic risk score for an individual for a given common disease across DTC genetic 2

10 testing companies. In 2009, Nature published a study by Ng et al. that compared test results from five individuals DNA samples sent to two DTC genetic testing companies, 23andMe and Navigenics [17]. While the study found a high concordance (99.7%) in the SNP data (genotypic microarray results), generated between the companies, their calculated genetic risk scores differed significantly for the individuals tested. In another study, DNA samples from an individual healthy volunteer were sent to the DTC genetic testing companies 23andMe, DeCODE Genetics (DeCODE) and Navigenics [18]. This study similarly found the concordance of SNP data to be high (>99.6%), but the predictive genetic risk scores between companies were poorly correlated. As an example, the relative risk estimates for rheumatoid arthritis from the three companies ranged between 0.9 (protective) to 1.85 (harmful). These previous studies have focused on comparing the correlation between companies based on a few samples submitted to multiple companies. While this method allows the studies to look across many diseases tested between companies, it provides a narrow view of the possible range of scores provided to consumers who use these services. As each individual s risk is dependent on their genotype, studies with a small sample size are limited to data produced only from their samples genotypes. Our study aims to expand the current description of consistency between major DTC genetic testing companies using a much larger sample size to better predict the range in genetic risk scores for two common diseases. For this analysis, we have assessed the consistency between major DTC genetics testing companies 23andMe and DeCODE, using type 2 diabetes (T2D) and age-related macular degeneration (AMD) as exemplar diseases. T2D is a well-studied common disease displaying multifactorial inheritance with an estimated heritability ranging from 26-77% [20-22]. The DNA 3

11 and Public Policy Center reports that 9 DTC genetic testing companies and 2 DTC genetic testing through-physician companies currently provide testing for the genetic risk of T2D [23]. Currently there exists between SNPs (depending on the rigor of association) in populations with European ancestry known to be associated with T2D, however these SNPs explain less than 10-13% of the known heritability of T2D [22, 24, 25]. Age-related macular degeneration (AMD) is another well-studied common disease. AMD has a significant known SNP contribution to disease risk with SNPs accounting for nearly 25% [22] of AMD s genetic contribution to overall heritability, which is estimated to range from 45-71% [22, 26-28]. In contrast to T2D, the large SNP heritability in AMD is accounted for by less than 30 known SNPs [29, 30]. These diseases are utilized in our study to describe differing scenarios, a disease with a relatively large genetic contribution from a few SNPs (AMD), and a disease with a small genetic contribution from many SNPs (T2D) with small effects. Together these diseases help identify the current range in genetic risk for common diseases estimated by DTC genetic testing companies. Methods To observe the correlation of genetic risk interpretation between DTC genetic testing companies, we generated genetic risk scores equivalent to those provided by DTC genetic testing companies 23andMe and DeCODE for 834 genotyped individuals from a control cohort. The data used for risk calculation was based on available information that DTC genetic testing companies use to predict genetic risk on their websites. The extracted information included which SNPs were used in the calculation as well as the associated risk estimates for each SNP. 4

12 The data from DTC genetic testing websites was compiled and applied to the genotypic data from our cohort to calculate the genetic risk for the common diseases T2D and AMD. DTC Genetic Testing Company Selection DTC genetic testing companies were chosen based on open availability of genomic risk prediction information on the company website. To determine the companies to include in our study, a list of 27 DTC genetic testing and DTC genetic testing through-physician companies, published by the Genetics and Public Policy Center was obtained [23]. A web search was performed that narrowed the list to four companies based on the availability of the necessary data for genetic risk score prediction, including SNPs used in risk calculation, odds ratio (OR)/relative risk (RR) for the SNPs and genotype. The selected companies 23andMe, DeCODE, Navigenics and Pathway Genomics were then further narrowed to 23andMe and DeCODE, based on company acquisitions and changes in website data access over the course of our data collection period (July 2012 to Nov. 2012). Common Disease Selection AMD and T2D were selected as exemplar diseases to represent the genetic spectrum of common diseases assessed by DTC genetic testing. T2D is a common disease of phenotypic and genetic heterogeneity with many small-effect SNP-associations contributing to a relatively small genetic contribution to disease risk 10-13% [22, 24, 25]. In contrast, AMD is the paradigm disease of SNP-based common disease testing, with only a few SNP-associations representing nearly 25% [22] of the significant genetic contribution (45-71%) [22, 26-28] to heritability. However, despite significant research in the genetics of both these diseases, much of the 5

13 heritability remains unexplained. As such, T2D and AMD represent many common diseases in which continued research is needed to explain further genetic contribution to disease. Cohort The Cincinnati Genomic Control Cohort was used to provide the genotypic data for the study. The cohort consists of 995 healthy individuals (at the time of enrollment) ages 3-18 from the Greater Cincinnati area, including individuals from the tri-state area of Ohio, northern Kentucky and southeastern Indiana. The cohort population was selected to be representative of the Cincinnati population based on the 2010 US Census data and consists of roughly 80% Caucasian, 20% African American and <1% of individuals of Asian descent. The racial stratification percentages provided are based on parental report. For the cohort study, parents of the participants filled out a form detailing age, race and gender for their child and the participants provided samples of urine, serum, hair and DNA. Each individual in the cohort was genotyped using an Affymetrix 6.0 array. The genotypic information used in our study was obtained de-identified from the cohort. The joint Cincinnati Children s Hospital Medical Center and University of Cincinnati Institutional Review Board deemed this study to be non-human subject research (Study ID: ). Genotyping The 995 samples of the Cincinnati Genomic Control Cohort were successfully genotyped using the Affymetrix Human SNP Array 6.0 following the manufacture s protocol. Genotype calls were determined using the CRLMM algorithm [31, 32] among chips that passed the vender suggested quality control (Contrast QC > 0.4). Specifically, contrast QC removed poor quality samples of the 995 raw genotype intensity data (cel files) from the cohort narrowing the cohort 6

14 to 834 individuals, an 83% pass rate. Genotype calling was performed using CRLMM for the 906,600 SNP markers generated for each individual. Imputation for the SNPs used by DTC genetic testing companies not represented by the Affymetrix 6.0 array was performed using MACH and the Minimac program [33, 34]. The reference haplotypes for the imputation were extracted from the phased genotype calls of all (ALL, N=1092) samples of the 1000 Genomes Integrated Phase I release [35]. Genetic Risk Prediction Genetic risk scores for each individual in the cohort were calculated for T2D and AMD using the equation: ( ) G is the genotype coded as either 0, 1 or 2 for the number of risk allele(s) at SNP i and β is defined as the reported effect size estimate (beta coefficient or log odds ratio) of the risk allele at SNP i. Instead of using odds ratios (ORs), 23andMe uses an adjusted OR and DeCODE uses an adjusted genotype relative risk (RR) in reporting the effect sizes of the risk alleles. For direct comparisons, these risk measures were converted to ORs by accounting for allele frequency and prevalence in control populations. The allele frequencies used in conversion of risk measures to ORs were taken from the 1000 Genomes Project [36], which are based on European ancestry. A disease prevalence of 25.7% for T2D and 6.5% for AMD were used in calculations and obtained from the DTC genetic testing company 23andMe website [11]. The DTC genetic testing company, genetic risk scores were then compared to scores generated using SNPs and ORs extracted from a published academic report for T2D [37] and AMD [30]. A PubMed search using keywords was performed to identify academic papers with 7

15 comprehensive SNP lists. Two academic papers, one each for T2D and AMD, were selected based on their recent publication (since 2011) and maximum number of SNPs listed (79 T2D and 24 AMD) compared to other academic papers found. Strength of SNP association was not considered, as there is no current consensus in appropriate cutoff value. Results Of the DTC genetic testing companies we reviewed online, two (23andMe and DeCODE) of 27 provided open web access to the SNP and adjusted OR/RR numbers they use to generate genetic risk scores. For T2D, 23andMe uses 11 SNPs, DeCODE uses 21 SNPs and an academic paper by Sanghera et al. listed 33 European SNPs (79 total SNPs). Only the 33 European SNPs were used in the current study, shown in Table 1 [37]. For AMD, 23andMe uses 3 SNPs, DeCODE uses 6 SNPs and the academic paper by Cipriani et al. listed 24 SNPs, see Table 2 [30]. Before any calculations were performed the wide discrepancies in the number of SNPs used between sources for each disease provided strong insight into the likely correlation between risk results provided to consumers. Results: Type 2 Diabetes A Venn diagram for each disease was created to display the overlap in SNPs used between the three sources (academic, 23andMe and DeCODE), Figure 1. Within T2D the academic source shared 6/33 SNPs with both 23andMe and DeCODE, 8/33 with 23andMe, 10/33 with DeCODE and had 21/33 unique SNPs. 23andMe shared 6/11 SNPs with both the academic source and DeCODEme, 8/11 with the academic source, 7/11 with DeCODE and had 2/11 unique SNPs. DeCODE shared 6/21 SNPs with both the academic source and 23andMe, 8

16 10/21 with the academic source, 7/21 with 23andME and had 10/21 unique SNPs. After accounting for the shared SNPs, there was a range of 2 unique SNPs used by 23andMe to 21 unique SNPs used in the academic paper representing a large gap in the current SNP knowledge and the utilization of all known SNPs in commercial testing. However, linkage disequilibrium was not accounted for between these SNPs and could explain some lack in overlap between the comprehensive panel and the DTC genetic testing companies. The mean genetic risk score and standard deviations calculated for each source are shown in Table 3. The distribution of genetic scores across the three groups (academic, DeCODE and 23andMe) is shown in Figure 2 (A-C). Each of the three sources showed a normal distribution as needed to calculate correlation, and each source had a mean genetic score close to zero (representing population risk), as would be expected from a large random sample. Notably, the academic panel, which included the greatest number of SNPs, had the genetic score mean closest to zero as well as the widest variation in genetic scores as expected given the greater number of SNPs used in the calculation. Pearson correlation was calculated for T2D between the genetic scores generated for each SNP source (academic, 23andMe, DeCODE). The r 2 can be found in the x-y plots showing the correlation between genetic scores estimated based on different risk panels (Figure 3, A-C). The genetic scores for DeCODE and 23andMe compared to the academic panel were similarly significant, r 2 = 0.46, p < 0.05 and, r 2 = 0.42, p < 0.05, respectively (Figure 3 A,B). The correlation between the two DTC genetic testing companies 23andMe and DeCODE (r 2 = 0.66, p < 0.05) was the highest of the three comparisons (Figure 3C). The higher correlation between 9

17 23andMe and DeCODE was not surprising, given that 23andMe shared 7 of 11 SNPs in common with DeCODE. Significant correlation was found for T2D between all sources; however given the overlap in SNPs used, the correlation was far from satisfactory. While an appropriate correlation for molecular genetic testing has not been established, a correlation between companies of r > 0.99, would be expected from a test used for clinical purposes as suggested by Cary, R.N. et al [38]. In addition, it can be seen in the variation in genetic scores (Figure 2, A-C) between sources that the highest and lowest possible genetic risk score is roughly proportional to the number of SNPs used in the panel, with the academic panel having the widest range of possible genetic score. Results: Age-Related Macular Degeneration A Venn diagram displaying the overlap in SNPs used between the three sources (academic, 23andMe and DeCODE) for AMD is shown in Figure 1B. Within AMD there were zero SNPs shared between all three sources. The academic sources shared 4/28 SNPs with DeCODE and had 24/28 unique SNPs. 23andMe shared 2/3 SNPs with DeCODE and had 1/3 unique SNPs. DeCODE shared 4/6 with the academic sources and 2/6 with 23andMe; DeCODE had no unique SNPs. There were also no SNPs shared between the academic panel and 23andMe. The mean genetic risk score and standard deviations calculated for each source are shown in Table 4. The distribution of genetic risk scores for AMD across the three panels is shown in Figure 4 (A-C). While normal distribution was present for all three sources, 23andMe and DeCODE had multiple peaks (Figure 4 B,C) caused by the relatively few number of SNPs 10

18 used in calculating their distribution. Across all three sources the mean genetic score was negative ranging from to -0.58, indicating that within our cohort there was a general trend toward having a protective genotype. Similarly to the results for T2D, the range of possible genetic risk score was roughly proportional to the number of SNP markers used by the source. Pearson correlation was calculated for AMD between the genetic scores generated for each SNP source (academic, 23andMe, DeCODE). The r 2 can be found in the x-y plots showing the correlation between genetic scores estimated based on different risk panels (Figure 5, A-C) Correlation was calculated between the three sources. The academic source and 23andMe had an r 2 = 0.70, p < 0.05, the academic source with DeCODE had a r 2 = 0.30, p < 0.05, and 23andMe compared to DeCODE had an r 2 = 0.31, p < All three correlations were statistically significant. Similar to the results for T2D, while there was significant correlation between sources, the correlation was not as highly significant (r > 0.99) as would be expected from a test used for clinical purposes. For AMD the highest correlation was noted between 23andMe and the academic source, which shared no SNPs in common. This suggests high linkage disequilibrium between SNPs from both sources. The position column of Table 2 suggests all three SNPs used by 23andMe are in high linkage disequilibrium with SNPs used in the academic source (a SNP within 1 cm, ~1,000,000 nucleotides). Discussion 11

19 Our results confirm that genetic testing for predisposition to common disease is far from consistent between DTC genetic testing companies. While previous studies have shown that consistency between individual samples submitted to multiple DTC genetic testing companies can produce significantly differing results, the consistency in genetic risk scoring between DTC genetic testing companies has been poorly understood. Genetic risk score used in our study is not a predictor of an individual s absolute genetic risk, which accounts for disease prevalence, age, gender and a number of other risk factors. Rather, genetic risk score provides a tool to compare consistency in the genetic risk reported between companies. The results from our study indicate significant inconsistencies in risk scoring between major DTC genetic companies testing for common disease, as well as clear differences in the number of SNPs used by companies and reported in academic literature. When considering the equation for genetic risk score there are three factors that combine to produce genetic risk; the number of SNPs used, the risk effect of each SNP and the genotype. As an individual s genotype stays consistent between companies and that DTC genetic testing companies use genotyping technology with high analytical validity (>99.6%) [17, 18], it can be assumed that genetic risk score inconsistency does not come from genotyping, but rather, the number of SNPs used and the risk effect of each SNP. To address differences in the number of SNPs used between sources our study compiled a list of the SNPs used (including the effect size of each SNP) by DTC genetic testing companies 23andMe and DeCODE. Additionally, SNP markers were collected from academic studies to create a comparison of incorporation of SNP usage by DTC genetic testing companies and the number of SNPs reported in academic literature [17, 18]. Correlation was calculated between the sources to measure the 12

20 consistency in genetic risk scoring across both DTC genetic testing companies and DTC genetic testing companies and recent academic studies. Our results show that an increased number of SNPs utilized by a company is associated with an increased range of genetic scores provided. Interestingly the number of SNPs shared between two companies was not consistently a good indicator of high correlation as would be expected. For example 23andMe shared 7/11 total SNPs with DeCODE (r 2 = 0.66) and 8/11 total SNPs with the academic source (r 2 = 0.41) with a significantly lower correlation. Such a phenomenon could indicate that SNPs not shared between companies have significant impact on the correlation (in this example the academic source has 25 SNPs not shared with 23andMe) or could indicate that several SNPs are in high linkage disequilibrium between sources. An example of high linkage disequilibrium could be seen in AMD between the academic source and 23andMe, which share no SNPs in common, yet have a high correlation (r 2 = 0.70). A direct comparison of the numbers of SNPs utilized by DTC genetic testing companies suggests that while linkage disequilibrium is likely present, it cannot account for the large gaps in the number of SNPs used between companies. While ideally linkage disequilibrium could be taken into account for all SNPs within the study, confounding factors such as allelic heterogeneity where SNPs may appear to have high linkage disequilibrium (r 2 = >0.8) but actually have significantly different risk effect and much lower linkage disequilibrium. Instances of allelic heterogeneity may be challenging to identify from simple differences in effect size estimation and therefore are often difficult to take into account. Our findings suggest that while the number of SNPs used by a source is critical, the need for all SNPs to be identical between sources is unnecessary. However, if two SNPs have an 13

21 r 2 = 1.0, then use of the same effect size is critical in calculating risk score. An example of allelic heterogeneity can be seen in our study between two SNPs for the disease AMD, rs# and rs# The SNPs rs# and rs# lie only ~40kb in separation, suggesting that they may be in high linkage disequilibrium (<1 cm in separation). However, these SNPs have an r 2 = 0.36, not r 2 = 1.0, supporting that they likely represent two different effects contributing to AMD. If a source were to assume the two SNPs were in high linkage disequilibrium, the source might incorrectly utilize only one of the two SNPs in risk calculation rather than both. While, SNPs with high linkage disequilibrium used by different companies can allow the usage of different platforms (Affymetrix vs. Illumina) to produce the same risk score, the effect of allelic heterogeneity, linkage disequilibrium and differences in effect size need to be taken into account. Our study suggests that much of the discrepancy in the number of SNPs used by DTC genetic testing companies lies in the lack of incorporation of newly found SNPs into risk calculation. For example in the calculation of AMD risk, 23andMe uses three SNPs from three different chromosomes, whereas SNPs have been associated with AMD on 12 different chromosomes in the academic paper [30]. The discrepancy in number of unique SNPs used in risk calculation cannot be attributed to linkage disequilibrium where risk information from 9 chromosomes is not accounted for by 23andMe. Standardization in the incorporation of newly discovered SNP-associations is a key factor in increasing the consistency in genetic risk scoring for SNP-based testing. Additionally, clinical validity will not improve with consistency, only with additional research into the genetic cause of disease. However, if only 10-13% of the heritability of T2D is accounted for by the >50 known SNPs, DTC genetic testing companies 14

22 need to use the full panel of known SNPs to achieve a genetic score that represents 10-13% of known heritability [22, 24, 25]. While this is an intuitive concept, barriers exist to DTC genetic testing companies integrating new SNPs into practice. The DTC genetic testing company 23andMe publishes standards they use to incorporate SNPs into their panel [39], but standards that are meant to protect consumers from additions of poorly-studied SNPs may also cause DTC genetic testing companies to lag behind in the inclusion of new SNPs into disease risk calculation. Barriers to incorporation of new SNPs often include quality of the research performed, the sample size of the study, the strength of associated between the SNP and disease, factors such as population ethnicity, gender and age, as well as others. It would be easy to assume that high standards cause the disparity in number of SNPs incorporated into DTC genetic testing panels. As an example, 23andMe has not updated the SNPs they use for T2D risk calculation since 2010 [11], yet the number of SNP associations discovered between outpaced the entire preceding decade [40], a trend of rapid discovery that has continued past However, many of the SNPs listed in Sanghera et al. [37], used in our study as a comprehensive SNP list for T2D, have already met current standards listed by 23andMe for inclusion but have not yet been incorporated by 23andMe [37, 39]. Such findings suggest that rather than standards preventing new incorporation of recently discovered SNPs, DTC genetic testing companies are not keeping up with the rate of discovery. While the focus of this study was on the consistency between DTC genetic testing company risk scores, inconsistency between the effect size estimates reported for SNPs from genome wide association studies (GWAS) remains a concern. Mentioned previously, the risk effect of each SNP is a key component in the calculation of disease risk score. As the effect size 15

23 estimates reported depend on the study population and study design, there are significant differences between the effect size estimates being reported by different academic sources even for a single SNP. When DTC genetic testing companies select a SNP to incorporate they may choose to average an effect size estimate taken from multiple academic sources or chose the effect size estimate from an academic source with the largest study population. Use of effect size estimates that vary slightly may seem inconsequential, however even slight variations have the potential to adjust risk when many SNPs are used in conjunction for disease risk calculation. An additional consideration not accounted for by the equation that generates genetic risk score is ethnic and genetic population diversity. When population diversity is taken into account, it is known that 96% of GWAS performed up to 2011 have used primarily European populations [41, 42]. While the DTC genetic testing company 23andMe incorporates SNPs for other ethnicities when available, users who are unaware of these important differences may not recognize the importance of their ancestry in the genetic prediction of disease risk. In the case of T2D, 23andMe only uses one SNP for individuals of African ancestry [11]. For consumers of mixed heritage or ethnic backgrounds not widely studied, a clear need remains for additional studies in these populations. In the case of mixed ethnic backgrounds, it is unclear what benefit DTC genetic testing would have for the consumer. Study Limitations This study was limited in the number of diseases compared. Whereas previous studies have looked between companies at multiple diseases, such was not the primary aim of our study. T2D and AMD were chosen as well-studied common diseases each with significant 16

24 features similar and overlapping with features significant to the predisposition SNP testing for other common diseases. Increasing the number of diseases examined in this study would provide corresponding correlation results for additional diseases, but would not necessarily broaden the study scope. The results of this study remain a confirmation of an intuitive hypothesis that an increased number of SNPs increases the range of possible genetic scores provided for a disease and that consistency in risk scoring between DTC genetic testing companies is far from adequate. While mentioned in this study, an in-depth look at linkage disequilibrium was not considered in comparing the difference/overlapping of risk SNPs used by different risk panels. Linkage disequilibrium is a confounding factor in the study of SNP-based disease associations where SNPs positioned closely on a chromosome may convey similar disease risk. To avoid duplication in selecting risk SNPs (for the academic panel) only one academic publication each was selected to represent T2D and AMD instead of multiple papers. Both of the selected studies, however, cited SNPs from multiple GWAS, so some linkage-disequilibrium between SNPs is expected. Conclusions Despite recent efforts by the FDA to scrutinize the field of DTC genetic testing, many scientific concerns in the field over how best to interpret the clinical validity and utility of SNPbased genetic risk results remain. While samples sent to multiple DTC genetic testing companies may receive differing results, the underlying problem lies not in the genotyping technology used by companies [17, 18], but in the interpretation of genotype into absolute 17

25 genetic risk. Furthermore, the current consistency, regardless of clinical utility and validity, does not meet standards for clinical testing. Continued research into the SNP-based association to disease and the genetic contribution to disease, remains critical to the forward growth of testing for SNP-based disease risk prediction. In addition, while the number and risk-effects of SNPs associated with disease risk continues to be an important component in the consistency between companies, the current knowledge of complex disease is far from the point of clinical utility. To increase consistency, companies should utilize collective available knowledge in incorporating newly validated SNPs. To do so, however, appropriate standards should be created to provide consistency for SNP incorporation into disease risk prediction. DTC genetic testing companies interested in improving the field of SNP-based diseaserisk testing should consider an open source policy, as practiced by 23andMe and DeCODE in SNP selection and usage. Such a policy would allow companies to compare SNP markers and increase inter-company consistency as well as provide a framework for appropriate SNP inclusion into disease risk panels. In addition, companies should create and openly publish (together or separately) quality standards for SNP and effect size selection in an effort to promote greater consistency between companies. The road for DTC genetic testing is a new one, with future obstacles to overcome. However, as DTC genetic testing companies strive to promote their product to consumers and expand their testing to cover more diseases, the need for improved consistency remains. Even after incorporation of the current SNP knowledge of common disease, the clinical validity and utility of SNP-based genetic risk prediction is a long way from incorporation into medical 18

26 practice. Only continued research effort into the field of complex disease will reveal the longterm potential of SNP-based testing. With the additions of whole-exome and whole-genome sequencing, SNP-based genotyping technology may quickly be replaced by DTC genetic testing companies and the medical field alike. Regardless of the technology behind the genotyping, consistency in interpretation of genetic information into disease risk will continue to be a topic of consideration well into the future. 19

27 References 1. Melzer, D., et al., Genetic tests for common diseases: new insights, old concerns. BMJ, (7644): p Goddard, K.A., et al., Awareness and use of direct-to-consumer nutrigenomic tests, United States, Genet Med, (8): p Magnus, D., M.K. Cho, and R. Cook-Deegan, Direct-to-consumer genetic tests: beyond medical regulation? Genome Med, (2): p Janssens, A.C., A.A. Wilde, and I.M. van Langen, The sense and nonsense of direct-to-consumer genetic testing for cardiovascular disease. Neth Heart J, (2): p Swan, M., Multigenic condition risk assessment in direct-to-consumer genomic services. Genetics in Medicine, (5): p Amin, N., C.M. van Duijn, and A.C. Janssens, Genetic scoring analysis: a way forward in genome wide association studies? Eur J Epidemiol, (10): p Gollust, S.E., et al., Motivations and perceptions of early adopters of personalized genomics: perspectives from research participants. Public Health Genomics, (1): p McGuire, A.L., et al., Social networkers' attitudes toward direct-to-consumer personal genome testing. Am J Bioeth, (6-7): p Myers, M.F., Health care providers and direct-to-consumer access and advertising of genetic testing in the United States. Genome Med, (12): p Sterckx, S., et al., "Trust is not something you can reclaim easily": patenting in the field of directto-consumer genetic testing. Genet Med, andMe. Genetic Testing for Health, Disease & Ancestry; DNA Test [cited 2013; Available from: Leighton, J.W., K. Valverde, and B.A. Bernhardt, The general public's understanding and perception of direct-to-consumer genetic test results. Public Health Genomics, (1): p Kaufman, D.J., et al., Risky Business: Risk Perception and the Use of Medical Services among Customers of DTC Personal Genetic Testing. J Genet Couns, (3): p Bloss, C.S., N.J. Schork, and E.J. Topol, Effect of direct-to-consumer genomewide profiling to assess disease risk. N Engl J Med, (6): p Myers, M.F., et al., Genetic testing for susceptibility to breast and ovarian cancer: Evaluating the impact of a direct-to-consumer marketing campaign on physicians' knowledge and practices. Genetics in Medicine, (6): p Kutz, G., Direct to consumer genetic tests: misleading test results are further complicated by deceptive marketing and other questionable practices, G.A. Office, Editor Ng, P.C., et al., An agenda for personalized medicine. Nature, (7265): p Imai, K., L.J. Kricka, and P. Fortina, Concordance study of 3 direct-to-consumer genetic-testing services. Clin Chem, (3): p Kuehn, B.M., Inconsistent results, inaccurate claims plague direct-to-consumer gene tests. JAMA, (12): p Carlsson, S., et al., Shared genetic influence of BMI, physical activity and type 2 diabetes: a twin study. Diabetologia, Poulsen, P., et al., Heritability of type II (non-insulin-dependent) diabetes mellitus and abnormal glucose tolerance--a population-based twin study. Diabetologia, (2): p Do, C.B., et al., Comparison of family history and SNPs for predicting risk of complex disease. PLoS Genet, (10): p. e

28 23. Center, G.P.P. List of DTC genetic testing companies August 11, 2011 [cited /21/2012]; Available from: Kwak, S.H. and K.S. Park, Genetics of type 2 diabetes and potential clinical implications. Arch Pharm Res, (2): p Perry, J.R., et al., Stratifying type 2 diabetes cases by BMI identifies genetic risk variants in LAMA1 and enrichment for risk variants in lean compared to obese cases. PLoS Genet, (5): p. e Fine, S.L., et al., Age-related macular degeneration. N Engl J Med, (7): p Hammond, C.J., et al., Genetic influence on early age-related maculopathy: a twin study. Ophthalmology, (4): p Seddon, J.M., et al., The US twin study of age-related macular degeneration: relative roles of genetic and environmental influences. Arch Ophthalmol, (3): p Fritsche, L.G., et al., Seven new loci associated with age-related macular degeneration. Nat Genet, Cipriani, V., et al., Genome-wide association study of age-related macular degeneration identifies associated variants in the TNXB-FKBPL-NOTCH4 region of chromosome 6p21.3. Hum Mol Genet, (18): p Carvalho, B., et al., Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data. Biostatistics, (2): p Scharpf, R.B., et al., Using the R Package crlmm for Genotyping and Copy Number Estimation. J Stat Softw, (12): p Li, Y., et al., Genotype imputation. Annu Rev Genomics Hum Genet, : p Howie, B., et al., Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet, (8): p Abecasis, G.R., et al., An integrated map of genetic variation from 1,092 human genomes. Nature, (7422): p Abecasis, G.R., et al., A map of human genome variation from population-scale sequencing. Nature, (7319): p Sanghera, D.K. and P.R. Blackett, Type 2 Diabetes Genetics: Beyond GWAS. J Diabetes Metab, (198). 38. Cary, R.N., C.C. Garber, and D.D. Koch, Concepts and Practices in the Evaluation of Laboratory Methods., in Am Assoc Clin Chem. 1993: New York. 39. Naughton, B.W., S., Guidelines on Vetting Genetic Associations, in White Paper 23-03, 23andMe, Editor. 2011, 23andMe. p McCarthy, M.I., et al., Genome-wide association studies for complex traits: consensus, uncertainty and challenges. Nat Rev Genet, (5): p Bustamante, C.D., E.G. Burchard, and F.M. De la Vega, Genomics for the world. Nature, (7355): p Rosenberg, N.A., et al., Genome-wide association studies in diverse populations. Nat Rev Genet, (5): p

29 Tables Table 1 Type 2 Diabetes SNP Reported Risks by SNP rs# Chr. Position Ref. Alt. Academic 23andMe DeCODE rs G T rs T C rs T C rs T C rs T C rs G A rs T C rs A G rs G A rs C G rs C T rs T C rs G T rs A G rs T C rs G A rs A G rs G C rs A G rs C T rs T G rs T C rs A G rs T C rs C T rs T C rs A G rs C T rs A G rs C T rs C T rs C T rs A G rs C T rs C T rs T C rs A C rs C G

30 rs T A rs G C rs C T rs T A rs A G rs C A rs C A rs G A Summary of SNPs used by DTC genetic testing and academic sources and the risk estimates used for each SNP. rs# denotes the reference SNP ID. Chr is the chromosome of SNP origin. Position is genomic nucleotide position of the SNP based on Build 37 of the Human Reference Consortium. Ref indicates the reference allele based off of the reference allele from the dbsnp database and 1000 genome reference allele from the forward strand. Alt indicates the alternate allele to the reference strand. Odds ratios are listed for the Academic source taken from the academic paper, all odds ratios are listed based on the reference allele risk estimate. *23andMe reports their risk panel in adjusted odds ratios listed in this table. **decode reports their risks as adjusted relative risk reported in this table. A dash indicates the SNP was not used by that source. 23

31 Table 2 Age-Related Macular Degeneration Reported Risks by SNP rs# Chr. Position Ref. Alt. Academic 23andMe* DeCODE** rs A C rs C T rs G A rs C T rs T C rs T C rs T A rs C T rs T C rs G C rs G T rs G A rs T C rs T C rs T C rs A G rs T C rs C T rs T C rs A C rs G T rs C T rs A C rs C T rs T C rs C A rs G C rs A G rs T C rs A C Summary of SNPs used by DTC genetic testing and academic sources and the risk estimates used for each SNP. rs# denotes the reference SNP ID. Chr is the chromosome of SNP origin. Position is genomic nucleotide position of the SNP based on Build 37 of the Human Reference Consortium. Ref indicates the reference allele based off of the reference allele from the dbsnp database and 1000 genome reference allele from the forward strand. Alt indicates the alternate allele to the reference strand. Odds ratios are listed for the Academic source taken from the academic paper, all odds ratios are listed based on the reference allele risk estimate. *23andMe reports their risk panel in adjusted odds ratios listed in this table. **decode reports their risks as adjusted relative risk reported in this table. A dash indicates the SNP was not used by that source. 24

32 Table 3 Mean and SDs of Genetic Risk Scores for Type 2 Diabetes Academic decode 23andMe mean = 0.01 SD = 0.21 mean = SD = 0.14 mean = SD = 0.14 The mean and standard deviation of the risk scores calculated for the combined 834 samples for each source contributing to type 2 diabetes are shown here. These numbers are displayed in units of genetic risk score (S). Genetic risk score is the sum of the de-adjusted odds ratios/relative risk multiplied by the genotype for each SNP a source uses, see Methods, Genetic Risk Prediction. 25

33 Table 4 Mean and SDs of Genetic Risk Scores for Age-Related Macular Degeneration Academic decode 23andMe mean = SD = 0.97 mean = SD = 0.37 mean = SD = 0.46 The mean and standard deviation of the risk scores calculated for the combined 834 samples for each source contributing to age-related macular degeneration are shown here. These numbers are displayed in units of genetic risk score (S). Genetic risk score is the sum of the de-adjusted odds ratios/relative risk multiplied by the genotype for each SNP a source uses, see Methods, Genetic Risk Prediction. 26

34 Figures Figure 1 Venn Diagram of SNPs Shared between Panels Type 2 Diabetes Age-Related Macular Degeneration A B Figure 1A represents the number of SNPs unique to each source and the number of SNPs shared between sources for T2D; Figure 1B reflects the same findings for AMD. SNPs were compared directly using their rs# values. Linkage disequilibrium was not taken into consideration in this figure, but could potentially account for even greater overlap in the sharing of SNPs between sources. 27

35 Figure 2 Distributions of Genetic Risk Score by Source for Type 2 Diabetes A B C The distribution of genetic risk score by source. 28

36 Figure 3 Correlation Plots between Sources for Type 2 Diabetes A. B. 29

37 C. 30

38 Figure 4 Distributions of Genetic Risk Score by Source for Age-Related Macular Degeneration A B C The distribution of genetic risk score by source. 31

39 Figure 5 Correlation Plots between Sources for Age-Related Macular Degeneration A. B. C. 32

Genome-wide analyses in admixed populations: Challenges and opportunities

Genome-wide analyses in admixed populations: Challenges and opportunities Genome-wide analyses in admixed populations: Challenges and opportunities E-mail: esteban.parra@utoronto.ca Esteban J. Parra, Ph.D. Admixed populations: an invaluable resource to study the genetics of

More information

AN EVALUATION OF POWER TO DETECT LOW-FREQUENCY VARIANT ASSOCIATIONS USING ALLELE-MATCHING TESTS THAT ACCOUNT FOR UNCERTAINTY

AN EVALUATION OF POWER TO DETECT LOW-FREQUENCY VARIANT ASSOCIATIONS USING ALLELE-MATCHING TESTS THAT ACCOUNT FOR UNCERTAINTY AN EVALUATION OF POWER TO DETECT LOW-FREQUENCY VARIANT ASSOCIATIONS USING ALLELE-MATCHING TESTS THAT ACCOUNT FOR UNCERTAINTY E. ZEGGINI and J.L. ASIMIT Wellcome Trust Sanger Institute, Hinxton, CB10 1HH,

More information

Genome Wide Association Studies

Genome Wide Association Studies Genome Wide Association Studies Liz Speliotes M.D., Ph.D., M.P.H. Instructor of Medicine and Gastroenterology Massachusetts General Hospital Harvard Medical School Fellow Broad Institute Outline Introduction

More information

Genetic Variation and Genome- Wide Association Studies. Keyan Salari, MD/PhD Candidate Department of Genetics

Genetic Variation and Genome- Wide Association Studies. Keyan Salari, MD/PhD Candidate Department of Genetics Genetic Variation and Genome- Wide Association Studies Keyan Salari, MD/PhD Candidate Department of Genetics How many of you did the readings before class? A. Yes, of course! B. Started, but didn t get

More information

EPIB 668 Genetic association studies. Aurélie LABBE - Winter 2011

EPIB 668 Genetic association studies. Aurélie LABBE - Winter 2011 EPIB 668 Genetic association studies Aurélie LABBE - Winter 2011 1 / 71 OUTLINE Linkage vs association Linkage disequilibrium Case control studies Family-based association 2 / 71 RECAP ON GENETIC VARIANTS

More information

Familial Breast Cancer

Familial Breast Cancer Familial Breast Cancer SEARCHING THE GENES Samuel J. Haryono 1 Issues in HSBOC Spectrum of mutation testing in familial breast cancer Variant of BRCA vs mutation of BRCA Clinical guideline and management

More information

Principal Component Analysis in Genomic Data

Principal Component Analysis in Genomic Data Principal Component Analysis in Genomic Data Seunggeun Lee Department of Biostatistics University of North Carolina at Chapel Hill March 4, 2010 Seunggeun Lee (UNC-CH) PCA March 4, 2010 1 / 12 Bio Korean

More information

Genome-wide association studies (GWAS) Part 1

Genome-wide association studies (GWAS) Part 1 Genome-wide association studies (GWAS) Part 1 Matti Pirinen FIMM, University of Helsinki 03.12.2013, Kumpula Campus FIMM - Institiute for Molecular Medicine Finland www.fimm.fi Published Genome-Wide Associations

More information

Analysis of genome-wide genotype data

Analysis of genome-wide genotype data Analysis of genome-wide genotype data Acknowledgement: Several slides based on a lecture course given by Jonathan Marchini & Chris Spencer, Cape Town 2007 Introduction & definitions - Allele: A version

More information

Perceptions of genetic counseling services in direct-to-consumer personal genomic testing

Perceptions of genetic counseling services in direct-to-consumer personal genomic testing Clin Genet 2013: 84: 335 339 Printed in Singapore. All rights reserved 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd CLINICAL GENETICS doi: 10.1111/cge.12166 Social and Behavioural Research

More information

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary

Personal Genomics Platform White Paper Last Updated November 15, Executive Summary Executive Summary Helix is a personal genomics platform company with a simple but powerful mission: to empower every person to improve their life through DNA. Our platform includes saliva sample collection,

More information

Understanding genetic association studies. Peter Kamerman

Understanding genetic association studies. Peter Kamerman Understanding genetic association studies Peter Kamerman Outline CONCEPTS UNDERLYING GENETIC ASSOCIATION STUDIES Genetic concepts: - Underlying principals - Genetic variants - Linkage disequilibrium -

More information

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016

CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 CS273B: Deep Learning in Genomics and Biomedicine. Recitation 1 30/9/2016 Topics Genetic variation Population structure Linkage disequilibrium Natural disease variants Genome Wide Association Studies Gene

More information

Exploring the Genetic Basis of Congenital Heart Defects

Exploring the Genetic Basis of Congenital Heart Defects Exploring the Genetic Basis of Congenital Heart Defects Sanjay Siddhanti Jordan Hannel Vineeth Gangaram szsiddh@stanford.edu jfhannel@stanford.edu vineethg@stanford.edu 1 Introduction The Human Genome

More information

Genome-Wide Association Studies. Ryan Collins, Gerissa Fowler, Sean Gamberg, Josselyn Hudasek & Victoria Mackey

Genome-Wide Association Studies. Ryan Collins, Gerissa Fowler, Sean Gamberg, Josselyn Hudasek & Victoria Mackey Genome-Wide Association Studies Ryan Collins, Gerissa Fowler, Sean Gamberg, Josselyn Hudasek & Victoria Mackey Introduction The next big advancement in the field of genetics after the Human Genome Project

More information

Human Genetics and Gene Mapping of Complex Traits

Human Genetics and Gene Mapping of Complex Traits Human Genetics and Gene Mapping of Complex Traits Advanced Genetics, Spring 2015 Human Genetics Series Thursday 4/02/15 Nancy L. Saccone, nlims@genetics.wustl.edu ancestral chromosome present day chromosomes:

More information

Multi-SNP Models for Fine-Mapping Studies: Application to an. Kallikrein Region and Prostate Cancer

Multi-SNP Models for Fine-Mapping Studies: Application to an. Kallikrein Region and Prostate Cancer Multi-SNP Models for Fine-Mapping Studies: Application to an association study of the Kallikrein Region and Prostate Cancer November 11, 2014 Contents Background 1 Background 2 3 4 5 6 Study Motivation

More information

Association studies (Linkage disequilibrium)

Association studies (Linkage disequilibrium) Positional cloning: statistical approaches to gene mapping, i.e. locating genes on the genome Linkage analysis Association studies (Linkage disequilibrium) Linkage analysis Uses a genetic marker map (a

More information

Introduction to Genome Wide Association Studies 2014 Sydney Brenner Institute for Molecular Bioscience/Wits Bioinformatics Shaun Aron

Introduction to Genome Wide Association Studies 2014 Sydney Brenner Institute for Molecular Bioscience/Wits Bioinformatics Shaun Aron Introduction to Genome Wide Association Studies 2014 Sydney Brenner Institute for Molecular Bioscience/Wits Bioinformatics Shaun Aron Genotype calling Genotyping methods for Affymetrix arrays Genotyping

More information

Introduction to Genome Wide Association Studies 2015 Sydney Brenner Institute for Molecular Bioscience Shaun Aron

Introduction to Genome Wide Association Studies 2015 Sydney Brenner Institute for Molecular Bioscience Shaun Aron Introduction to Genome Wide Association Studies 2015 Sydney Brenner Institute for Molecular Bioscience Shaun Aron Many sources of technical bias in a genotyping experiment DNA sample quality and handling

More information

INTERDISCIPLINARY COLLABORATIONS MEDICAL RECORDS AND GENOMICS (EMERGE) NETWORK AND EHRS: THE ELECTRONIC. October 30, 2015

INTERDISCIPLINARY COLLABORATIONS MEDICAL RECORDS AND GENOMICS (EMERGE) NETWORK AND EHRS: THE ELECTRONIC. October 30, 2015 INTERDISCIPLINARY COLLABORATIONS AND EHRS: THE ELECTRONIC MEDICAL RECORDS AND GENOMICS (EMERGE) NETWORK October 30, 2015 Dana C. Crawford, PhD Associate Professor Epidemiology and Biostatistics Institute

More information

Clinical Applications in Pharmacogenomics/Genomic Medicine. Post-Course Survey

Clinical Applications in Pharmacogenomics/Genomic Medicine. Post-Course Survey Clinical Applications in Pharmacogenomics/Genomic Medicine Post-Course Survey Note: Students will be asked questions specific to the course in which they are enrolled. This is denoted throughout by use

More information

Computational Workflows for Genome-Wide Association Study: I

Computational Workflows for Genome-Wide Association Study: I Computational Workflows for Genome-Wide Association Study: I Department of Computer Science Brown University, Providence sorin@cs.brown.edu October 16, 2014 Outline 1 Outline 2 3 Monogenic Mendelian Diseases

More information

THE HEALTH AND RETIREMENT STUDY: GENETIC DATA UPDATE

THE HEALTH AND RETIREMENT STUDY: GENETIC DATA UPDATE : GENETIC DATA UPDATE April 30, 2014 Biomarker Network Meeting PAA Jessica Faul, Ph.D., M.P.H. Health and Retirement Study Survey Research Center Institute for Social Research University of Michigan HRS

More information

Appendix 5: Details of statistical methods in the CRP CHD Genetics Collaboration (CCGC) [posted as supplied by

Appendix 5: Details of statistical methods in the CRP CHD Genetics Collaboration (CCGC) [posted as supplied by Appendix 5: Details of statistical methods in the CRP CHD Genetics Collaboration (CCGC) [posted as supplied by author] Statistical methods: All hypothesis tests were conducted using two-sided P-values

More information

Human SNP haplotypes. Statistics 246, Spring 2002 Week 15, Lecture 1

Human SNP haplotypes. Statistics 246, Spring 2002 Week 15, Lecture 1 Human SNP haplotypes Statistics 246, Spring 2002 Week 15, Lecture 1 Human single nucleotide polymorphisms The majority of human sequence variation is due to substitutions that have occurred once in the

More information

Estimation problems in high throughput SNP platforms

Estimation problems in high throughput SNP platforms Estimation problems in high throughput SNP platforms Rob Scharpf Department of Biostatistics Johns Hopkins Bloomberg School of Public Health November, 8 Outline Introduction Introduction What is a SNP?

More information

Prostate Cancer Genetics: Today and tomorrow

Prostate Cancer Genetics: Today and tomorrow Prostate Cancer Genetics: Today and tomorrow Henrik Grönberg Professor Cancer Epidemiology, Deputy Chair Department of Medical Epidemiology and Biostatistics ( MEB) Karolinska Institutet, Stockholm IMPACT-Atanta

More information

Genomics Resources in WHI. WHI ( ) Extension Study Steering Committee Meeting Seattle, WA May 05-06, 2011

Genomics Resources in WHI. WHI ( ) Extension Study Steering Committee Meeting Seattle, WA May 05-06, 2011 Genomics Resources in WHI WHI (2010-2015) Extension Study Steering Committee Meeting Seattle, WA May 05-06, 2011 WHI Genomic Resources in dbgap Outcomes and traits in AA and Hispanics GWAS-SHARe Sequencing-ESP

More information

S G. Design and Analysis of Genetic Association Studies. ection. tatistical. enetics

S G. Design and Analysis of Genetic Association Studies. ection. tatistical. enetics S G ection ON tatistical enetics Design and Analysis of Genetic Association Studies Hemant K Tiwari, Ph.D. Professor & Head Section on Statistical Genetics Department of Biostatistics School of Public

More information

Potential of human genome sequencing. Paul Pharoah Reader in Cancer Epidemiology University of Cambridge

Potential of human genome sequencing. Paul Pharoah Reader in Cancer Epidemiology University of Cambridge Potential of human genome sequencing Paul Pharoah Reader in Cancer Epidemiology University of Cambridge Key considerations Strength of association Exposure Genetic model Outcome Quantitative trait Binary

More information

Crash-course in genomics

Crash-course in genomics Crash-course in genomics Molecular biology : How does the genome code for function? Genetics: How is the genome passed on from parent to child? Genetic variation: How does the genome change when it is

More information

Polygenic Influences on Boys & Girls Pubertal Timing & Tempo. Gregor Horvath, Valerie Knopik, Kristine Marceau Purdue University

Polygenic Influences on Boys & Girls Pubertal Timing & Tempo. Gregor Horvath, Valerie Knopik, Kristine Marceau Purdue University Polygenic Influences on Boys & Girls Pubertal Timing & Tempo Gregor Horvath, Valerie Knopik, Kristine Marceau Purdue University Timing & Tempo of Puberty Varies by individual (Marceau et al., 2011) Risk

More information

Supplementary Note: Detecting population structure in rare variant data

Supplementary Note: Detecting population structure in rare variant data Supplementary Note: Detecting population structure in rare variant data Inferring ancestry from genetic data is a common problem in both population and medical genetic studies, and many methods exist to

More information

Imputation. Genetics of Human Complex Traits

Imputation. Genetics of Human Complex Traits Genetics of Human Complex Traits GWAS results Manhattan plot x-axis: chromosomal position y-axis: -log 10 (p-value), so p = 1 x 10-8 is plotted at y = 8 p = 5 x 10-8 is plotted at y = 7.3 Advanced Genetics,

More information

Introduction to Genetics and Pharmacogenomics

Introduction to Genetics and Pharmacogenomics Introduction to Genetics and Pharmacogenomics Ching-Lung Cheung, PhD Assistant Professor, Department of Pharmacology and Pharmacy, Centre for Genomic Sciences, HKU Survey on pharmacogenomic knowledge Survey

More information

Bioinformatic Analysis of SNP Data for Genetic Association Studies EPI573

Bioinformatic Analysis of SNP Data for Genetic Association Studies EPI573 Bioinformatic Analysis of SNP Data for Genetic Association Studies EPI573 Mark J. Rieder Department of Genome Sciences mrieder@u.washington washington.edu Epidemiology Studies Cohort Outcome Model to fit/explain

More information

SNPs - GWAS - eqtls. Sebastian Schmeier

SNPs - GWAS - eqtls. Sebastian Schmeier SNPs - GWAS - eqtls s.schmeier@gmail.com http://sschmeier.github.io/bioinf-workshop/ 17.08.2015 Overview Single nucleotide polymorphism (refresh) SNPs effect on genes (refresh) Genome-wide association

More information

Single Nucleotide Polymorphisms (SNPs)

Single Nucleotide Polymorphisms (SNPs) Single Nucleotide Polymorphisms (SNPs) Sequence variations Single nucleotide polymorphisms Insertions/deletions Copy number variations (large: >1kb) Variable (short) number tandem repeats Single Nucleotide

More information

What is genetic variation?

What is genetic variation? enetic Variation Applied Computational enomics, Lecture 05 https://github.com/quinlan-lab/applied-computational-genomics Aaron Quinlan Departments of Human enetics and Biomedical Informatics USTAR Center

More information

Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Supplementary information

Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Supplementary information Fast and accurate genotype imputation in genome-wide association studies through pre-phasing Supplementary information Bryan Howie 1,6, Christian Fuchsberger 2,6, Matthew Stephens 1,3, Jonathan Marchini

More information

Redefine what s possible with the Axiom Genotyping Solution

Redefine what s possible with the Axiom Genotyping Solution Redefine what s possible with the Axiom Genotyping Solution From discovery to translation on a single platform The Axiom Genotyping Solution enables enhanced genotyping studies to accelerate your research

More information

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome.

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome. Glossary of Terms Genetics is a term that refers to the study of genes and their role in inheritance the way certain traits are passed down from one generation to another. Genomics is the study of all

More information

SUPPLEMENTARY INFORMATION. Common variants in TMPRSS6 are associated with iron status and erythrocyte volume

SUPPLEMENTARY INFORMATION. Common variants in TMPRSS6 are associated with iron status and erythrocyte volume SUPPLEMENTARY INFORMATION Common variants in TMPRSS6 are associated with iron status and erythrocyte volume Beben Benyamin, Manuel A. R. Ferreira, Gonneke Willemsen, Scott Gordon, Rita P. S. Middelberg,

More information

Introduction to Add Health GWAS Data Part I. Christy Avery Department of Epidemiology University of North Carolina at Chapel Hill

Introduction to Add Health GWAS Data Part I. Christy Avery Department of Epidemiology University of North Carolina at Chapel Hill Introduction to Add Health GWAS Data Part I Christy Avery Department of Epidemiology University of North Carolina at Chapel Hill Outline Introduction to genome-wide association studies (GWAS) Research

More information

DNA Collection. Data Quality Control. Whole Genome Amplification. Whole Genome Amplification. Measure DNA concentrations. Pros

DNA Collection. Data Quality Control. Whole Genome Amplification. Whole Genome Amplification. Measure DNA concentrations. Pros DNA Collection Data Quality Control Suzanne M. Leal Baylor College of Medicine sleal@bcm.edu Copyrighted S.M. Leal 2016 Blood samples For unlimited supply of DNA Transformed cell lines Buccal Swabs Small

More information

Global Screening Array (GSA)

Global Screening Array (GSA) Technical overview - Infinium Global Screening Array (GSA) with optional Multi-disease drop in (MD) The Infinium Global Screening Array (GSA) combines a highly optimized, universal genome-wide backbone,

More information

CMSC423: Bioinformatic Algorithms, Databases and Tools. Some Genetics

CMSC423: Bioinformatic Algorithms, Databases and Tools. Some Genetics CMSC423: Bioinformatic Algorithms, Databases and Tools Some Genetics CMSC423 Fall 2009 2 Chapter 13 Reading assignment CMSC423 Fall 2009 3 Gene association studies Goal: identify genes/markers associated

More information

PERSPECTIVES. A gene-centric approach to genome-wide association studies

PERSPECTIVES. A gene-centric approach to genome-wide association studies PERSPECTIVES O P I N I O N A gene-centric approach to genome-wide association studies Eric Jorgenson and John S. Witte Abstract Genic variants are more likely to alter gene function and affect disease

More information

Genetics Effective Use of New and Existing Methods

Genetics Effective Use of New and Existing Methods Genetics Effective Use of New and Existing Methods Making Genetic Improvement Phenotype = Genetics + Environment = + To make genetic improvement, we want to know the Genetic value or Breeding value for

More information

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014

Single Nucleotide Variant Analysis. H3ABioNet May 14, 2014 Single Nucleotide Variant Analysis H3ABioNet May 14, 2014 Outline What are SNPs and SNVs? How do we identify them? How do we call them? SAMTools GATK VCF File Format Let s call variants! Single Nucleotide

More information

Algorithms for Genetics: Introduction, and sources of variation

Algorithms for Genetics: Introduction, and sources of variation Algorithms for Genetics: Introduction, and sources of variation Scribe: David Dean Instructor: Vineet Bafna 1 Terms Genotype: the genetic makeup of an individual. For example, we may refer to an individual

More information

emerge-ii site report Vanderbilt

emerge-ii site report Vanderbilt emerge-ii site report Vanderbilt 29 June 2015 Vanderbilt activities emerge II PGx implementation locally and emerge-pgx SCN5A/KCNH2 project provider attitudes Phenotype contributions Methods development

More information

Association Mapping. Mendelian versus Complex Phenotypes. How to Perform an Association Study. Why Association Studies (Can) Work

Association Mapping. Mendelian versus Complex Phenotypes. How to Perform an Association Study. Why Association Studies (Can) Work Genome 371, 1 March 2010, Lecture 13 Association Mapping Mendelian versus Complex Phenotypes How to Perform an Association Study Why Association Studies (Can) Work Introduction to LOD score analysis Common

More information

BTRY 7210: Topics in Quantitative Genomics and Genetics

BTRY 7210: Topics in Quantitative Genomics and Genetics BTRY 7210: Topics in Quantitative Genomics and Genetics Jason Mezey Biological Statistics and Computational Biology (BSCB) Department of Genetic Medicine jgm45@cornell.edu Spring 2015, Thurs.,12:20-1:10

More information

Blood Pressure and Hypertension Genetics

Blood Pressure and Hypertension Genetics Blood Pressure and Hypertension Genetics Yong Huo, M.D. Wei Gao, M.D. Yan Zhang, M.D. Santhi K. Ganesh, M.D. Outline Blood pressure and hypertension in China Update on genetics of blood pressure BP/HTN

More information

Supplementary Methods Illumina Genome-Wide Genotyping Single SNP and Microsatellite Genotyping. Supplementary Table 4a Supplementary Table 4b

Supplementary Methods Illumina Genome-Wide Genotyping Single SNP and Microsatellite Genotyping. Supplementary Table 4a Supplementary Table 4b Supplementary Methods Illumina Genome-Wide Genotyping All Icelandic case- and control-samples were assayed with the Infinium HumanHap300 SNP chips (Illumina, SanDiego, CA, USA), containing 317,503 haplotype

More information

Genome-Wide Association Studies (GWAS): Computational Them

Genome-Wide Association Studies (GWAS): Computational Them Genome-Wide Association Studies (GWAS): Computational Themes and Caveats October 14, 2014 Many issues in Genomewide Association Studies We show that even for the simplest analysis, there is little consensus

More information

Introduc)on to Sta)s)cal Gene)cs: emphasis on Gene)c Associa)on Studies

Introduc)on to Sta)s)cal Gene)cs: emphasis on Gene)c Associa)on Studies Introduc)on to Sta)s)cal Gene)cs: emphasis on Gene)c Associa)on Studies Lisa J. Strug, PhD Guest Lecturer Biosta)s)cs Laboratory Course (CHL5207/8) March 5, 2015 Gene Mapping in the News Study Finds Gene

More information

Office Hours. We will try to find a time

Office Hours.   We will try to find a time Office Hours We will try to find a time If you haven t done so yet, please mark times when you are available at: https://tinyurl.com/666-office-hours Thanks! Hardy Weinberg Equilibrium Biostatistics 666

More information

High-density SNP Genotyping Analysis of Broiler Breeding Lines

High-density SNP Genotyping Analysis of Broiler Breeding Lines Animal Industry Report AS 653 ASL R2219 2007 High-density SNP Genotyping Analysis of Broiler Breeding Lines Abebe T. Hassen Jack C.M. Dekkers Susan J. Lamont Rohan L. Fernando Santiago Avendano Aviagen

More information

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010

Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Association Mapping in Plants PLSC 731 Plant Molecular Genetics Phil McClean April, 2010 Traditional QTL approach Uses standard bi-parental mapping populations o F2 or RI These have a limited number of

More information

Core Resources Working Group Report. Opportunities for Investigator Engagement

Core Resources Working Group Report. Opportunities for Investigator Engagement Core Resources Working Group Report Opportunities for Investigator Engagement Goals of Core Resource Working Group Initial purpose was to explore intervention effects in the 4 clinical trials Extend definition

More information

Supplementary Information. Werner Koch, Petra Hoppmann, Jakob C. Mueller, Albert Schömig & Adnan Kastrati

Supplementary Information. Werner Koch, Petra Hoppmann, Jakob C. Mueller, Albert Schömig & Adnan Kastrati Supplementary Information Werner Koch, Petra Hoppmann, Jakob C. Mueller, Albert Schömig & Adnan Kastrati The Supplementary Information has the following sections in order: 1. Supplementary Methods 2. Supplementary

More information

Strategy for Applying Genome-Wide Selection in Dairy Cattle

Strategy for Applying Genome-Wide Selection in Dairy Cattle Strategy for Applying Genome-Wide Selection in Dairy Cattle L. R. Schaeffer Centre for Genetic Improvement of Livestock Department of Animal & Poultry Science University of Guelph, Guelph, ON, Canada N1G

More information

Why do we need statistics to study genetics and evolution?

Why do we need statistics to study genetics and evolution? Why do we need statistics to study genetics and evolution? 1. Mapping traits to the genome [Linkage maps (incl. QTLs), LOD] 2. Quantifying genetic basis of complex traits [Concordance, heritability] 3.

More information

Concepts and relevance of genome-wide association studies

Concepts and relevance of genome-wide association studies Science Progress (2016), 99(1), 59 67 Paper 1500149 doi:10.3184/003685016x14558068452913 Concepts and relevance of genome-wide association studies ANDREAS SCHERER and G. BRYCE CHRISTENSEN Dr Andreas Scherer

More information

USER MANUAL for the use of the human Genome Clinical Annotation Tool (h-gcat) uthors: Klaas J. Wierenga, MD & Zhijie Jiang, P PhD

USER MANUAL for the use of the human Genome Clinical Annotation Tool (h-gcat) uthors: Klaas J. Wierenga, MD & Zhijie Jiang, P PhD USER MANUAL for the use of the human Genome Clinical Annotation Tool (h-gcat)) Authors: Klaas J. Wierenga, MD & Zhijie Jiang, PhD First edition, May 2013 0 Introduction The Human Genome Clinical Annotation

More information

Testimony of Christopher Newton-Cheh, MD, MPH Volunteer for the American Heart Association

Testimony of Christopher Newton-Cheh, MD, MPH Volunteer for the American Heart Association Testimony of Christopher Newton-Cheh, MD, MPH Volunteer for the American Heart Association Before the House Energy and Commerce Subcommittee on Health 21st Century Cures: Examining the Regulation of Laboratory

More information

Cross Haplotype Sharing Statistic: Haplotype length based method for whole genome association testing

Cross Haplotype Sharing Statistic: Haplotype length based method for whole genome association testing Cross Haplotype Sharing Statistic: Haplotype length based method for whole genome association testing André R. de Vries a, Ilja M. Nolte b, Geert T. Spijker c, Dumitru Brinza d, Alexander Zelikovsky d,

More information

Linking Genetic Variation to Important Phenotypes

Linking Genetic Variation to Important Phenotypes Linking Genetic Variation to Important Phenotypes BMI/CS 776 www.biostat.wisc.edu/bmi776/ Spring 2018 Anthony Gitter gitter@biostat.wisc.edu These slides, excluding third-party material, are licensed under

More information

Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era

Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era Illumina s GWAS Roadmap: next-generation genotyping studies in the post-1kgp era Anthony Green Sr. Genotyping Sales Specialist North America 2010 Illumina, Inc. All rights reserved. Illumina, illuminadx,

More information

Axiom mydesign Custom Array design guide for human genotyping applications

Axiom mydesign Custom Array design guide for human genotyping applications TECHNICAL NOTE Axiom mydesign Custom Genotyping Arrays Axiom mydesign Custom Array design guide for human genotyping applications Overview In the past, custom genotyping arrays were expensive, required

More information

Comparison of scoring methods for the detection of causal genes with or without rare variants

Comparison of scoring methods for the detection of causal genes with or without rare variants PROCEEDINGS Open Access Comparison of scoring methods for the detection of causal genes with or without rare variants Markus Scholz 1,*, Holger Kirsten 1,,3,4 From Genetic Analysis Workshop 17 Boston,

More information

Human Genetics and Gene Mapping of Complex Traits

Human Genetics and Gene Mapping of Complex Traits Human Genetics and Gene Mapping of Complex Traits Advanced Genetics, Spring 2017 Human Genetics Series Tuesday 4/10/17 Nancy L. Saccone, nlims@genetics.wustl.edu ancestral chromosome present day chromosomes:

More information

Age-Adjusted Death Rates for Coronary Heart Disease, U.S.,

Age-Adjusted Death Rates for Coronary Heart Disease, U.S., Age-Adjusted Death Rates for Coronary Heart Disease, U.S., 1950-2004 Deaths/100,000 Population 600 500 400 300 200 100 Risk Factors U.S. Actual U.S. "Could Be" (Based on Japan Actual) 0 1950 1960 1970

More information

Population and Statistical Genetics including Hardy-Weinberg Equilibrium (HWE) and Genetic Drift

Population and Statistical Genetics including Hardy-Weinberg Equilibrium (HWE) and Genetic Drift Population and Statistical Genetics including Hardy-Weinberg Equilibrium (HWE) and Genetic Drift Heather J. Cordell Professor of Statistical Genetics Institute of Genetic Medicine Newcastle University,

More information

First Do No Harm: Regulation and Clinical Integration of DTC Genetic Testing

First Do No Harm: Regulation and Clinical Integration of DTC Genetic Testing First Do No Harm: Regulation and Clinical Integration of DTC Genetic Testing Amy L. McGuire, JD, PhD Center for Medical Ethics and Health Policy Baylor College of Medicine DTC Advertising Personal Genome

More information

POLYMORPHISM AND VARIANT ANALYSIS. Matt Hudson Crop Sciences NCSA HPCBio IGB University of Illinois

POLYMORPHISM AND VARIANT ANALYSIS. Matt Hudson Crop Sciences NCSA HPCBio IGB University of Illinois POLYMORPHISM AND VARIANT ANALYSIS Matt Hudson Crop Sciences NCSA HPCBio IGB University of Illinois Outline How do we predict molecular or genetic functions using variants?! Predicting when a coding SNP

More information

Statistical Tools for Predicting Ancestry from Genetic Data

Statistical Tools for Predicting Ancestry from Genetic Data Statistical Tools for Predicting Ancestry from Genetic Data Timothy Thornton Department of Biostatistics University of Washington March 1, 2015 1 / 33 Basic Genetic Terminology A gene is the most fundamental

More information

Genomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC

Genomic Research: Issues to Consider. IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC Genomic Research: Issues to Consider IRB Brown Bag August 28, 2014 Sharon Aufox, MS, LGC Outline Key genomic terms and concepts Issues in genomic research Consent models Types of findings Returning results

More information

Slides are from Level 3 Biology Course Content Day, 7 th November Presenter: Justin O Sullivan

Slides are from Level 3 Biology Course Content Day, 7 th November Presenter: Justin O Sullivan Slides are from Level 3 Biology Course Content Day, 7 th November 2012 Presenter: Justin O Sullivan Teachers are free to use these for teaching purposes with appropriate acknowledgement Personalized genomics

More information

HLA and other tales: The different perspectives of Celiac Disease Gutierrez Achury, Henry Javier

HLA and other tales: The different perspectives of Celiac Disease Gutierrez Achury, Henry Javier University of Groningen HLA and other tales: The different perspectives of Celiac Disease Gutierrez Achury, Henry Javier IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's

More information

b. (3 points) The expected frequencies of each blood type in the deme if mating is random with respect to variation at this locus.

b. (3 points) The expected frequencies of each blood type in the deme if mating is random with respect to variation at this locus. NAME EXAM# 1 1. (15 points) Next to each unnumbered item in the left column place the number from the right column/bottom that best corresponds: 10 additive genetic variance 1) a hermaphroditic adult develops

More information

H3A - Genome-Wide Association testing SOP

H3A - Genome-Wide Association testing SOP H3A - Genome-Wide Association testing SOP Introduction File format Strand errors Sample quality control Marker quality control Batch effects Population stratification Association testing Replication Meta

More information

Whole Genome Sequencing. Biostatistics 666

Whole Genome Sequencing. Biostatistics 666 Whole Genome Sequencing Biostatistics 666 Genomewide Association Studies Survey 500,000 SNPs in a large sample An effective way to skim the genome and find common variants associated with a trait of interest

More information

Human linkage analysis. fundamental concepts

Human linkage analysis. fundamental concepts Human linkage analysis fundamental concepts Genes and chromosomes Alelles of genes located on different chromosomes show independent assortment (Mendel s 2nd law) For 2 genes: 4 gamete classes with equal

More information

Nucleotide variation in the human genome

Nucleotide variation in the human genome Nucleotide variation in the human genome Elena Salmerón Quesada Genomics 13/12/2017 HUMAN GENETIC VARIATION 84.7 MILLION SINGLE NUCLEOTIDE POLYMORPHISMS (SNPs) 3.6 MILLION INDELS 60.000 STRUCTURAL VARIANTS

More information

Benno Pütz. MPI of Psychiatry

Benno Pütz. MPI of Psychiatry Benno Pütz Lifetime prevalence ~20% Treatment response CC CT TT Binder et al., Nature Genetics 2004 Drug transport Dosierung Drug transport Text Uhr et al., Neuron, 2008 150 years ago Gregor Mendel Inheritance

More information

Linking Genetic Variation to Important Phenotypes: SNPs, CNVs, GWAS, and eqtls

Linking Genetic Variation to Important Phenotypes: SNPs, CNVs, GWAS, and eqtls Linking Genetic Variation to Important Phenotypes: SNPs, CNVs, GWAS, and eqtls BMI/CS 776 www.biostat.wisc.edu/bmi776/ Colin Dewey cdewey@biostat.wisc.edu Spring 2012 1. Understanding Human Genetic Variation

More information

Genetics of Stroke. Daniel Woo, M.D., M.S. University of Cincinnati

Genetics of Stroke. Daniel Woo, M.D., M.S. University of Cincinnati Genetics of Stroke Daniel Woo, M.D., M.S. University of Cincinnati Objectives To understand the basic terms and concepts of genetics To understand how they have been applied to genetic discovery To understand

More information

Translational Medicine in the Era of Big Data: Hype or Real?

Translational Medicine in the Era of Big Data: Hype or Real? Translational Medicine in the Era of Big Data: Hype or Real? AAHCI MENA Regional Conference September 27, 2018 AKL FAHED, MD, MPH @aklfahed Disclosures None 2 Outline The Promise of Big Data Genomics Polygenic

More information

S SG. Metabolomics meets Genomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical.

S SG. Metabolomics meets Genomics. Hemant K. Tiwari, Ph.D. Professor and Head. Metabolomics: Bench to Bedside. ection ON tatistical. S SG ection ON tatistical enetics Metabolomics meets Genomics Hemant K. Tiwari, Ph.D. Professor and Head Section on Statistical Genetics Department of Biostatistics School of Public Health Metabolomics:

More information

B) You can conclude that A 1 is identical by descent. Notice that A2 had to come from the father (and therefore, A1 is maternal in both cases).

B) You can conclude that A 1 is identical by descent. Notice that A2 had to come from the father (and therefore, A1 is maternal in both cases). Homework questions. Please provide your answers on a separate sheet. Examine the following pedigree. A 1,2 B 1,2 A 1,3 B 1,3 A 1,2 B 1,2 A 1,2 B 1,3 1. (1 point) The A 1 alleles in the two brothers are

More information

A genome wide association study of metabolic traits in human urine

A genome wide association study of metabolic traits in human urine Supplementary material for A genome wide association study of metabolic traits in human urine Suhre et al. CONTENTS SUPPLEMENTARY FIGURES Supplementary Figure 1: Regional association plots surrounding

More information

Runs of Homozygosity Analysis Tutorial

Runs of Homozygosity Analysis Tutorial Runs of Homozygosity Analysis Tutorial Release 8.7.0 Golden Helix, Inc. March 22, 2017 Contents 1. Overview of the Project 2 2. Identify Runs of Homozygosity 6 Illustrative Example...............................................

More information

Multiple Sclerosis: Recent Insights from Genomics. Bruce Cree, MD, PhD, MAS University of California San Francisco. Disclosure

Multiple Sclerosis: Recent Insights from Genomics. Bruce Cree, MD, PhD, MAS University of California San Francisco. Disclosure Multiple Sclerosis: Recent Insights from Genomics Bruce Cree, MD, PhD, MAS University of California San Francisco Disclosure Bruce Cree has received personal compensation for consulting from Abbvie, Biogen

More information

Linkage Disequilibrium

Linkage Disequilibrium Linkage Disequilibrium Why do we care about linkage disequilibrium? Determines the extent to which association mapping can be used in a species o Long distance LD Mapping at the tens of kilobase level

More information

Policy Number: Title: Abstract Purpose: Policy Detail:

Policy Number: Title: Abstract Purpose: Policy Detail: - 1 - Policy Number: N04202 Title: NHP/NHIC-Medical Policy-Genetic, Genotype and Genomic Testing Abstract Purpose: The purpose of this policy to provide guidance for decisions related to genetic, genotype

More information

Integrative Genomics 1a. Introduction

Integrative Genomics 1a. Introduction 2016 Course Outline Integrative Genomics 1a. Introduction ggibson.gt@gmail.com http://www.cig.gatech.edu 1a. Experimental Design and Hypothesis Testing (GG) 1b. Normalization (GG) 2a. RNASeq (MI) 2b. Clustering

More information