Standard Genetic Nomenclature

Size: px
Start display at page:

Download "Standard Genetic Nomenclature"

Transcription

1 Animal Science Publications Animal Science 2014 Standard Genetic Nomenclature Zhiliang Hu Iowa State University, James M. Reecy Iowa State University, Fiona M. McCarthy University of Arizona Carissa A. Park Iowa State University, Follow this and additional works at: Part of the Agriculture Commons, Animal Sciences Commons, and the Genetics Commons The complete bibliographic information for this item can be found at ans_pubs/171. For information on how to cite this item, please visit howtocite.html. This Book Chapter is brought to you for free and open access by the Animal Science at Iowa State University Digital Repository. It has been accepted for inclusion in Animal Science Publications by an authorized administrator of Iowa State University Digital Repository. For more information, please contact

2 Standard Genetic Nomenclature Abstract Genetics includes the study of genotypes and phenotypes, the mechanisms of genetic control between them, and information transfer between generations. Genetic terms describe processes, genes and traits with which genetic phenomena are examined and described. While the genetic terminologies are extensively discussed in this book and elsewhere, the standardization of their names has been an ongoing process. Therefore, this chapter will only concentrate on discussions about the issues involved in the standardization of gene and trait terminologies. Disciplines Agriculture Animal Sciences Genetics Comments This is a chapter in The Genetics of Cattle, 2nd edition, chapter 24 (2014): 598. Posted with permission. This book chapter is available at Iowa State University Digital Repository:

3 24 Standard Ganetic Nomencl,ture Zhi-Liang Hu, 1 James M. Reecy, 1 Fiona McCarthy2 and Carissa A. Park1 1/owa State University, Ames, Iowa, USA; 2 University of Arizona, Tucson, Arizona, USA Introduction Locus and Gene Names and Symbols Locus name and symbol Allele name and symbol Genotype terminology Gene annotations and the gene ontology (GO) Trait and Phenotype Terminology Traits Super-traits Trait hierarchy and ontology Current status of research Trait and phenotype nomenclature Future Prospects Acknowledgements References Introduction Genetics includes the study of genotypes and phenotypes, the mechanisms of genetic control between them, and information transfer between generations. Genetic terms describe processes, genes and traits with which genetic phenomena are examined and described. While the genetic terminologies are extensively discussed in this book and elsewhere, the standardization of their names has been an ongoing process. Therefore, this chapter will only concentrate on discussions about the issues involved in the standardization of gene and trait terminologies. Readers may wish to refer to online resources (see Table 24.1 for URLs) for lists of the glossaries currently in use. A standardized genetic nomenclature is vital for unambiguous concept description, efficient genetic data management and effective communications not only among scientists, but also among those who are involved in cattle production and genetic improvement. This issue has become even more critical in the post-genomics era due to rapid accumulation of large quantities of genetic and phenotypic data, and the requirement for data management and computational analysis, which increases the need for precise definition and interpretation of gene and trait terms. For example, the Myostatin (MSTN) gene is known as Growth and Differentiation Factor 8 (GDF8 or GDF-8) in some literature and is also referred to as the 'muscle hypertrophy' or 'double-muscling' locus in cattle. While the 598 CAB International The Genetics of Cattle, 2nd Edn (eds D.J. Garrick and A. Ruvinsky)

4 interchangeable use of all these names in the literature can cause confusion, it gets more complicated when one considers paralogous gene duplications across species, which led Rodgers et a/. (2007) to propose MSTN-1 and MSTN- 2. Unfortunately, this naming scheme does not follow the Human Genome Organization (HUGO) Gene Nomenclature Committee (HGNC) guidelines, which indicate that these paralogues should be named MSTNl and MSTN2, respectively. In terms of traits, an example that would benefit from consistent nomenclature is the longissimus dorsi muscle area, which is also referred to as the loin eye area (LEA), loin muscle area (LMA), meat area (MLD), ribeye area (REA), etc. Each of these is known to certain researchers as their default name for the trait. Complexity is further increased by variation in anatomic locations, physiological stages and methods used to measure a given trait. This may seem manageable at first, but once one starts to compare data across different laboratories, publications or species, it quickly becomes very confusing. The 'standard genetic nomenclature' recommendations made by the Committee on Genetic Nomenclature of Sheep and Goats (COGNOSAG) in the 1980s and 1990s initially covered sheep and goats and were later extended. to cattle (Broad et a/., 1999). Dolling (1999) summarized these efforts and abstracted guidelines for practical use. In 2009, an international meeting to discuss coordination of gene names across vertebrate species was held in Cambridge, UK (Bruford, 2010). While we may hesitate to dictate how genetic terms are defined, adopting a standardized genetic nomenclature system enables researchers to more easily manage and compare their data, both within and across species. The emergence of the use of ontologies in biological research has contributed a new way to effectively organize biological data and facilitate analysis of large datasets. Adopting standardized nomenclature will further enable researchers to unambiguously organize and manage their data. When genomic information must be transferred across species to perpetuate genetic discoveries, the role of a standardized genetic nomenclature becomes even more important. The goal of this chapter is to clearly state guidelines for nomenclature, with the hope that they will facilitate comparison of results between experiments and, most importantly, prevent confusion. Locus and Gene Names and Symbols Locus name and symbol The following guidelines for cattle gene nomenclature are adapted and abbreviated from the HUGO Gene Nomenclature Committee (HGNC; see Table 24.1 for URL). A gene is defined as 'a DNA segment that contributes to phenotype/function. In the absence of demonstrated function a gene may be characterized by sequence, transcription or homology.' A locus is not synonymous with a gene. It is defined as 'a point in the genome, identified by a marker, which can be mapped by some means. A locus could be an anonymous non-coding DNA segment or a cytogenetic feature.' A single gene may have numerous loci within it (each may be defined by different markers). A gene name should be short and specific, and convey the character or function of the gene. Gene names should be written using American spelling and contain only Latin letters or a combination of Latin letters and Arabic numerals. A gene symbol should start with the same letter as the gene name. The gene symbol should consist of upper-case Latin letters and possibly Arabic numerals. Gene symbols must be unique. A locus name should be in capitalized Latin letters or a combination of Latin letters and Arabic numerals. A locus symbol should consist of as few Latin letters as possible or a combination of Latin letters and Arabic numerals. The characters of a symbol should always be capital Latin characters and should begin with the initial letter of the name of the locus. If the locus name is two or more words, then the initial letters of each word should be used in the locus symbol. Gene and locus names and symbols should be printed in italics whenever possible; otherwise they should be underlined.

5 500 Z -LHu.etal.) When assigning cattle gene nomenclature, the gene name and symbol should be assigned based on existing HGNC nomenclature when 1:1 human:bovine orthology is well established. Recognized members of gene families should be named following existing naming schemes. Initial efforts to provide information about genes predicted during the cattle genome sequencing project resulted in the assignment of standardized names for cattle genes based on human gene nomenclature (Bovine Genome Sequencing and Analysis Consortium, 2009). There are two categories of novel cattle genes: (i) novel genes predicted by bioinformatic gene prediction programs; and (ii) novel genes that have been studied prior to the completion of the cattle genome. In addition, it is anticipated that, in the future, additional novel genes will be identified by RNA-sequencing experiments. In cases where no strict 1: 1 human orthologue exists that has been assigned nomenclature, the NCB! LOC# or Ensembl 10 should be used as a temporary gene symbol for predicted genes with no known function. In order to assign a symbol/ name to novel genes, they will need to be manually curated and assigned a unique symbol/name following these guidelines. Allele name and symbol These guidelines for allele nomenclature are adapted from Dolling (1999) and mouse genome nomenclature guidelines (see Table 24.1 for URL), consistent with HGNC guidelines. Alleles do not have to be named, but should be assigned symbols. An allele symbol should always be written following the locus symbol. It can consist of Latin letters or a combination of Latin letters and Arabic numerals. An allele name should be as brief as possible, and should convey the variation associated with the allele. If a new allele is similar to one that has already been named, it should be named according to the breed, geographic location or population of origin. If new alleles are to be named for a recognized locus, they should conform to nomenclature established for that locus. The first letter of the allele name should be lower case. The allele name and symbol may be identical for a locus detected by biochemical, serological or nucleotide methods. The HGNC guideline recommends that 'allele designation should be written on the same line as gene symbol separated by an asterisk e.g. PGM1 * 1, the allele is printed as * 1 '. The wild-type allele can be denoted with a+ (e.g. MSTN+). Neither+ nor - symbols should be used in alleles detected by biochemical, serological or nucleotide methods. Null alleles should be designated by the number zero. A single nucleotide polymorphism (SNP) allele should be designated based on its dbsnp _id, followed by a hyphen and the specific nucleotide (e.g. MSTNrs T). If the SNP occurs outside of an identified gene, the SNP locus can be designated using the dbsnp _id as the locus symbol, followed by a hyphen and the nucleotide allelic variants as in rs t. The allele name and symbol should be printed in italics whenever possible; otherwise they should be underlined. Genotype terminology The genotype of an individual should be shown by printing the relevant locus and allele symbols for the two homologous chromosomes concerned, separated by a slash, e.g. MSTNrs T/rs C. Unlinked loci should be separated by a semicolon, e.g. CD 11Rsal-2400!2200; ESRPvuii-5 700/4200. Linked loci should be separated by a space or dash and listed in linkage order (e.g. POU1F1A!G-STCHC/G-PRSS7A!T), or in alphabetical order if the linkage order is not known. For X-linked loci, the hemizygous case should have a /Y following the locus and allele symbol, e.g. AR-Eco57I-1094/Y. Likewise, Y-linked loci should be designated by /X following the locus and allele symbol. Gene annotations and the gene ontology (GO) Advances in genomic technologies require that researchers be able to functionally analyse large, high-throughput datasets to gain insight into the complex systems they are studying. By using the same nomenclature and procedures

6 to describe gene function, gene components can be consistently linked to function in a way that facilitates effective computational analysis and promotes comparative genomics. In 1998, the GO Consortium was formed to standardize functional annotation in the form of gene ontologies that can be used across all eukaryotes (Gene Ontology Consortium, 2000). This effort not only provided a standard method for functional annotation but also promoted data sharing and enabled modelling of functional genomics datasets. The GO consists of three separate ontologies: Biological Process, Cellular Component, and Molecular Function. Genes or gene products are associated with GO terms that represent gene attributes. A GO term is defined with a term name, a unique identifier and a definition (preferably indicating which of the three sub-ontologies it belongs to, information about its relationships to other GO terms and cited sources). GO terms may also have synonyms, database cross-references and comments to provide more detailed information. A unique GO identifier consists of the prefix 'GO' followed by a colon and six to eight numerical digits, e.g. G0: It serves as a key to reference GO terms in a GO database. An example of a GO term is shown in Fig Standard GO annotations are maintained by the GO Consortium (see Table 24.1 for URL), which provides updates of qualitychecked data for public access. The GO annotations are used by secondary source databases like Entrez Gene (see Table 24.1 for URL; Sayers eta/., 2012) and UniProt (UniProt Consortium, 2010), genome browsers like Ensembl (see Table 24.1 for URL; Flicek, 2013), and analysis tools like DAVID (see Table 24.1 for URL; Huang, 2009), among other publicly accessible resources and tools. A growing number of model organism and livestock animal species (including bovine) databases and working groups contribute annotation sets to the GO repository (McCarthy, 2007; Reese, 2010). GO annotations are created by capturing the gene product information (database, database accession, name and symbol, type of gene product and species taxon), its associated GO term, GO sub-ontology and evidence for the assertion with references. The current practice for bovine GO annotation is to provide names and symbols based upon a combination of NCBI Entrez Gene and UniProtKB names. In instances where there is no suitable gene symbol, database accessions are used. Continued efforts are made to improve the accuracy of the bovine GO annotations by transferring GO annotations from better annotated proteins in human and mouse based on Ensembl orthology. As of September 2012, GO annotation for bovine (McCarthy, 2007) comprises 306,7 46 annotation entries for 41,63 7 gene products; 86.7% of these annotations id: G0: name: lactase activity namespace: molecular_function def: "Catalysis of the reaction: lactose + H20 = D-glucose + D galactose. [EC: ] synonym: "lactase-phlorizin hydrolase activity BROAD [EC: ] synonym: "lactose galactohydrolase activity EXACT [EC: ] xref: EC: xref: MetaCyc:LACTASE-RXN xref: Reactome:20536 is_a: G0: ! hydrolase activity, hydrolyzing 0-glycosyl compounds Fig An example of a GO term. (For further information, see Table 24.1 for GO website URL.)

7 are computationally derived (AgBase: see Table 24.1 for URL). To contribute annotations to the GO, or for a complete list of bovine GO data, users are encouraged to contact either the GO Consortium or AgBase at their respective websites. Trait and Phenotype Terminology Cattle traits are conventionally named based on performance (e.g. body weight), physiological parameters (e.g. blood cholesterol level), anatomic locations/dissections (e.g. loin muscle area), physical-chemical properties (e.g. milk protein content), livelihood soundness (e.g. immune capacity) and exterior appearance (e.g. coat colour), etc. As such, there is a good chance a trait will be named differently by different people, even within a species community. Furthermore, traits have been studied across many species, which adds additional complexity to their naming. The study of traits may also involve the study of underlying genes and markers, environments and management protocols that contribute to the manifestation of a trait. Therefore, it is obvious that factors that contribute to the naming of a trait are multi-dimensional. As the amount of trait information associated with a gene or chromosomal region is growing exponentially, we cannot overemphasize the need for a standard nomenclature to be used by researchers to communicate as consistently and unambiguously as possible, with the aid of bioinformatics tools. Traits Cattle trait terms can be found ubiquitously throughout journal articles, farm reports and daily communications among scientists and cattle industry personnel. A trait term can be created by anyone, and each person may have a slightly different definition for any given term. As such, hundreds of thousands of terms can be found in the literature with various naming conventions used. Previously, there was no central repository where the uniqueness of a trait term could be maintained and checked, until two relatively recent database development efforts emerged: the Online Mendelian Inheritance in Animals (OMIA) database and the Animal QTL database (QTLdb). OMIA (see Table 24.1 for URL) was initiated in To date, it contains >400 cattle trait variations and/or abnormalities from cattle genetic research publications (Nicholas, Chapter 5). The Animal QTLdb (see Table 24.1 for URL) has a collection of 4 70 cattle traits, including measurement method variations (Hu eta/., 2013), of which 407 traits have at least one QTL. Curators at both OMIA and Animal QTLdb made efforts to make each database entry unique in terms of the names and their representations. Expanded from. the QTLdb development, an Animal Trait Ontology (ATO) project at Iowa State University (see Table 24.1 for URL) has been launched to standardize traits for livestock species including cattle. Its initial purpose was to help with organization and management of trait information through the use of a controlled vocabulary to facilitate comparison of QTL results and standardize trait data annotation and retrieval (Hu et a/., 2005, 2007). It was soon introduced to the community (Hughes eta/., 2008). Super-traits Compared to standard gene nomenclature, trait name standardization is far more complex, not only because the same trait can be named differently (e.g. 'loin eye area' versus 'ribeye area'), but also because many factors contribute to how a trait is defined under various circumstances. For example, Fig shows a list of 10 'backfat thickness' variations, each of which is defined by their specific measurement methods, measuring time and specific anatomic locations, which may contribute to trait comparison difficulties and increase the potential for confusion. One attempt to simplify the comparisons was by introduction of the concept of 'trait types' or 'super-traits'. Hu et a/. (2005) described trait type as a general physical or chemical property of, or the processes that lead to, or types of measurements that result in, an observation (phenotype). The 'trait types'

8 ~~a~~~~~""~fhejil~~~~ 60S Backfat thickness (average backfat) by ultrasound Backfat thickness (average backfat) by ruler Backfat thickness at the 7th rib Backfat thickness at the 121h rib Backfat thickness at the 12th_ 13th rib Backfat thickness at the 13th rib Backfat thickness measured at 1-3 days postpartum } Backfat thickness measured at days postpartum Backfat thickness measured at days postpartum Backfat thickness measured at days postpartum } bymethods } by locat;orn; "'"""' Fig An example of the trait name variations by different modifiers such as measurement methods, time and sampling locations. This variation can easily add difficulties for accurate and unambiguous trait comparisons. or 'super-traits' were initially used to serve as a general concept for a trait, regardless of possible variations in trait names based on measurement times, locations or methods. As the ATO project progressed, the factors in the methods of trait measurements, such as point in time or time span, anatomic locations, instruments, etc., were classified as 'trait modifiers', because they do not constitute a component of a trait, but only affect the way a trait is described. Therefore, the 'super-trait' may only be employed to categorize variations in how a trait is defined or named. For example, 'rib eye area', 'rib-eye area', 'rib muscle area', 'longissimus dorsi muscle area', 'longissimus muscle area', 'loin eye area', 'loin muscle area', etc. can be unified as 'longissimus dorsi muscle area (LMA)'. 'Backfat', 'backfat depth', 'backfat thickness', 'backfat above muscle dorsi', 'backfat intercept', 'backfat linear', etc. may all simply be referred to as 'subcutaneous fat thickness'. Trait hierarchy and ontology In order to compare QTL across experiments, the Cattle QTLdb uses a trait hierarchy (Fig. 24.3) to provide a framework for organizing the traits and easily locating them (Hu et a/., 2013). This approach simplifies the procedures by which traits are defined, linked and compared. Subsequently, a computer program could be implemented to automatically process the database searches, so that when a user queries for a trait by keywords, the database can gather and retrieve related trait names and their associated QTL, put them together and present them to the user in real time. However, people of different disciplines may see the need for a different trait hierarchy, which may better capture the subtleties required in their field. For example, for body weight gained over a period of time (e.g. average daily gain, ADG), a farmer considers it a production trait, a nutritionist may see it as an indicator for feed conversion efficiency and a veterinarian may find it a health status parameter. Similarly, blood cholesterol levels may be used to predict meat quality by beef producers, and may also be used as a parameter to predict coronary heart disease by those who use cattle as an animal model for human heart disease research. Therefore, a simple hierarchy may be helpful to reduce the complexity in some cases, although may not be adequate in all cases. In addition, due to the existence of multiple overlapping hierarchies for cattle traits, the management of such data may introduce one more dimension of complexity to the ontology structure. Ontologies are controlled vocabularies used to describe objects and relationships between them in a formal manner. In an ontology, the Directed Acyclic Graph (DAG), a mathematical graphic modelling method, is used to solve data management problems with complex hierarchical structures. For example, the trait 'marbling' may belong to the 'meat quality', 'adipose trait' or 'muscular system physiology' hierarchies. Computer tools have been developed and are freely available to manage such ontology data with DAG structures. The two most popular tools that are likely to be useful to the cattle genetics community

9 Cattle traits Disease susceptibility General health parameters Mastitis Organ disorder Parasite load Parasite resistance Carcass characteristics Meat quality + Milk composition - fat Milk composition - other Milk composition - protein Milk processing trait Milk yield Production traits Energy efficiency Feed conversion Feed intake Growth Life history traits Lifetime production Reproduction traits Fertility General Semen quality Behavioural Conformation Pigmentation Fig A simple cattle trait class hierarchy used in the Animal QTLdb for users to browse for traits of interest. (See Table 24.1 for URL.) are AmiGO and OBO-Edit (Gene Ontology Tools, see Table 24.1 for URL). AmiGO is an ontology browser adapted to the ATO database, which allows users to share and view trait data stored in ATO with any web browser on the internet. OBO-Edit is a java-based ontology data editor that can be used by anyone to edit ontology term definitions and relationships, and to export data in Open Biological/Biomedical Ontologies (080) format to share data. Current status of research The ATO has been a successful project since its development from the QTLdb several years ago. Recently, the developers of ATO have begun working with Mouse Genome Informatics, the Rat Genome Database, European Animal Disease Genomics Network of Excellence (EADGENE) and the French National Institute for Agricultural Research (INRA) to incorporate the Mammalian Phenotype Ontology (MPO) and the ATO into a unified Vertebrate Trait (VT) Ontology (Park et a/., 2013; see Table 24.1 for URL). To reach a proper granularity level of the trait ontology, Product Trait (PT) Ontology (see Table 24.1 for URL) and Clinical Measurement Ontology (CMO; Shimoyama et a/., 2012; see Table 24.1 for URL) were introduced. By reuse of existing ontologies and integration of production-specific livestock traits, researchers at INRA have also launched an Animal Trait Ontology for Livestock (ATOL) site, containing over 1000 traits including those of cattle (Golik eta/., 2012). Current efforts have been aimed at enhancing the ability to standardize trait nomenclature within and across species. For example, a disease such as mastitis in dairy cattle may have been considered a 'trait' in classical animal genetic studies. In fact, in terms of concept specifications, it is not a characteristic cattle trait observable in the general population, but rather an abnormal manifestation in some cattle (in fact, resistance to mastitis is a trait). In addition, a trait name may have variations because it is 'modified' by measurement time or method (Fig. 24.2), but the names actually represent the same trait. The separation of diseases from traits reflects the efforts toward a well-defined and standardized trait nomenclature. Standardization of the trait nomenclature will undoubtedly help the cattle genomics community make meaningful trait comparisons, as well as facilitate the transfer of genomics information from some well-studied species. The challenge of using ontologies to standardize and manage trait nomenclature is not only a technical issue, but a community issue, in the sense that it has to be commonly recognized, mutually agreed upon, and widely shared.

10 (standard Genetic Nomendat~re 005 Trait and phenotype nomenclature Until an international committee issues rules for trait and phenotype nomenclature, a good practice with wide acceptability is to follow the 'norm' in published materials. Listed in Table 24.1 are some of the best trait reference resources available to date (see table footnote for details). Since this has been an active research area in recent years, it is highly recommended that users check multiple databases for the best and most up-to-date information. Phenotype is the actual manifestation of observable traits. A phenotype is a trait observed in an individual. It usually consists of a trait with characteristic features (e.g. twinning), variations that can be described (e.g. black spots on the body) or qualities that can be measured (e.g. birth weight of 30 kg). Since there are so many variations as to how a phenotype can be 'observed' (often such observation is made indirectly with instruments or through tests) and obtained, a technical guide for recording each trait might be ideal. Often a description of comments for a phenotype record may be necessary to correctly understand and use the data. For example, when blood samples are taken, the number of hours the animal is fasted might be an important co-factor for the measurement of blood cholesterol concentration. When a phenotype is a reflection of a certain genotype, the phenotype symbol should be the same as the genotype symbol. The difference is that the characters should not be underlined or in italics, and they should be written with a space between locus characters and allele characters, instead of an asterisk. Square brackets [ ] may also be used. In classical genetics, phenotypes were sometimes used to denote Mendelian genotypes. This was done using an abbreviation of the trait, post-fixed with a plus(+) or minus(-) sign to represent 'presence' or 'absence' of certain trait features. For example, halothanenegative was denoted as 'Hal-', and halothanepositive as 'Hal+'. A phenotype denotation can also be used to represent genetic haplotypes, such that 'K88ab+, ac+, ad-' are written together as an entire denotation. Likewise, numbers or letters may be used to denote alleles when polymorphisms are observed, for example, ApoB1/2, ApoB2/3, etc. (Note the difference from recording genotypes, where italics or asterisks are required.) Future Prospects The Gene Ontology and Mammalian Phenotype Ontology are already playing a role in robust annotation of mammalian genes and phenotypes in the context of mutations, quantitative trait loci, etc. (Smith et a/., 2005). Undoubtedly, a standardized cattle genetic nomenclature will more effectively facilitate efficient cattle genome annotation and transfer of knowledge from information-rich species such as humans and mouse, and make it possible for new bioinformatics tools to easily streamline data management and genetic analysis. Meanwhile, it is noteworthy to mention that the term 'phene' for 'trait' is being used more frequently in the scientific literature in recent years. It is interesting that in terms of etymology lineage, 'phene' is to 'phenotype' and 'phenome' as 'gene' is to 'genotype' and 'genome' (Wikipedia, 2012), where 'phene' is an equivalent term for 'trait'. However, Dr Frank Nicholas from the University of Sydney has used the term 'phene' in OMIA in a slightly different but more concise context, namely 'phene is to gene as phenotype is to genotype', where 'phene' refers to a set of phenotypes that correspond to a set of genotypes determined by a gene. This is practically very useful in light of the future structured genetic terminology standardization in the genomics era. Several genome databases, such as ArkDB, Animal QTLdb, Bovine Genome Database, Ensembl and NCB! GeneDB, have played a role in the usage of commonly accepted gene/trait notations. Undoubtedly, existing and new genome databases and tools will further develop and evolve. As such, a standardized genetic nomenclature in cattle will definitely become crucial for information sharing and comparisons between different research groups, across experiments and even across species. Recently the Animal Genetics journal has updated its Author Guidelines insisting that proper gene nomenclature be followed: 'All gene names and symbols should be italicized throughout the text, table and figures'; 'Locus

11 Table Internet URL addresses for the web resources used in this chapter and cattle trait glossary information. Data source URL AgBASE Animal QTLdb Animal Trait Ontology project ATOL Cattle trait hierarchy CMO project DAVID Ensembl Entrez Gene Gene Ontology Tools Genetic glossaries GO Consortium GO structure HGNC guidelines Mouse genome nomenclature guidelines OMIA PT project UniProt VT project (BioPortal) Resource:Gene_ Ontology_ Tools (BioPortal) VT, Vertebrate Trait Ontology is a controlled vocabulary for the description of traits (measurable or observable characteristics) pertaining to the morphology, physiology or development of vertebrate organisms. CMO, Clinical Measurement Ontology is designed to be used to standardize morphological and physiological measurement records generated from clinical and model organism research and health programmes. PT, Product Trait Ontology is a controlled vocabulary for the description of traits (measurable or observable characteristics) pertaining to products produced by or obtained from the body of an agricultural animal or bird maintained for use and profit. QTLdb, Animal QTLdb is a database to house all QTL data for all livestock species. OMIA, Online Mendelian Inheritance in Animals is a comprehensive collection of phenotypic information on heritable animal traits and genes in a comparative context, relating traits to genes where possible. ATOL, Animal Trait Ontology for Livestock is aimed at defining livestock traits, with a focus on the main types of animal production in line with societal priorities.

12 (stand&'4 ~~No!J'I~~clatut$ 607 symbols used in Animal Genetics publications must be confirmed with HGNC' and 'nonhuman gene names should be checked against NCBI's Entrez Gene database'. This is a good move towards educating the community on the proper use of standardized genetic nomenclatures. Active development and use of a standardized genetic nomenclature will surely help to improve data quality and reusability, and facilitate data comparisons between experiments, laboratories, even species. Acknowledgements The authors wish to thank Dr Frank Nicholas from the University of Sydney for useful discussions, inputs and kind review of the draft. References Bovine Genome Sequencing and Analysis Consortium eta/. (2009) The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science 324, Broad, T.E., Dolling, C.H.S., Lauvergne, J.J. and Millar, P. (1999) Revised COGNOSAG guidelines for gene nomenclature in ruminants Genetics, Selection, Evolution 31, Bruford, E.A. (201 0) Highlights of the 'Gene Nomenclature Across Species' meeting. Human Genomics 4, Dolling, C.H.S. (1999) Standardized genetic nomenclature for cattle. In: Fries, R. and Ruvinsky, A. (eds) The Genetics of Cattle. CAB International, Wallingford, UK, pp Flicek, P., Ahmed, 1., Amode, M.R., Barrell, D., Beal, K., Brent, S., Carvalho-Silva, D., Clapham, P., Coates, G., Fairley, S. eta/. (2013) Ensembl Nucleic Acids Research 41 (Database issue), D48-D55. Gene Ontology Consortium (2000) Gene Ontology: tool for the unification of biology. Nature Genetics 25, Golik, W., Dameron, 0., Bugeon, J., Fatet, A., Hue, 1., Hurtaud, C., Reichstadt, M., Meunier-Salaun, M.C., Vernet, J., Joret, L. eta/. (2012) ATOL: the multi-species livestock trait ontology. 6th International Conference on Metadata and Semantic Research (MTSR'12), Cadiz, Spain, November. Hu, Z.-L., Dracheva, S., Jang, W.-H., Maglott, D., Bastiaansen, J., Rothschild, M.F. and Reecy, J.M. (2005) A QTL resource and comparison tool for cattle: PigQTLDB. Mammalian Genome 16, Hu, Z.-L., Fritz, E.R. and Reecy, J.M. (2007) AnimaiQTLdb: a livestock QTL database tool set for positional QTL information mining and beyond. Nucleic Acids Research 35 (Database issue), D604-D609. Hu, Z.-L., Park, C.A., Wu, X.-L. and Reecy, J.M. (2013) Animal QTLdb: an improved database tool for livestock animal QTUassociation data dissemination in the post-genome era. Nucleic Acids Research 41' D871-D879. Huang, D:W., Sherman, B.T. and Lempicki, R.A. (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature Protocols 4, Hughes, L.M., Bao, J., Hu, Z.-L., Honavar, V.G. and Reecy, J.M. (2008) Animal Trait Ontology (ATO): The importance and usefulness of a unified trait vocabulary for animal species. Journal of Animal Science 86, McCarthy, F.M., Bridges, S.M., Wang, N., Magee, G.B., Williams, W.P., Luthe, D.S. and Burgess, S.C. (2007) AgBase: a unified resource for functional analysis in agriculture. Nucleic Acids Research 35 (Database issue), D599-D603. Park, C.A., Bello, S.M., Smith, C.L., Hu, Z.-L., Munzenmaier, D.H., Nigam, R., Smith, J.R., Shimoyama, M., Eppig, J.T. and Reecy, J.M. (2013) The Vertebrate Trait Ontology: a controlled vocabulary for the annotation of trait data across species. Journal of Biomedical Semantics 4, 13. Reese, J.T., Childers, C.P., Sundaram, J.P., Dickens, C.M., Childs, K.L., Vile, D.C. and Elsik, C.G. (2010) Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome. BMC Genomics 11, 645. Rodgers, B.D., Roalson, E. H., Weber, G.M., Roberts, S.B. and Goetz, F.W. (2007) A proposed nomenclature consensus for the myostatin gene family. American Journal of Physiology- Endocrinology and Metabolism 292, E371-E372.

13 008 l.-l.. Hu et al.) Sayers, E.W., Barrett, T., Benson, D.A., Bolton, E., Bryant, S.H., Canese, K., Chetvernin, V., Church, D.M., Dicuccio, M., Federhen, S. eta/. (2012) Database resources of the National Center for Biotechnology Information. Nucleic Acids Research 40 (Database issue), D13-D25. Shimoyama, M., Nigam, R., Mcintosh, L.S., Nagarajan, R., Rice, T., Rao, D.C. and Dwinell, M.R. (2012) Three ontologies to define phenotype measurement data. Frontiers in Genetics 3, 87. Smith, C.L., Goldsmith, C.A. and Epcattle, J.T. (2005) The Mammalian Phenotype Ontology as a tool for annotating, analyzing and comparing phenotypic information. Genome Biology6, R7. UniProt Consortium (201 O) The Universal Protein Resource (UniProt) in Nucleic Acids Research 38 (Database issue), D142-D148. Wikipedia (2012) Phene. Available at: (accessed 30 March 2013).

Standard Genetic Nomenglature of the Pig, with Glossaries

Standard Genetic Nomenglature of the Pig, with Glossaries Animal Science Publications Animal Science 2011 Standard Genetic Nomenglature of the Pig, with Glossaries Zhiliang Hu Iowa State University, zhu@iastate.edu Carissa A. Park Iowa State University, caripark@iastate.edu

More information

USDA NRSP-8 BIOINFORMATICS COORDINATION PROGRAM ACTIVITIES

USDA NRSP-8 BIOINFORMATICS COORDINATION PROGRAM ACTIVITIES USDA NRSP-8 BIOINFORMATICS COORDINATION PROGRAM ACTIVITIES Supported by Regional Research Funds, Hatch Act for the Period 10/1/15-9/30/16 James Reecy, Sue Lamont, Chris Tuggle, Max Rothschild and Fiona

More information

Genetic Nomenclature. Zhiliang Hu Iowa State University, James M. Reecy Iowa State University,

Genetic Nomenclature. Zhiliang Hu Iowa State University, James M. Reecy Iowa State University, Animal Science Publications Animal Science 2012 Genetic Nomenclature Zhiliang Hu Iowa State University, zhu@iastate.edu James M. Reecy Iowa State University, jreecy@iastate.edu Follow this and additional

More information

This place covers: Methods or systems for genetic or protein-related data processing in computational molecular biology.

This place covers: Methods or systems for genetic or protein-related data processing in computational molecular biology. G16B BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY Methods or systems for genetic

More information

Niemann-Pick Type C Disease Gene Variation Database ( )

Niemann-Pick Type C Disease Gene Variation Database (   ) NPC-db (vs. 1.1) User Manual An introduction to the Niemann-Pick Type C Disease Gene Variation Database ( http://npc.fzk.de ) curated 2007/2008 by Dirk Dolle and Heiko Runz, Institute of Human Genetics,

More information

Optimizing Traditional and Marker Assisted Evaluation in Beef Cattle

Optimizing Traditional and Marker Assisted Evaluation in Beef Cattle Optimizing Traditional and Marker Assisted Evaluation in Beef Cattle D. H. Denny Crews, Jr., 1,2,3 Stephen S. Moore, 2 and R. Mark Enns 3 1 Agriculture and Agri-Food Canada Research Centre, Lethbridge,

More information

Genetics and Bioinformatics

Genetics and Bioinformatics Genetics and Bioinformatics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be Lecture 1: Setting the pace 1 Bioinformatics what s

More information

Applied Bioinformatics

Applied Bioinformatics Applied Bioinformatics In silico and In clinico characterization of genetic variations Assistant Professor Department of Biomedical Informatics Center for Human Genetics Research ATCAAAATTATGGAAGAA ATCAAAATCATGGAAGAA

More information

Sequencing the Swine Genome: Progress and Prospects

Sequencing the Swine Genome: Progress and Prospects Sequencing the Swine Genome: Progress and Prospects Max F. Rothschild C.F. Curtiss Distinguished Professor Department of Animal Science Center of Integrated Animal Genomics Iowa State University mfrothsc@iastate.edu

More information

Chapter 2: Access to Information

Chapter 2: Access to Information Chapter 2: Access to Information Outline Introduction to biological databases Centralized databases store DNA sequences Contents of DNA, RNA, and protein databases Central bioinformatics resources: NCBI

More information

The University of California, Santa Cruz (UCSC) Genome Browser

The University of California, Santa Cruz (UCSC) Genome Browser The University of California, Santa Cruz (UCSC) Genome Browser There are hundreds of available userselected tracks in categories such as mapping and sequencing, phenotype and disease associations, genes,

More information

GREG GIBSON SPENCER V. MUSE

GREG GIBSON SPENCER V. MUSE A Primer of Genome Science ience THIRD EDITION TAGCACCTAGAATCATGGAGAGATAATTCGGTGAGAATTAAATGGAGAGTTGCATAGAGAACTGCGAACTG GREG GIBSON SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc.

More information

Gene-centered resources at NCBI

Gene-centered resources at NCBI COURSE OF BIOINFORMATICS a.a. 2014-2015 Gene-centered resources at NCBI We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about genes, serving

More information

Training materials.

Training materials. Training materials - Ensembl training materials are protected by a CC BY license - http://creativecommons.org/licenses/by/4.0/ - If you wish to re-use these materials, please credit Ensembl for their creation

More information

Section 4 - Guidelines for DNA Technology. Version October, 2017

Section 4 - Guidelines for DNA Technology. Version October, 2017 Section 4 - Guidelines for DNA Technology Section 4 DNA Technology Table of Contents Overview 1 Molecular genetics... 4 1.1 Introduction... 4 1.2 Current and potential uses of DNA technologies... 4 1.2.1

More information

DNA Technologies and Production Markers

DNA Technologies and Production Markers DNA Technologies and Production Markers Alison Van Eenennaam, PhD Cooperative Extension Specialist Animal Biotechnology and Genomics University of California, Davis, USA Email: alvaneenennaam@ucdavis.edu

More information

AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1

AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1 AGRO/ANSC/BIO/GENE/HORT 305 Fall, 2016 Overview of Genetics Lecture outline (Chpt 1, Genetics by Brooker) #1 - Genetics: Progress from Mendel to DNA: Gregor Mendel, in the mid 19 th century provided the

More information

Gene-centered databases and Genome Browsers

Gene-centered databases and Genome Browsers COURSE OF BIOINFORMATICS a.a. 2015-2016 Gene-centered databases and Genome Browsers We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about

More information

Gene-centered databases and Genome Browsers

Gene-centered databases and Genome Browsers COURSE OF BIOINFORMATICS a.a. 2016-2017 Gene-centered databases and Genome Browsers We searched Accession Number: M60495 AT NCBI Nucleotide Gene has been implemented at NCBI to organize information about

More information

The what, why and how of Genomics for the beef cattle breeder

The what, why and how of Genomics for the beef cattle breeder The what, why and how of Genomics for the beef cattle breeder Este van Marle-Köster (Pr. Anim. Sci) 25 January 2019 Waguy info day 2 Today s talk Focus on what is genomics Brief definition & references

More information

Traditional Genetic Improvement. Genetic variation is due to differences in DNA sequence. Adding DNA sequence data to traditional breeding.

Traditional Genetic Improvement. Genetic variation is due to differences in DNA sequence. Adding DNA sequence data to traditional breeding. 1 Introduction What is Genomic selection and how does it work? How can we best use DNA data in the selection of cattle? Mike Goddard 5/1/9 University of Melbourne and Victorian DPI of genomic selection

More information

Retrieval of gene information at NCBI

Retrieval of gene information at NCBI Retrieval of gene information at NCBI Some notes 1. http://www.cs.ucf.edu/~xiaoman/fall/ 2. Slides are for presenting the main paper, should minimize the copy and paste from the paper, should write in

More information

BIOINFORMATICS FOR DUMMIES MB&C2017 WORKSHOP

BIOINFORMATICS FOR DUMMIES MB&C2017 WORKSHOP Jasper Decuyper BIOINFORMATICS FOR DUMMIES MB&C2017 WORKSHOP MB&C2017 Workshop Bioinformatics for dummies 2 INTRODUCTION Imagine your workspace without the computers Both in research laboratories and in

More information

Proceedings, The Range Beef Cow Symposium XXII Nov. 29, 30 and Dec. 1, 2011, Mitchell, NE. IMPLEMENTATION OF MARKER ASSISTED EPDs

Proceedings, The Range Beef Cow Symposium XXII Nov. 29, 30 and Dec. 1, 2011, Mitchell, NE. IMPLEMENTATION OF MARKER ASSISTED EPDs Proceedings, The Range Beef Cow Symposium II Nov. 29, 30 and Dec. 1, 2011, Mitchell, NE IMPLEMENTATION OF MARKER ASSISTED EPDs Matt Spangler Department of Animal Science University of Nebraska Lincoln,

More information

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome.

Exome Sequencing Exome sequencing is a technique that is used to examine all of the protein-coding regions of the genome. Glossary of Terms Genetics is a term that refers to the study of genes and their role in inheritance the way certain traits are passed down from one generation to another. Genomics is the study of all

More information

Genetics of Beef Cattle: Moving to the genomics era Matt Spangler, Assistant Professor, Animal Science, University of Nebraska-Lincoln

Genetics of Beef Cattle: Moving to the genomics era Matt Spangler, Assistant Professor, Animal Science, University of Nebraska-Lincoln Genetics of Beef Cattle: Moving to the genomics era Matt Spangler, Assistant Professor, Animal Science, University of Nebraska-Lincoln Several companies offer DNA marker tests for a wide range of traits

More information

High-density SNP Genotyping Analysis of Broiler Breeding Lines

High-density SNP Genotyping Analysis of Broiler Breeding Lines Animal Industry Report AS 653 ASL R2219 2007 High-density SNP Genotyping Analysis of Broiler Breeding Lines Abebe T. Hassen Jack C.M. Dekkers Susan J. Lamont Rohan L. Fernando Santiago Avendano Aviagen

More information

Initiatives to Speed up Data Mining. Ségolène Aymé ICORD 2014, Ede, Netherlands 9 October 2014

Initiatives to Speed up Data Mining. Ségolène Aymé ICORD 2014, Ede, Netherlands 9 October 2014 Initiatives to Speed up Data Mining Ségolène Aymé ICORD 2014, Ede, Netherlands 9 October 2014 1 Objectives Make the most of remarkable advances in the molecular basis of human diseases dissect the physiological

More information

MAPPING GENES TO TRAITS IN DOGS USING SNPs

MAPPING GENES TO TRAITS IN DOGS USING SNPs MAPPING GENES TO TRAITS IN DOGS USING SNPs OVERVIEW This instructor guide provides support for a genetic mapping activity, based on the dog genome project data. The activity complements a 29-minute lecture

More information

Two Mark question and Answers

Two Mark question and Answers 1. Define Bioinformatics Two Mark question and Answers Bioinformatics is the field of science in which biology, computer science, and information technology merge into a single discipline. There are three

More information

LowInputBreeds & Genomic breeding

LowInputBreeds & Genomic breeding LowInputBreeds & Genomic breeding Gillian Butler Newcastle University http://www.nefg-organic.org/ COMMUNITY RESEARCH Developing integrated livestock breeding and management strategies to improve animal

More information

B I O I N F O R M A T I C S

B I O I N F O R M A T I C S B I O I N F O R M A T I C S Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be SUPPLEMENTARY CHAPTER: DATA BASES AND MINING 1 What

More information

Whole Genome-Assisted Selection. Alison Van Eenennaam, Ph.D. Cooperative Extension Specialist

Whole Genome-Assisted Selection. Alison Van Eenennaam, Ph.D. Cooperative Extension Specialist Whole Genome-Assisted Selection Alison Van Eenennaam, Ph.D. Cooperative Extension Specialist Animal Biotechnology and Genomics UC Davis alvaneenennaam@ucdavis.edu http://animalscience.ucdavis.edu/animalbiotech/

More information

EBI web resources I: databases and tools. Yanbin Yin Spring 2013

EBI web resources I: databases and tools. Yanbin Yin Spring 2013 EBI web resources I: databases and tools Yanbin Yin Spring 2013 1 Outline Intro to EBI Databases and web tools UniProt Gene Ontology Hands on PracBce MOST MATERIALS ARE FROM: hkp://www.ebi.ac.uk/training/online/course-

More information

Domestic animal genomes meet animal breeding translational biology in the ag-biotech sector. Jerry Taylor University of Missouri-Columbia

Domestic animal genomes meet animal breeding translational biology in the ag-biotech sector. Jerry Taylor University of Missouri-Columbia Domestic animal genomes meet animal breeding translational biology in the ag-biotech sector Jerry Taylor University of Missouri-Columbia Worldwide distribution and use of cattle A brief history of cattle

More information

ANIMAL SCIENCE (ANSI)

ANIMAL SCIENCE (ANSI) Animal Science (ANSI) 1 ANIMAL SCIENCE (ANSI) ANSI 1111 Animal and Food Science Experience Description: Student development through connections among the student s major curriculum, career goals specific

More information

Question. In the last 100 years. What is Feed Efficiency? Genetics of Feed Efficiency and Applications for the Dairy Industry

Question. In the last 100 years. What is Feed Efficiency? Genetics of Feed Efficiency and Applications for the Dairy Industry Question Genetics of Feed Efficiency and Applications for the Dairy Industry Can we increase milk yield while decreasing feed cost? If so, how can we accomplish this? Stephanie McKay University of Vermont

More information

Chapter 9. Gene Interactions. As we learned in Chapter 3, Mendel reported that the pairs of loci he observed segregated independently

Chapter 9. Gene Interactions. As we learned in Chapter 3, Mendel reported that the pairs of loci he observed segregated independently Chapter 9 Gene Interactions Figure 9-1 The coat colour on this juvenile horse is called Bay Roan Tobiano. Bay is the brown base coat colour; Roan is the mixture of white hairs with the base coat, making

More information

Triticeae Gene Nomenclature

Triticeae Gene Nomenclature A Wheat Initiative workshop Triticeae Gene Nomenclature 11-13 October 2016, Munich (Germany) Report 1 Background A number of high quality genome sequences have been completed in recent years for species

More information

BLUP and Genomic Selection

BLUP and Genomic Selection BLUP and Genomic Selection Alison Van Eenennaam Cooperative Extension Specialist Animal Biotechnology and Genomics University of California, Davis, USA alvaneenennaam@ucdavis.edu http://animalscience.ucdavis.edu/animalbiotech/

More information

Answers to additional linkage problems.

Answers to additional linkage problems. Spring 2013 Biology 321 Answers to Assignment Set 8 Chapter 4 http://fire.biol.wwu.edu/trent/trent/iga_10e_sm_chapter_04.pdf Answers to additional linkage problems. Problem -1 In this cell, there two copies

More information

Big picture and history

Big picture and history Big picture and history (and Computational Biology) CS-5700 / BIO-5323 Outline 1 2 3 4 Outline 1 2 3 4 First to be databased were proteins The development of protein- s (Sanger and Tuppy 1951) led to the

More information

Understanding genomic selection in poultry breeding

Understanding genomic selection in poultry breeding doi:10.1017/s0043933914000324 Understanding genomic selection in poultry breeding A. WOLC 1, 2 1 Hy-Line International, Dallas Center, IA, USA; 2 Iowa State University, Ames, IA, USA Corresponding author:

More information

Axiom mydesign Custom Array design guide for human genotyping applications

Axiom mydesign Custom Array design guide for human genotyping applications TECHNICAL NOTE Axiom mydesign Custom Genotyping Arrays Axiom mydesign Custom Array design guide for human genotyping applications Overview In the past, custom genotyping arrays were expensive, required

More information

Trudy F C Mackay, Department of Genetics, North Carolina State University, Raleigh NC , USA.

Trudy F C Mackay, Department of Genetics, North Carolina State University, Raleigh NC , USA. Question & Answer Q&A: Genetic analysis of quantitative traits Trudy FC Mackay What are quantitative traits? Quantitative, or complex, traits are traits for which phenotypic variation is continuously distributed

More information

Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers

Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Week 1 BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Web resources: NCBI database: http://www.ncbi.nlm.nih.gov/ Ensembl database: http://useast.ensembl.org/index.html

More information

Deakin Research Online

Deakin Research Online Deakin Research Online This is the published version: Church, Philip, Goscinski, Andrzej, Wong, Adam and Lefevre, Christophe 2011, Simplifying gene expression microarray comparative analysis., in BIOCOM

More information

The 150+ Tomato Genome (re-)sequence Project; Lessons Learned and Potential

The 150+ Tomato Genome (re-)sequence Project; Lessons Learned and Potential The 150+ Tomato Genome (re-)sequence Project; Lessons Learned and Potential Applications Richard Finkers Researcher Plant Breeding, Wageningen UR Plant Breeding, P.O. Box 16, 6700 AA, Wageningen, The Netherlands,

More information

The human gene encoding Glucose-6-phosphate dehydrogenase (G6PD) is located on chromosome X in cytogenetic band q28.

The human gene encoding Glucose-6-phosphate dehydrogenase (G6PD) is located on chromosome X in cytogenetic band q28. Data mining in Ensembl with BioMart Worked Example The human gene encoding Glucose-6-phosphate dehydrogenase (G6PD) is located on chromosome X in cytogenetic band q28. Which other genes related to human

More information

The Gene Ontology Annotation (GOA) project application of GO in SWISS-PROT, TrEMBL and InterPro

The Gene Ontology Annotation (GOA) project application of GO in SWISS-PROT, TrEMBL and InterPro Comparative and Functional Genomics Comp Funct Genom 2003; 4: 71 74. Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/cfg.235 Conference Review The Gene Ontology Annotation

More information

Annotation. (Chapter 8)

Annotation. (Chapter 8) Annotation (Chapter 8) Genome annotation Genome annotation is the process of attaching biological information to sequences: identify elements on the genome attach biological information to elements store

More information

A systematic approach to breeding programs for game PART 1

A systematic approach to breeding programs for game PART 1 A systematic approach to breeding programs for game PART 1 Japie van der Westhuizen South African Stud Book and Animal Improvement Association japie@studbook.co.za As with many other things in life, the

More information

Crash-course in genomics

Crash-course in genomics Crash-course in genomics Molecular biology : How does the genome code for function? Genetics: How is the genome passed on from parent to child? Genetic variation: How does the genome change when it is

More information

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow

From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow From Variants to Pathways: Agilent GeneSpring GX s Variant Analysis Workflow Technical Overview Import VCF Introduction Next-generation sequencing (NGS) studies have created unanticipated challenges with

More information

Trilateral Project WM4 Comparative studies in new technologies (biotechnology, business methods, etc.)

Trilateral Project WM4 Comparative studies in new technologies (biotechnology, business methods, etc.) Trilateral Project WM4 Comparative studies in new technologies (biotechnology, business methods, etc.) Report on comparative study on Examination Practice Relating to Single Nucleotide Polymorphisms (SNPs)

More information

Gene Mapping in Natural Plant Populations Guilt by Association

Gene Mapping in Natural Plant Populations Guilt by Association Gene Mapping in Natural Plant Populations Guilt by Association Leif Skøt What is linkage disequilibrium? 12 Natural populations as a tool for gene mapping 13 Conclusion 15 POPULATIONS GUILT BY ASSOCIATION

More information

Genomic selection in the Australian sheep industry

Genomic selection in the Australian sheep industry Genomic selection in the Australian sheep industry Andrew A Swan Animal Genetics and Breeding Unit (AGBU), University of New England, Armidale, NSW, 2351, AGBU is a joint venture between NSW Department

More information

Biology 644: Bioinformatics

Biology 644: Bioinformatics Processes Activation Repression Initiation Elongation.... Processes Splicing Editing Degradation Translation.... Transcription Translation DNA Regulators DNA-Binding Transcription Factors Chromatin Remodelers....

More information

SeattleSNPs Interactive Tutorial: Database Inteface Entrez, dbsnp, HapMap, Perlegen

SeattleSNPs Interactive Tutorial: Database Inteface Entrez, dbsnp, HapMap, Perlegen SeattleSNPs Interactive Tutorial: Database Inteface Entrez, dbsnp, HapMap, Perlegen The tutorial is designed to take you through the steps necessary to access SNP data from the primary database resources:

More information

Online Mendelian Inheritance in Man (OMIM)

Online Mendelian Inheritance in Man (OMIM) HUMAN MUTATION 15:57 61 (2000) MDI SPECIAL ARTICLE Online Mendelian Inheritance in Man (OMIM) Ada Hamosh, Alan F. Scott,* Joanna Amberger, David Valle, and Victor A. McKusick McKusick-Nathans Institute

More information

Revised COGNOSAG guidelines for gene nomenclature in ruminants 1998

Revised COGNOSAG guidelines for gene nomenclature in ruminants 1998 Letter to the editor Revised COGNOSAG guidelines for gene nomenclature in ruminants 1998 Thomas E. Broad Charles H.S. Dolling Jean-Jacques Lauvergne c Paul Millar a AgResearch, P.B. 50034, Mosgiel, New

More information

A WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING

A WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING A WEB-BASED TOOL FOR GENOMIC FUNCTIONAL ANNOTATION, STATISTICAL ANALYSIS AND DATA MINING D. Martucci a, F. Pinciroli a,b, M. Masseroli a a Dipartimento di Bioingegneria, Politecnico di Milano, Milano,

More information

Genome Informatics. Systems Biology and the Omics Cascade (Course 2143) Day 3, June 11 th, Kiyoko F. Aoki-Kinoshita

Genome Informatics. Systems Biology and the Omics Cascade (Course 2143) Day 3, June 11 th, Kiyoko F. Aoki-Kinoshita Genome Informatics Systems Biology and the Omics Cascade (Course 2143) Day 3, June 11 th, 2008 Kiyoko F. Aoki-Kinoshita Introduction Genome informatics covers the computer- based modeling and data processing

More information

Strategy for Applying Genome-Wide Selection in Dairy Cattle

Strategy for Applying Genome-Wide Selection in Dairy Cattle Strategy for Applying Genome-Wide Selection in Dairy Cattle L. R. Schaeffer Centre for Genetic Improvement of Livestock Department of Animal & Poultry Science University of Guelph, Guelph, ON, Canada N1G

More information

EPIB 668 Genetic association studies. Aurélie LABBE - Winter 2011

EPIB 668 Genetic association studies. Aurélie LABBE - Winter 2011 EPIB 668 Genetic association studies Aurélie LABBE - Winter 2011 1 / 71 OUTLINE Linkage vs association Linkage disequilibrium Case control studies Family-based association 2 / 71 RECAP ON GENETIC VARIANTS

More information

user s guide Question 3

user s guide Question 3 Question 3 During a positional cloning project aimed at finding a human disease gene, linkage data have been obtained suggesting that the gene of interest lies between two sequence-tagged site markers.

More information

Decoding genomic selection and the benefit for unconventional traits

Decoding genomic selection and the benefit for unconventional traits Decoding genomic selection and the benefit for unconventional traits Pete Hansen Dept. of Animal Sciences University of Florida Change in Breeding Value for Milk Yield in Holsteins 1957-2013 Breeding Value

More information

Applied Bioinformatics

Applied Bioinformatics Applied Bioinformatics Bing Zhang Department of Biomedical Informatics Vanderbilt University bing.zhang@vanderbilt.edu Course overview What is bioinformatics Data driven science: the creation and advancement

More information

The first and only fully-integrated microarray instrument for hands-free array processing

The first and only fully-integrated microarray instrument for hands-free array processing The first and only fully-integrated microarray instrument for hands-free array processing GeneTitan Instrument Transform your lab with a GeneTitan Instrument and experience the unparalleled power of streamlining

More information

BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers

BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers BCHM 6280 Tutorial: Gene specific information using NCBI, Ensembl and genome viewers Web resources: NCBI database: http://www.ncbi.nlm.nih.gov/ Ensembl database: http://useast.ensembl.org/index.html UCSC

More information

Cronicon VETERINARY SCIENCE

Cronicon VETERINARY SCIENCE Cronicon OPEN ACCESS S Mercer 1, N Walker 2 and MD Garcia 1,2 * 1 Schoolof Animal Sciences, Louisiana State University, USA 2 Louisiana State University Ag Center, Baton Rouge, USA VETERINARY SCIENCE Research

More information

Implementing direct and indirect markers.

Implementing direct and indirect markers. Chapter 16. Brian Kinghorn University of New England Some Definitions... 130 Directly and indirectly marked genes... 131 The potential commercial value of detected QTL... 132 Will the observed QTL effects

More information

Molecular and Applied Genetics

Molecular and Applied Genetics Molecular and Applied Genetics Ian King, Iain Donnison, Helen Ougham, Julie King and Sid Thomas Developing links between rice and the grasses 6 Gene isolation 7 Informatics 8 Statistics and multivariate

More information

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools News About NCBI Site Map

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Brad Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org 8

More information

J.P. Gibson The Institute for Genetics and Bioinformatics University of New England Australia

J.P. Gibson The Institute for Genetics and Bioinformatics University of New England Australia APPLICATION OF MOLECULAR GENETIC TECHNOLOGIES FOR GENETIC IMPROVEMENT OF RUMINANTS J.P. Gibson The Institute for Genetics and Bioinformatics University of New England Australia Some applications of genomic

More information

Current Reality of the Ayrshire Breed and the Opportunity of Technology for Future Growth

Current Reality of the Ayrshire Breed and the Opportunity of Technology for Future Growth Current Reality of the Ayrshire Breed and the Opportunity of Technology for Future Growth First Current Reality August 2016 US CDCB-AGIL Genetic Breed Difference From Holstein Breed Milk Fat Protein Productive

More information

Chapter 3: Evolutionary genetics of natural populations

Chapter 3: Evolutionary genetics of natural populations Chapter 3: Evolutionary genetics of natural populations What is Evolution? Change in the frequency of an allele within a population Evolution acts on DIVERSITY to cause adaptive change Ex. Light vs. Dark

More information

Research proposal. Title: Development and Implementation of a Data Model for Pathway Mapping. Student: Linghao Yi

Research proposal. Title: Development and Implementation of a Data Model for Pathway Mapping. Student: Linghao Yi Research proposal Title: Development and Implementation of a Data Model for Pathway Mapping Student: Linghao Yi Supervisors: Kevin Robertson Muriel Mewissen Peter Ghazal Douglas Armstrong (GTI), (GTI),

More information

KnetMiner USER TUTORIAL

KnetMiner USER TUTORIAL KnetMiner USER TUTORIAL Keywan Hassani-Pak ROTHAMSTED RESEARCH 10 NOVEMBER 2017 About KnetMiner KnetMiner, with a silent "K" and standing for Knowledge Network Miner, is a suite of open-source software

More information

10. BIOTECHNOLOGY (Code No. 045)

10. BIOTECHNOLOGY (Code No. 045) 10. BIOTECHNOLOGY (Code No. 045) An unprecedented growth of human knowledge in the field of Biological Sciences coupled with equally significant developments in the field of technology have brought significant

More information

Validating Genetic Markers

Validating Genetic Markers Validating Genetic Markers Dick Quaas, Cornell University Mark Thallman, USMARC Alison Van Eenennaam, UC Davis Cooperative Extension Specialist Animal Biotechnology and Genomics alvaneenennaam@ucdavis.edu

More information

From Genotype to Phenotype

From Genotype to Phenotype From Genotype to Phenotype Johanna Vilkki Green technology, Natural Resources Institute Finland Systems biology Genome Transcriptome genes mrna Genotyping methodology SNP TOOLS, WG SEQUENCING Functional

More information

Addressing the Phenomic Gap: Next generation genomic tools in beef

Addressing the Phenomic Gap: Next generation genomic tools in beef Addressing the Phenomic Gap: Next generation genomic tools in beef Basarab, Aalhus, Stothard, Wang, Crews, Dixon, Moore, Plastow Deseret Ranches of Alberta The Canadian beef cattle industry faces severe

More information

GS Analysis of Microarray Data

GS Analysis of Microarray Data GS01 0163 Analysis of Microarray Data Keith Baggerly and Brad Broom Department of Bioinformatics and Computational Biology UT M. D. Anderson Cancer Center kabagg@mdanderson.org bmbroom@mdanderson.org 7

More information

Ecological genomics and molecular adaptation: state of the Union and some research goals for the near future.

Ecological genomics and molecular adaptation: state of the Union and some research goals for the near future. Ecological genomics and molecular adaptation: state of the Union and some research goals for the near future. Louis Bernatchez Genomics and Conservation of Aquatic Resources Université LAVAL! Molecular

More information

Overview. 50,000 Markers on a Chip. Definitions. Problem: Whole Genome Selection Project Involving 2,000 Industry A.I. Sires

Overview. 50,000 Markers on a Chip. Definitions. Problem: Whole Genome Selection Project Involving 2,000 Industry A.I. Sires Whole Genome Selection Project Involving 2,000 Industry A.I. Sires Mark Thallman, Mark Allan, Larry Kuehn, John Keele, Warren Snelling, Gary Bennett Introduction Overview 2,000 Bull Project Training Data

More information

Searching for mutations in pigs using the human genome

Searching for mutations in pigs using the human genome Animal Industry Report AS 651 ASL R2024 2005 Searching for mutations in pigs using the human genome Laura Grapes Iowa State University Stephen Rudd Centre for Biotechnology Rohan L. Fernando Iowa State

More information

University of York Department of Biology B. Sc Stage 2 Degree Examinations

University of York Department of Biology B. Sc Stage 2 Degree Examinations Examination Candidate Number: Desk Number: University of York Department of Biology B. Sc Stage 2 Degree Examinations 2016-17 Evolutionary and Population Genetics Time allowed: 1 hour and 30 minutes Total

More information

ROAD TO STATISTICAL BIOINFORMATICS CHALLENGE 1: MULTIPLE-COMPARISONS ISSUE

ROAD TO STATISTICAL BIOINFORMATICS CHALLENGE 1: MULTIPLE-COMPARISONS ISSUE CHAPTER1 ROAD TO STATISTICAL BIOINFORMATICS Jae K. Lee Department of Public Health Science, University of Virginia, Charlottesville, Virginia, USA There has been a great explosion of biological data and

More information

2. Materials and Methods

2. Materials and Methods Identification of cancer-relevant Variations in a Novel Human Genome Sequence Robert Bruggner, Amir Ghazvinian 1, & Lekan Wang 1 CS229 Final Report, Fall 2009 1. Introduction Cancer affects people of all

More information

Wheat Genomic Resources in a post reference sequence era: a road map to establish priorities, timelines and outcomes

Wheat Genomic Resources in a post reference sequence era: a road map to establish priorities, timelines and outcomes Wheat Genomic Resources in a post reference sequence era: a road map to establish priorities, timelines and outcomes Prospects in Wheat Genomics -- what is needed from a research perspective Rudi Appels,

More information

Basic Concepts of Human Genetics

Basic Concepts of Human Genetics Basic Concepts of Human Genetics The genetic information of an individual is contained in 23 pairs of chromosomes. Every human cell contains the 23 pair of chromosomes. One pair is called sex chromosomes

More information

Practical Application of DNA Markers for Genetic Improvement

Practical Application of DNA Markers for Genetic Improvement Practical Application of DNA Markers for Genetic Improvement G. Plastow 1, S. Sasaki 2, T-P. Yu 3, N. Deeb 3, G. Prall 2, K. Siggens 4 and E. Wilson 2 Summary 1 Sygen International, Kingston Bagpuize,

More information

Genetic Evaluations. Stephen Scott Canadian Hereford Association

Genetic Evaluations. Stephen Scott Canadian Hereford Association Genetic Evaluations Stephen Scott Canadian Hereford Association Canadian Hereford Association Herefords were first imported into Canada in 1860 The CHA was incorporated in 1902 under the Government of

More information

Genetic databases. Anna Sowińska-Seidler, MSc, PhD Department of Medical Genetics

Genetic databases. Anna Sowińska-Seidler, MSc, PhD Department of Medical Genetics Genetic databases Anna Sowińska-Seidler, MSc, PhD seidler@ump.edu.pl Department of Medical Genetics Genetic databases what to start with? www.ncbi.nml.nih.gov NCBI National Center for Biotechnology Information

More information

Frumkin, 2e Part 1: Methods and Paradigms. Chapter 6: Genetics and Environmental Health

Frumkin, 2e Part 1: Methods and Paradigms. Chapter 6: Genetics and Environmental Health Frumkin, 2e Part 1: Methods and Paradigms Chapter 6: Genetics and Environmental Health Genetics Genetics, the study of individual genes, has expanded to include genomics, which is the study of all the

More information

Data representation for clinical data and metadata

Data representation for clinical data and metadata Data representation for clinical data and metadata WP1: Data representation for clinical data and metadata Inconsistent terminology creates barriers to identifying common clinical entities in disparate

More information

ECONOMICS OF USING DNA MARKERS FOR SORTING FEEDLOT CATTLE Alison Van Eenennaam, Ph.D.

ECONOMICS OF USING DNA MARKERS FOR SORTING FEEDLOT CATTLE Alison Van Eenennaam, Ph.D. ECONOMICS OF USING DNA MARKERS FOR SORTING FEEDLOT CATTLE Alison Van Eenennaam, Ph.D. Cooperative Extension Specialist Animal Biotechnology and Genomics University of California, Davis alvaneenennaam@ucdavis.edu

More information

2010 Annual Research Report

2010 Annual Research Report 2010 Annual Research Report 2. Animal Genomics & Bioinformatics Research (1) Construction of Standard Reference Genome Map in Hanwoo The new sequencing technologies for whole genome of an organism have

More information