Human KIR sequences 2003

Similar documents
The Immuno Polymorphism Database

HLA Nomenclature. Steven GE Marsh. Anthony Nolan Research Institute London

PCR-SSP primer mixes for KIR3DL3 non-synonymous polymorphism, and SNP linkage (L) reactions.

The University of California, Santa Cruz (UCSC) Genome Browser

IMGT Locus on Focus. ABC Fax Marie-Paule Lefranc

Lecture 7 Motif Databases and Gene Finding

The HLA Community s Success in Combining Clinical & Genomic Data

DNAFSMiner: A Web-Based Software Toolbox to Recognize Two Types of Functional Sites in DNA Sequences

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall

Biology. Biology. Slide 1 of 39. End Show. Copyright Pearson Prentice Hall

Genome annotation & EST

The Nature of Genes. The Nature of Genes. Genes and How They Work. Chapter 15/16

Unified nomenclature for the winged helix/forkhead transcription factors

Mechanisms of Copy Number Variation and Hybrid Gene Formation in the KIR Immune Gene Complex

Bioinformatics Tools. Stuart M. Brown, Ph.D Dept of Cell Biology NYU School of Medicine

Textbook Reading Guidelines

Types of Databases - By Scope

The IMGT/HLA database

Molecular Genetics of Disease and the Human Genome Project

This information is current as of February 23, 2013.

Antibody humanization and engineering: what do we learn from IMGT standardization.

Report. B2. dbmhc. Introduction

Gene is the basic physical and functional unit of heredity. A Gene, in molecular terms,

Attribution: University of Michigan Medical School, Department of Microbiology and Immunology

Applications of HMMs in Computational Biology. BMI/CS Colin Dewey

Array-Ready Oligo Set for the Rat Genome Version 3.0

Gene diversity of chimpanzee ABO blood group genes elucidated from exon 7 sequences

TEMA 7. LA GENERACIÓN DE LA DIVERSIDAD

STSs and ESTs. Sequence-Tagged Site: short, unique sequence Expressed Sequence Tag: short, unique sequence from a coding region

Themes: RNA and RNA Processing. Messenger RNA (mrna) What is a gene? RNA is very versatile! RNA-RNA interactions are very important!

The study of the structure, function, and interaction of cellular proteins is called. A) bioinformatics B) haplotypics C) genomics D) proteomics

Bioinformatics for Proteomics. Ann Loraine

Molecular Biology Primer. CptS 580, Computational Genomics, Spring 09

Research Article Distribution of HLA-A, -B, and -C Alleles and HLA/KIR Combinations in Han Population in China

Bioinformatics Translation Exercise

IMGT-ONTOLOGY and IMGT databases, tools and Web resources for immunoinformatics

Proteogenomics Workflow for Neoantigen Discovery

Allele-Level Haplotype Frequencies and Pairwise Linkage Disequilibrium for 14 KIR Loci in 506 European-American Individuals

RNA, & PROTEIN SYNTHESIS. 7 th Grade, Week 4, Day 1 Monday, July 15, 2013

Introduction to Cellular Biology and Bioinformatics. Farzaneh Salari

From DNA to Protein: Genotype to Phenotype

Annotating 7G24-63 Justin Richner May 4, Figure 1: Map of my sequence

Hands-On Four Investigating Inherited Diseases

Sequence Based Function Annotation

Chapter 13. From DNA to Protein

Investigating Inherited Diseases

Recombination Lecture, Dr. Aguilera 2/17/2014

Comparative Bioinformatics. BSCI348S Fall 2003 Midterm 1

CS313 Exercise 1 Cover Page Fall 2017

Figure S1 Correlation in size of analogous introns in mouse and teleost Piccolo genes. Mouse intron size was plotted against teleost intron size for t

EECS730: Introduction to Bioinformatics

Biology 3201 Genetics Unit #5

The Flow of Genetic Information

Bacterial Genome Annotation

FUNCTIONAL BIOINFORMATICS

Unit 1: DNA and the Genome. Sub-Topic (1.3) Gene Expression

BIOL 300 Foundations of Biology Summer 2017 Telleen Lecture Outline

John Hammond Targeted genomic enrichment and SMRT sequencing of immune-related gene complexes

Lecture for Wednesday. Dr. Prince BIOL 1408

Biology Celebration of Learning (100 points possible)

Enzyme that uses RNA as a template to synthesize a complementary DNA

Bi 8 Lecture 5. Ellen Rothenberg 19 January 2016

Aaditya Khatri. Abstract

Make the protein through the genetic dogma process.

The Structure of RNA. The Central Dogma

I nternet Resources for Bioinformatics Data and Tools

Gene Identification in silico

Molecular Genetics. Before You Read. Read to Learn

GENETICS - CLUTCH CH.15 GENOMES AND GENOMICS.

histocompatibility system (DLA), A. Barnes

Biotechnology Project Lab

Question 2: There are 5 retroelements (2 LINEs and 3 LTRs), 6 unclassified elements (XDMR and XDMR_DM), and 7 satellite sequences.

Understanding Genes & Mutations. John A Phillips III May 16, 2005

From DNA to Protein: Genotype to Phenotype

ENGR 213 Bioengineering Fundamentals April 25, A very coarse introduction to bioinformatics

BIOL 1030 Introduction to Biology: Organismal Biology. Fall 2009 Sections B & D. Steve Thompson:

IMGT Databases and Tools for Immunoglobulin (IG) and T cell receptor (TR) analysis, and for Antibody humanization.

Will discuss proteins in view of Sequence (I,II) Structure (III) Function (IV) proteins in practice

PrimePCR Assay Validation Report

HISTO TYPE SSP typing kits for HLA Class I + II low resolution SSP typing kits for HLA Class II high resolution

The Nature of Genes. The Nature of Genes. The Nature of Genes. The Nature of Genes. The Nature of Genes. The Genetic Code. Genes and How They Work

Atlas of Genetics and Cytogenetics in Oncology and Haematology. IMMUNOGLOBULIN GENES: CONCEPT OF DNA REARRANGEMENT * Introduction

Tutorial for Stop codon reassignment in the wild

BS 50 Genetics and Genomics Week of Oct 24

Molecular Genetics. The flow of genetic information from DNA. DNA Replication. Two kinds of nucleic acids in cells: DNA and RNA.

Biology Evolution Dr. Kilburn, page 1 Mutation and genetic variation

Higher Human Biology Unit 1: Human Cells Pupils Learning Outcomes

Annotating Fosmid 14p24 of D. Virilis chromosome 4

Unit 6 DNA ppt 3 Gene Expression and Mutations Chapter 8.6 & 8.7 pg

CHAPTER 21 LECTURE SLIDES

In silico variant analysis: Challenges and Pitfalls

7.2 Protein Synthesis. From DNA to Protein Animation

Videos. Lesson Overview. Fermentation

Comparative Genomics. Page 1. REMINDER: BMI 214 Industry Night. We ve already done some comparative genomics. Loose Definition. Human vs.

Section 10.3 Outline 10.3 How Is the Base Sequence of a Messenger RNA Molecule Translated into Protein?

BIOLOGY - CLUTCH CH.17 - GENE EXPRESSION.

Aligning GENCODE and RefSeq transcripts By EMBL-EBI and NCBI

DNA is normally found in pairs, held together by hydrogen bonds between the bases

Applicazioni biotecnologiche

LECTURE: 22 IMMUNOGLOBULIN DIVERSITIES LEARNING OBJECTIVES: The student should be able to:

Transcription:

Immunogenetics (2003) 55:227 239 DOI 10.1007/s00251-003-0572-y ORIGINAL PAPER C. A. Garcia J. Robinson L. A. Guethlein P. Parham J. A. Madrigal S. G. E. Marsh Human KIR sequences 2003 Received: 17 March 2003 / Accepted: 18 March 2003 / Published online: 28 June 2003 Springer-Verlag 2003 Abstract We have compiled the nucleotide sequences and their amino acid translations from a total of 89 Killer Immunoglobulin-like Receptor (KIR) alleles, derived from 17 different KIR genes. The alignments use the KIR3DL2*001 allele as a reference sequence. Each of the KIR sequences included in these alignments has been checked and where discrepancies have arisen between reported sequences, the original authors have been contacted where possible, and necessary amendments to published sequences have been incorporated into this alignment. Future sequencing may identify errors in this list and we would welcome any evidence that helps to maintain the accuracy of this compilation. Keywords Alignments KIR Nucleotide Protein Sequences The sequences included in this compilation are taken from the publications listed in the Killer Immunoglobulinlike Receptor (KIR) Nomenclature Committee Report (Marsh et al. 2003). The KIR Nomenclature Committee has officially assigned the names to all the sequences included in these sequence alignments with the exception of two. Details of the officially named sequences including accession numbers and publication details can C. A. Garcia J. Robinson J. A. Madrigal S. G. E. Marsh () ) Anthony Nolan Research Institute, Royal Free Hospital, Pond Street, London, NW3 2QG, UK e-mail: marsh@ebi.ac.uk Tel.: +44-20-72848321 Fax: +44-20-72848331 C. A. Garcia J. A. Madrigal S. G. E. Marsh Department of Haematology, Royal Free Hospital, Pond Street, London, NW3 2QG, UK L. A. Guethlein P. Parham Department of Structural Biology and Microbiology & Immunology, Stanford University, California, USA be found in the accompanying Nomenclature Report (Marsh et al. 2003). The two sequences listed under the names KIR2DL5(KIR2DLXa) (AF271607) and KIR2DL5(KIR2DLXb) (AF271608) have not yet been assigned official names as it is unclear whether they represent alleles of the KIR2DL5A or KIR2DL5B genes. Each of the KIR sequences included in these alignments has been checked and where discrepancies have arisen between reported sequences, the original authors have been contacted where possible, and necessary amendments to published sequences have been incorporated into this alignment. Future sequencing may identify errors in this list and we would welcome any evidence that helps to maintain the accuracy of this compilation. In the nucleotide (Fig. 1) and amino acid (Fig. 2) sequence alignments, a total of 89 sequences comprising 14 KIR genes and three pseudogenes have been aligned to the KIR3DL2*001 sequence. The KIR3DL2*001 sequence was used as a reference sequence as it provided a sufficiently long reference sequence which also possessed a high level of nucleotide identity and structural homology to the majority of the other KIR genes. The KIR sequences were retrieved from the EMBL Nucleotide Sequence Database or GenBank by means of the accession numbers given in the KIR Nomenclature Report (Marsh et al. 2003). Criteria used for the grouping of these sequences into genes is based on the number of immunoglobulin domains, the length of the cytoplasmic tail and sequence homology as proposed in previous publications (Long et al. 1996; Steffens et al. 1998; Vilches and Parham 2002). The sequence comparison was done by using a combination of Clustal (Thompson et al. 1994) and manual analysis after which sequence alignments were subjected to a reformatting tool available in house, which allowed us to show identity to a reference sequence and translate the nucleotide sequences into their corresponding protein sequences (Fig. 2). In the alignments, exon intron boundaries and protein domains have been marked with a pipe ( ), asterisks (*) indicate positions where sequence is unavailable, periods (.) indicate an insertion or deletion. Identity to the reference sequence KIR3DL2*001 is shown by a hyphen

228 Fig. 1 Alignment of KIR nucleotide sequences

229

230

231

232

233

234

235

236

237 Fig. 2 Alignment of KIR amino acid sequences

238

(-). In the amino acid alignment, stop codons are indicated by an X. Minimum gaps in the sequence, indicated by a period (.), have been inserted to maintain the alignment between alleles of differing length, in such a way as to maintain the reading frame. The pseudo-exon 3 sequences for type I KIR2Ds have been included in the nucleotide alignment where available. The nucleotides are numbered starting at 1 for the first nucleotide of the codon for the initiation methionine. The numbering of the codons of the mature protein, after cleavage of the signal sequence, begins at +1, while the signal sequence is numbered backwards from Ÿ1. These alignments include KIR genes and alleles for which complete cdna sequences or full genomic sequences are available, alternative splice variants escape the scope of this publication and partial cdna sequences are not being included until further information for them is made available. These sequences are also available from the IPD/KIR Database, http://www.ebi.ac.uk/ipd/kir. The database provides an online repository for the KIR sequences officially named by the KIR Nomenclature Committee. The IPD/ KIR Database provides online versions of the sequence alignments and the nomenclature reports. In time it is envisaged that the website will also provide tools for the submission of new and confirmatory KIR sequences to the KIR Nomenclature Committee. The IPD/KIR Database is part of the Immuno Polymorphism Database (IPD), which provides a suite of tools, and databases for the study of polymorphisms in the immune system. Acknowledgements We would like to thank Peter Stoehr and the staff of the European Bioinformatics Institute for their support of the IPD/KIR Database. References 239 Long EO, Colonna M, Lanier LL (1996) Inhibitory MHC class I receptors on NK and T cells: a standard nomenclature. Immunol Today 17:100 Marsh SGE, Parham P, Dupont B, Geraghty DE, Trowsdale J, Middleton D, Vilches C, Carrington M, Witt C, Guethlein LA, Shilling H, Garcia CA, Hsu KC, Wain H (2003) Killer Immunoglobulin-like Receptor (KIR) Nomenclature Report. Immunogenetics DOI 10.1007/s00251-003-0571-z Steffens U, Vyas Y, Dupont B, Selvakumar A (1998) Nucleotide and amino acid sequence alignment for human killer cell inhibitory receptors (KIR), 1998. Tissue Antigens 51:398 413 Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22:4673 4680 Vilches C, Parham P (2002) KIR: diverse, rapidly evolving receptors of innate and adaptive immunity. Annu Rev Immunol 20:217 251 Fig. 2 (continued)