Open Data Open Access to Scientific Data

Similar documents
University of Michigan Eco-Driving Index (EDI) Latest data: August 2017

Chapter 6 Planning and Controlling Production: Work-in-Process and Finished-Good Inventories. Omar Maguiña Rivero

Local 2110 Instructions for Web Time Entry

A how-to session featuring a behind-the-scenes look at how Best-in-Show winner, HCSS, decides when to outsource and when to create from within

COURSE LISTING. Courses Listed. with Customer Relationship Management (CRM) SAP CRM. 15 December 2017 (12:23 GMT)

Electric Forward Market Report

OPTIMISATIONS IN THE USE OF DIGITAL DOCUMENT MANAGEMENT FOR COLLABORATIVE ENGINEERING DESIGN. deliver projects better, faster, together

Woking. q business confidence report

Administration Division Public Works Department Anchorage: Performance. Value. Results.

EFSA Strategy 2020 AF meeting. Luxembourg, 8 9 December 2015

TL 9000 Quality Management System. Measurements Handbook. BRR Examples

Traffic Department Anchorage: Performance. Value. Results.

Reconciliation of produced grade and tonnage

Raynet Software Lifecycle

Hetch Hetchy Water and Power

2017 KEY INSIGHTS ON. Employee Attendance and Tardiness

Revised SMIS+ implementation plan. 4 th August 2015 SMIS+ Team

Project Risk Management Bootcamp. Contents are subject to change. For the latest updates visit

Raynet Software Lifecycle

YOU WON T FALL. WITH RUN CHARTS

The Statutes of Upper Canada and the Province of Canada, 1792 to 1866

Migrating to Rose & Cylc ISENES2 Workshop on Workflow Solutions in Earth System Modelling

Lawrence Ludlow. for Agile Tour Toronto

Traffic Division Public Works Department Anchorage: Performance. Value. Results.

CDASH Clinical Data Acquisition Standards Harmonization

Financial Intelligence NCIA 2016 National Training Conference Pittsburg April 18 th & 19 th

Understanding Usage Statistics and Its Implications on Collection Management

CI User Engagement Activities and Schedule Nov Apr 2013

PI SERVER 2015 AND FUTURE DATA

Draft Refined Incremental Proposal GAC WS 2

Recalculation of Seasonal Adjustment Factors

The monthly variation of the Business Turnover 1 stands at 1.6%, after seasonal and calendar adjustment

2016/SOM1/SCSC/025 Agenda Item: 6.2

5 Star London Hotels - Example Report

Department of Medicine Faculty Mentoring Plan

Groundwater Discharge Permit Application ~ Part A Business Information

HRA User Satisfaction Report

IT tool developments: R4BP 3 and Summary of Product Characteristics (SPC) editor

Continuous Data Quality Improvement Process King County Continuum of Care

National Reference Laboratory Quality Assurance Dashboard. Quality Improvement Metrics Q2 2017

Puget Sound Green Infrastructure Summit 2017

FINANCE AND STRATEGY PRACTICE CFO EXECUTIVE BOARD. Safeguarding Supply. Protecting the Enterprise from Unforeseen Supply Chain Risks

Air quality sensing in europe

Are you prepared to make the decisions that matter most? Decision making in banking & capital markets

UCMR 4. What will be its challenges and costs?

The Certified Business Analyst Professional. Contents are subject to change. For the latest updates visit

The Role of Data in Nonprofit Storytelling Strategy? Presenter: Kerri VanMeveren Amazing Traditions, LLC

Eco-Impact Evaluator Project for ICT Equipment

Peter Randall Solar Kingdom Ltd

Business plan toolkit

CO2, SO2 and NOX Emission Rates. March 15, 2018

AUSTIN AREA SUMMARY REPORT ON BUSINESS February 2018

California Independent System Operator Corporation. California ISO. Import resource adequacy. Department of Market Monitoring

Cotton Update. Hosted by United States Fashion Industry Association (USFIA) and Cotton Incorporated

Safety Document (SD) COMPLIANCE MONITORING. Executive Management Team (EMT), Board of Directors.

Young Professionals Award

UBT Performance Tracking Tool

Minimum Information About a Microarray Experiment (MIAME) Successes, Failures, Challenges

European Freight Forwarding Index

Finch Drinking Water System O. Reg 170/03 Schedule 22 - Summary Report for Municipalities

Department of Transportation Rapid City Region Office 2300 Eglin Street P.O. Box 1970 Rapid City, SD Phone: 605/ FAX: 605/

Open Metrics. Open Metrics. Martin Fenner Technical Lead Article-Level Metrics Public Library of Science

INDUSTRIAL SURVEY. Entrepreneurs show optimism in December. Expectations indices Diffusion index (0-100 points)*

Managing Your WiP By Larry Maccherone

COURSE LISTING. Courses Listed. Training for Applications with Integration in SAP Business One. 27 November 2017 (07:09 GMT) Advanced

K.Arokiam. Innovative ICT and Knowledge Sharing Platforms for. Challenges. Global Consultation Global Consultation on

High Impact Internal Audit Leadership. Contents are subject to change. For the latest updates visit

CIP Routine/Small Purchasing Team Close-out

Managing Project Portfolio. Contents are subject to change. For the latest updates visit

Formalize Corporate Social Responsibility Information Disclosure System and Improve Transparency (also through the supply chain reporting practices)

Audit Program Test of Controls and Transactions for Sales and Cash Receipts

Heat Pump Field Trials and Implications for Design

Chain Linking and Annual Update of Weights for Producer Price Indices for Services in Italy

City of San Clemente Water Usage Report

TABLE B 59. Manufacturers new and unfilled orders, [Amounts in millions of dollars; monthly data seasonally adjusted]

1 PEW RESEARCH CENTER

San Francisco Housing Authority (SFHA) Leased Housing Programs as of January 31, 2016

Finding the Right Tool for your Purpose. Using Data to Show Improvement and the Need for Improvement

IPRO 355. Enhanced Vision System for Construction Safety

CT Regulation: EMA role

Phase 1 Benzene Studies Quick Update

Compliance Training and the Board

Climate Change & Urbanization Have Changed River Flows in Ontario

Strategic publication planning. Margaret Haugh MediCom Consult

Cattle Outlook. January, 2018

T2-T2S Consolidation T2-T2S Consolidation

Campus Solutions 9.2 Upgrade Functional Testing Kickoff. panthersoft.fiu.edu

Storage as a Transmission Asset:

Meeting System Needs with Demand Response - Work Plan and Technical Panel Timelines

Agricultural Producers. Price Index. Hotel Price Index

Workday Network. Patti Rowe Julie Vlier Jane Zbyszynski

Source Water Protection Integrating with Existing Watershed and Water Management Frameworks

HKU announces 2017 Q4 HK Macroeconomic Forecast

The Digital Economy and the state of play in national accounting

Using etechnologies to Increase Efficiency and Quality in Regulatory Operations

Company: Rockwell Collins Industry: Aerospace & Defense Category: Technology

DATE: JANUARY 24, 2018 HERITAGE VALLEY POLICY ADVISORY COMMITTEE (HVTAC) HEATHER MILLER, TRANSIT PLANNER KEY PERFORMANCE INDICATORS (KPI) REPORT

Florida Courts E-Filing Portal Update. September 2016

Rainwater tank study of new homes

Seed potato physiological age and crop establishment

Transcription:

Open Data Open Access to Scientific Data The perspective of a researcher Prof. Dr. Tom Coenye Tom.Coenye@UGent.be

Open Data Shared Data Not necessarily the same! Data deposition (open/shared) Data available in Supporting Information Files (open/shared but depending on OA status of publication) Data made available upon request (shared) Data available from third party (shared)

Data deposition Data can deposited in an appropriate repository Public repository Other Own repository/website Institutional archive Advantages of using public repositories over others Continued access over time is guaranteed An accession number, DOI, is assigned (~traceability) No cost for researcher Streamlined submission process «Quality label»

Data deposition Data can deposited in an appropriate public repository Generalist repositories accept multiple data types For example Dryad

Data deposition Data can deposited in an appropriate public repository Subject-specific repositories examples from the Life Sciences ArrayExpress or GEO data on gene expression GenBank, EMBL, DDBJ nucleic acid sequence data PDB protein structures

Public data deposition nothing new! Size of GenBank database 1982-2014 (#entries) 200000000 180000000 160000000 140000000 120000000 100000000 80000000 2000000 1500000 1000000 60000000 40000000 20000000 500000 0 0 jun/14 dec/82 nov/84 aug/86 sep/87 Oct 1988 dec/89 Mar 1991 jun/92 jun/93 apr/94 feb/95 dec/95 Oct 1996 aug/97 jun/98 jun/99 apr/00 feb/01 dec/01 Oct 2002 aug/03 jun/04 apr/05 feb/06 dec/06 Oct 2007 aug/08 jun/09 apr/10 feb/11 dec/11 Oct 2012 aug/13 dec/82 nov/84 aug/86 sep/87 Oct 1988 dec/89 Mar 1991 jun/92 jun/93 apr/94 feb/95 dec/95 Oct 1996 aug/97

Public data deposition nothing new! Size of PDB Archive (1975-2014) (#entries)

Why would I share my data? Because I have to Journal University Funding agency Because it increases the visibility of my research

Same story as for OA? Differences between OA and subscription minimal at best Too early to tell.

Why would I share my data? Because it is good scientific practice and discourages QRP and scientific fraud Because it allows re-use of my data By other scientists By other stakeholders (policy makers, ) Brings data that were obtained with public funding in the public domain

Why would I not share my data? Cost (but as far as I know all public repositories in Life Sciences are free of charge for the researcher) Time and effort (!!) Overly complicated systems demanding a lot of time/effort will discourage deposition of data Duplication of efforts should be avoided Competition «I don t want others to be able to re-use my data!» Don t see added value «What s in it for me?»

Public data deposition a guarantee for quality or GIGO? http://mibbi.sourceforge.net/portal.shtml

http://www.rdml.org/miqe.php

http://biofomics.org/

By re editing the original chromatogram data we found that approximately 40% of the grey seal mtdna haplotype sequences posted in GenBank contained errors. a significantly different outcome was observed when using the uncorrected dataset based on the GenBank haplotypes. We therefore suggest disregarding the existing GenBank data and instead using the correct haplotypes reported here. Our study serves as an illustrative example reiterating the importance of quality control through every step of a research project, from data generation to interpretation and submission to an online repository. Errors conducted in any step may lead to biased results and conclusions, and could impact management decisions.

What needs to be deposited and how? ALL data? Raw data? Processed data? Selection of raw and/or processed data? Minimal dataset? PLoS One : the dataset used to reach the conclusions drawn in the manuscript with related metadata and methods, and any additional data required to replicate the reported study findings in their entirety What about biological material? Cell lines, tissues, animals, purified biomolecules,... Mandatory deposition in BRCs? Similar issues likely with chemicals etc.

Open Data Open Access to Scientific Data The perspective of a researcher Prof. Dr. Tom Coenye Tom.Coenye@UGent.be