Simplicity, Complexity and Modelling in Clinical Trials

Similar documents
Response, quality and variation: what drug development may be missing

Heterogeneity Random and fixed effects

Alternative Trial Designs

On being Bayesian Namur 13 October 2016

Missing Data Handling in Non-Inferiority & Equivalence Trials

How To Design A Clinical Trial. Statistical Analysis. Gynecologic Cancer InterGroup

Citation Statistics (discussion for Statistical Science) David Spiegelhalter, University of Cambridge and Harvey Goldstein, University of Bristol We

Session 4: Statistical considerations in confirmatory clinical trials II

Group Sequential Monitoring of Clinical. Trials with Multiple Endpoints. Christopher Jennison, Dept of Mathematical Sciences, University of Bath, UK

University of Bath, UK

Model Selection, Evaluation, Diagnosis

GEN-003. Positive Phase 2b Clinical Efficacy Results. Immunotherapy Candidate for Genital Herpes. 12-Month Top-line Results

Bios 6648: Design & conduct of clinical research

Bios 6648: Design & conduct of clinical research

Implementing Current Regulatory Guidance: An Industry Perspective

Appendix 5: Details of statistical methods in the CRP CHD Genetics Collaboration (CCGC) [posted as supplied by

Equipment and preparation required for one group (2-4 students) to complete the workshop

Chapter 9. Regression Wisdom. Copyright 2010 Pearson Education, Inc.

Correlation and Simple. Linear Regression. Scenario. Defining Correlation

Friday, September 12th, :00-11:00 am New Research Building Auditorium Refreshments will be provided at 9:45am. Gang Han, Ph.D.

Lean Strategy Execution

The Role of a Clinical Statistician in Drug Development By: Jackie Reisner

From Professional HR. Evidence-Based People Management & Development. Chapter 4. Educating the professional, evidence-based manager.

BIOSTATISTICAL METHODS

The Application of Survival Analysis to Customer-Centric Forecasting

AP Statistics Scope & Sequence

(Draft) Guideline on multiplicity issues in clinical trials

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

BIOSTATISTICAL METHODS

Facilitating Antibacterial Drug Development: Bayesian vs Frequentist Methods

FDA Perspective on Clinical Trial Design for Rare Diseases

Adaptive Trials and Trial Simulation. Kert Viele CTMC 2018

Department of Economics, University of Michigan, Ann Arbor, MI

Clinical Study Synopsis for Public Disclosure

Capability on Aggregate Processes

An adaptive design for dose-response using the Normal Dynamic Linear Model (NDLM)

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

David M. Rocke Division of Biostatistics and Department of Biomedical Engineering University of California, Davis

Evaluating Diagnostic Tests in the Absence of a Gold Standard

By the end of this lecture you should be able to explain: Some of the principles underlying the statistical analysis of QTLs

Practical Regression: Fixed Effects Models

ACC: Review of the Weekly Compensation Duration Model

Cost Accounting The Foundation of Management Control

Fritz has the test management software on the screen, and a scribbled requirements document in his hand. Verify that Pass/Fail

Kenneth J. Wilkins & Elizabeth C. Wright Biostatistics Program, Office of the Director NIDDK/NIH

Model-based Designs in Oncology Dose Finding Studies

LCA in decision making

BIOEQUIVALENCE TRIAL INFORMATION FORM (Medicines and Allied Substances Act [No. 3] of 2013 Part V Section 39)

Introduction to QTL mapping

Non-Inferiority Trials: What are they and why are they so difficult?

BIOSTATISTICS AND MEDICAL INFORMATICS (B M I)

Non-parametric optimal design in dose finding studies

An Application of Categorical Analysis of Variance in Nested Arrangements

BIOSTATISTICAL METHODS FOR TRANSLATIONAL & CLINICAL RESEARCH

Statistical Competencies for Clinical & Translational Science: Evolution in Action

Visions on the role of Statisticians in the Pharmaceutical Industry. Merete Jørgensen President EFSPI VP Biostatistics, Novo Nordisk A/S

DOES BENTON COUNTY, OREGON, NEED A NEW JAIL? Version Three 1 By John H. Detweiler

ECONOMICS 103. Topic 2: Specialization & Trade

Evolve or die: the urgent need to streamline randomized trials

Near-Balanced Incomplete Block Designs with An Application to Poster Competitions

Draft agreed by Scientific Advice Working Party 5 September Adopted by CHMP for release for consultation 19 September

Bayesian Designs for Early Phase Clinical Trials

Biotech Patents in Europe

MANY important decisions are based on measurement results. Some of. Measurement Uncertainty. Chapter 1

Examination of Cross Validation techniques and the biases they reduce.

Disclaimer This presentation expresses my personal views on this topic and must not be interpreted as the regulatory views or the policy of the FDA

Indirect Comparisons in Economic Evaluations: Experience from the Common Drug Review Julie Blouin, Karen M Lee

SESSION 810 Friday, April 15, 10:15am - 11:15am Track: Industry Insights. Do What You Do Best, Every Day! Session Description. Speaker Background

VQA Proficiency Testing Scoring Document for Quantitative HIV-1 RNA

OPTIMIZATION AND CV ESTIMATION OF A PLATE COUNT ASSAY USING JMP

Page 78

How to Categorize Operational Losses?

Understanding Simulations and Their Value in Clinical Trial Planning. William Meurer, MD, MS Scott Berry, PhD

Chapter 12. Sample Surveys. Copyright 2010 Pearson Education, Inc.

Animal Research Ethics Committee. Guidelines for Submitting Protocols for ethical approval of research or teaching involving live animals

Improve Alerting Accuracy

PROJECT EVALUATION A PRACTICAL GUIDE. Part One: Building Evaluation into Project Planning

How to Randomize? Dean Karlan Yale University. povertyactionlab.org

TickITplus Implementation Note

Is Confirmatory PK/PD Modeling Possible?

Conducting Legal Performance Appraisals. Leader s Guide

8. Keys to effective decision-making

MARK SCHEME for the May/June 2010 question paper for the guidance of teachers 9772 ECONOMICS. 9772/02 Paper 2 (Essays), maximum raw mark 75

Use of modified intention to treat analysis in studies of direct oral anticoagulants and risk of selection bias.

APPRAISAL OF CLINICAL TRIALS. Su May Liew CEBM Oxford

Chapter 8 Script. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts.

Proof of Concept Vs Proof of Value

More-Advanced Statistical Sampling Concepts for Tests of Controls and Tests of Balances

Evaluation of Diagnostic Agents: a SAS Macro for Sample Size Estimation Using Exact Methods

The combination of factors and their levels describe the battlespace conditions.

BIOEQUIVALENCE TRIAL INFORMATION

Campaigns - 5 things you need to know. 27 Signs You Need A New Agency. What the AdWords Update Means for Your Paid Search Strategy

The CONSORT Statement II

Bios 6648: Design & conduct of clinical research

How to reduce bias in the estimates of count data regression? Ashwini Joshi Sumit Singh PhUSE 2015, Vienna

Industrial Engineering Prof. Inderdeep Singh Department of Mechanical and Industrial Engineering Indian Institute of Technology, Roorkee

Economic evaluations in cancer clinical trials

Performance criteria

Technology appraisal guidance Published: 22 January 2014 nice.org.uk/guidance/ta303

Transcription:

Simplicity, Complexity and Modelling in Clinical Trials Stephen Senn (c) Stephen Senn 1

Acknowledgements This work is partly supported by the European Union s 7th Framework Programme for research, technological development and demonstration under grant agreement no. 602552. IDeAl However, some of the ideas grew out of work on the EPSRC Simplicity Complexity and Modelling project EP/E018173 (c) Stephen Senn 2

An apology Although I have worked for many years on clinical trials, my only involvement with cancer has been occasional membership of data safety monitoring boards Thus, my examples are not taken from trials in cancer The relevance (or not) to cancer of anything I have to say I leave for others to judge (c) Stephen Senn 3

Outline Examples where we have a considerable gain by increasing complexity Examples where we do better to be simple Examples where more complex designs and modelling can teach us to be simpler Recommendations and conclusions (c) Stephen Senn 4

General thesis: as complex as necessary but no more For complexity Complex allocation needs complex analysis Baseline covariates carry useful information Some apparently simple transformations mislead and destroy information and should be avoided For simplicity Some complex models have unfortunate side-effects Overfitting can reduce capacity to predict Complex models can hide dangerous implicit assumptions (c) Stephen Senn 5

For complexity Where we lose information by being too simple (c) Stephen Senn 6

Failure to allow for design in analysis Survey by Kahan and Morris 2012 Survey of trials published in BMJ, JAMA, NEJM & Lancet in 2010 Kahan and Morris, BMJ, 2012 (c) Stephen Senn 7

Findings as regard analysis Balancing by centre Strategy Number Percent Adjusted in primary analysis 31 26% Adjusted in secondary analysis 4 3% Did not adjust 68 57% Unclear 17 14% Total 120 100% Balancing by prognostic factors Strategy Number Percent Adjusted for all in primary analysis 40 36% Adjusted for some in primary analysis 4 4% Adjusted in secondary analysis 10 9% Didnot adjust 45 41% Unclear 12 11% Total 111 100% Did not adjust + unclear = 71% Did not adjust + unclear = 52% (c) Stephen Senn 8

In summary K & M Found fewer than 50% of clinical trials (in leading journals) that balanced by centre or prognostic factor declared that the main analysis took account of this An opportunity for increasing efficiency is being missed For linear models, the standard errors will be larger than they should be For non-linear models, effective treatments will have estimates biased towards the null This is something of a scandal (c) Stephen Senn 9

Change from baseline Waste in the name of simplicity A common habit for true baselines is to use them to construct a change-score by simple subtraction Assuming equal variances at baseline and outcome this increases the variance unless the correlation is greater than 0.5 Analysis of covariance (invented 1931) is (asymptotically) better than either raw scores or change scores (c) Stephen Senn 10

Responder dichotomies Criminal waste in the name of simplicity Responder dichotomies compound the change score crime by replacing them by a binary score If the cut-point is at the median, the sample size must be increased by 100 π 2 1 57% For other cut-points it is worse Encourages false belief in causal differences between responders and non-responders To be picked up later (c) Stephen Senn 11

For simplicity Where we lose information by being too complex (c) Stephen Senn 12

Repeated measures analysis Much repeated measures analysis using mixed models could be simply replaced by a summary measures approach This will often be nearly as efficient In fact for simple correlation structures and complete data will be fully efficient Helps understanding Furthermore some repeated measures approach implicitly violate intention to treat. See Senn, Stevens and Chaturvedi, 2000 Bamia, White, Kenward, 2013 (c) Stephen Senn 13

How analysis of repeated measures can violate ITT What everybody agrees is unacceptable It is common advice that you should not correct for post randomisation covariates For example, measures of the form Y 3 -Y 1 instead of Y 3 -Y 0 What everybody assumes is fine Ordinary least squares estimates of slopes But suppose that you have three post randomisation measurements at equal intervals The ordinary least squares measure of the slope is (Y 3 -Y 1 )/2 (c) Stephen Senn 14

An example of the wrong sort of complexity: Thall and Vail, 1990 Repeated seizures for 59 patients over 4 two week periods Compares two treatments Placebo and progabide Has been cited 461 times by 2017 according to Google Scholar Cites Leppik et al (1985) for the data Seven different covariance models proposed by Thall & Vail Very many different models proposed since, the predicted mean seizure rate for the progabide group is either higher or lower than that for the placebo group, accordingly as the baseline count does or does not exceed a critical threshold. This suggests that progabide may be contraindicated for patients with high seizure rates. Thall and Vail p666 (c) Stephen Senn 15

But data and what they mean are important The data are not from Leppik et al 1985 but 1987 It should be obvious from the patient numbers that it is a two centre trial, but nobody appears to notice this The division into four two-week periods has no clinical meaning whatsoever Modelling this as a correlated series is pointless and possible misleading (c) Stephen Senn 16

In my view the analysis of data as four visits is pointless and we might as well analyse the totals. Leppik et al. (1987), using all the data of the original crossover trial, found no convincing evidence of a treatment effect and I am suspicious of any analyses of the first-period data only, including those of Lee and Nelder and Thall and Vail (1990), that do. Fitting total seizures as a function of centre, age and base-line seizure in addition to treatment using either Poisson regression and allowing for overdispersion or a negative binomial model, or using the square root of the number of seizures in a linear model, I find no convincing evidence of a treatment effect. (c) Stephen Senn 17

Increasing the number of covariates in a linear model Adding predictive covariates to a model makes the residual error smaller But it makes the design matrix somewhat less well-conditioned Second order efficiency is also affected Fewer degrees of freedom for estimating the error variance Less favourable t-distribution for confidence intervals Eventually as we add covariates we lose Problem in small trials FP7 HEALTH 2013-602552 18

Complexity that may yield simplicity How being complex can sometimes yield insights that lead to simplicity (c) Stephen Senn 19

A complex design in asthma Dose Formulation of Formoterol ISF MTA Nothing 0 µg Placebo 6 µg ISF6 MTA6 12 µg 1SF12 MTA12 24 µg ISF24 MTA24 Senn, S. J., Lillienthal, J., Patalano, F., & Till, M. D. (1997). An incomplete blocks cross-over in asthma: a case study in collaboration. In J. Vollmar & L. A. Hothorn (Eds.), Cross-over Clinical Trials (pp. 3-26). Stuttgart: Fischer. Parallel assay Cross-over Incomplete blocks Seven treatments Five periods Twenty-one sequences Forced expiratory volume in one second (FEV 1 ) 18 time-points over 12 hours Log AUC of FEV 1 as main outcome (c) Stephen Senn 20

Results Perfect dose response 6µg, 12µg, 24µg within each formulation Big surprise is complete separation of formulations Formulations not at all equivalent MTA 24µg appears to be less potent than ISF 6µg (c) Stephen Senn 21

Analysis of Contrasts ISF (Reference) MTA (Test) Contrast 6 12 24 6 12 24 Placebo Active 1/6 1/6 1/6 1/6 1/6 1/6-1 Formulation -1/3-1/3-1/3 1/3 1/3 1/3 0 Dose -1/2 0 1/2-1/2 0 1/2 0 Parallelism -1 0 1 1 0-1 0 Curvature -1 2-1 -1 2-1 0 Opposing curvature -1 2-1 1-2 1 0 (c) Stephen Senn 22

Implications As regards comparing formulations The formulations are clearly not equipotent The difference between formulations is a great as the difference between doses But the model adequacy contrasts are all non-significant Linear (in the log dose) appears to work A careful complicated design killed the new formulation But there is more The fact that patients have been measured many times enables us to say something about individual response Consider a common (very stupid) definition of response 15% increase in FEV 1 above baseline Now look at responders 12 hours after treatment for two of the formulations (c) Stephen Senn 23

The case for personalised medicine There seem to be a number of patients who respond to B and not to A and vice versa Clearly if we can find predictive characteristics of them we can improve treatment Next stop, precision medicine (c) Stephen Senn 24

The case against personalised medicine A is ISF 12µg, the second most potent of the six formulations and doses tested B is ISF 24µg the most potent of the six formulations and doses tested It is biologically extremely implausible that patients could respond to 12µg and not to 24µg Yet apparently 8 out of 71 patients did Conclusion: naïve simple views of causality and response aren t good enough and more complex design and analysis is needed (c) Stephen Senn 25

Responder analysis is the work of the devil Any statistician who collaborates in this crime deserves to languish in data-analysis hell (c) Stephen Senn 26

Example of Atrial Fibrillation Such patients are at higher risk of stroke Meta-analysis (reproduced in Hart et al 2007)concluded that warfarin has a beneficial protective effect But there is a risk of intracranial bleeding Who should get warfarin? (c) Stephen Senn 27

(c) Stephen Senn 28

(c) Stephen Senn 29

The line gives the prediction if the common log-odds ratio estimate is applied to the control group rate Estimate based on 6 v 3 cases only (c) Stephen Senn 30

Recommendations and conclusions Reminding ourselves why we do this (c) Stephen Senn 31

Conclusions The appropriate degree of complexity is a matter of judgement The key to getting the right degree is maintaining a sense of purpose Does the complexity reflect pharmacology etc to the degree needed? Have we followed through: analysis that reflects design and design that serves the analysis needed? Are we doing it to increase our understanding of the effects of treatment? Are we being complex in the right places Elaborate models of responder dichotomies are pointless Measurement matters (c) Stephen Senn 32

Recommendation Always ask yourself this: Am I really interested in finding out about the effects of treatment? (c) Stephen Senn 33