Reporting financial inclusion research results using confidence intervals, margins of error and confidence levels

Similar documents
Financial inclusion metrics for the real world How a spatial understanding of financial inclusion can contribute to meaningful measurement and impact

Terms of Reference for i2i Credit Pilot Study in Zimbabwe. June 2017

majority, plurality Polls are not perfect Margin of error Margin of error

Chapter 4: Foundations for inference. OpenIntro Statistics, 2nd Edition

International Program for Development Evaluation Training (IPDET)

Chapter 19. Confidence Intervals for Proportions. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Best Practices for Reporting American Community Survey in Municipal Planning Chittenden County Regional Planning Commission, October 2018

Energy Efficiency Impact Study

Davidson Trahaire Corpsych

To provide a framework and tools for planning, doing, checking and acting upon audits

Contact: Version: 2.0 Date: March 2018

8 ways to make your survey useless if you are making any of these mistakes with your surveys, you might as well throw away the data and start all over

Case 2.1. American Idol: A Big Hit for Marketing Research?

Performance criteria

EXPERT REBUTTAL REPORT of HENRY S. FARBER In Connection With. Chen-Oster v. Goldman Sachs July 29, 2014

IT Audit Process. Michael Romeu-Lugo MBA, CISA November 4, IT Audit Process. Prof. Mike Romeu

Campaign Skills Trainer s Guide. Module 6 Message Development Creating Powerful and Persuasive Messages

If you are using a survey: who will participate in your survey? Why did you decide on that? Explain

Suppose we wanted to use a sample of 25 students to estimate the average GPA of all students. Now suppose we choose our sample by random sampling and

Data Collection Instrument. By Temtim Assefa

Capital Regional District Public Participation Framework

Sawtooth Software. Sample Size Issues for Conjoint Analysis Studies RESEARCH PAPER SERIES. Bryan Orme, Sawtooth Software, Inc.

Noel-Levitz Satisfaction-Priorities Surveys Interpretive Guide

Sustainable data collection: Mobile Modes

AP Statistics: Chapter 4.1, Pages Summarized by Lauren Chambers, Logan Wagner, Jack Thompson

audit monitoring 2010

Tutorial Segmentation and Classification

InvestXD: Putting women s investment needs at the centre

ADMINISTRATION OF QUALITY ASSURANCE PROCESSES

Becoming the Value Added Partner: The Evolving Role of the CFO. kpmg.com

Environmental Studies. Course Code Number and Abbreviation. Course Description. Instructional Units and Pacing Plans

Watson-Glaser III Critical Thinking Appraisal (US)

Chapter 8 Project Monitoring and Control It s important to have a schedule, but it s more important to use it

Leading Change is a young, dynamic consultancy focused on strategy execution.

Copyright 2017, Ontario Energy Association

FACT SHEET ON QUANTITATIVE SURVEY METHODOLOGIES APRIL 2011

One in Three Americans (36%) Likely to Use Sharing Economy Services on Vacation This Summer

IBM Kenexa Talent Insights

7 Statistical characteristics of the test

Section E3 Social sector dynamics

Scheduling for Success with Critical Chain. Abstract

How Much Do We Know About Savings Attributable to a Program?

The economic value of improved air quality in urban Africa: a contingent valuation survey in Douala, Cameroon

Noel-Levitz Interpretive Guide Satisfaction-Priorities Surveys Interpretive Guide

CEO S Report. Jacqueline Gold CBE CEO - Ann Summers Ltd

Board Evaluation. Introduction. The Requirement for Board Evaluation. Further Guidance. Regulation at a Glance. Supporting Principles

Introduction. Contents

GENDER PAY GAP REPORTING 2018.

The Financial and Insurance Advisor s Guide to Content Writing

Module - 01 Lecture - 03 Descriptive Statistics: Graphical Approaches

Draft agreed by Scientific Advice Working Party 5 September Adopted by CHMP for release for consultation 19 September

1 PEW RESEARCH CENTER

Gush vs. Bore: A Look at the Statistics of Sampling

Economic and Social Council

The Real Value of Joining a Local Chamber of Commerce

Selecting a Sample Size

Using Natural Language Processing and Machine Learning to Forecast Legislative Outcomes

ISO whitepaper, January Inspiring Business Confidence.

Partnership working across UK Public Services. Dr Ailsa Cook, Outcome Focus

ENERGY, WATER AND CLEAN AIR: WHAT KIND OF LEADERSHIP DO AMERICANS WANT?

PROJECT MANAGEMENT - BUSINESS CASES AND GATEWAYS BY KEN GARRETT

RE: Internal Control Integrated Framework: Guidance on Monitoring Internal Control Systems Discussion Document

Best practice guide for using statistics in communications. These guidelines will cover:

The top five ways to develop an objective, informative white paper

Drafting Election Statements and Reports. A Field Guide for Election Monitoring Groups

Exploring Good Clinical Practice guidance in clinical trials meeting summary

SAMPLING The Local Landscape Fall 2014

BUSS1020. Quantitative Business Analysis. Lecture Notes

Jeffco Public Schools

Nottinghamshire Police Equality Analysis

GUIDANCE ON CHOOSING INDICATORS OF OUTCOMES

Employee Engagement: getting the best from your people A Roevin recruitment guide

Watson Glaser III (US)

Lessons Learnt in the Area of Stakeholder Dialogue to Strengthen National Competencies for Radioactive Waste Disposal. Lecture #1

LECTURE 10: CONFIDENCE INTERVALS II

Embedding equality and diversity into the design and delivery of Higher Apprenticeships

Employment Report for University of Hertfordshire (incorporating diversity analysis)

Statistical Sampling in Healthcare Audits and Investigations

Objectives The objectives behind the preparation of this document, including the list of key principles, are:

Public Opinion on Wind Energy in Alberta and Ontario

Energy savings reporting and uncertainty in Measurement & Verification

Measuring Impact. Advanced Course, July 6 th, 2011 Rachel Glennerster

A12 CALA Policy on Reference Methods Revision 1.4 July 6, 2018

Six Stages for Exec Coach Brokering

Writing a Request for Proposals for an E-learning Solution

Ethical Principles for City Attorneys 1 Adopted October 6, 2005 City Attorneys Department Business Session

Analyzing stakeholders and power to identify advocacy targets. Stakeholder Analysis 2

AP Stats ~ Lesson 8A: Confidence Intervals OBJECTIVES:

Statistical Pay Equity Analyses: Data and Methodological Overview

CHAPTER 21A. What is a Confidence Interval?

Untangling Correlated Predictors with Principle Components

Is Behavioral Energy Efficiency and Demand Response Really Better Together?

Gender Pay Gap Report 2017

The State of the UK Ticketing Industry 2013: Analysing the trend towards direct, online ticket buying

NEPEAN-BLUE MOUNTAINS SHARED OUTCOMES PILOT PROJECT. Project Learnings Document

Dynamic Pricing Works in a Hot, Humid Climate

MARKETING RESEARCH PROCESS. RESEARCH DESIGN SECONDARY DATA RESOURCES.

Chapter 8 Script. Welcome to Chapter 8, Are Your Curves Normal? Probability and Why It Counts.

ISPC Commentary on the resubmission of the proposal CRP3.6: Dryland cereals (Revision of February 2012)

Citation Statistics (discussion for Statistical Science) David Spiegelhalter, University of Cambridge and Harvey Goldstein, University of Bristol We

Transcription:

Reporting financial inclusion research results using confidence intervals, margins of error and confidence levels October 2017 Established by Technical Note

Authors Grant Robertson and Bobby Berkowitz, Data4FI, insight2impact About insight2impact Insight2impact i2i is a resource centre that aims to catalyse the provision and use of data by private and public-sector actors to improve financial inclusion through evidence-based, data-driven policies and client-centric product design. i2i is funded by the Bill & Melinda Gates Foundation in partnership with The MasterCard Foundation. For more information: Visit our website at www.i2ifacility.org. Email Mari-Lise du Preez (i2i s Partnerships Manager) at mari-lise@i2ifacility.org. Call us on +27 21 913 9510.

Contents Introduction 2 Error reporting approach and tool 3 Competing estimates 3 Tracking change 3 Confidence intervals as the appropriate tool 3 Confidence intervals in practice 4 What to create confidence intervals for 4 Some important concepts 5 Caveats in interpreting the MoE 6 What affects the size of the MoE? 6 Calculating MoE on sub-groups 6 Practical application 7 Conclusion 10 Bibliography 11 I 1

Introduction This note introduces a tool and approach to assist financial inclusion users in better understanding, interpreting and using their data. Adoption of this tool will enable the entire financial inclusion community to produce better research. This will increase credibility among users, as well as onlookers. Financial inclusion users often have to choose between the reported figures of rival surveys. They also often want to understand whether and how the financial inclusion situation is changing. Users may take the figures at face value. Doing this may create confusion and lead to inappropriate interpretation and interventions. Researchers need to ask whether the differing estimates from different surveys are really different and whether they are simply from an acceptable range of estimates. 2 I Reporting financial inclusion research results using confidence intervals, margins of error and confidence levels

Error reporting approach and tool Competing estimates Findex is a World Bank published survey of financial inclusion estimates in many countries around the world. There are other syndicated studies such as FII. Many countries also do their own bespoke demand-side surveys of financial inclusion. Competing surveys lead to competing estimates of financial inclusion indicators. When these differ and users do not have the tools to compare the differences, users may lose trust. This undermines reporting on financial inclusion in the first place. The question researchers need to ask is whether the differing estimates from different surveys are really different and whether they are simply from an acceptable range of estimates. Tracking change Financial inclusion surveys are repeated to understand how key indicators move over time. This provides policymakers and other users with targets to set, as well as feedback on their initiatives. Users of this data look for trends in changes. These changes do not always move in the direction expected. It is important for users to understand that these unexpected changes may not be due to underlying phenomenon but are often due to the nature of survey data and implicit random error in survey estimates of underlying phenomena. The same principle applies here: researchers need to ask whether the differing estimates over time are really different and whether they are simply from an acceptable range of estimates. If the estimates are truly different, there will be no overlap in the confidence intervals. Confidence intervals as the appropriate tool There is a large debate in the scientific research community around the efficacy of significance testing and the over-reliance on the p-value to make claims about important scientific findings. There have been calls to scrap significance testing completely or to reduce the standard p-value from 0.05% to 0.01%. There have also been calls to consider alternative approaches such as reporting: Effect sizes Bayesian estimates Likelihoods Confidence intervals 1 We feel that the use of confidence intervals is most appropriate for financial inclusion surveys, as very little significance testing is conducted and results are usually presented as point estimates, which can be misleading. As such, this note will provide detail on this method. When results of financial inclusion surveys are reported on, they often include statements like 55% of the population have a bank account. An estimate from a survey like this is unlikely to exactly equal the true population quantity of interest due to sampling and non-sampling error. It is thus more accurate to think about an estimate as an interval as opposed to an exact point estimate. 1 https://www.vox.com I 3

Confidence intervals in practice In this note, we are interested in sampling error and how it can be quantified and used to report financial inclusion estimates more accurately. In a probability survey, every unit in the population has some known positive probability of being selected for the sample, and the probability of any particular sample being chosen can be calculated. It is this property that allows us to calculate the extent of sampling error present in our estimates. This estimate of the error is often called the Confidence Interval or Margin of Error (MoE). The Confidence Interval allows us to specify the intervals (or range) of values around an estimate with a certain level of confidence (usually 95%). What to create confidence intervals for To keep things simple, only the most important indicators should be compared using confidence intervals. This will depend on the country but can be things like headline financial inclusion, as well as key interventions looked at. In addition, a clear explanation should be provided to each confidence interval done. This is to ensure that confidence intervals are not done as a simple box-ticking exercise but the purpose and meaning is clearly understood by a broad audience. As an example, the result above should be reported on as 55% (+/-3%) of the population have a bank account. This simply means that, should we draw repeated samples from the same population, we are 95% certain that the true estimate of people with bank accounts will fall between 52% and 58%. An often overlooked fact is that there is still a 5% chance that the estimate will fall outside of this range, i.e. lower than 52% and higher than 58%. The Confidence Interval allows us to specify the intervals (or range) of values around an estimate with a certain level of confidence (usually 95%). 4 I Reporting financial inclusion research results using confidence intervals, margins of error and confidence levels

Some important concepts Sampling error Each time we draw a new sample from a population, the estimate of interest will differ from the true population value. This is because each new sample is different. The difference in estimates brought about by chance is called the sampling error. Confidence Interval Statisticians use a confidence interval to express the degree of uncertainty associated with a sample statistic. A confidence interval is an interval estimate combined with a probability statement (confidence level) (http://stattrek.com/statistics/). It basically is the range of possible estimates generated by an estimating process that would 95% of the time contain the true value of the parameter being estimated. The confidence interval is the range into which the true population parameter will fall, assuming a given confidence level (Malhotra & Birk, 2003). Confidence intervals are preferred to point estimates and to interval estimates, because only confidence intervals indicate (a) the precision of the estimate and (b) the uncertainty of the estimate. Confidence Level Some confidence intervals include the true population parameter, while others don t. In survey sampling, different samples can be randomly selected from the same population, and each sample can often produce a different confidence interval. Some of these samples will include the true population parameter and some will not (http://stattrek.com/statistics/). The confidence level is the probability that a confidence level will include the population parameter (Malhotra & Birks, 2003). Usually a 95% confidence interval is set, which means that 95% of the sampled confidence intervals will include the true value. Margin of Error The margin of error expresses the maximum expected difference between the true population parameter and a sample estimate of that parameter. To be meaningful, the margin of error should, like the confidence interval, be qualified by a confidence level. The margin of error is simply half the confidence interval. If the confidence interval is 10% with a confidence level of 95%, then the margin of error is +5% of the estimate and -5% of the estimate. For example, on a sample size of 400 with a confidence interval of 9.8% and a proportion of 50%, the MoE will be +-4.9%. This means that the true value will fall between 45.1% and 54.9%, 95% of the time the population is sampled. I 5

Caveats in interpreting the MoE The importance of sample design Demand-side surveys of financial inclusion typically involve idiosyncratic survey study designs. This happens whenever a survey uses existing population database references to design or create the survey s structure, as well as weight or extrapolate the sample results to make broader population inferences. This matters because all decisions used to design a survey create their own kinds of error estimates. Any confidence interval needs to take the structure of error into account. Each survey has its own unique design and thus its own unique confidence interval calculation. Confidence intervals should therefore be individually calculated. There is simple computer software readily available for doing this, with the appropriate parameters to be identified and inputted. It is important to note that the MoE does not take into account non-sampling related sources of error and non-response error and is thus by definition an underestimate of the true total error in a survey. All sources of potential error/bias should be considered when interpreting an estimate. What affects the size of the MoE? There are two key aspects that determine the margin of error: 1. The larger the sample the lower the MoE will be. It should be noted that an increase in sample size has diminishing returns for the MoE. 2. The more extreme the estimate the lower the MoE will be, e.g. a 10% estimate and 90% estimate have a lower MoE than a 50% estimate using the same sample size. The absolute size of the population has very little to do with the MoE. Calculating MoE on sub-groups Margins of error typically are calculated and reported on for surveys overall but should also be calculated again when a subgroup of the sample is considered. Some surveys do not require every respondent to answer every question, and sometimes only certain demographic groups are analysed. In such cases the sample size and extremeness of the estimates will vary. The MoE will therefore also vary. 6 I Reporting financial inclusion research results using confidence intervals, margins of error and confidence levels

Practical application In the charts below, the margin of error has been calculated individually for each FinScope Access Strand based on the realised sample size and actual proportion achieved. The blue bar is the actual estimate of the number of people in each strand, whereas the grey bar is this estimate plus the estimated margin of error and the orange block the estimate minus the MoE. The Formal strand has been excluded. If we look at the Formal Other strand, we can see that we are 95% confident that the true estimate falls between 5.33% and 11.17%. If we take the excluded strands results and break them down by gender, we can see that the MoE increases from 2.82% (both males and females included) to 4.18% (for males only) and the result for males being excluded thus ranges from as low as 10.67% to as high as 19.03%. It is very important to communicate this band or interval continuality when citing estimates, especially when dealing with small sample sizes and proportions that centre around 50%, as that is where the MoE is the largest. South Africa Access Strand 2016 20% 4.18% 4.50% 15% 4.00% 3.50% 10% 3.08% 2.92% 2.94% 2.82% 3.00% 5% 2.50% 0% 8.25% 5.33% 11.17% Formal other 3.16% 0.22% 6.10% 3.16% 0.22% 6.10% 14.85% 10.67% 19.03% 8.30% 5.23% 11.38% Informal only Excluded Excluded male Excluded female 2.00% Estimate % Lower MoE Upper MoE MoE I 7

Alternative way to represent the confidence interval There are many ways to show the interval for key estimates. South Africa Access Strand 2016 20% 15% 10% 5% 0% Formal other Informal only Excluded Excluded male Excluded female 8 I Reporting financial inclusion research results using confidence intervals, margins of error and confidence levels

South Africa Access Strand 2016 20% 15% 10% 5% 8.25% 3.16% 11.46% 14.85% 8.30% 0% Formal other Informal only Excluded Excluded male Excluded female I 9

Conclusion It is important that the users of statistics realise that there is an interval around any estimate (mean or proportion). These intervals can be very small or very large depending on a number of factors. These factors should be identified and considered when interpreting findings. The authors recommend that the interval and MoE be reported on multiple times in a report, especially for key indicators and when the sample size (base) changes due to sub-group analyses. 10 I Reporting financial inclusion research results using confidence intervals, margins of error and confidence levels

Bibliography http://www.aapor.org/education-resources/ Election-Polling-Resources/Margin-of-Sampling- Error-Credibility-Interval.aspx http://www.scor.qc.ca/en_calculez.html http://stattrek.com/statistics/dictionary. aspx?definition=confidence%20level http://stattrek.com/statistics/dictionary. aspx?definition=confidence%20level https://www.ncbi.nlm.nih.gov/pmc/articles/ PMC2689604/ https://www.ncbi.nlm.nih.gov/pmc/articles/ PMC4877414/ https://www.vox.com/science-andhealth/2017/7/31/16021654/p-values-statisticalsignificance-redefine-0005 Malhotra, N.K. & Birks, D.F. (2003). Marketing Research. An applied Approach. Prentice Hall. I 11

How to find us: Get involved. Contact us. Grant Robertson E: grant@i2ifacility.org Bobby Berkowitz E: bobby@i2ifacility.org Damola Olowade E: damola@i2ifacility.org Shirley Jeoffreys-Leach E: shirley@i2ifacility.org +27 11 315 9197 i2ifacility.org @i2ifacility /insight2impact /insight2impact /i2ifacility