Lithium-Ion Battery Analysis for Reliability and Accelerated Testing Using Logistic Regression

Size: px
Start display at page:

Download "Lithium-Ion Battery Analysis for Reliability and Accelerated Testing Using Logistic Regression"

Transcription

1 for Reliability and Accelerated Testing Using Logistic Regression Travis A. Moebes, PhD Dyn-Corp International, LLC Houston, Texas

2 Biography Dr. Travis Moebes has a B.S. in Mathematics with Engineering Applications - University of Texas-Austin, a M.S. in Applied Mathematics from Sam Houston State University - Huntsville, and a PhD in Mathematics - Analytic Topology from the University of Houston. He was nominated in 1984 for the Fields Medal (Mathematics, International Union of Mathematicians, WARSAW, POLAND) for his work in applying combinatory to Topology and Geophysics. His over 100 publications includes papers in Mathematics, Geophysics, Computer Science, Education and Reliability Analysis in the American Mathematical Society, IEEE, NASA Journals, and RAMS. He worked for SAIC at NASA for 25 years as a Reliability Engineer. He joined Dyn-Corp International, LLC in 2012 where he works in reliability analysis development for aviation at NASA/JSC/AOD in Houston. ASTR 2013 Oct 9-11, San Diego, CA

3 Purpose Report on BASICS and Methods To Perform Logistic Regression For Obtaining Probabilities of Failures On Lithium-Ion Battery Cells That Have Gone Through Cycle-Down/Up Accelerated Testing Give A Road Map To Follow Good Analysis Practices Through The Steps Of Performing Logistic Regression Using SAS Enterprise Miner (EM) and TIBCO Miner(TM) Demonstrate How Statistical Based Algorithms Improved Accelerated Testing

4 Topics Testing for the significance of the logistic regression model Insightful Miner (IM) and SAS EM Lithium-Ion Battery Cell Test Data Vendor 1, Vendor 2, Vendor 3 Logistic Regression Assessment results for Selection of Vendor

5 The Logistic Regression Model In logistic regression, you model the probability of a binary event occurring as a linear function of a set of independent variables. Logistic regression models are a special type of linear model in which the dependent variable is categorical and has exactly two levels.

6 The Logistic Regression Model Linear Regression Analysis A linear model provides a way of estimating a dependent variable Y, conditional on a linear function of a set of independent variables, X1, X2... Xp. Mathematically, this is written as:

7 The Logistic Regression Model continued In this equation, the terms are the coefficients βi of the linear model; the intercept of the model is β0 and e is the residual. Estimates of the coefficients,, are computed from the training data from which an estimate of the dependent variable, is computed by substituting the estimated coefficients into Equation for Y above. An estimate of the residual, is then the difference between the observed dependent variable and its estimate.

8 The Logistic Regression Model continued Logistic regression is a predictive analysis, like linear regression, but logistic regression involves prediction of a dichotomous dependent variable. The predictors can be continuous or dichotomous, just as in regression analysis, but ordinary least squares regression (OLS) is not appropriate if the outcome is dichotomous.

9 The Logistic Regression Model continued Whereas the OLS regression uses normal probability theory, logistic regression uses binomial probability theory. This makes things a bit more complicated mathematically, so we will only cover this topic fairly superficially (believe me; I'm mixing it with ease!).

10 The Logistic Regression Model continued Chi-square and Logistic Regression Because the binomial distribution is used, we might expect that there will be a relationship between logistic regression and chi-square analysis. It turns out that the 2 X 2 contingency analysis with chi-square is really just a special case of logistic regression, and this is analogous to the relationship between ANOVA and regression

11 The Logistic Regression Model continued Chi-square and Logistic Regression(continued) With chi-square contingency analysis, the independent variable is dichotomous and the dependent variable is dichotomous. We can also conduct an equivalent logistic regression analysis with a dichotomous independent variable predicting a dichotomous dependent variable.

12 The Logistic Regression Model continued Chi-square and Logistic Regression(continued) Logistic regression is a more general analysis, however, because the independent variable (i.e., the predictor) is not restricted to a dichotomous variable. Nor is logistic regression limited to a single predictor. Let's take an example. Coronary heart disease (CHD) is an increasing risk as one's age increases. We can think of CHD as a dichotomous variable (although one can also imagine some continuous measures of this). For this example, either a patient has CHD or not. If we were to plot the relationship between age and CHD in a scatter plot, we would get something that looks like this: (next slide)

13 The Logistic Regression Model continued Chi-square and Logistic Regression(continued) Figure 1: Plot of CHD by AGE

14 The Logistic Regression Model continued Chi-square and Logistic Regression(continued) We can see from the graph that there is somewhat of a greater likelihood that CHD will occur at older ages. But this figure is not very suitable for examining that. If we tried to draw a straight (best fitting) line through the points, it would not do a very good job of explaining the data. One solution would be to convert or transform these numbers into probabilities.

15 The Logistic Regression Model continued Chi-square and Logistic Regression(continued) The y values can only be 0 or 1, so an average of them will be between 0 and 1 (.2,.9,.6 etc.). This average is the same as the probability of having a value of 1 on the y variable, given a certain value of x (notated as P(y xi). So, we could then plot the probabilities of y at each value of x and it would look something like this: (next slide)

16 The Logistic Regression Model continued Chi-square and Logistic Regression(continued) Figure 2 Cumulative Probability Curve For The Logistic Distribution

17 Testing for the significance of the logistic regression model IM and SAS EM TIBCO Miner- continued The p-value for each t-statistic indicates if the corresponding coefficient is significant in the model. In general, if the p-value is less than 0.05 the t- statistic is greater than This suggests that the coefficients are significant. In Figure 3 below, the small p-values for Start implies the term is very significant and the variables Age and Number contribute to the model but less so. Generally, a test for the intercept is uninformative since we rarely expect the regression surface to intersect with the origin.

18 Testing for the significance of the logistic regression model IM and SAS EM TIBCO Miner- continued Figure 3 TIBCO Miner Work flow Diagram

19 Testing for the significance of the logistic regression model IM and SAS EM TIBCO Miner- continued Figure 4 Model Assessment output From TM

20 Testing for the significance of the logistic regression model IM and SAS EM TIBCO Miner- continued In Figure 4 above, the small p-values for Start implies the term is very significant and the variables Age and Number contribute to the model but less so. Generally, a test for the intercept is uninformative since we rarely expect the regression surface to intersect with the origin.

21 Testing for the significance of the logistic regression model IM and SAS EM SAS Enterprise Miner SAS Enterprise Miner uses the Likelihood Ratio Test for Global Null Hypothesis to evaluate the entire model. See Figure 6 below. The Null hypothesis is that are no significant variables in the model whose variation explains the variation in the dependent variable. One may reject the Null Hypothesis if Pr is less than (95% confidence) and Beta = 0. Models with higher Likelihood Ratio Chi-Square statistics are the better models.

22 Testing for the significance of the logistic regression model IM and SAS EM SAS Enterprise Miner Figure 5 : SAS EM Work Flow Diagram

23 Testing for the significance of the logistic regression model IM and SAS EM SAS Enterprise Miner - continued Figure 6: The SAS EM display for the Logistic Regression node SAS EM also displays t-statistics for term importance with p- values

24 Lithium-Ion Battery and Data Fourteen-hundred rows by 52 columns of Electrovaya Cell Acceptance Data were obtained from JSC/EP. A portion of the data appears below in Figure 7 Figure 7 Snapshot of the Vendor 1 Cell Acceptance Data

25 Lithium-Ion Battery and Data Figure 7 Snapshot of the Vendor Cell Acceptance Data ASTR 2013, Oct. 9-11, San Diego, CA

26 Lithium-Ion Battery and Data - continued The LIB No. column refers to the battery, the Module No. refers to the Modules containing the cells, and the Serial column refers to the cells. The P/F column indicates if a particular cell (serial number) passed or failed the vendor test. We assume the possibility of building a Logistic Regression classification model with this data using P/F as the dependent parameter and all other parameters as possible independent parameters.

27 Lithium-Ion Battery and Data - continued Our scope was limited to the Logistic Regression classification for model determination. Other models such as Classification Trees and Classification Neural Networks were not used at this time. Our study was also limited to the above data set. Different data sets and models are planned for the future.

28 The Lithium Battery Lithium-Ion Battery and Data - continued Figure 8 Cells (cell 5) and (cell 4) Damage Due To Debris

29 Lithium-Ion Battery and Data - continued JSC/EP tests showed there are 2 separate internal shorts, both on bottom corners of cell #11577occurred. JSC/EP The short tests at GZ showed affected several there cell fold are layers 2 more separate deeply than internal the more superficial shorts, damage both caused on by bottom short at the opposite corners of on cell # #11577occurred. The short at GZ affected several cell fold layers more deeply than the more superficial damage caused by short at the opposite corner on cell #11577.

30 Analysis and Assessment results Data from five cells selected by JSC/EP were processed through three model options to determine the best model and to indicate a known cell that failed. The output of the best model showed good acceptability statistics and an indication of the failed cell as less acceptable than the other cells. All results were similar in both TM and EM processing and model building.

31 Analysis and Assessment results-continued Option 1 All Rows and Columns As Input A total of 1400 rows of data and all usable columns (53) were used as input to initiate the process of finding a best model. There were 52 independent variables. Option 2. This option was like Option 1, except six independent variables were dropped because of less significant statistical indications based on p-values and t-statistics from Option 1. Option 3. This option was also like Option 1, except deleting the following: Columns with too many blanks, namely the soft short test parameters with the exception of column ACR8C. Non-relevant columns according to JSC Engineering (EP). Relevant columns with good t- Statistics were kept. Categorical Columns. These had less than desirable t-statistics. ASTR 2013, Oct. 9-11, San Diego, CA

32 Analysis and Assessment results-continued R-Square LR Ratio Pr/Beta /0.0 Option 1 Option /0.0 Option /0.0 TM SAS EM SAS EM Figure 8 Model Evaluation Analysis results from both TM and SAS EM.

33 Analysis and Assessment results-continued Model Analysis Results - TM Figure 9 Pass/Fail Output From TM and Option 3. Predict.prob is the model prediction that the cell should pass vendor tests. Predict.class is the model indication of Pass or Fail.

34 Analysis and Assessment results-continued Model Analysis Results SAS EM Figure 10 Pass/Fail Output From SAS EM and Option 3. Predict.prob is the model prediction that the cell should pass vendor tests. Predict.class is the model indication of Pass or Fail.

35 Analysis and Assessment results-continued Our best statistical model showed that cells11577 and in module FGM-91 and battery 1010 had a probability of 0.79 to pass. This was the lowest probability of passing than any other cells in the module. Cell showed very similar results. All other cells showed a 0.90 probability or higher to pass. Acceptance Statistics were good.

36 Analysis and Assessment results-continued Further Logistic Regression Models were applied to more Electrovaya Battery Test Cell Data. More low probability indicators were found and confirmed by hardware testing Electrovaya was dropped as the vendor. Sony (MoliJ) and Motorola (LV) were chosen as the two possible vendors to furnish the Lithium-Ion battery cells. The use of the LR Models were used to cherry pick out the potential cells that may fail without returning to more hardware, non-destructive testing.

SPSS Guide Page 1 of 13

SPSS Guide Page 1 of 13 SPSS Guide Page 1 of 13 A Guide to SPSS for Public Affairs Students This is intended as a handy how-to guide for most of what you might want to do in SPSS. First, here is what a typical data set might

More information

Business Quantitative Analysis [QU1] Examination Blueprint

Business Quantitative Analysis [QU1] Examination Blueprint Business Quantitative Analysis [QU1] Examination Blueprint 2014-2015 Purpose The Business Quantitative Analysis [QU1] examination has been constructed using an examination blueprint. The blueprint, also

More information

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney

= = Intro to Statistics for the Social Sciences. Name: Lab Session: Spring, 2015, Dr. Suzanne Delaney Name: Intro to Statistics for the Social Sciences Lab Session: Spring, 2015, Dr. Suzanne Delaney CID Number: _ Homework #22 You have been hired as a statistical consultant by Donald who is a used car dealer

More information

= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data

= = Name: Lab Session: CID Number: The database can be found on our class website: Donald s used car data Intro to Statistics for the Social Sciences Fall, 2017, Dr. Suzanne Delaney Extra Credit Assignment Instructions: You have been hired as a statistical consultant by Donald who is a used car dealer to help

More information

After completion of this unit you will be able to: Define data analytic and explain why it is important Outline the data analytic tools and

After completion of this unit you will be able to: Define data analytic and explain why it is important Outline the data analytic tools and After completion of this unit you will be able to: Define data analytic and explain why it is important Outline the data analytic tools and techniques and explain them Now the difference between descriptive

More information

Distinguish between different types of numerical data and different data collection processes.

Distinguish between different types of numerical data and different data collection processes. Level: Diploma in Business Learning Outcomes 1.1 1.3 Distinguish between different types of numerical data and different data collection processes. Introduce the course by defining statistics and explaining

More information

Continuous Improvement Toolkit

Continuous Improvement Toolkit Continuous Improvement Toolkit Regression (Introduction) Managing Risk PDPC Pros and Cons Importance-Urgency Mapping RACI Matrix Stakeholders Analysis FMEA RAID Logs Break-even Analysis Cost -Benefit Analysis

More information

GETTING STARTED WITH PROC LOGISTIC

GETTING STARTED WITH PROC LOGISTIC PAPER 255-25 GETTING STARTED WITH PROC LOGISTIC Andrew H. Karp Sierra Information Services, Inc. USA Introduction Logistic Regression is an increasingly popular analytic tool. Used to predict the probability

More information

GETTING STARTED WITH PROC LOGISTIC

GETTING STARTED WITH PROC LOGISTIC GETTING STARTED WITH PROC LOGISTIC Andrew H. Karp Sierra Information Services and University of California, Berkeley Extension Division Introduction Logistic Regression is an increasingly popular analytic

More information

APPLICATION OF SEASONAL ADJUSTMENT FACTORS TO SUBSEQUENT YEAR DATA. Corresponding Author

APPLICATION OF SEASONAL ADJUSTMENT FACTORS TO SUBSEQUENT YEAR DATA. Corresponding Author 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 APPLICATION OF SEASONAL ADJUSTMENT FACTORS TO SUBSEQUENT

More information

Forecasting Introduction Version 1.7

Forecasting Introduction Version 1.7 Forecasting Introduction Version 1.7 Dr. Ron Tibben-Lembke Sept. 3, 2006 This introduction will cover basic forecasting methods, how to set the parameters of those methods, and how to measure forecast

More information

Advanced Tutorials. SESUG '95 Proceedings GETTING STARTED WITH PROC LOGISTIC

Advanced Tutorials. SESUG '95 Proceedings GETTING STARTED WITH PROC LOGISTIC GETTING STARTED WITH PROC LOGISTIC Andrew H. Karp Sierra Information Services and University of California, Berkeley Extension Division Introduction Logistic Regression is an increasingly popular analytic

More information

Prediction of Used Cars Prices by Using SAS EM

Prediction of Used Cars Prices by Using SAS EM Prediction of Used Cars Prices by Using SAS EM Jaideep A Muley Student, MS in Business Analytics Spears School of Business Oklahoma State University 1 ABSTRACT Prediction of Used Cars Prices by Using SAS

More information

Multiple Regression. Dr. Tom Pierce Department of Psychology Radford University

Multiple Regression. Dr. Tom Pierce Department of Psychology Radford University Multiple Regression Dr. Tom Pierce Department of Psychology Radford University In the previous chapter we talked about regression as a technique for using a person s score on one variable to make a best

More information

Logistic Regression and Decision Trees

Logistic Regression and Decision Trees Logistic Regression and Decision Trees Reminder: Regression We want to find a hypothesis that explains the behavior of a continuous y y = B0 + B1x1 + + Bpxp+ ε Source Regression for binary outcomes Regression

More information

Getting Started with HLM 5. For Windows

Getting Started with HLM 5. For Windows For Windows Updated: August 2012 Table of Contents Section 1: Overview... 3 1.1 About this Document... 3 1.2 Introduction to HLM... 3 1.3 Accessing HLM... 3 1.4 Getting Help with HLM... 3 Section 2: Accessing

More information

Who Are My Best Customers?

Who Are My Best Customers? Technical report Who Are My Best Customers? Using SPSS to get greater value from your customer database Table of contents Introduction..............................................................2 Exploring

More information

THE CONCEPT OF CONJOINT ANALYSIS Scott M. Smith, Ph.D.

THE CONCEPT OF CONJOINT ANALYSIS Scott M. Smith, Ph.D. THE CONCEPT OF CONJOINT ANALYSIS Scott M. Smith, Ph.D. Marketing managers are faced with numerous difficult tasks directed at assessing future profitability, sales, and market share for new product entries

More information

4.3 Nonparametric Tests cont...

4.3 Nonparametric Tests cont... Class #14 Wednesday 2 March 2011 What did we cover last time? Hypothesis Testing Types Student s t-test - practical equations Effective degrees of freedom Parametric Tests Chi squared test Kolmogorov-Smirnov

More information

Session 7. Introduction to important statistical techniques for competitiveness analysis example and interpretations

Session 7. Introduction to important statistical techniques for competitiveness analysis example and interpretations ARTNeT Greater Mekong Sub-region (GMS) initiative Session 7 Introduction to important statistical techniques for competitiveness analysis example and interpretations ARTNeT Consultant Witada Anukoonwattaka,

More information

Week 4 Lecture 10 We have been examining the question of equal pay for equal work for several weeks now; but have been somewhat frustrated with the

Week 4 Lecture 10 We have been examining the question of equal pay for equal work for several weeks now; but have been somewhat frustrated with the Week 4 Lecture 10 We have been examining the question of equal pay for equal work for several weeks now; but have been somewhat frustrated with the equal work part. We suspect that salary varies with grade

More information

AcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved.

AcaStat How To Guide. AcaStat. Software. Copyright 2016, AcaStat Software. All rights Reserved. AcaStat How To Guide AcaStat Software Copyright 2016, AcaStat Software. All rights Reserved. http://www.acastat.com Table of Contents Frequencies... 3 List Variables... 4 Descriptives... 5 Explore Means...

More information

Churn Prevention in Telecom Services Industry- A systematic approach to prevent B2B churn using SAS

Churn Prevention in Telecom Services Industry- A systematic approach to prevent B2B churn using SAS Paper 1414-2017 Churn Prevention in Telecom Services Industry- A systematic approach to prevent B2B churn using SAS ABSTRACT Krutharth Peravalli, Dr. Dmitriy Khots West Corporation It takes months to find

More information

QUESTION 2 What conclusion is most correct about the Experimental Design shown here with the response in the far right column?

QUESTION 2 What conclusion is most correct about the Experimental Design shown here with the response in the far right column? QUESTION 1 When a Belt Poka-Yoke's a defect out of the process entirely then she should track the activity with a robust SPC system on the characteristic of interest in the defect as an early warning system.

More information

Getting Started With PROC LOGISTIC

Getting Started With PROC LOGISTIC Getting Started With PROC LOGISTIC Andrew H. Karp Sierra Information Services, Inc. 19229 Sonoma Hwy. PMB 264 Sonoma, California 95476 707 996 7380 SierraInfo@aol.com www.sierrainformation.com Getting

More information

Correlation and Simple. Linear Regression. Scenario. Defining Correlation

Correlation and Simple. Linear Regression. Scenario. Defining Correlation Linear Regression Scenario Let s imagine that we work in a real estate business and we re attempting to understand whether there s any association between the square footage of a house and it s final selling

More information

Statistical Modelling for Social Scientists. Manchester University. January 20, 21 and 24, Modelling categorical variables using logit models

Statistical Modelling for Social Scientists. Manchester University. January 20, 21 and 24, Modelling categorical variables using logit models Statistical Modelling for Social Scientists Manchester University January 20, 21 and 24, 2011 Graeme Hutcheson, University of Manchester Modelling categorical variables using logit models Software commands

More information

RESULT AND DISCUSSION

RESULT AND DISCUSSION 4 Figure 3 shows ROC curve. It plots the probability of false positive (1-specificity) against true positive (sensitivity). The area under the ROC curve (AUR), which ranges from to 1, provides measure

More information

Topics in Biostatistics Categorical Data Analysis and Logistic Regression, part 2. B. Rosner, 5/09/17

Topics in Biostatistics Categorical Data Analysis and Logistic Regression, part 2. B. Rosner, 5/09/17 Topics in Biostatistics Categorical Data Analysis and Logistic Regression, part 2 B. Rosner, 5/09/17 1 Outline 1. Testing for effect modification in logistic regression analyses 2. Conditional logistic

More information

ADVANCED DATA ANALYTICS

ADVANCED DATA ANALYTICS ADVANCED DATA ANALYTICS MBB essay by Marcel Suszka 17 AUGUSTUS 2018 PROJECTSONE De Corridor 12L 3621 ZB Breukelen MBB Essay Advanced Data Analytics Outline This essay is about a statistical research for

More information

Credit Card Marketing Classification Trees

Credit Card Marketing Classification Trees Credit Card Marketing Classification Trees From Building Better Models With JMP Pro, Chapter 6, SAS Press (2015). Grayson, Gardner and Stephens. Used with permission. For additional information, see community.jmp.com/docs/doc-7562.

More information

Quantitative Methods

Quantitative Methods THE ASSOCIATION OF BUSINESS EXECUTIVES DIPLOMA PART 2 QM Quantitative Methods afternoon 4 June 2003 1 Time allowed: 3 hours. 2 Answer any FOUR questions. 3 All questions carry 25 marks. Marks for subdivisions

More information

Statistical Modelling for Business and Management. J.E. Cairnes School of Business & Economics National University of Ireland Galway.

Statistical Modelling for Business and Management. J.E. Cairnes School of Business & Economics National University of Ireland Galway. Statistical Modelling for Business and Management J.E. Cairnes School of Business & Economics National University of Ireland Galway June 28 30, 2010 Graeme Hutcheson, University of Manchester Luiz Moutinho,

More information

Haplotype Based Association Tests. Biostatistics 666 Lecture 10

Haplotype Based Association Tests. Biostatistics 666 Lecture 10 Haplotype Based Association Tests Biostatistics 666 Lecture 10 Last Lecture Statistical Haplotyping Methods Clark s greedy algorithm The E-M algorithm Stephens et al. coalescent-based algorithm Hypothesis

More information

Statistics, Data Analysis, and Decision Modeling

Statistics, Data Analysis, and Decision Modeling - ' 'li* Statistics, Data Analysis, and Decision Modeling T H I R D E D I T I O N James R. Evans University of Cincinnati PEARSON Prentice Hall Upper Saddle River, New Jersey 07458 CONTENTS Preface xv

More information

3 Ways to Improve Your Targeted Marketing with Analytics

3 Ways to Improve Your Targeted Marketing with Analytics 3 Ways to Improve Your Targeted Marketing with Analytics Introduction Targeted marketing is a simple concept, but a key element in a marketing strategy. The goal is to identify the potential customers

More information

Identification of Rogue Tools and Process Stage Drift by using JMP Software Visualization and Analytical Techniques

Identification of Rogue Tools and Process Stage Drift by using JMP Software Visualization and Analytical Techniques September 16 th, 2010 Identification of Rogue Tools and Process Stage Drift by using JMP Software Visualization and Analytical Techniques Jim Nelson Engineering Data Specialist, Freescale Semiconductor,

More information

Salford Predictive Modeler. Powerful machine learning software for developing predictive, descriptive, and analytical models.

Salford Predictive Modeler. Powerful machine learning software for developing predictive, descriptive, and analytical models. Powerful machine learning software for developing predictive, descriptive, and analytical models. The Company Minitab helps companies and institutions to spot trends, solve problems and discover valuable

More information

Predictive Modeling using SAS. Principles and Best Practices CAROLYN OLSEN & DANIEL FUHRMANN

Predictive Modeling using SAS. Principles and Best Practices CAROLYN OLSEN & DANIEL FUHRMANN Predictive Modeling using SAS Enterprise Miner and SAS/STAT : Principles and Best Practices CAROLYN OLSEN & DANIEL FUHRMANN 1 Overview This presentation will: Provide a brief introduction of how to set

More information

Linear model to forecast sales from past data of Rossmann drug Store

Linear model to forecast sales from past data of Rossmann drug Store Abstract Linear model to forecast sales from past data of Rossmann drug Store Group id: G3 Recent years, the explosive growth in data results in the need to develop new tools to process data into knowledge

More information

Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS

Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS Logistic Regression, Part III: Hypothesis Testing, Comparisons to OLS Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 22, 2015 This handout steals heavily

More information

CUSTOM DECISION SUPPORT, LLC Telephone (831) Pharmaceutical Pricing Research

CUSTOM DECISION SUPPORT, LLC Telephone (831) Pharmaceutical Pricing Research CUSTOM DECISION SUPPORT, LLC Telephone (831) 854-2256 gene@lieb.com http://www.lieb.com Pharmaceutical Pricing Research Custom Decision Support, LLC (CDS) provides pricing analytic services to both commercial

More information

Choosing the best statistical tests for your data and hypotheses. Dr. Christine Pereira Academic Skills Team (ASK)

Choosing the best statistical tests for your data and hypotheses. Dr. Christine Pereira Academic Skills Team (ASK) Choosing the best statistical tests for your data and hypotheses Dr. Christine Pereira Academic Skills Team (ASK) ask@brunel.ac.uk Which test should I use? T-tests Correlations Regression Dr. Christine

More information

Econometric Analysis Dr. Sobel

Econometric Analysis Dr. Sobel Econometric Analysis Dr. Sobel Econometrics Session 1: 1. Building a data set Which software - usually best to use Microsoft Excel (XLS format) but CSV is also okay Variable names (first row only, 15 character

More information

Chapter 10 Regression Analysis

Chapter 10 Regression Analysis Chapter 10 Regression Analysis Goal: To become familiar with how to use Excel 2007/2010 for Correlation and Regression. Instructions: You will be using CORREL, FORECAST and Regression. CORREL and FORECAST

More information

Creative Commons Attribution-NonCommercial-Share Alike License

Creative Commons Attribution-NonCommercial-Share Alike License Author: Brenda Gunderson, Ph.D., 2015 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution- NonCommercial-Share Alike 3.0 Unported License:

More information

Profit Optimization ABSTRACT PROBLEM INTRODUCTION

Profit Optimization ABSTRACT PROBLEM INTRODUCTION Profit Optimization Quinn Burzynski, Lydia Frank, Zac Nordstrom, and Jake Wolfe Dr. Song Chen and Dr. Chad Vidden, UW-LaCrosse Mathematics Department ABSTRACT Each branch store of Fastenal is responsible

More information

JMP TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING

JMP TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING JMP TIP SHEET FOR BUSINESS STATISTICS CENGAGE LEARNING INTRODUCTION JMP software provides introductory statistics in a package designed to let students visually explore data in an interactive way with

More information

SOCIAL MEDIA MINING. Behavior Analytics

SOCIAL MEDIA MINING. Behavior Analytics SOCIAL MEDIA MINING Behavior Analytics Dear instructors/users of these slides: Please feel free to include these slides in your own material, or modify them as you see fit. If you decide to incorporate

More information

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration

KnowledgeSTUDIO. Advanced Modeling for Better Decisions. Data Preparation, Data Profiling and Exploration KnowledgeSTUDIO Advanced Modeling for Better Decisions Companies that compete with analytics are looking for advanced analytical technologies that accelerate decision making and identify opportunities

More information

Habitat as a predictor of Warbler usage.

Habitat as a predictor of Warbler usage. 1. Introduction: Habitat as a predictor of Warbler usage. Carl James Schwarz (cschwarz@stat.sfu.ca) Over the past few years, the BC MOE has completed detailed habitat evaluations at 223 Black-throated

More information

Copyright 2013, SAS Institute Inc. All rights reserved.

Copyright 2013, SAS Institute Inc. All rights reserved. IMPROVING PREDICTION OF CYBER ATTACKS USING ENSEMBLE MODELING June 17, 2014 82 nd MORSS Alexandria, VA Tom Donnelly, PhD Systems Engineer & Co-insurrectionist JMP Federal Government Team ABSTRACT Improving

More information

AN ANALYSIS OF CUSTOMERS SATISFACTION AND FACTORS INFLUENCING THE INTERNET BANKING

AN ANALYSIS OF CUSTOMERS SATISFACTION AND FACTORS INFLUENCING THE INTERNET BANKING CHAPTER V AN ANALYSIS OF CUSTOMERS SATISFACTION AND FACTORS INFLUENCING THE INTERNET BANKING 5.1 INTRODUCTION Banking industry is also one of the predominant industries adopting technologies which are

More information

MAS187/AEF258. University of Newcastle upon Tyne

MAS187/AEF258. University of Newcastle upon Tyne MAS187/AEF258 University of Newcastle upon Tyne 2005-6 Contents 1 Collecting and Presenting Data 5 1.1 Introduction...................................... 5 1.1.1 Examples...................................

More information

Learn What s New. Statistical Software

Learn What s New. Statistical Software Statistical Software Learn What s New Upgrade now to access new and improved statistical features and other enhancements that make it even easier to analyze your data. The Assistant Let Minitab s Assistant

More information

9.7 Getting Schooled. A Solidify Understanding Task

9.7 Getting Schooled. A Solidify Understanding Task 35 9.7 Getting Schooled A Solidify Understanding Task In Getting More $, Leo and Araceli noticed a difference in men s and women s salaries. Araceli thought that it was unfair that women were paid less

More information

Penalizing Your Models: An Overview of the Generalized Regression Platform Michael Crotty, SAS Institute; Clay Barker, SAS Institute

Penalizing Your Models: An Overview of the Generalized Regression Platform Michael Crotty, SAS Institute; Clay Barker, SAS Institute Paper RIV-151 Penalizing Your Models: An Overview of the Generalized Regression Platform Michael Crotty, SAS Institute; Clay Barker, SAS Institute ABSTRACT We provide an overview of the Generalized Regression

More information

AP Statistics Scope & Sequence

AP Statistics Scope & Sequence AP Statistics Scope & Sequence Grading Period Unit Title Learning Targets Throughout the School Year First Grading Period *Apply mathematics to problems in everyday life *Use a problem-solving model that

More information

Code Compulsory Module Credits Continuous Assignment

Code Compulsory Module Credits Continuous Assignment CURRICULUM AND SCHEME OF EVALUATION Compulsory Modules Evaluation (%) Code Compulsory Module Credits Continuous Assignment Final Exam MA 5210 Probability and Statistics 3 40±10 60 10 MA 5202 Statistical

More information

Overview of Statistics used in QbD Throughout the Product Lifecycle

Overview of Statistics used in QbD Throughout the Product Lifecycle Overview of Statistics used in QbD Throughout the Product Lifecycle August 2014 The Windshire Group, LLC Comprehensive CMC Consulting Presentation format and purpose Method name What it is used for and/or

More information

Ensemble Modeling. Toronto Data Mining Forum November 2017 Helen Ngo

Ensemble Modeling. Toronto Data Mining Forum November 2017 Helen Ngo Ensemble Modeling Toronto Data Mining Forum November 2017 Helen Ngo Agenda Introductions Why Ensemble Models? Simple & Complex ensembles Thoughts: Post-real-life Experimentation Downsides of Ensembles

More information

Multilevel Modeling and Cross-Cultural Research

Multilevel Modeling and Cross-Cultural Research 11 Multilevel Modeling and Cross-Cultural Research john b. nezlek Cross-cultural psychologists, and other scholars who are interested in the joint effects of cultural and individual-level constructs, often

More information

The Dummy s Guide to Data Analysis Using SPSS

The Dummy s Guide to Data Analysis Using SPSS The Dummy s Guide to Data Analysis Using SPSS Univariate Statistics Scripps College Amy Gamble April, 2001 Amy Gamble 4/30/01 All Rights Rerserved Table of Contents PAGE Creating a Data File...3 1. Creating

More information

CREDIT RISK MODELLING Using SAS

CREDIT RISK MODELLING Using SAS Basic Modelling Concepts Advance Credit Risk Model Development Scorecard Model Development Credit Risk Regulatory Guidelines 70 HOURS Practical Learning Live Online Classroom Weekends DexLab Certified

More information

Logistic Regression with Expert Intervention

Logistic Regression with Expert Intervention Smart Cities Symposium Prague 2016 1 Logistic Regression with Expert Intervention Pavla Pecherková and Ivan Nagy Abstract This paper deals with problem of analysis of traffic data. A traffic network has

More information

Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction

Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction Paper SAS1774-2015 Predictive Modeling Using SAS Visual Statistics: Beyond the Prediction ABSTRACT Xiangxiang Meng, Wayne Thompson, and Jennifer Ames, SAS Institute Inc. Predictions, including regressions

More information

HIMSS ME-PI Community. Quick Tour. Sigma Score Calculation Worksheet INSTRUCTIONS

HIMSS ME-PI Community. Quick Tour. Sigma Score Calculation Worksheet INSTRUCTIONS HIMSS ME-PI Community Sigma Score Calculation Worksheet INSTRUCTIONS Quick Tour Let s start with a quick tour of the Excel spreadsheet. There are six worksheets in the spreadsheet. Sigma Score (Snapshot)

More information

Telecommunications Churn Analysis Using Cox Regression

Telecommunications Churn Analysis Using Cox Regression Telecommunications Churn Analysis Using Cox Regression Introduction As part of its efforts to increase customer loyalty and reduce churn, a telecommunications company is interested in modeling the "time

More information

Continuous Improvement Toolkit

Continuous Improvement Toolkit Continuous Improvement Toolkit ANOVA Managing Risk PDPC Pros and Cons Importance-Urgency Mapping RACI Matrix Stakeholders Analysis FMEA RAID Logs Break-even Analysis Cost -Benefit Analysis PEST PERT/CPM

More information

Categorical Data Analysis

Categorical Data Analysis Categorical Data Analysis Hsueh-Sheng Wu Center for Family and Demographic Research October 4, 200 Outline What are categorical variables? When do we need categorical data analysis? Some methods for categorical

More information

Quadratic Regressions Group Acitivity 2 Business Project Week #4

Quadratic Regressions Group Acitivity 2 Business Project Week #4 Quadratic Regressions Group Acitivity 2 Business Project Week #4 In activity 1 we created a scatter plot on the calculator using a table of values that were given. Some of you were able to create a linear

More information

An Application of Categorical Analysis of Variance in Nested Arrangements

An Application of Categorical Analysis of Variance in Nested Arrangements International Journal of Probability and Statistics 2018, 7(3): 67-81 DOI: 10.5923/j.ijps.20180703.02 An Application of Categorical Analysis of Variance in Nested Arrangements Iwundu M. P. *, Anyanwu C.

More information

PROBLEMS OF WORLD AGRICULTURE

PROBLEMS OF WORLD AGRICULTURE Scientific Journal Warsaw University of Life Sciences SGGW PROBLEMS OF WORLD AGRICULTURE Volume 3 (XVIII) Warsaw University of Life Sciences Press Warszawa 28 Pawe Kobus 1 Chair of Agricultural Economics

More information

Application of Intelligent Methods for Improving the Performance of COCOMO in Software Projects

Application of Intelligent Methods for Improving the Performance of COCOMO in Software Projects Application of Intelligent Methods for Improving the Performance of COCOMO in Software Projects Mahboobeh Dorosti,. Vahid Khatibi Bardsiri Department of Computer Engineering, Kerman Branch, Islamic Azad

More information

Model-Based Fiber Network Expansion Using SAS Enterprise Miner and SAS Visual Analytics

Model-Based Fiber Network Expansion Using SAS Enterprise Miner and SAS Visual Analytics Paper 2634-2018 Model-Based Fiber Network Expansion Using SAS Enterprise Miner and SAS Visual Analytics ABSTRACT Nishant Sharma, Charter Communications Charter Communications is the second largest cable

More information

5 CHAPTER: DATA COLLECTION AND ANALYSIS

5 CHAPTER: DATA COLLECTION AND ANALYSIS 5 CHAPTER: DATA COLLECTION AND ANALYSIS 5.1 INTRODUCTION This chapter will have a discussion on the data collection for this study and detail analysis of the collected data from the sample out of target

More information

Do Customers Respond to Real-Time Usage Feedback? Evidence from Singapore

Do Customers Respond to Real-Time Usage Feedback? Evidence from Singapore Do Customers Respond to Real-Time Usage Feedback? Evidence from Singapore Frank A. Wolak Director, Program on Energy and Sustainable Development Professor, Department of Economics Stanford University Stanford,

More information

CONTRIBUTORY AND INFLUENCING FACTORS ON THE LABOUR WELFARE PRACTICES IN SELECT COMPANIES IN TIRUNELVELI DISTRICT AN ANALYSIS

CONTRIBUTORY AND INFLUENCING FACTORS ON THE LABOUR WELFARE PRACTICES IN SELECT COMPANIES IN TIRUNELVELI DISTRICT AN ANALYSIS CONTRIBUTORY AND INFLUENCING FACTORS ON THE LABOUR WELFARE PRACTICES IN SELECT COMPANIES IN TIRUNELVELI DISTRICT AN ANALYSIS DR.J.TAMILSELVI Assistant Professor, Department of Business Administration Annamalai

More information

intelligent world. GIV

intelligent world. GIV Huawei provides industries aiming to development, as well that will enable the ecosystem to truly intelligent world. GIV 2025 the direction for ramp up the pace of as the foundations diverse ICT industry

More information

Hierarchical Linear Modeling: A Primer 1 (Measures Within People) R. C. Gardner Department of Psychology

Hierarchical Linear Modeling: A Primer 1 (Measures Within People) R. C. Gardner Department of Psychology Hierarchical Linear Modeling: A Primer 1 (Measures Within People) R. C. Gardner Department of Psychology As noted previously, Hierarchical Linear Modeling (HLM) can be considered a particular instance

More information

Who Is Likely to Succeed: Predictive Modeling of the Journey from H-1B to Permanent US Work Visa

Who Is Likely to Succeed: Predictive Modeling of the Journey from H-1B to Permanent US Work Visa Who Is Likely to Succeed: Predictive Modeling of the Journey from H-1B to Shibbir Dripto Khan ABSTRACT The purpose of this Study is to help US employers and legislators predict which employees are most

More information

SAS/STAT 14.1 User s Guide. Introduction to Categorical Data Analysis Procedures

SAS/STAT 14.1 User s Guide. Introduction to Categorical Data Analysis Procedures SAS/STAT 14.1 User s Guide Introduction to Categorical Data Analysis Procedures This document is an individual chapter from SAS/STAT 14.1 User s Guide. The correct bibliographic citation for this manual

More information

USING R IN SAS ENTERPRISE MINER EDMONTON USER GROUP

USING R IN SAS ENTERPRISE MINER EDMONTON USER GROUP USING R IN SAS ENTERPRISE MINER EDMONTON USER GROUP INTRODUCTION PAT VALENTE, MA Solution Specialist, Data Sciences at SAS. Training in Economics and Statistics. 20 years experience in business areas including

More information

LEAN SIX SIGMA BLACK BELT CHEAT SHEET

LEAN SIX SIGMA BLACK BELT CHEAT SHEET GreyCampus LEAN SIX SIGMA BLACK BELT CHEAT SHEET 2 Includes formulas: what they are, when to use them, references CONTENTS ANOVA DOE (DESIGN OF EXPERIMENTS) One Factor At a Time (OFAT) Comparison Randomization

More information

MODELING THE EXPERT. An Introduction to Logistic Regression The Analytics Edge

MODELING THE EXPERT. An Introduction to Logistic Regression The Analytics Edge MODELING THE EXPERT An Introduction to Logistic Regression 15.071 The Analytics Edge Ask the Experts! Critical decisions are often made by people with expert knowledge Healthcare Quality Assessment Good

More information

STAT 350 (Spring 2016) Homework 12 Online 1

STAT 350 (Spring 2016) Homework 12 Online 1 STAT 350 (Spring 2016) Homework 12 Online 1 1. In simple linear regression, both the t and F tests can be used as model utility tests. 2. The sample correlation coefficient is a measure of the strength

More information

Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur

Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Applied Multivariate Statistical Modeling Prof. J. Maiti Department of Industrial Engineering and Management Indian Institute of Technology, Kharagpur Lecture - 19 Tutorial - ANOVA (Contd.) (Refer Slide

More information

STATISTICS PART Instructor: Dr. Samir Safi Name:

STATISTICS PART Instructor: Dr. Samir Safi Name: STATISTICS PART Instructor: Dr. Samir Safi Name: ID Number: Question #1: (20 Points) For each of the situations described below, state the sample(s) type the statistical technique that you believe is the

More information

Acceptance Sampling. CD Tutorial. Tutorial Outline

Acceptance Sampling. CD Tutorial. Tutorial Outline Acceptance Sampling CD Tutorial 2 Tutorial Outline SAMLING LANS Single Sampling Double Sampling Sequential Sampling OERATING CHARACTERISTIC (OC) CURVES RODUCER S AND CONSUMER S RISK AVERAGE OUTGOING QUALITY

More information

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 02 Data Mining Process Welcome to the lecture 2 of

More information

C-14 FINDING THE RIGHT SYNERGY FROM GLMS AND MACHINE LEARNING. CAS Annual Meeting November 7-10

C-14 FINDING THE RIGHT SYNERGY FROM GLMS AND MACHINE LEARNING. CAS Annual Meeting November 7-10 1 C-14 FINDING THE RIGHT SYNERGY FROM GLMS AND MACHINE LEARNING CAS Annual Meeting November 7-10 GLM Process 2 Data Prep Model Form Validation Reduction Simplification Interactions GLM Process 3 Opportunities

More information

Masters in Business Statistics (MBS) /2015. Department of Mathematics Faculty of Engineering University of Moratuwa Moratuwa. Web:

Masters in Business Statistics (MBS) /2015. Department of Mathematics Faculty of Engineering University of Moratuwa Moratuwa. Web: Masters in Business Statistics (MBS) - 2014/2015 Department of Mathematics Faculty of Engineering University of Moratuwa Moratuwa Web: www.mrt.ac.lk Course Coordinator: Prof. T S G Peiris Prof. in Applied

More information

SAS BIG DATA ANALYTICS INCREASING YOUR COMPETITIVE EDGE

SAS BIG DATA ANALYTICS INCREASING YOUR COMPETITIVE EDGE SAS BIG DATA ANALYTICS INCREASING YOUR COMPETITIVE EDGE SAS VISUAL ANALYTICS STATE OF THE ART SOLUTION FOR FASTER, SMARTER DECISIONS AIMED AT THE MASSES Data visualization Approachable analytics Robust

More information

Comparative analysis on the probability of being a good payer

Comparative analysis on the probability of being a good payer Comparative analysis on the probability of being a good payer V. Mihova, and V. Pavlov Citation: AIP Conference Proceedings 1895, 050006 (2017); View online: https://doi.org/10.1063/1.5007378 View Table

More information

Use Multi-Stage Model to Target the Most Valuable Customers

Use Multi-Stage Model to Target the Most Valuable Customers ABSTRACT MWSUG 2016 - Paper AA21 Use Multi-Stage Model to Target the Most Valuable Customers Chao Xu, Alliance Data Systems, Columbus, OH Jing Ren, Alliance Data Systems, Columbus, OH Hongying Yang, Alliance

More information

CHAPTER 4, SECTION 1

CHAPTER 4, SECTION 1 DAILY LECTURE CHAPTER 4, SECTION 1 Understanding Demand What Is Demand? Demand is the willingness and ability of buyers to purchase different quantities of a good, at different prices, during a specific

More information

Week 10: Heteroskedasticity

Week 10: Heteroskedasticity Week 10: Heteroskedasticity Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ARR 1 Outline The problem of (conditional)

More information

Session 31, Statistical Techniques for Fund Mapping and Other Applications. Moderator: Douglas L. Robbins, FSA, MAAA. Presenter:

Session 31, Statistical Techniques for Fund Mapping and Other Applications. Moderator: Douglas L. Robbins, FSA, MAAA. Presenter: Session 31, Statistical Techniques for Fund Mapping and Other Applications Moderator: Douglas L. Robbins, FSA, MAAA Presenter: Douglas L. Robbins, FSA, MAAA Statistical Techniques for Fund Mapping and

More information

Chapter 5 Regression

Chapter 5 Regression Chapter 5 Regression Topics to be covered in this chapter: Regression Fitted Line Plots Residual Plots Regression The scatterplot below shows that there is a linear relationship between the percent x of

More information

4»IEEE POWER SYSTEMS .*ISK ASSESSMENT OF. Models, Methods, and Applications. Wiley. Ph.D., Fellow, IfE, CAE, EIC SECOND EDITION

4»IEEE POWER SYSTEMS .*ISK ASSESSMENT OF. Models, Methods, and Applications. Wiley. Ph.D., Fellow, IfE, CAE, EIC SECOND EDITION .*ISK ASSESSMENT OF POWER SYSTEMS Models, Methods, and Applications SECOND EDITION Wenyuan Li, Ph.D., Fellow, IfE, CAE, EIC Chongqing University, China BC H> Canada, n IEEE PRESS SERIES ON POWER ENGINEERING

More information