Automated Test Assembly for COMLEX USA: A SAS Operations Research (SAS/OR) Approach

Size: px
Start display at page:

Download "Automated Test Assembly for COMLEX USA: A SAS Operations Research (SAS/OR) Approach"

Transcription

1 Automated Test Assembly for COMLEX USA: A SAS Operations Research (SAS/OR) Approach Dr. Hao Song, Senior Director for Psychometrics and Research Dr. Hongwei Patrick Yang, Senior Research Associate

2 Introduction Automated test assembly (ATA) is the process of automating test form construction through constrained optimization (vs. manual assembly) Improved effectiveness and efficiency for constructing multiple parallel test forms Improved psychometric quality: Increased form comparability and less variation Targeting at population ability to assure more accuracy of pass/fail decisions at the cut score

3 Introduction In this ATA demonstration, we choose an optimization program PROC OPTMODEL, part of Statistical Analysis Software Operations Research (SAS/OR), as the tool for ATA SAS is the official statistical analysis platform at NBOME SAS is an industry standard product in mathematical and statistical computing Important to operational work related to COMLEX-USA as a high-stakes licensure test designed to protect the public Note: Operations research deals with the application of advanced analytical methods to help make better decisions

4 Three Fundamental Components In the ATA work, we utilize the technique of mixed/pure integer linear/nonlinear programming. Three fundamental components need to be established: Decision variables Constraints Including both content and psychometrics constraints Objective function(s)

5 Decision Variables The decision variables we define here are binary variables in the form of 0 s and 1 s indicating the inclusion or exclusion of each item in each test form: x if = 1, if item i is assigned to form f x if = 0, otherwise Here, i = 1, N, f = 1, M, N is the total number of items in the item pool and M is the total number of forms to be assembled

6 Constraints Constraints are test specifications that need to be met. Typical constraints include: To restrict the test length to be exactly n items for form f N i=1 x if = n To ensure that item i is selected no more than once across all M forms 0 M i=1 x if 1 To limit the number of items on a certain topic (say, OPP items, or set of enemy items, etc.) to be between l and u on any given form Let t be a binary indicator variable with 1 indicating the item falling into the topic and 0 otherwise. Then, l x if t i u N i=1

7 Objective Function(s) Finally, the objective function is formulated by requiring the test information function (TIF) of each assembled form be as close to the target value as possible at the cut score θ = θ c : N Minimize x I i θ c x if T i θ c i=1 Careful consideration is given to keep examinations comparable over years

8 Measures of Test Quality Basically, test information function or TIF tells us how well the test is doing in estimating ability over the whole range of ability scores Given ability θ, a higher value in TIF indicates that the test is doing a better job

9 Data and Constraints Applied In development of the ATA engine, one-level data was used with the target latent ability cut score of θ = θ c Data sources of anchor, operational and pretest items The following criteria are specified as constraints in this ATA demonstration Blueprint Dimension 1 criteria Blueprint Dimension 2 criteria Life stage in Clinical presentations Number of items in a test form etc.

10 New ATA Forms vs. Previous Forms: TIF by Ability

11 New ATA Forms vs. Previous Forms The newly assembled ATA forms (in RED) are graphically presented when compared with a set of forms (in BLUE) assembled using the traditional manual assembly method Figure 1 presents an overlay of both groups of graphs by plotting one statistic against a wide range of ability levels across all assembled forms values of test information functions by ability levels from [(-4), (+4)]

12 New ATA Forms vs. Previous Forms Within each group of graphs, there is very good equivalency among forms The graphs are all closely overlapped with each other The new ATA forms noticeably demonstrate less variability among them around the cut score θ = θ c than do the traditional forms

13 New ATA Forms vs. Previous Forms Across the two groups of graphs, for a major portion of the continuum, the new ATA forms show relatively high test information function values than those traditional forms

14 New ATA Forms vs. Previous Forms In sum, the new and the traditional forms are reasonably comparable with each other in terms of equivalency within their respective group The new ATA forms can be better tailored to the candidate ability, around the cut score θ = θ c in particular

15 Impact Analysis for Classification Accuracy To further evaluate the new ATA forms, we have conducted an impact analysis via a simulation study using the empirical administration data Assuming the same cohort of candidates were to take the newly assembled ATA forms, we would compare their between-year examination scores and pass/fail decisions

16 Impact Analysis for Classification Accuracy

17 Impact Analysis for Classification Accuracy Figure 2 plots the newly estimated ability θ values after equating (vertical axis) against their previous estimates (horizontal axis) for two select ATA forms In each plot, the points fall around a 45 reference line, indicating the newly equated ability estimates tend to be identical to their previously obtained values

18 Impact Analysis for Classification Accuracy Besides, in each scatterplot, almost completely overlapped with the 45 reference line is an ordinary least squares regression line with the equated ability estimates as the dependent variable and the previous estimates as the predictor Additional, convincing evidence supportive of the new, equated ability estimates from the ATA forms

19 Impact Analysis for Classification Accuracy

20 Impact Analysis for Classification Accuracy Table 1 provides a cross-tabulation of two sets of classification results from the same classification criterion for measuring ability Based on the actual data from first-time candidates in one recent administration cycle Based on the data simulated from the ATA forms when administered to the same group of candidates above

21 Impact Analysis for Classification Accuracy Depending on which form it is, the passing rate from ATA ranges from 91.52% to 92.33% across all ATA forms, highly comparable across forms Close to the actual passing rate of 92% Depending on which form it is, the failing rate from ATA ranges from 7.67% to 8.48% across all ATA forms

22 Impact Analysis for Classification Accuracy As for the sensitivity statistic, its estimate ranges from 97.25% to 98.03% across all ATA forms Definition: Proportion of truly qualified candidates who actually pass the examination As for the specificity statistic, its estimate ranges from 78.55% to 81.80% across all ATA forms Definition: Proportion of truly unqualified candidates who actually fail the examination

23 Conclusions The ATA approach is preferred over the manual assembly approach because More equivalent with a reduction in the variability among forms over the continuum of candidate ability More rigorous psychometrics and content properties Higher on the test information function along a major portion of the ability continuum More content constraints being factored into form assembly As accurate as traditional forms in terms of scoring and classifying candidates

24 Conclusions In actual form assembly, we go even further in an effort to keep our strong commitment to the public Numerous communications among the Test Development and the Psychometrics and Research Teams, and external Subject Matter Experts on both content and psychometrics issues To enhance form equivalency to the greatest possible extent Flexibility for adding other content and psychometrics constraints whenever needed Multiple stages of ATA where feedback from item and form review meetings can be factored into each stage

25 Conclusions This small scale study is based on rigorous mathematics optimization procedures implemented in an industry standard software package and has demonstrated the ATA as one of many ongoing innovations at NBOME A *demonstration* of the ATA work only Not to be viewed as reflecting the full process typically used in a real ATA project at NBOME

26 References Choe, E. M. & Denbleyker, J. (2014). Quality psychometrics of Common Block Assembly: Summary report. Chicago, IL: National Board of Osteopathic Medical Examiners (NBOME). Crocker, L. & Algina, J. (1986). Introduction to classical and modern test theory. Orlando, FL: Harcourt Brace Jovanovich, Inc. Kalinowski, K. (2015). COMAT form assembly instructions for Chicago, IL: National Board of Osteopathic Medical Examiners (NBOME). Lathrop, Q. N. (2015). cacirt: Classification Accuracy and Consistency under Item Response Theory. R package version Linacre, J. M. (2007) How to simulate Rasch data. Rasch Measurement Transactions, 21(3), Papadimitriou, C. H., & Steiglitz, K. (1998). Combinatorial optimization: Algorithms and complexity. Mineola, NY: Dover Publications. Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Danish Institute for Educational Research, Copenhagen. Reif, M. (2014). PP: Estimation of person parameters for the 1,2,3,4-PL model and the GPCM. R package version Rudner, L. M. (2001) Computing the expected proportions of misclassified examinees. Practical Assessment, Research & Evaluation, 7(14), 1 5. Rudner, L. M. (2005) Expected classification accuracy. Practical Assessment Research & Evaluation, 10(13), 1 4. Schrijver, A. (2003). Combinatorial optimization. NYC, NY: Springer. van der Linden, W. J. (2005). Linear models for optimal test design. NYC, NY: Springer. Woo, A., & Gorham, J. L. (2010). Understanding the impact of enemy items on test validity and measurement precision. Journal of Clear Exam Review, 21(1),

27 Feel Free to Follow-Up with Questions! If you have any remaining questions, please do not hesitate to contact Dr. Hao Song or Dr. Hongwei Patrick Yang, via or phone: for Dr. Hao Song: Phone number: extension 294 for Dr. Hongwei Patrick Yang: Phone number: extension 290

28 And, finally, on behalf of NBOME THANK YOU! 2013 NBOME

An Integer Programming Approach to Item Bank Design

An Integer Programming Approach to Item Bank Design An Integer Programming Approach to Item Bank Design Wim J. van der Linden and Bernard P. Veldkamp, University of Twente Lynda M. Reese, Law School Admission Council An integer programming approach to item

More information

Designing item pools to optimize the functioning of a computerized adaptive test

Designing item pools to optimize the functioning of a computerized adaptive test Psychological Test and Assessment Modeling, Volume 52, 2 (2), 27-4 Designing item pools to optimize the functioning of a computerized adaptive test Mark D. Reckase Abstract Computerized adaptive testing

More information

Audit - The process of conducting an evaluation of an entity's compliance with published standards. This is also referred to as a program audit.

Audit - The process of conducting an evaluation of an entity's compliance with published standards. This is also referred to as a program audit. Glossary 1 Accreditation - Accreditation is a voluntary process that an entity, such as a certification program, may elect to undergo. In accreditation a non-governmental agency grants recognition to the

More information

National Council for Strength & Fitness

National Council for Strength & Fitness 1 National Council for Strength & Fitness Certified Personal Trainer Examination Annual Exam Report January 1 to December 31, 2016 March 24 th 2017 Exam Statistical Report Copyright 2017 National Council

More information

What are the Steps in the Development of an Exam Program? 1

What are the Steps in the Development of an Exam Program? 1 What are the Steps in the Development of an Exam Program? 1 1. Establish the test purpose A good first step in the development of an exam program is to establish the test purpose. How will the test scores

More information

Computer Adaptive Testing and Multidimensional Computer Adaptive Testing

Computer Adaptive Testing and Multidimensional Computer Adaptive Testing Computer Adaptive Testing and Multidimensional Computer Adaptive Testing Lihua Yao Monterey, CA Lihua.Yao.civ@mail.mil Presented on January 23, 2015 Lisbon, Portugal The views expressed are those of the

More information

An Automatic Online Calibration Design in Adaptive Testing 1. Guido Makransky 2. Master Management International A/S and University of Twente

An Automatic Online Calibration Design in Adaptive Testing 1. Guido Makransky 2. Master Management International A/S and University of Twente Automatic Online Calibration1 An Automatic Online Calibration Design in Adaptive Testing 1 Guido Makransky 2 Master Management International A/S and University of Twente Cees. A. W. Glas University of

More information

A Strategy for Optimizing Item-Pool Management

A Strategy for Optimizing Item-Pool Management Journal of Educational Measurement Summer 2006, Vol. 43, No. 2, pp. 85 96 A Strategy for Optimizing Item-Pool Management Adelaide Ariel, Wim J. van der Linden, and Bernard P. Veldkamp University of Twente

More information

Psychometric Issues in Through Course Assessment

Psychometric Issues in Through Course Assessment Psychometric Issues in Through Course Assessment Jonathan Templin The University of Georgia Neal Kingston and Wenhao Wang University of Kansas Talk Overview Formative, Interim, and Summative Tests Examining

More information

SEE Evaluation Report September 1, 2017-August 31, 2018

SEE Evaluation Report September 1, 2017-August 31, 2018 SEE Evaluation Report September 1, 2017-August 31, 2018 Copyright 2019 by the National Board of Certification and Recertification for Nurse Anesthetists (NBCRNA). All Rights Reserved. Table of Contents

More information

A standardization approach to adjusting pretest item statistics. Shun-Wen Chang National Taiwan Normal University

A standardization approach to adjusting pretest item statistics. Shun-Wen Chang National Taiwan Normal University A standardization approach to adjusting pretest item statistics Shun-Wen Chang National Taiwan Normal University Bradley A. Hanson and Deborah J. Harris ACT, Inc. Paper presented at the annual meeting

More information

UK Clinical Aptitude Test (UKCAT) Consortium UKCAT Examination. Executive Summary Testing Interval: 1 July October 2016

UK Clinical Aptitude Test (UKCAT) Consortium UKCAT Examination. Executive Summary Testing Interval: 1 July October 2016 UK Clinical Aptitude Test (UKCAT) Consortium UKCAT Examination Executive Summary Testing Interval: 1 July 2016 4 October 2016 Prepared by: Pearson VUE 6 February 2017 Non-disclosure and Confidentiality

More information

Innovative Item Types Require Innovative Analysis

Innovative Item Types Require Innovative Analysis Innovative Item Types Require Innovative Analysis Nathan A. Thompson Assessment Systems Corporation Shungwon Ro, Larissa Smith Prometric Jo Santos American Health Information Management Association Paper

More information

An Introduction to Psychometrics. Sharon E. Osborn Popp, Ph.D. AADB Mid-Year Meeting April 23, 2017

An Introduction to Psychometrics. Sharon E. Osborn Popp, Ph.D. AADB Mid-Year Meeting April 23, 2017 An Introduction to Psychometrics Sharon E. Osborn Popp, Ph.D. AADB Mid-Year Meeting April 23, 2017 Overview A Little Measurement Theory Assessing Item/Task/Test Quality Selected-response & Performance

More information

Assembling a Computerized Adaptive Testing Item Pool as a Set of Linear Tests

Assembling a Computerized Adaptive Testing Item Pool as a Set of Linear Tests Journal of Educational and Behavioral Statistics Spring 2006, Vol. 31, No. 1, pp. 81 100 Assembling a Computerized Adaptive Testing Item Pool as a Set of Linear Tests Wim J. van der Linden Adelaide Ariel

More information

ESTIMATING TOTAL-TEST SCORES FROM PARTIAL SCORES IN A MATRIX SAMPLING DESIGN JANE SACHAR. The Rand Corporatlon

ESTIMATING TOTAL-TEST SCORES FROM PARTIAL SCORES IN A MATRIX SAMPLING DESIGN JANE SACHAR. The Rand Corporatlon EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT 1980,40 ESTIMATING TOTAL-TEST SCORES FROM PARTIAL SCORES IN A MATRIX SAMPLING DESIGN JANE SACHAR The Rand Corporatlon PATRICK SUPPES Institute for Mathematmal

More information

Understanding the Dimensionality and Reliability of the Cognitive Scales of the UK Clinical Aptitude test (UKCAT): Summary Version of the Report

Understanding the Dimensionality and Reliability of the Cognitive Scales of the UK Clinical Aptitude test (UKCAT): Summary Version of the Report Understanding the Dimensionality and Reliability of the Cognitive Scales of the UK Clinical Aptitude test (UKCAT): Summary Version of the Report Dr Paul A. Tiffin, Reader in Psychometric Epidemiology,

More information

Conjoint analysis based on Thurstone judgement comparison model in the optimization of banking products

Conjoint analysis based on Thurstone judgement comparison model in the optimization of banking products Conjoint analysis based on Thurstone judgement comparison model in the optimization of banking products Adam Sagan 1, Aneta Rybicka, Justyna Brzezińska 3 Abstract Conjoint measurement, as well as conjoint

More information

Evaluating the use of psychometrics

Evaluating the use of psychometrics Evaluating the use of psychometrics It is now readily accepted that the effective use of psychometric instruments can add value to an organisation. When used in selection, in conjunction with a competency

More information

STAAR-Like Quality Starts with Reliability

STAAR-Like Quality Starts with Reliability STAAR-Like Quality Starts with Reliability Quality Educational Research Our mission is to provide a comprehensive independent researchbased resource of easily accessible and interpretable data for policy

More information

Effects of Selected Multi-Stage Test Design Alternatives on Credentialing Examination Outcomes 1,2. April L. Zenisky and Ronald K.

Effects of Selected Multi-Stage Test Design Alternatives on Credentialing Examination Outcomes 1,2. April L. Zenisky and Ronald K. Effects of Selected Multi-Stage Test Design Alternatives on Credentialing Examination Outcomes 1,2 April L. Zenisky and Ronald K. Hambleton University of Massachusetts Amherst March 29, 2004 1 Paper presented

More information

The computer-adaptive multistage testing (ca-mst) has been developed as an

The computer-adaptive multistage testing (ca-mst) has been developed as an WANG, XINRUI, Ph.D. An Investigation on Computer-Adaptive Multistage Testing Panels for Multidimensional Assessment. (2013) Directed by Dr. Richard M Luecht. 89 pp. The computer-adaptive multistage testing

More information

Equating and Scaling for Examination Programs

Equating and Scaling for Examination Programs Equating and Scaling for Examination Programs The process of scaling is used to report scores from equated examinations. When an examination is administered with multiple forms like the NBCOT OTC Examination,

More information

Investigating Common-Item Screening Procedures in Developing a Vertical Scale

Investigating Common-Item Screening Procedures in Developing a Vertical Scale Investigating Common-Item Screening Procedures in Developing a Vertical Scale Annual Meeting of the National Council of Educational Measurement New Orleans, LA Marc Johnson Qing Yi April 011 COMMON-ITEM

More information

Test-Free Person Measurement with the Rasch Simple Logistic Model

Test-Free Person Measurement with the Rasch Simple Logistic Model Test-Free Person Measurement with the Rasch Simple Logistic Model Howard E. A. Tinsley Southern Illinois University at Carbondale René V. Dawis University of Minnesota This research investigated the use

More information

IBM Workforce Science. IBM Kenexa Ability Series Computerized Adaptive Tests (IKASCAT) Technical Manual

IBM Workforce Science. IBM Kenexa Ability Series Computerized Adaptive Tests (IKASCAT) Technical Manual IBM Workforce Science IBM Kenexa Ability Series Computerized Adaptive Tests (IKASCAT) Technical Manual Version 1.0.1 UK/Europe Release Date: October 2014 Copyright IBM Corporation 2014. All rights reserved.

More information

HOGAN BUSINESS REASONING INVENTORY

HOGAN BUSINESS REASONING INVENTORY HOGAN BUSINESS REASONING INVENTORY O V E R V I E W G U I D E HBRI Introduction The Hogan Business Reasoning Inventory (HBRI) evaluates a person s ability to solve problems and make business-related decisions

More information

Redesign of MCAS Tests Based on a Consideration of Information Functions 1,2. (Revised Version) Ronald K. Hambleton and Wendy Lam

Redesign of MCAS Tests Based on a Consideration of Information Functions 1,2. (Revised Version) Ronald K. Hambleton and Wendy Lam Redesign of MCAS Tests Based on a Consideration of Information Functions 1,2 (Revised Version) Ronald K. Hambleton and Wendy Lam University of Massachusetts Amherst January 9, 2009 1 Center for Educational

More information

Fusion Analytical Method Validation

Fusion Analytical Method Validation Fusion QbD Software System Fusion Analytical Method Validation The Only Software That Has It All! 100% aligned with FDA/ICH Quality by Design (QbD) guidances! Can be used for LC and Non-LC methods (e.g.

More information

HOGAN BUSINESS REASONING INVENTORY

HOGAN BUSINESS REASONING INVENTORY HOGAN BUSINESS REASONING INVENTORY O V E R V I E W G U I D E HBRI INTRODUCTION The Hogan Business Reasoning Inventory (HBRI) evaluates a person s ability to solve problems and make business-related decisions

More information

Conditional Item-Exposure Control in Adaptive Testing Using Item-Ineligibility Probabilities

Conditional Item-Exposure Control in Adaptive Testing Using Item-Ineligibility Probabilities Journal of Educational and Behavioral Statistics December 2007, Vol. 32, No. 4, pp. 398 418 DOI: 10.3102/1076998606298044 Ó 2007 AERA and ASA. http://jebs.aera.net Conditional Item-Exposure Control in

More information

Using a Performance Test Development & Validation Framework

Using a Performance Test Development & Validation Framework Using a Performance Test Development & Validation Framework James B. Olsen Russell W. Smith Cristina Goodwin Alpine Testing Solutions Presentation Overview Present a useful performance test development

More information

STATE OF THE ART ANALYTICS

STATE OF THE ART ANALYTICS STATE OF THE ART ANALYTICS Udo Sglavo Analytic Solutions Manager SAS Technology Practice 1 STANDARD REPORTS Answer the questions: What happened? When did it happen? Example: Monthly or quarterly financial

More information

9.7 Getting Schooled. A Solidify Understanding Task

9.7 Getting Schooled. A Solidify Understanding Task 35 9.7 Getting Schooled A Solidify Understanding Task In Getting More $, Leo and Araceli noticed a difference in men s and women s salaries. Araceli thought that it was unfair that women were paid less

More information

Test Development. and. Psychometric Services

Test Development. and. Psychometric Services Test Development and Psychometric Services Test Development Services Fair, valid, reliable, legally defensible: the definition of a successful high-stakes exam. Ensuring that level of excellence depends

More information

A Comparison of Item-Selection Methods for Adaptive Tests with Content Constraints

A Comparison of Item-Selection Methods for Adaptive Tests with Content Constraints Journal of Educational Measurement Fall 2005, Vol. 42, No. 3, pp. 283 302 A Comparison of Item-Selection Methods for Adaptive Tests with Content Constraints Wim J. van der Linden University of Twente In

More information

Reliability & Validity

Reliability & Validity Request for Proposal Reliability & Validity Nathan A. Thompson Ph.D. Whitepaper-September, 2013 6053 Hudson Road, Suite 345 St. Paul, MN 55125 USA P a g e 1 To begin a discussion of reliability and validity,

More information

Examination Report for Testing Year Board of Certification (BOC) Certification Examination for Athletic Trainers.

Examination Report for Testing Year Board of Certification (BOC) Certification Examination for Athletic Trainers. Examination Report for 2017-2018 Testing Year Board of Certification (BOC) Certification Examination for Athletic Trainers April 2018 INTRODUCTION The Board of Certification, Inc., (BOC) is a non-profit

More information

Solving Business Problems with Analytics

Solving Business Problems with Analytics Solving Business Problems with Analytics New York Chapter Meeting INFORMS New York, NY SAS Institute Inc. December 12, 2012 c 2010, SAS Institute Inc. All rights reserved. Outline 1 Customer Case Study:

More information

Validity and Reliability Issues in the Large-Scale Assessment of English Language Proficiency

Validity and Reliability Issues in the Large-Scale Assessment of English Language Proficiency Validity and Reliability Issues in the Large-Scale Assessment of English Language Proficiency The 5 th International Conference on ELT in China Beijing, China May 21, 2007 Richard J. Patz, Ph.D. CTB/McGraw-Hill

More information

Field Testing and Equating Designs for State Educational Assessments. Rob Kirkpatrick. Walter D. Way. Pearson

Field Testing and Equating Designs for State Educational Assessments. Rob Kirkpatrick. Walter D. Way. Pearson Field Testing and Equating Designs for State Educational Assessments Rob Kirkpatrick Walter D. Way Pearson Paper presented at the annual meeting of the American Educational Research Association, New York,

More information

Fusion Analytical Method Validation

Fusion Analytical Method Validation Fusion QbD Software Platform Fusion Analytical Method Validation The Only Software That Has It All! 100% aligned with FDA/ICH Quality by Design (QbD) guidances! Can be used for LC and Non-LC methods (e.g.

More information

A Production Problem

A Production Problem Session #2 Page 1 A Production Problem Weekly supply of raw materials: Large Bricks Small Bricks Products: Table Profit = $20/Table Chair Profit = $15/Chair Session #2 Page 2 Linear Programming Linear

More information

7 Statistical characteristics of the test

7 Statistical characteristics of the test 7 Statistical characteristics of the test Two key qualities of an exam are validity and reliability. Validity relates to the usefulness of a test for a purpose: does it enable well-founded inferences about

More information

The Effects of Model Misfit in Computerized Classification Test. Hong Jiao Florida State University

The Effects of Model Misfit in Computerized Classification Test. Hong Jiao Florida State University Model Misfit in CCT 1 The Effects of Model Misfit in Computerized Classification Test Hong Jiao Florida State University hjiao@usa.net Allen C. Lau Harcourt Educational Measurement allen_lau@harcourt.com

More information

Item Analysis of National Examination Council Senior School Certificate Examination Economics Objective Tests

Item Analysis of National Examination Council Senior School Certificate Examination Economics Objective Tests International Journal of Innovative Education Research 3 (4):23-30, Oct.-Dec. 2015 SEAHI PUBLICATIONS, 2015 www.seahipaj.org ISSN: 2354-2942 Item Analysis of National Examination Council Senior School

More information

Logistic Regression with Expert Intervention

Logistic Regression with Expert Intervention Smart Cities Symposium Prague 2016 1 Logistic Regression with Expert Intervention Pavla Pecherková and Ivan Nagy Abstract This paper deals with problem of analysis of traffic data. A traffic network has

More information

A Gradual Maximum Information Ratio Approach to Item Selection in Computerized Adaptive Testing. Kyung T. Han Graduate Management Admission Council

A Gradual Maximum Information Ratio Approach to Item Selection in Computerized Adaptive Testing. Kyung T. Han Graduate Management Admission Council A Gradual Maimum Information Ratio Approach to Item Selection in Computerized Adaptive Testing Kyung T. Han Graduate Management Admission Council Presented at the Item Selection Paper Session, June 2,

More information

Telecommunications Churn Analysis Using Cox Regression

Telecommunications Churn Analysis Using Cox Regression Telecommunications Churn Analysis Using Cox Regression Introduction As part of its efforts to increase customer loyalty and reduce churn, a telecommunications company is interested in modeling the "time

More information

SURVEY OF SOFTWARE FOR THE TEST QUALITY ANALYSIS. Varazdat Avetisyan

SURVEY OF SOFTWARE FOR THE TEST QUALITY ANALYSIS. Varazdat Avetisyan 82 SURVEY OF SOFTWARE FOR THE TEST QUALITY ANALYSIS Varazdat Avetisyan Abstract: A test method of checking and evaluating the knowledge is one of the most reliable and promising ways to increase educational

More information

Linear model to forecast sales from past data of Rossmann drug Store

Linear model to forecast sales from past data of Rossmann drug Store Abstract Linear model to forecast sales from past data of Rossmann drug Store Group id: G3 Recent years, the explosive growth in data results in the need to develop new tools to process data into knowledge

More information

Quadratic Regressions Group Acitivity 2 Business Project Week #4

Quadratic Regressions Group Acitivity 2 Business Project Week #4 Quadratic Regressions Group Acitivity 2 Business Project Week #4 In activity 1 we created a scatter plot on the calculator using a table of values that were given. Some of you were able to create a linear

More information

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Business Analytics & Data Mining Modeling Using R Dr. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 02 Data Mining Process Welcome to the lecture 2 of

More information

Multiple Choice (#1-9). Circle the letter corresponding to the best answer.

Multiple Choice (#1-9). Circle the letter corresponding to the best answer. !! AP Statistics Ch. 3 Practice Test!! Name: Multiple Choice (#1-9). Circle the letter corresponding to the best answer. 1. In a statistics course, a linear regression equation was computed to predict

More information

Percentiles the precise definition page 1. Percentiles and textbook definitions confused or what?

Percentiles the precise definition page 1. Percentiles and textbook definitions confused or what? Percentiles the precise definition page 1 Percentiles and textbook definitions confused or what? Take the following definitions HyperStat Online: A percentile rank is the proportion of scores in a distribution

More information

After completion of this unit you will be able to: Define data analytic and explain why it is important Outline the data analytic tools and

After completion of this unit you will be able to: Define data analytic and explain why it is important Outline the data analytic tools and After completion of this unit you will be able to: Define data analytic and explain why it is important Outline the data analytic tools and techniques and explain them Now the difference between descriptive

More information

Creative Commons Attribution-NonCommercial-Share Alike License

Creative Commons Attribution-NonCommercial-Share Alike License Author: Brenda Gunderson, Ph.D., 2015 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution- NonCommercial-Share Alike 3.0 Unported License:

More information

ITEM RESPONSE THEORY FOR WEIGHTED SUMMED SCORES. Brian Dale Stucky

ITEM RESPONSE THEORY FOR WEIGHTED SUMMED SCORES. Brian Dale Stucky ITEM RESPONSE THEORY FOR WEIGHTED SUMMED SCORES Brian Dale Stucky A thesis submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment of the requirements for the

More information

Mastering Modern Psychological Testing Theory & Methods Cecil R. Reynolds Ronald B. Livingston First Edition

Mastering Modern Psychological Testing Theory & Methods Cecil R. Reynolds Ronald B. Livingston First Edition Mastering Modern Psychological Testing Theory & Methods Cecil R. Reynolds Ronald B. Livingston First Edition Pearson Education Limited Edinburgh Gate Harlow Essex CM20 2JE England and Associated Companies

More information

Disentangling Prognostic and Predictive Biomarkers Through Mutual Information

Disentangling Prognostic and Predictive Biomarkers Through Mutual Information Informatics for Health: Connected Citizen-Led Wellness and Population Health R. Randell et al. (Eds.) 2017 European Federation for Medical Informatics (EFMI) and IOS Press. This article is published online

More information

Worker Types: A New Approach to Human Capital Management

Worker Types: A New Approach to Human Capital Management Worker Types: A New Approach to Human Capital Management James Houran, President, 20 20 Skills Employee Assessment 20 20 SKILLS ASSESSMENT 372 Willis Ave. Mineola, NY 11501 +1 516.248.8828 (ph) +1 516.742.3059

More information

A Test Development Life Cycle Framework for Testing Program Planning

A Test Development Life Cycle Framework for Testing Program Planning A Test Development Life Cycle Framework for Testing Program Planning Pamela Ing Stemmer, Ph.D. February 29, 2016 How can an organization (test sponsor) successfully execute a testing program? Successfully

More information

proficiency that the entire response pattern provides, assuming that the model summarizes the data accurately (p. 169).

proficiency that the entire response pattern provides, assuming that the model summarizes the data accurately (p. 169). A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute

More information

Applying Tabu Search to Container Loading Problems

Applying Tabu Search to Container Loading Problems Applying Tabu Search to Container Loading Problems A. Bortfeldt and H. Gehring, FernUniversität Hagen Abstract: This paper presents a Tabu Search Algorithm (TSA) for container loading problems with a container

More information

Potential Impact of Item Parameter Drift Due to Practice and Curriculum Change on Item Calibration in Computerized Adaptive Testing

Potential Impact of Item Parameter Drift Due to Practice and Curriculum Change on Item Calibration in Computerized Adaptive Testing Potential Impact of Item Parameter Drift Due to Practice and Curriculum Change on Item Calibration in Computerized Adaptive Testing Kyung T. Han & Fanmin Guo GMAC Research Reports RR-11-02 January 1, 2011

More information

Operational Check of the 2010 FCAT 3 rd Grade Reading Equating Results

Operational Check of the 2010 FCAT 3 rd Grade Reading Equating Results Operational Check of the 2010 FCAT 3 rd Grade Reading Equating Results Prepared for the Florida Department of Education by: Andrew C. Dwyer, M.S. (Doctoral Student) Tzu-Yun Chin, M.S. (Doctoral Student)

More information

PASSPOINT SETTING FOR MULTIPLE CHOICE EXAMINATIONS

PASSPOINT SETTING FOR MULTIPLE CHOICE EXAMINATIONS WHITE WHITE PAPER PASSPOINT SETTING FOR MULTIPLE CHOICE EXAMINATIONS CPS HR Consulting 241 Lathrop Way Sacramento, CA 95815 t: 916.263.3600 f: 916.263.3520 www.cpshr.us INTRODUCTION An examination is a

More information

Glossary of Terms Ability Accommodation Adjusted validity/reliability coefficient Alternate forms Analysis of work Assessment Band Battery

Glossary of Terms Ability Accommodation Adjusted validity/reliability coefficient Alternate forms Analysis of work Assessment Band Battery 1 1 1 0 1 0 1 0 1 Glossary of Terms Ability A defined domain of cognitive, perceptual, psychomotor, or physical functioning. Accommodation A change in the content, format, and/or administration of a selection

More information

A Statistical Comparison Of Accelerated Concrete Testing Methods

A Statistical Comparison Of Accelerated Concrete Testing Methods Journal of Applied Mathematics & Decision Sciences, 1(2), 89-1 (1997) Reprints available directly from the Editor. Printed in New Zealand. A Statistical Comparison Of Accelerated Concrete Testing Methods

More information

Mining for Gold gets easier and a lot more fun! By Ken Deal

Mining for Gold gets easier and a lot more fun! By Ken Deal Mining for Gold gets easier and a lot more fun! By Ken Deal Marketing researchers develop and use scales routinely. It seems to be a fairly common procedure when analyzing survey data to assume that a

More information

Rounding a method for estimating a number by increasing or retaining a specific place value digit according to specific rules and changing all

Rounding a method for estimating a number by increasing or retaining a specific place value digit according to specific rules and changing all Unit 1 This unit bundles student expectations that address whole number estimation and computational fluency and proficiency. According to the Texas Education Agency, mathematical process standards including

More information

Test Development: Ten Steps to a Valid and Reliable Certification Exam Linda A. Althouse, Ph.D., SAS, Cary, NC

Test Development: Ten Steps to a Valid and Reliable Certification Exam Linda A. Althouse, Ph.D., SAS, Cary, NC Test Development: Ten Steps to a Valid and Reliable Certification Exam Linda A. Althouse, Ph.D., SAS, Cary, NC ABSTRACT The intent of a certification program is to evaluate the knowledge and skills of

More information

QUANTITATIVE COMPARABILITY STUDY of the ICC INDEX and THE QUALITY OF LIFE DATA

QUANTITATIVE COMPARABILITY STUDY of the ICC INDEX and THE QUALITY OF LIFE DATA QUANTITATIVE COMPARABILITY STUDY of the ICC INDEX and THE QUALITY OF LIFE DATA Dr. Kseniya Rubicondo - November 2016 Table of Contents Introduction...p.3 Methodology. p.4 Analysis and Key Findings. p.5

More information

1. Contingency Table (Cross Tabulation Table)

1. Contingency Table (Cross Tabulation Table) II. Descriptive Statistics C. Bivariate Data In this section Contingency Table (Cross Tabulation Table) Box and Whisker Plot Line Graph Scatter Plot 1. Contingency Table (Cross Tabulation Table) Bivariate

More information

Technical Report: June 2018 CKE 1. Human Resources Professionals Association

Technical Report: June 2018 CKE 1. Human Resources Professionals Association Technical Report: June 2018 CKE 1 Human Resources Professionals Association 3 August 2018 Contents Executive Summary... 4 Administration... 5 Form Setting... 5 Testing Window... 6 Analysis... 7 Data Cleaning

More information

Diagnostic Online Math Assessment: Technical Document. Published by Let s Go Learn, Inc.

Diagnostic Online Math Assessment: Technical Document. Published by Let s Go Learn, Inc. Diagnostic Online Math Assessment: Technical Document Published by Let s Go Learn, Inc. Table of Contents Diagnostic Online Math Assessment Specifications... 3 ADAM: K-7 & DOMA: Basic Math Skills...3 DOMA:

More information

Setting Standards. John Norcini, Ph.D.

Setting Standards. John Norcini, Ph.D. Setting Standards John Norcini, Ph.D. jnorcini@faimer.org Overview Scores and standards Definitions and types Characteristics of a credible standard Who sets the standards, what are the characteristics

More information

ational ssessment ollaboration Annual Technical Report

ational ssessment ollaboration Annual Technical Report N ational ssessment ollaboration 2017 Annual Technical Report TABLE OF CONTENTS OVERVIEW... 4 1. EXAM DEVELOPMENT... 5 Blueprint and test specifications... 5 Exam format... 7 Exam content... 7 Content

More information

ational ssessment ollaboration Annual Technical Report

ational ssessment ollaboration Annual Technical Report N ational ssessment ollaboration 2016 Annual Technical Report TABLE OF CONTENTS OVERVIEW... 2 1. EXAM DEVELOPMENT... 6 Exam content... 6 Exam format... 7 Blueprint and test specifications... 7 Table 1:

More information

Linear Programming: Basic Concepts

Linear Programming: Basic Concepts Linear Programming: Basic Concepts Irwin/McGraw-Hill 1.١ The McGraw-Hill Companies, Inc., 2003 Introduction The management of any organization make Decision about how to allocate its resources to various

More information

The Examination for Professional Practice in Psychology: The Enhanced EPPP Frequently Asked Questions

The Examination for Professional Practice in Psychology: The Enhanced EPPP Frequently Asked Questions The Examination for Professional Practice in Psychology: The Enhanced EPPP Frequently Asked Questions What is the Enhanced EPPP? The Enhanced EPPP is the national psychology licensing examination that

More information

Analysis and Modelling of Flexible Manufacturing System

Analysis and Modelling of Flexible Manufacturing System Analysis and Modelling of Flexible Manufacturing System Swetapadma Mishra 1, Biswabihari Rath 2, Aravind Tripathy 3 1,2,3Gandhi Institute For Technology,Bhubaneswar, Odisha, India --------------------------------------------------------------------***----------------------------------------------------------------------

More information

Evolving Control for Micro Aerial Vehicles (MAVs)

Evolving Control for Micro Aerial Vehicles (MAVs) Evolving Control for Micro Aerial Vehicles (MAVs) M. Rhodes, G. Tener, and A. S. Wu Abstract This paper further explores the use of a genetic algorithm for the purposes of evolving the control systems

More information

Evolutionary Algorithms

Evolutionary Algorithms Evolutionary Algorithms Evolutionary Algorithms What is Evolutionary Algorithms (EAs)? Evolutionary algorithms are iterative and stochastic search methods that mimic the natural biological evolution and/or

More information

Copyright c 2009 Stanley B. Gershwin. All rights reserved. 2/30 People Philosophy Basic Issues Industry Needs Uncertainty, Variability, and Randomness

Copyright c 2009 Stanley B. Gershwin. All rights reserved. 2/30 People Philosophy Basic Issues Industry Needs Uncertainty, Variability, and Randomness Copyright c 2009 Stanley B. Gershwin. All rights reserved. 1/30 Uncertainty, Variability, Randomness, and Manufacturing Systems Engineering Stanley B. Gershwin gershwin@mit.edu http://web.mit.edu/manuf-sys

More information

Advanced analytics at your hands

Advanced analytics at your hands 2.4 Advanced analytics at your hands Today, most organizations are stuck at lower-value descriptive analytics. But more sophisticated analysis can bring great business value. TARGET APPLICATIONS Business

More information

Score Reporting: More Than Just Pass/Fail. Susan Davis-Becker, Alpine Testing Solutions Sheila Mauldin, NCCPA Debbra Hecker, NAWCO

Score Reporting: More Than Just Pass/Fail. Susan Davis-Becker, Alpine Testing Solutions Sheila Mauldin, NCCPA Debbra Hecker, NAWCO Score Reporting: More Than Just Pass/Fail Susan Davis-Becker, Alpine Testing Solutions Sheila Mauldin, NCCPA Debbra Hecker, NAWCO Overview Review professional standards Provide guidance on considerations

More information

Assessing first- and second-order equity for the common-item nonequivalent groups design using multidimensional IRT

Assessing first- and second-order equity for the common-item nonequivalent groups design using multidimensional IRT University of Iowa Iowa Research Online Theses and Dissertations Summer 2011 Assessing first- and second-order equity for the common-item nonequivalent groups design using multidimensional IRT Benjamin

More information

Assessing first- and second-order equity for the common-item nonequivalent groups design using multidimensional IRT

Assessing first- and second-order equity for the common-item nonequivalent groups design using multidimensional IRT University of Iowa Iowa Research Online Theses and Dissertations Summer 2011 Assessing first- and second-order equity for the common-item nonequivalent groups design using multidimensional IRT Benjamin

More information

Prescriptive Analytics for Facility Location: an AIMMS-based perspective

Prescriptive Analytics for Facility Location: an AIMMS-based perspective Prescriptive Analytics for Facility Location: an AIMMS-based perspective Dr. Ovidiu Listes Senior Consultant AIMMS Analytics and Optimization Outline Analytics for Facility Location AIMMS Analytics Platform

More information

(1960) had proposed similar procedures for the measurement of attitude. The present paper

(1960) had proposed similar procedures for the measurement of attitude. The present paper Rasch Analysis of the Central Life Interest Measure Neal Schmitt Michigan State University Rasch item analyses were conducted and estimates of item residuals correlated with various demographic or person

More information

Issues surrounding conversion of paperand-pencil to computerized testing

Issues surrounding conversion of paperand-pencil to computerized testing Issues surrounding conversion of paperand-pencil to computerized testing Industrial/Organizational Solutions, Inc. July 24, 2012 Presented at the 2012 conference of the International Personnel Assessment

More information

Linking Current and Future Score Scales for the AICPA Uniform CPA Exam i

Linking Current and Future Score Scales for the AICPA Uniform CPA Exam i Linking Current and Future Score Scales for the AICPA Uniform CPA Exam i Technical Report August 4, 2009 W0902 Wendy Lam University of Massachusetts Amherst Copyright 2007 by American Institute of Certified

More information

A Simulation-based Multi-level Redundancy Allocation for a Multi-level System

A Simulation-based Multi-level Redundancy Allocation for a Multi-level System International Journal of Performability Engineering Vol., No. 4, July 205, pp. 357-367. RAMS Consultants Printed in India A Simulation-based Multi-level Redundancy Allocation for a Multi-level System YOUNG

More information

Final Examination. Department of Computer Science and Engineering CSE 291 University of California, San Diego Spring Tuesday June 7, 2011

Final Examination. Department of Computer Science and Engineering CSE 291 University of California, San Diego Spring Tuesday June 7, 2011 Department of Computer Science and Engineering CSE 291 University of California, San Diego Spring 2011 Your name: Final Examination Tuesday June 7, 2011 Instructions: Answer each question in the space

More information

PSS E. High-Performance Transmission Planning Application for the Power Industry. Answers for energy.

PSS E. High-Performance Transmission Planning Application for the Power Industry. Answers for energy. PSS E High-Performance Transmission Planning Application for the Power Industry Answers for energy. PSS E architecture power flow, short circuit and dynamic simulation Siemens Power Technologies International

More information

3 Ways to Improve Your Targeted Marketing with Analytics

3 Ways to Improve Your Targeted Marketing with Analytics 3 Ways to Improve Your Targeted Marketing with Analytics Introduction Targeted marketing is a simple concept, but a key element in a marketing strategy. The goal is to identify the potential customers

More information

Core vs NYS Standards

Core vs NYS Standards Core vs NYS Standards Grade 5 Core NYS Operations and Algebraic Thinking -------------------------------------------------------------------------- 5.OA Write / Interpret Numerical Expressions Use ( ),

More information

Smarter Balanced Adaptive Item Selection Algorithm Design Report

Smarter Balanced Adaptive Item Selection Algorithm Design Report Smarter Balanced Adaptive Item Selection Algorithm Design Report Preview Release 16 May 2014 Jon Cohen and Larry Albright American Institutes for Research Produced for Smarter Balanced by the American

More information

Know Your Data (Chapter 2)

Know Your Data (Chapter 2) Let s Get Started! Know Your Data (Chapter 2) Now we each have a time series whose future values we are interested in forecasting. The next step is to become thoroughly familiar with the construction of

More information