EVALUATION OF PROBABILITY OF MARKET STRATEGY PROFITABILITY USING PCA METHOD

Similar documents
Variable Selection Methods for Multivariate Process Monitoring

Comparison of Different Independent Component Analysis Algorithms for Sales Forecasting

Contributed to AIChE 2006 Annual Meeting, San Francisco, CA. With the rapid growth of biotechnology and the PAT (Process Analytical Technology)

Modeling of competition in revenue management Petr Fiala 1

2 Maria Carolina Monard and Gustavo E. A. P. A. Batista

Testing the Dinosaur Hypothesis under Empirical Datasets

How to estimate the model of sustainable profit and corporate social responsibility

Copyright. Daifeng Wang

MANAGEMENT OF EMISSIONS PERMITS: THREATS OF THE EUROPEAN EMISSIONS TRADING SYSTEM

Churn Prediction Model Using Linear Discriminant Analysis (LDA)

M. Han, Int. J. Sus. Dev. Plann. Vol. 10, No. 1 (2015)

MdAIR Spring Institute. April 28, 2006

Demand Forecasting of New Products Using Attribute Analysis. Marina Kang

Perception of the Value of Library Services Provided by Municipal Library of Prague

Using Previous Knowledge for Stock Market Prediction Based on Fundamentalist Analysis with Fuzzy-Neural Networks

On utility of temporal embeddings for skill matching. Manisha Verma, PhD student, UCL Nathan Francis, NJFSearch

DATA PREPROCESSING METHOD FOR COST ESTIMATION OF BUILDING PROJECTS

Runs of Homozygosity Analysis Tutorial

Multivariate Methods for Evaluating Building Energy Efficiency

Reconsidering landfill monitoring with the aid of multivariate data analysis

Level versus Variability Trade-offs in Wind and Solar Generation Investments: The Case of California

Telematics System dedicated for Provisioning of On-Line Information about Predicted Occupancy of Highway Parking Areas

TRANSFORMATION OF HR ORGANIZATIONAL STRUCTURES IN CZECH ORGANIZATIONS

Uncovering the Small Community Structure in Large Networks: A Local Spectral Approach

) ln (GDP t /Hours t ) ln (GDP deflator t ) capacity utilization t ln (Hours t

Getting Started with OptQuest

2016 INFORMS International The Analytics Tool Kit: A Case Study with JMP Pro

Application the Queuing Theory in the Warehouse Optimization Jaroslav Masek, Juraj Camaj, Eva Nedeliakova

Game Theory Approach to Solve Economic Dispatch Problem

Accuracy in Powder Diffraction IV NIST, Gaithersburg MD, USA April 22-25, Uwe König Thomas Degen PANalytical B.V. Almelo, The Netherlands

FINANCIAL DISTRESS CRITERIA DEFINED BY CLUSTERING OF LONGITUDINAL DATA

Elias Pereira A QUARTERLY COINCIDENT INDICATOR FOR THE CAPE VERDEAN ECONOMY

Risk Management User Guide

Exploring Similarities of Conserved Domains/Motifs

Population differentiation analysis of 54,734 European Americans reveals independent evolution of ADH1B gene in Europe and East Asia

Study on Talent Introduction Strategies in Zhejiang University of Finance and Economics Based on Data Mining

Mathematical modeling - food price analysis

Near-Balanced Incomplete Block Designs with An Application to Poster Competitions

Package FSTpackage. June 27, 2017

A Beginner s Guide to Factor Analysis: Focusing on Exploratory Factor Analysis

KRONOS ESP SYSTEM MANAGER SELF SERVICE QUICK REFERENCE. Manager Home Page. Sample Manager Home Page

MMAP Genomic Matrix Calculations

Analysis of User Behavior in a Public Library using Bibliomining

Overview of Statistics used in QbD Throughout the Product Lifecycle

Integrating Market and Credit Risk Measures using SAS Risk Dimensions software

Supply Chain MICROSOFT BUSINESS SOLUTIONS DEMAND PLANNER

MODELING OF TRANSPORT DEMAND IN RELATION TO COMPETITION ON TRANSPORT MARKET

COMPARISON OF LOGISTIC REGRESSION MODEL AND MARS CLASSIFICATION RESULTS ON BINARY RESPONSE FOR TEKNISI AHLI BBPLK SERANG TRAINING GRADUATES STATUS

A LONG - TERM CONTRACTS PORTFOLIO MANAGEMENT CONFERENCE THE EUROPEAN ELECTRICITY MARKET SEPTEMBER 20-22, 2004, LODZ, POLAND

The Role of Start-Ups in Structural Transformation

CROSS-DOCKING: SCHEDULING OF INCOMING AND OUTGOING SEMI TRAILERS

ERP Reference models and module implementation

Global Reliability Indices Calculation Using Monte Carlo Simulation

Better ACF and PACF plots, but no optimal linear prediction

Tutorial Segmentation and Classification

Strategic and Financial Risk Model for Small and Medium Enterprises

Modelling of Water Pollution in Urban Areas with GIS and Multivariate Statistical Methods

Survey of Behavioral Segmentation Methods

The Accounting Methods of the Local Government Department Output by Factor Analysis

Reliability demonstration for complex redundant systems in railway applications

COMP6053 lecture: Principal components analysis.

ANNEX 5: PRINCIPAL COMPONENTS ANALYSIS AND THE WELLBEING INDEX

An Approach for Damage Identification and Optimal Sensor Placement in Structural Health Monitoring by Genetic Algorithm Technique

Getting Started with HLM 5. For Windows

QUESTION 2 What conclusion is most correct about the Experimental Design shown here with the response in the far right column?

Managing risks in a multi-tier supply chain

INVENTORY MANAGEMENT BASED ON ABC ANALYSIS AS A PART OF LOGISTICS MANAGEMENT IN FIRM *

Principal component and factor analytic models in international sire evaluation

PRICING ADMINISTRATION GUIDE

Tutorial Segmentation and Classification

MANUFACTURING PROCESS MANAGEMENT USING A FLEXIBLE MODELING AND SIMULATION APPROACH. Duilio Curcio Francesco Longo Giovanni Mirabelli

ANALYSIS OF THE FACTORS OF LOCALIZATION OF THE COMPANY DEVELOPMENT USING MCDM METHODS IN PRACTICE

OVERCOMING THE UNCERTAINTY IN THE DU-PONT GRAPH OF PROFITABILITY

Archive of SID. The Law of One Price and the Cointegration of Meat Price in the Global Market: the Case of Iran s Market. Abstract

Effects of MIG process parameters on the geometry and dilution of the bead in the automatic surfacing

An Assessment of Deforestation Models for Reducing Emissions from Deforestation & forest Degradation (REDD)

PRACTICAL EXPERIENCE OF THE EFQM MODEL IMPLEMENTATION IN THE CONDITIONS OF PUBLIC UNIVERSITY.

Predicting Corporate Influence Cascades In Health Care Communities

Oracle. SCM Cloud Administering Pricing. Release 13 (update 17D)

Evaluation next steps Lift and Costs

Latent Semantic Analysis for large document sets.

Hyperspectral Subspace Identification and Endmember Extraction by Integration of Spatial-Spectral Information

Data Mining in CRM THE CRM STRATEGY

TDWI strives to provide course books that are contentrich and that serve as useful reference documents after a class has ended.

Analysis of Radio Field Requirement for RFID Systems Use in Warehouses

Introduction to Information Systems Fifth Edition

Linear Cost, Revenue, Profit, Supply, and Demand

CUSTOMER PROFITABILITY DOCUMENTATION UPDATES

Prediction of Heart Disease Using Cleveland Dataset: A Machine Learning Approach

IFRS VERSUS CZ GAAP: INFLUENCE OF CONSTRUCTION CONTRACTS ON FINANCIAL INDICATORS

Optimization of the Maintenance Process Using Genetic Algorithms

Selection of optimal reference genes for normalization in quantitative RT-PCR

APPLICATION OF TIME-SERIES DEMAND FORECASTING MODELS WITH SEASONALITY AND TREND COMPONENTS FOR INDUSTRIAL PRODUCTS

Stock data models analysis based on window mechanism

Data Processing of Thermal Power Plants Based on Dynamic Data Reconciliation

A Production Problem

Time Series Models for Business and Economic Forecasting

Classification of Bank Customers for Granting Banking Facility Using Fuzzy Expert System Based on Rules Extracted from the Banking Data

Alternative Methods for Business Process Planning

Chapter 1 INTRODUCTION TO SIMULATION

Transcription:

EVALUATION OF PROBABILITY OF MARKET STRATEGY PROFITABILITY USING PCA METHOD Pavla Kotatkova Stranska Jana Heckenbergerova Abstract Principal component analysis (PCA) is a commonly used and very powerful method for simplification of multivariate data. In the presented contribution, it is, however, utilized in a novel way. The methodology described in the paper provides tables of probabilities of market strategy profitability. Values of the first two principal components are utilized in evaluation of conditional probability. The resulting probability tables can help to detect buying signals for profitable trade opportunities or can provide a warning against a high financial loss. A time period of two weeks before an actual buying signal is analyzed. After one week of holding, the share is sold and the resultant relative price differences determine the group of profitability (high profit, profit, loss, high loss). For each group, tables of occurrences and conditional probabilities are evaluated using the PCA method. The whole methodology is tested on a case study that takes in account one year long dataset (1 st Aug 2014 14 th Aug 2015) of L-3 Communications Holding shares. Key words: profitability, conditional probability, principal component analysis JEL Code: C38, G11, G17 Introduction Analysis of the financial market is very large topic and one needs a lot of knowledge and experience for its full understanding you. Most of the investors, however, do not attempt to fully understand all the analyses and they prefer to use one or two methods that are considered most effective (Lo et al., 2000), (Neely et al., 1997), (Byun et al., 2015). Each method is based on a different approach and has various prerequisites for successful investing. Nevertheless, all methods have one thing in common; they are trying to understand the current situation in the market and to predict whether the prices or shares on financial market will grow or decline. According to Bauer and Dahlquist (1998), the main motto of the majority of trades is "cheap 752

acquisition of a financial asset or his precious sales". Based on scientific studies Lee et al. (2017), Lynch et al. (2000), Lo et al. (2000) and Neely et al. (1997), all kinds of statistical methods are applied nowadays to build reliable prediction model, e.g. principal component analysis (PCA). So we can conclude that prediction of prices future development is a constantly evolving area. PCA is a popular method in the area of financial analyses; in particular it is used to define and to quantify the factors that influence changes in the prices of securities or shares (Gemperline, 2006). Utilization of PCA for evaluation of profit probability is, however, novelty in this research field. The prerequisite is classification of profit levels for selected trading strategy so each trading day, consequently, belong to one profit class. Based on data precedent selected trading day, principal components are evaluated. As a result from whole dataset, we are getting conditional probabilities of profit class occurrences based on principal component values. Proposed methodology is similar to method for prediction of wind ramp events, which was presented and validated in (Heckenbergerova, 2014), (Heckenbergerova, 2015) and (Heckenbergerova, 2016). This paper consist five main sections. After introduction, principal component analysis is briefly described. Second section summarizes methodology that is used in case study presented in the section 3. Major conclusions with direction of future work are provided in the last section. 1 Principal Component Analysis The basic idea of principal component analysis (PCA) is the reduction of the number of variables by using the so-called the principal components. Each principal component is formed as linear combination of the original variables and it is characterized by covariance of all the variables. Therefore most significant principal components can summarize knowledge about the default variables while preserving all the original information (Lo et al., 2000), (Neely et al., 1997), (Gradojevic and Lento, 2015). According to Rencher (2002), PCA is based on a covariance matrix of the p variables Xj. From its eigenvalues and eigenvectors, new variables Zj (principal components) are derived that are uncorrelated and sorted according to decreasing variance (eigenvalues). 753

Let us assume that all data from selected dataset are forming N-dimensional random vector Y ( Y, Y2,, Y 1 N X is then in form: ) and moreover let us consider the time frame of length n. Observation matrix Y1 Y2 Yn Y2 Y3 Yn 1 X, (1) YN n YN n1 YN where each row corresponds to one time frame of simulation and each column is a random vector X j of given time index. In the next step eigenvalues and eigenvectors of covariance matrix var X are evaluated using singular value decomposition: var X SVD (2) where S is an orthogonal (n n)-matrix with columns of singular vectors, V is a diagonal (n n)-matrix corresponding to eigenvalues of X and columns of matrix XD are the principal components of matrix X (Wall et.al, 2003). 2 Methodology Methodology of evaluation of conditional probability tables can be summarized in following steps: 1) Dataset (time series of share prices) selection; 2) Market strategy selection (holding period); 3) Sorting of trading days based on profitability (creation of profitability classes); 4) Creation of observation matrix X with selected length of time frame prior trade; 5) Evaluation of principal components; 6) Division of principal components ranges to create support rectangles and table axes; 7) Evaluation of occurrence frequencies on each support rectangle from whole dataset and from each profitability class; 8) Evaluation of conditional probability for each support rectangle and creation of conditional probability tables. Resulting tables show us conditional probability of the future trade profitability based on actual principal component values. 754

3 Case study 1.1 Data and Market Strategy Let us take in account the one year long dataset of L-3 Communications Holding shares (1 st Aug 2014 14 th Aug 2015). Our market strategy is selling the share after one week of holding (5 business days). For each trading day relative profitability (RP) is evaluated using formula: ( Psell P RP P buy buy ), (3) where Pbuy is buying price for selected trading day and Psell is share price on selling day. Based on these values, data are divided into four groups: HP (high profit) for profitability greater than 0.5; P (profit) for RP values between 0 and 0.5; L (loss) for negative profitability between -0.5 and 0; and HL (high loss) for profitability below -0.5 threshold. Table 1 summarizes the size of profitability classes and Figure 1 shows time series of L-3 Communications Holding shares with high profit and high loss trades highlighted. All figures are generated in Matlab using our own scripts. Tab. 1: Profitability classes Type of class Size of class whole dataset 262 HP (high profit) 43 P (profit) 84 L (loss) 106 HL (high loss) 23 755

Fig. 1: Share prices of L-3 Communications Holding from 1 st Aug 2014 till 14 th Aug 2015 augmented with high profitable (triangle up) and high loss (triangle down) trades 1.2 Principal Component Analysis Let us assume that all data from selected dataset are forming N-dimensional random vector Y Y, Y,, Y ) and moreover let us consider the time frame of length n 10, corresponding to ( 1 2 N two weeks horizon before buying signal (10 business days). Observation matrix X, based on (1), has then 10 columns and 252 rows. In the next step eigenvalues and eigenvectors of covariance matrix var X are evaluated using singular value decomposition. As a result we are getting principal components of observation matrix X. First three principal components (PCA1, PCA2 and PCA3) are illustrated in Figure 2, where color and shape of triangles signalize different profitability classes (green-hp, yellow- P, magenta-l, red-hl). Components PCA1 and PCA2 can explain together 39% respectively 52% of the total variability of data, therefore they are chosen as significant ones. 756

Fig. 2: Three principal components for L-3 Communications Holding dataset 1.3 2D Tables of Trade Profit Conditional Probabilities Let us divide the PCA axes into 12 parts so that the whole graph area is covered by 2D blocks (rectangles). Frequency tables for HP, P, L and HL events are created by the number of occurrences in the corresponding 2D block. They are captured in following figure 3. Fig. 3: Frequencies of profit classes occurrence on selected 2D blocks. 757

Conditional probability value in selected rectangle for any profitability class is defined as fraction of profitability class frequency and all data frequency in that block. Figure 4 shows column graphs of evaluated conditional probabilities. We can see that for high value of first principal components PCA1, there is significant probability (close to one) that buying now and selling in a week will generate high loss. We can also conclude, that low values of PCA1 (less than 30) shows very low risk of high loss. Fig. 3: Conditional probabilities of trade profitability given by the 2D block of principal components 758

Conclusion This paper presents a principal component analysis of share prices time series to evaluate probability of selected market strategy profit. Illustrated methodology uses share prices of L-3 Communications Holding from 1.August 2014 till 14.August 2015. Market strategy is selling the share after one week of holding. Two weeks horizon before actual buying signal has been analyzed to see whether trade profitability is predictable. Principal components, frequencies of occurrence and conditional probabilities of trade profitability have been evaluated using described methodology. Gained results show that low values of first principal component are not in favor with trade with high loss. In contrary, high values of PCA1 will result in high loss with high probability. Therefore the resulting probability tables can help to detect future buying signals for profitable trade opportunities or can provide a warning against a high financial loss. Our goal in our further research is validation of presented methodology. More and current datasets will be taken in account and validation will be performed on last quarter of selected databases. References BAUER, R. J., DAHLQUIST J. R. (1998). Technical Market Indicators: Analysis and Performance. Wiley Trading. ISBN 978-0-471-19721-8. BYUN, H. W., BAEK, S., AHN, J.J., OH, K. J., KIM, T.Y. (2015). Using a principal component analysis for multi-currencies trading in the foreign Exchange market. Ingelligent Data Analysis. 19(3), pp. 683-697. DOI 10.3233/IDA-150738. HECKENBERGEROVA J., MUSILEK P., JANATA M., WANG J. (2015). Analysis of Wind Ramp Events Using PCA of 3D Tables of their Conditional Probabilities. 16th International 759

Scientific Conference on Electric Power Engineering-EPE 2015. Kouty nad Desnou, pp. 63-66. ISBN 978-146736788-2. GRADOJEVIC, N., LENTO, C. (2015). Multiscale analysis of foreign Exchange order flows and technical trading profitability. Economic Modelling. 47(1), pp. 156-165. ISSN 0264-9993. DOI 10.1016/j.econmod.2015.02.028. HECKENBERGEROVA J., MUSILEK P., MAREK J., RODWAY J. (2014). Principal Component Analysis for Evaluation of Wind Ramp Event Probability. Electrical Power and Energy Conference-(EPEC 2014). Canada: Calgary, pp 208 212. ISBN 978-147996038-5. HECKENBERGEROVA, J., MUSILEK, P., JANATA, M. (2016). Sensitivity Analysis of PCA method for Wind Ramp event Detection. 16th International Conference on Environment and Electrical Engineering (EEEIC).Italy: Florence. doi:10.1109/eeeic.2016.7555892. LEE, S. ENKE, D., KIM, Y. (2017). A relative value trading system based on a correlation and rough set analysis for the foreign exchange futures market. Engineering Applications of Artificial Inteligence. 61(1). ISSN 0952-1976. DOI 10.1016/j.engappai.2017.02.014 LO, A. W., MAMAYSKY, H., WANG, J. (2000). Foundation of Technical Analysis: computational algorithms, statistical inference, and empirical implementation. National Bureau of Economic Research. LYNCH, D. F., KELLER, S. B., OZMENT, J. (2000). The effects of logistics Capabilities and Strategy on firm performance. Journal of Business Logistics. pp. 47-68. NEELY, CH., WELLER, P., DITTMAR, R. (1997). Is Technical Analysis in the Foreign Exchange Market Profitable?: A Genetic Programming Approach. Journal of Financial and Quantitative Analysis. 4(32), pp. 405-426. RENCHER, A. C. (2002). Methods of Multivariate Analysis. New York: John Wiley & Sons, Inc. ISBN 0-471-41889-7. DOI: 10.1002/0471271357.ch12 GEMPERLINE, P. J. (2006). Practical Guide To Chemometrics, New York: CRC Press. ISBN 978-1-57444-783-5. DOI 10.1201/9781420018301.ch4 WALL, M., RECHTSTEINER, A., ROCHA, L. (2003). Singular value decomposition and Principal component analysis. Kluwer: Norwell, MA. Contact Pavla Kotatkova Stranska, Ing. Ph.D. Masaryk Institute of Advanced Studies, MIAS School of Business Department of Economic studies Czech Technical University in Prague Kolejní 2637/2A, 160 00 Praha 6 e-mail: pavla.kotatkova.stranska@cvut.cz Jana Heckenbergerova, Mgr. Ph.D. Faculty of Economics and Adminitration, Institute of Mathematics and Quantitative Methods, University of Pardubice, Studentska 95, 532 10 Pardubice, Czech Republic email: jana.heckenbergerova@upce.cz 760