Session 4 Case Study of Modern Approach to Lapse Rate Assumption - - PDF document

session 4 case study of modern approach to lapse rate
SMART_READER_LITE
LIVE PREVIEW

Session 4 Case Study of Modern Approach to Lapse Rate Assumption - - PDF document

SOA Predictive Analytics Seminar Malaysia 27 Aug. 2018 | Kuala Lumpur, Malaysia Session 4 Case Study of Modern Approach to Lapse Rate Assumption Wing Wong, FSA, MAAA Stanley Hsieh Case Study of Modern Approach to Lapse Rate Assumption


slide-1
SLIDE 1

SOA Predictive Analytics Seminar – Malaysia

27 Aug. 2018 | Kuala Lumpur, Malaysia

Session 4 Case Study of Modern Approach to Lapse Rate Assumption

Wing Wong, FSA, MAAA Stanley Hsieh

slide-2
SLIDE 2

Case Study of Modern Approach to Lapse Rate Assumption

WING WONG / STANLEY HSIEH

27 August, 2018

slide-3
SLIDE 3

Table of Contents

  • Why machine learning for lapse study?...................................3
  • Machine learning preparation………………………………..…14
  • Machine learning model…………………………..……………..21
  • Case study – analysis of outcome……………………………..30
  • Machine learning tool……………………………….…………...40
  • Q & A………………………………………………………………43

2

slide-4
SLIDE 4

Why machine learning for lapse study?

slide-5
SLIDE 5

What is Machine Learning?

  • Use statistic to give computer ability to learn
  • Let the algorithm do the job to improve the prediction

4

slide-6
SLIDE 6

What is Machine Learning?

5

Two m mai ain tas asks

Supervised learning

  • Learning a function with input and output
  • Labeled training data set is used to learn a function
  • This function can be used to map new examples

Unsupervised learning

  • Learning a function describing the structure of unlabeled data
slide-7
SLIDE 7

What is Machine Learning?

6

Two m mai ain tas asks

Regression

  • To predict

“continuous”

  • utcomes

Classification

  • To predict “discrete”

classes

Training set

  • For training machine learning

model

Validation set

  • For machine learning model

adjustment

Testing set

  • For prediction and testing

prediction power

slide-8
SLIDE 8

What Impacts Lapse Rate?

  • What are the attributes affecting lapse rate?
  • Only one attribute or more attributes?
  • Should it be really time dependent?
  • Different product types?
  • Sales channel or even sales office, sales person?
  • Social economic trends impact?
  • Other factors we don’t normally think of?

7

slide-9
SLIDE 9

Traditional Experience Study

  • Traditional way of lapse rate experience study usually contains

a few dimensions only:

  • Often times, the result by the above dimensions look volatile.

Should more dimensions be considered? What are those? How can we find them easier?

8

Sales channel Gender Product type Policy year Premium mode

slide-10
SLIDE 10

Business Impact by Lapse Rate

  • It is really, really hard to sell an insurance policy. Have we tried

upmost to prevent lapse?

9

slide-11
SLIDE 11

Business Impact by Lapse Rate

Profit and Loss

  • High volatility of lapse rate estimation may cause high volatility
  • f profit and loss, especially after the implementation of

IFRS17, significant difference of actual lapse realized and expected lapse becomes the source of profit and loss

Market influence

  • The ability to monitor and retain insurance policies may

influence the domination of market share and corporate reputation

Customer value

  • When high value policies are sold, preventing policies from

surrender is the key to keep customer value or company value

10

slide-12
SLIDE 12

Business Impact by Lapse Rate

Marketing strategy

  • When knowing the possible lapse behaviors resulting from

specific product types, sales behaviors, policyholders’ features, non-policyholders’ features, or other factors, insurance companies can have better position on making marketing strategy for policy sales

Product design

  • Lapse rate plays a key role when pricing a product and

determining the profitability of a product. Accurate estimation of lapse rate becomes important when implementing business plan

Risk management and ALM

  • Asset and liability management and risk capital management

heavily relies on the accuracy of cash flow projection. Hence, lapse rate prediction is extraordinarily crucial for the management decision

11

slide-13
SLIDE 13

Linking Machine Learning with Lapse Study

  • Supervised learning X Y
  • Binary classification problem:

Y = 1 for Surrender = 0 for Non-surrender

  • Combine policy related data with economic data to enrich data
  • Algorithm learns from information of data
  • Select an appropriate machine learning model

12

slide-14
SLIDE 14

Benefit of Machine Learning Approach

Higher prediction power More dimensions to determine lapse behaviors More automatic assumption making process Improve short term money management

13

slide-15
SLIDE 15

Machine learning preparation

slide-16
SLIDE 16

Project Flow

15

Problem Definition Data Investigation Modeling Analysis

slide-17
SLIDE 17

Data, Resource and Business Impact

  • Data availability
  • Cost of data purchase or collection
  • Privacy issue / legal issue
  • Data quality
  • Consistency over time regarding definitions
  • Mindful of “garbage in, garbage out”
  • Enough data counts
  • Enough variable (attribute) counts
  • Dealing with missing date – apply common methodologies
  • Investment in data infrastructure

16

slide-18
SLIDE 18

How to succeed?

  • Start from small and realistic goals, and build from the success

to make it bigger

  • Cooperate with subject matter experts
  • Understand the implementation needs of the model, such as

purpose, cost, time frame of each prediction, or resource supported

17

slide-19
SLIDE 19

Data Types & Variable Types

  • Independent Variable (X):
  • Policy Related Data:

premium balance, channel mode…etc

  • Economic Index:

GDP, stock index, inflation, real-estate price…etc

  • Dependent Variable (Y):

Y = 1 for Surrender and Y = 0 for Non-Surrender

18

slide-20
SLIDE 20

Quality of Data & Data Collections

  • Source of Data: Internet? Agent?
  • Why do we have missing data?
  • There is no value in learning constant data
  • Some data is recorded recently so there is lack of historical data
  • Communication with data engineer for data cleaning
  • Actuarial Perspective is important for variables selection

19

slide-21
SLIDE 21

Data Cleaning Techniques & Transformation

  • Select a threshold for excluding variable with too many missing

data

  • Mean Imputation – by filling data mean to missing observations
  • We can use feature engineering to create variables
  • Categorical variable has to be transformed into factors

20

slide-22
SLIDE 22

Machine learning model

slide-23
SLIDE 23

Machine learning – Model

Generalized Linear Model Decision Tree Random Forest Gradient Boosting Machine

22

slide-24
SLIDE 24

Generalized Linear Model

  • Result can be interpreted by coefficients of variables
  • Link Function and Distribution – logit and binomial for binary

classification

  • Classical Way – By using statistical test for model significance
  • Machine Learning Way – By feeding more variables for

prediction power

  • Regularization: To control overfitting of GLM
  • Regularization tool: Ridge (L2-norm) vs Lasso (L1-norm)
  • LASSO is widely more popular due to its penalty character

23

slide-25
SLIDE 25

24

slide-26
SLIDE 26

Decision Tree

  • Decision boundary is drawn to capture non-linear trend
  • Key idea of algorithm: recursive binary splitting
  • Measure impurity of node by Gini Index

25

Policy = 200 Y=90 N=110 Policy = 120 Y = 70 N=50 Policy = 80 Y = 20 N =60

Algorithm goes through the variables to find the variable that has lower Gini index as this variable classifies lapse behavior more distinguishably.

slide-27
SLIDE 27

26

slide-28
SLIDE 28

Random Forest

  • Start from idea of bagging – resampling and bootstrapping
  • Searches for the best feature among a random subset of

features – to de-correlate the trees

  • Trees can be implemented by parallel computation

27

slide-29
SLIDE 29

28

slide-30
SLIDE 30

Gradient Boosting Machine (GBM)

  • G(x) = F(X) + h(x) + ……
  • F(X) = weaker learner
  • Residuals = y – F(X)
  • Residuals is trained in the direction of gradient descent
  • Add the trained residuals to weaker learner then repeat this

process

  • Train a “bad” tree first then train its residual to make it a better

tree

  • Generally, a powerful machine learning model

29

slide-31
SLIDE 31

Case study – analysis of outcome

slide-32
SLIDE 32

Outcome

  • Class Probability:

p0 = Non-surrender probability and p1 = Surrender Probability

  • Optimal Threshold – Threshold that optimally decide whether

each policy will surrender next quarter

31

Predict p0 p1 0.99 0.01 0.90 0.10 1 0.11 0.89 0.91 0.09 0.87 0.13 0.88 0.12 1 0.12 0.92

slide-33
SLIDE 33

Metrics

  • To evaluate performance of model
  • To prevent overfitting
  • MSE (Mean Square Error):

It can be used to evaluate numeric prediction like stock price prediction

  • AUC (Area under Curve):

This is what we used for the case study which is a classification problem.

32

slide-34
SLIDE 34

AUC (Area under Curve)

33

AUC = 0.95

  • AUC stands for Area under the ROC (Return of Characteristics) Curve
  • Points on ROC is the False Positive Rate and True Positive Rate at certain threshold
slide-35
SLIDE 35

Hyper-Parameter Tuning

  • Maximum Variables Allows in a GLM :

Tradeoff between model explanation and model prediction

  • Depth of Tree:

Is deeper the tree better the model?

  • Number of Trees in a Forest:

Is more trees in a forest better the model?

  • Number of Sequential Estimators for GBM:

How many time should we repeat sequential training?

  • Grid Search vs Random Search:

A tradeoff between efficiency and accuracy

34

slide-36
SLIDE 36

AE Ratio

  • Gives some sense of model performance in one dimensional space
  • However, machine learning model should capture all dimensions’ performance

35

It is not easy to tell which method is better here as models are compared in one- dimensional space

slide-37
SLIDE 37

Two-Way Lift Chart

  • Vertical axis is A/E ratio and horizontal axis is the ratio of machine learning model prediction to

experience study prediction

  • AE Ratio approach but capture more dimensions
  • Better model is determined by whether the line is close to 1 throughout the range of horizontal axis

36

ML shows better result here as the chart consider

  • verall dimensions
slide-38
SLIDE 38

Variable Importance Chart

37

Variables

  • Variable Importance: Calculate the relative

influence of the variable in a machine learning model

  • It can be used to look at the variable that

has higher influence on classifying surrender policy

  • We can find the variable that is not

considered by traditional experience study

slide-39
SLIDE 39

More on the Case Study

38

Method AUC on Testing Data Performance Increase by Experience Study 0.70 N/A GLM 0.81 16% Decision Tree 0.80 12% Random Forest 0.87 21% GBM 0.95 29%

  • Adopt GLM for model explanation while it has shown reasonable prediction power
  • Random forest and GBM has shown better prediction power than decision tree
  • Adopt GBM for most accurate prediction as it has the best prediction power
slide-40
SLIDE 40

Some “Learnings” from the Study

  • Machine learning suggested new dimensions not commonly

looked at before in traditional experience study:

  • Machine learning can derive a lapse function for each policy,

which can be used for:

  • Lapse simulation
  • Value of customer calculation
  • Continuously monitoring lapse behavior with up-to-date data

and updated model

39

Consider the top 3 important variables Amount of policy an agent sells affect lapse rate? Re-examine the important variables for agents management

slide-41
SLIDE 41

Machine learning tool

slide-42
SLIDE 42

41

They are all Open-Source

slide-43
SLIDE 43

Machine Learning – Tool

Distributed data storage to store and distribute big data

Hadoop

Data processor for data cleaning process

Spark

Package for machine learning with big data

H2o.ai

Object-oriented programming language that implement Spark and H2o.ai

Python

Web-based computing interface for modelling and visualization

Jupyter Notebook

42

slide-44
SLIDE 44

Q & A

slide-45
SLIDE 45