这是用户在 2024-10-20 22:15 为 https://app.immersivetranslate.com/word/ 保存的双语快照页面,由 沉浸式翻译 提供双语支持。了解如何保存?

Content I

基于ahp总结的风险损失模型

随着极端天气和自然灾害的发生,财产保险变得越来越昂贵。但保险公司的盈利能力并没有同步提高。这导致了业主负担能力和保险公司盈利能力的危机。因此,财产保险的可持续性受到了影响。我们的目的是适当地定位财产保险。使保险公司能够承受未来的索赔,并继续以健康的方式增长。

对于问题1,为了确定是否在极端天气和自然灾害频繁的地区投保,我们首先对收集到的数据进行预处理,如自然灾害造成的风险、脆弱性和财产损失。然后,基于层次分析法的思想,我们构建了一个CRITIC-KNN风险水平模型,并将其与成熟的灾害模型相结合,以获得每种灾害的年损失数据。两者共同构成了风险损失模型,CRITIC目标熵权法(以2020年为例)获得的权重分别为37.42%、12.92%、18.47%、16.82%和14.37%。选取美国俄亥俄州和中国四川进行模型验证,发现两者的WRI指数分别为3.86和5.88。然而,考虑到两者的财产损失情况,不建议投保。

对于问题3,通过改进模型,我们将社区财产的文化和经济价值添加到不符合保险条件的社区中。为社区提供保护机制,以保护其财产。首先,通过CRITIC-KNN算法,我们根据指标权重5.68计算了泸定大桥的总加权得分。然后,我们添加了文化和经济价值影响因素。我们重新使用CRITIC为影响因素分配权重值,并根据指标权重计算总加权分数4.48。随着分数的下降,风险水平降低,表明需要现场保护。否则,建议在另一个位置保护它。原始模型的计算结果在WRI类别中为5.68,风险等级为高风险。在WRI类别和中等风险水平下,改进模型的计算结果为4.48。这一地标对该地区具有巨大的文化和经济价值,应就地保护。

内容

1介绍 1

1.1背景 1

1.2对该问题的重述 2

1.3我们的工作 2

2模型准备 3

2.1.假设和正当理由 3

2.2符号 3

3、模型的建立及解决方案 4

3.1 Task1 4

3.1.1、数据分析和预处理:4

3.1.2模型建立:5

3.1.3模型求解 7

3.1.4对结果的分析与评价 10

3.2 Task2 11

3.2.1 模型建立: 12

3.2.2模型解决 14

3.2.3分析和结果评价 15

3.3 Task3 16

3.3.1 模型建立: 16

3.3.2模型解决 16

3.3.3分析和结果评价 18

3.4任务4:拯救泸定桥 18

3.4.2型号来源: 18

3.4.2模型解决 19

3.4.3分析和结果评价 19

4敏感性分析 20

5.对模型的评价 21

5.1优势 21

5.2弱点 21

6参考资料 21

7附录 1

Content

1 Introduction 1

1.1 Background1

1.2 Restatement of the Problem2

1.3 Our Work 2

2 Model Preparation3

2.1 Assumptions and Justification3

2.2 Notations3

3 Model Establishment and Solutions 4

3.1 Task1 4

3.1.1 Data analysis and preprocessing 4

3.1.2 Model Establishment: 5

3.1.3 Model Solving 7

3.1.4 Analysis and Evaluation of results 10

3.2 Task211

3.2.1 Model Establishment:12

3.2.2 Model Solving 14

3.2.3 Analysis and Evaluation of results 15

3.3 Task316

3.3.1 Model Establishment:16

3.3.2 Model Solving 16

3.3.3 Analysis and Evaluation of results 18

3.4 Task4Saving the Luding Bridge18

3.4.1 Model Source: 18

3.4.2 Model Solving 19

3.4.3 Analysis and Evaluation of results 19

4 Sensitivity analysis20

5 Evaluation of the model 21

5.1 Strengths 21

5.2 Weaknesses21

6 Reference21

7 Appendix1

Page 1 of 24

1 介绍

1.1 背景

近年来,极端天气和自然灾害频繁发生。它给人类社会和自然环境带来了巨大的影响。极端天气 和自然灾害不仅造成了大量的人员伤亡和财产损失,而且对当地的经济和社会发展产生了深远的 影响。它导致了保险财产成本的上升,这导致了保险公司的盈利能力和业主的财务能力的危机。 因此,有必要加强对各地区极端天气和自然灾害的研究和预测,以更好地应对风险和保护财产。

Figure 1:Global Disaster Map

从保险公司的角度来看,由于极端天气和自然灾害造成的财产损失,保险公司需要支付巨额索赔 。为了补偿索赔成本的增加,保险公司可能会提高保费,这会影响保险市场的稳定。由于索赔成 本和保费的增加,保险公司可能会影响其承保能力,减少了其在市场上的承保范围和数量。保险 公司应加强风险管理,提高其对灾难的适应能力,并与政府和其他组织更密切地合作。

从业主的角度来看,随着极端天气和自然灾害的频繁发生,业主可能会更加意识到风险管理和保 险的重要性。他们可以选择更全面的保险计划或采取其他风险管理措施。极端天气和自然灾害可 能使业主遭受财产损失或完全损失,从而增加业主的风险。业主还需要更加重视风险管理,并采 取适当措施保护其财产。

从被保险财产的角度来看,由于极端天气和自然灾害的频繁发生,对被保险财产的损害更加严重 。为了加强对被保险对象的保护,必须实现长期的保护。提倡开展气候相关政府培训,提高世界 人民意识,促进生态周期建设,实现可持续发展。

Page 5 of 24

1.2 Restatement of the Problem

Considering background information and restricted conditions identified in the problem statement, we need to solve the following problems.

Problem: 1:Extreme weather events and natural disasters increase the risk of property damage.Insurance companies need higher premiums to cover risks, and the risk of claims payouts increases.For sustainable development of the property insurance industry, develop a model for insurance companies to determine when they can and cannot underwrite.Demonstrate the model through two regions in different continents that have experienced extreme weather or natural disasters.

Problem: 2:The insurance environment is undergoing with the gradual increase in extreme weather events and natural disasters.Adjust your model to provide appropriate advice for future real estate decisions.Determine whether sites are suitable for development and how to build properties more resistant to disasters.

Problem 3:Develop a protection model to determine the level of protection for properties not recommended for insurance due to cultural and economic factors.

Problem 4:Choose a coordinate location and use the model you have developed to evaluate that location.Based on the evaluation results, write a proposal to the community about the future development plans, timeline, and development costs for the site.

Figure 2:The Overall Process Of The Problem

1.3 Our Work

首先,针对问题 1,为了确定在极端天气和自然灾害频繁发生的地区是否应该进行承保,我们建立了保险模型,对收集到的数据进行预处理,例如自然灾害造成的风险、脆弱性、财产损失等。通过 CRITIC-KNN 算法,为影响因素分配权重值。我们使用指标权重计算两个区域的总加权分数。最后,确定是否应为该站点投保。

其次,改进问题 1 中设计的模型。在选定的区域建造房地产并使其具有抗灾能力将改变脆弱性影响因素。然后,我们使用 ARIMA 时间序列模型来预测影响因子的未来值。最后,通过 CRITIC-KNN 算法,我们获取所选区域的未来加权分数。根据评估结果,向社区撰写关于该站点未来开发计划、时间表和开发成本的建议。

接下来,对于问题 3,继续改进模型。通过将社区内财产的文化和经济价值因素纳入,为不符合保险资格的社区提供保护机制。使用 CRITIC-KNN 算法,计算改变影响变量后的总加权得分,以确定如何保护社区中的财产。

2模型准备

2.1假设和正当理由

·假设1:将极端天气和自然灾害的发生时间限制在2011年至2020年期间。

论证1:限制时间的目的是为了简化问题,使模型更易于管理,并提高计算效率。极端天气和自然 灾害的时间范围太宽,导致了一些计算。过去距离太遥远的数据可能会对预测结果产生不利影响

·假设2:假设极端条件和自然灾害类型包括暴雨、干旱、地震、洪水、山体滑坡、火山爆发和 野火。

理由2:极端条件和自然灾害有多种形式。根据我们的数据处理,全球模型中包含的8个自然灾害 和极端天气事件都是发生频率较高的事件。使我们的模型更普遍地适用。

·假设3:经济正处于一种自然发展的状态

理由3:首先,我们认为国际贸易不受国际形势的影响,而国际形势对经济有重大影响。其次,我 们假设没有类似的经济危机情况,且经济的异常波动不利于该数据的应用,这将会导致较大的误 差。

2.2符号

symbol

meaning

Y t

使用中的时间序列

φ 1-φp(AR)

前一个值和过去的p时间点之间的关系

θ 1-θq (MA)

前一个值与q过去时间点的误差之间的关系

C

常数项

Yp

序列中每个阶样本的自相关函数。

εt

在时间点t处的误差项

3 模型的建立及解决方案

3.1 Task1

关于问题1,我们在联合国官方网站和各大洲和国家的政府门户网站上获得了2011年至2020年期间 五大洲约180个国家自然和气候灾害的频率、损失和伤亡的数据。通过这些数据,分析国家的综合 风险指数,对其风险进行分类,结合当地的财产损失,综合判断当地的风险和损失,构建风险损 失模型,帮助保险公司做出更好的商业决策。

1.13 数据分析和预处理:

首先,在本期中,我们主要被称为、处理和分析了三个数据集。

第一个数据集是关于全球灾害风险的年度技术报告,世界风险报告主要关注与灾害风险管理相关 的各种主题,以及用于评估世界上许多国家的极端自然事件灾害风险的指数。世界风险指数由某 一地区的极端自然灾害和暴露水平以及该区域的社会脆弱性组成。社会脆弱性分为某一地区对极 端自然事件的易感性、缺乏应对能力和缺乏适应性。

该指标从灾害发生的可能性、灾害发生前、灾害发生中、灾害发生后的因素、多个方面评价某一 地区的极端自然事件灾害指数,对区域灾害的评价具有重要意义。一个国家在世界风险指数上的 指数得分越高,其国家的灾害风险就越高。该数据集全面地展示了世界各国的灾害风险,因此我 们使用来自世界风险报告的数据集作为我们研究的基础。

图 3:全局灾难分类 图 4 :灾难趋势图

在此数据集的基础上,我们使用其他两个数据集的灾害损失和频率的详细数据来整合和优化数据,并使用这些数据来分析每个大洲和国家的各种灾害的比例,以及每个国家每次灾害的经济损失。这是我们主要使用的三个数据集,我将详细介绍我们的数据预处理步骤和一些重要的数据可视化成果。

由于数据集中缺乏某些信息,我们使用Python在预处理过程中搜索和修改数据集中的离群值,并 删除缺失的值。使该数据集更加准确和全面,提高数据的准确性和价值,并促进减少后续机器学 习中的错误。在第一个数据集中,我们需要主要关注诸如“世界资源研究所”、“暴露 ”、“脆 弱性 ”、“敏感性 ”、“响应能力不足 ”和“适应能力不足 ”等条目。这些条目科学地分析了在 客观和主观条件下的局部风险指数和风险水平,并充分考虑了时间和空间的重要性。

通过数据可视化,我们可以得到以下图像:

图5:暴露水平与风险评分之间的相关性分析

This graph tells us that there is a strong correlation between exposure level and risk score. Of course, the correlation coefficients we obtained later also confirm this point. In addition to the strong correlation between exposure level and risk, we obtained a strong correlation between the exposure index and the number of disasters that occurred in the area by integrating the data table. This reflects the correctness of our data integration, and we can predict the number of disasters that occurred in the year through the change pattern of the index.

Of course, with the frequency of disasters occurring, each continent and country will have a tendency to predict what kind of disasters will occur, how many times such disasters will occur, and the risks will be reduced accordingly. It is not only an economic issue, but also a safety issue.The following graph are the

disaster "tendency" graphs obtained after data integration:

3.1.2 Model Establishment:

For such problems, we can convert evaluation problems into classification problems, which can better directly determine their risk level. For higher and very high-risk areas, we can ignore their insurance coverage, but for medium, low, and very low-risk areas, we need to combine local disaster losses to make a final judgment on whether to insure locally. This can effectively avoid complex numerical calculations.

According to the characteristics of the data, it has temporal characteristics and can be divided into training and testing sets by year. After integrating and optimizing the dataset, it can be used for regression analysis

Page 6 of 24

using the KNN algorithm in machine learning. It determines the nearest neighbor samples based on their eigenvalues, calculates distances, and makes classification decisions based on their labels. The core idea of the KNN algorithm is to assume that neighboring samples that are most similar to the target sample have similar labels.As a representative algorithm of supervised learning, the approximate steps are as follows:

·Exploratory analysis of EDA data ·Feature engineering

·Splitting the dataset into training and testing sets

· Training model: Use the training set to train the model

·Test set: Use the obtained training model to test the test set6. Model evaluation: accuracy

For KNN algorithm, there are three unique elements compared to other machine learning algorithms that can provide more accurate instances in the training set.

·Distance: This question uses Euclidean distance

·K value: Using loss function to reduce K value

·Classification Decision: Weighted Optimization CRITIC

(1)

(2)

Table 1 Objective entropy weight analysis

Adaptability

correlation coefficient p value

0.934 0

Coping ability

correlation coefficient p value

0.91 0

Inductive

correlation coefficient

0.949

ability

p value

0

When dividing levels, we used Analytic Hierarchy Process (AHP), which is a quantitative method used to handle complex decision-making problems. It decomposes the problem into different levels of criteria and sub criteria, and then compares them pairwise to determine their importance weights. In this issue, the risk index can be divided into two criteria: objective and subjective. That is, the exposure index and vulnerability belong to the first level, and the second level is the three components of the vulnerability index: sensing ability, coping ability, and adaptability.

The AHP-KNN algorithm uses the weight vector of AHP as the weight attribute of the KNN algorithm to improve classification decision-making. Specifically, it first uses AHP to determine the importance weight of each feature and applies these weights to the feature vectors in the KNN algorithm. Then, by calculating the distance between the target sample and each sample in the training set, classification decisions are made based on the nearest neighbor label.

Page 7 of 24

Use Analytic Hierarchy Process to determine the weight of each factor. The Analytic Hierarchy Process allows us to establish a judgment matrix by comparing the importance of each factor pairwise. With the help of expert opinions or relevant statistical data, we can rate each factor to reflect its relative importance to other factors. Through the feature vector method, we can calculate the weight of each factor and represent it as w1, w2, w3, w4, and w5

Use the K-nearest neighbor algorithm to calculate the disaster risk index.

Assuming we have a training dataset that includes data on exposure, vulnerability, susceptibility, lack of response and adaptation in different regions, as well as corresponding disaster risk level labels.Specifically, we will use the following formula to calculate the disaster risk index for a certain region:

RiskIndex = KNN(w1 * x1, w2 * x2, w3 * x3, w4 * x4, w5 * x5 ) (3)

x1, x2, x3, x4, and x5 represent the values of exposure, vulnerability, susceptibility, lack of coping ability, and lack of adaptability in the region.

w1, w2, w3, w4, and w5 are the weights of the corresponding factors, respectively.By combining the AHP method to determine weights and the K-nearest neighbor algorithm for disaster risk index calculation, various factors can be comprehensively considered to obtain a relatively accurate disaster risk assessment result.

In the process of reducing the k-value, we found that the Analytic Hierarchy Process (AHP) did not achieve ideal prediction results because there were always human factors in the process of dividing levels, and some correlations may be ignored, leading to a decrease in accuracy and poor model performance.

So we chose an objective entropy weighting method, CRITIC, which is a method used to evaluate models. This method conducts correlation analysis on multiple indicators of model performance to determine the importance of each indicator to model performance.

The CRITIC method includes the following steps: ·Determine evaluation

·indicatorsEvaluation model

·Calculate correlation coefficient:

·Determine the importance of indicators ·Analysis results

3.1.3 Model Solving

In the KNN algorithm, the exposure index, vulnerability index, sensitivity index, lack of coping ability index, and lack of adaptability index are used as independent variables, while the wri category is used as the dependent variable. The training set ratio is set to 0.8, the number of K neighboring samples is set to 5, and the sample voting weight is set to proportional voting weight. The neighbor search method is automated, and the distance calculation is performed using the Euclidean distance calculation method for KNN modeling.The final model achieved an accuracy of 45.95% on the test set, an accuracy (comprehensive) of 62.93%, a recall (comprehensive) of 45.95%, and an F1 score (comprehensive) of 0.49. The model effect is relatively poor.

Using the CRITIC model, assuming that the dataset consists of n data samples, m indicators are defined based on this dataset and abstracted into mathematical language as follows: X1, X2... Xn ...

1 ·Data normalization processing

Page 8 of 24

The types of indicators are usually divided into four types: positive indicators, negative indicators, intermediate indicators, and interval indicators.

The calculation method for normalizing positive indicators is:

The calculation method for normalizing negative indicators is:

The calculation method for normalizing intermediate indicators is:

The calculation method for normalizing interval indicators is:

2 Calculate the carrying capacity of information ·The formula for calculating volatility is

(4)

(5)

(6)

(7)

(8)

CRITIC evaluates and decides how to allocate weights based on the differences in values within each indicator. It is believed that the larger the standard deviation, the more information the indicator can reflect, and therefore, more weight should be assigned to this indicator.

·

The formula for calculating conflict degree is

(9)

The CRITIC method believes that the larger the value of the linear positive correlation coefficient between indicators, the smaller the conflict, and therefore, smaller weights should be assigned.

·The carrying capacity of information and specific calculation methods Calculate indicator weights

Table 2:Factor indicator weighting table

% 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020

Page 9 of 24

Exposure

30.59

33.36

33.70

32.69

32.78

32.66

33.59

34.64

37.59

37.42

Vulnerability

14.48

13.89

13.77

14.09

14.05

13.89

13.98

13.82

12.92

12.92

Receptivity(L)

21.54

21.24

21.25

20.24

20.03

20.54

20.62

19.06

18.50

18.47

Coping(L)

18.68

18.41

17.87

17.95

18.15

18.73

18.16

16.65

16. 14

16.82

Adapt(L)

14.71

13. 11

13.41

15.03

14.99

14.18

13.64

15.82

14.85

14.37

Figure 6 Empowerment Comparison Chart

According to the CRITIC model, assign weights to variables and recalculate the k value based on the test results to verify the effectiveness of CRITIC.The accuracy is 80.73%

Figure 7 Model evaluation renderings

The above is the "risk" section of the risk loss model we want to establish, and the process of establishing the other "loss" section is as follows:

Through data integration, we found that the disaster losses in a region are related to multiple aspects, such as the number of deaths, damage level, duration, and very important types of disasters. Due to the large number of missing values in the disaster insurance amount data from 2011 to 2020, which cannot be supplemented by algorithms, we can only determine an economic risk in this region based on its losses.

Disaster modeling, also known as cat modeling, is the process of using computer-aided calculations to estimate the potential losses caused by catastrophic events such as hurricanes and earthquakes.

Table 3 Model results analysis table2011-2020

Page 10 of 24

storm

Drought

Earthquake

Extreme weather

2011

5691 334

17.04

814.2 17 47.89

23030 30

767.67

5087.2148 84

60.56

2012

6859 346

19.82

2548 21 121.3

1853.6314 27

68.65

8573.2579 90

95.26

2013

10659.5 332

32. 11

108.7 9 12.08

911.2859 29

31.42

5238.8364 10

49.89

2014

11695.8 320

36.55

1100.5 18 61. 14

717.4 26

27.59

4011.4072 99

40.52

2015

15698.5 380

41.31

1981.23 28 70.76

602.7852 23

26.21

3305.1767 11

28.01

2016

10335.5 325

31.80

355.4 15 23.69

3299.45 30

109.98

4511.1315 84

53.70

2017

2368 276

8.58

242.2 7 34.60

276.4338 19

14.55

12211.8 85

143.67

2018

2698 282

9.57

525.366 13 40.41

220.5 20

11.03

5925.0181 84

70.54

2019

3685 316

11.66

625.621 9 69.51

205. 1 21

9.77

2142.2743 95

22.55

2020

3759.5 306

12.29

765.365 15 51.02

196.2058 20

9.81

4522.6041 96

47. 11

Flood

Landslide

Volcanic

Wildfire

2011

7075.7047 156

45.36

0.73 17 0.04

10.4 6

1.73

313.7 6

52.28

2012

2579.0538 136

18.96

9.65 13 0.74

96.854 1

96.85

100 10

10.00

2013

5478.2566 149

36.77

17.5692 11 1.60

15.400645 3

5.13

107.24 4

26.81

2014

3624.0242 135

26.84

27.3 15 1.82

18.6 6

3.10

25.9 12

2.16

2015

2108.63 160

13.18

0.8 20 0.04

60 6

10.00

343.982 10

34.40

2016

5738.235 161

35.64

72.5 13 5.58

19.8 2

9.90

628.7 13

48.36

2017

1577.8682 114

13.84

0.63 25 0.03

86 7

12.29

101.9 10

10.19

2018

1743.6152 109

16.00

87.8036 13 6.75

56.3564 6

9.39

2274.5 12

189.54

2019

1599.503 106

15.09

68.5915 16 4.29

18.6 8

2.33

768.7 10

76.87

2020

1586.495 99

16.03

26.1605 18 1.45

18.6 8

2.33

626.5 13

48.19

The output of the cat model is an estimate of the model's prediction of losses related to a specific event or set of events. When running a random model, the output is a random loss distribution or a set of events that can be used to create a loss distribution. The loss distribution can be calculated as the possible maximum loss and the average annual loss.

3.1.4 Analysis and Evaluation of results

In response to this issue, we have selected Asia and the Americas as the two continents, China and the United States in Asia, which have rich landforms and large land areas. In addition, the two countries are in different development situations. China is a developing country and the United States is a developed country, and there are certain differences in the level of disaster response between the two countries. Overall, the vulnerability index is more informative and representative.

By analyzing the frequency and types of disasters that occur in each continent and country, we can infer whether there is a disaster situation and level in that area at a certain time. To determine whether to insure in that area, we first use the first CRITIC-KNN risk assessment model based on the Analytic Hierarchy Process to obtain the WRI index and types for China and the United States:

Table 4 KNN Risk Assessment WRI Index and Classification

Page 11 of 24

country

wri

explore

Vulnerability

Receptivity

Coping ability

Adaptability

year

United States

3.76

12.25

30.68

16.35

48.24

27.46

2021

China

5.87

14.29

41.08

21.64

71.42

30.17

2021

Specific classifications can be obtained through weighted distance:

Table 5 Weighted distance analysis evaluation scale scale

year

Wc

Ec

Vc

Sc

Cc

Ac

US

2021

Medium

Low

Very Low

Very Low

Medium

Medium

China

2021

Medium

Medium

Low

Medium

High

Medium

Then, combined with the disaster model, a comprehensive judgment is made, taking the disaster loss situation and risk level of the entire United States as an example:

Figure 8 Diagram of risk versus population density

The United States and China can obtain risk indices WRI of 3.76 and 5.87 through risk models, with a risk level of intermediate. Therefore, it is necessary to combine the second disaster model to determine their property losses. Based on the tendency of disasters and the losses caused by this tendency, it can be seen from the image that the loss index of Ohio in the United States is relatively high. Therefore, when planning insurance plans for Ohio in the United States, insurance companies should carefully consider and not insure.

For China, it is also necessary to consider the property loss index and property loss level. After data research, it can be found that China's property damage has been fluctuating around the world's property losses. Due to the frequent occurrence of disasters every year, taking Sichuan Province as an example, local disasters occur more frequently, but property losses are at a moderate level, and insurance needs to be carefully considered.

3.2 Task2

In the second question, we need to clarify a premise that our discussion object is already in line with the requirements of insurance companies for insurance, which means that their risk index classification

Page 12 of 24

evaluation and loss index classification evaluation are at a relatively low level, because adding a community will inevitably bring some population and ecological pressure to the local area. For example, an increase in population will lead to an increase in the "lack of sensitivity index". In other words, an increase in population will increase the local "entropy value", which is the degree of chaos.

In the initial stage of establishing the model, we need to know the local population density and economic situation, add new influencing factors to the first question model, objectively assign weights, adjust the k value, improve accuracy, and obtain a new risk loss model.

Compared to the first question, we need to pay more attention to one issue, "time", which means we need to predict and use the predicted data to determine how we should adjust our insurance plan. There are mainly three types of adjustments, of course, the adjustment of premiums and the strengthening of intervention are not conflicting, but the focus is different.

·Waiver ofInsurance

·

Increase reporting fees

·

Strengthen intervention and believe in reducing risk factors

In this question, we will take Beijing, China as an example. Objectively speaking, Beijing belongs to a lower risk area. Subjectively speaking, Beijing's vulnerability does not match its economic development status. For the capital, there is great room for improvement in perception, response, and adaptability. Beijing's vulnerability is closely related to local population density, economic development, and other factors.

Figure 9 Population density map of Beijing, China

3.2.1 Model Establishment:

The ARIMA model can use the historical information of the data itself to predict the future. This includes regression models (AR) and moving average models (MA). The AR model has the advantage of having data with long historical trends and can capture trends for prediction. However, due to its dependence on history, it may ignore the complexity of the real situation. Therefore, by combining the MA model, it can handle temporary sudden changes.

The ARIMA model assumes that label values fluctuate around a major trend over time, where the trend is influenced by historical labels and the fluctuation is influenced by accidental events over a period of time, and the major trend itself may not necessarily be stable.

Page 13 of 24

Basic steps:

·Perform stationarity processing on non-stationary time series data to obtain a stationary sequence ·Model research:

·Select Model ·Definite order

Partial autocorrelation coefficient (determining order): Considering the presence of random errors, the partial autocorrelation coefficient may not be strictly 0 after p-order delay, but fluctuates within a small range around 0.

·Fixed order optimization

The FPE criterion assumes that AR (p) is the fitting model, and 01 ......p are the

autocorrelation functions of each order sample in the sequence. The final prediction error can be expressed as

(10)

·Parameter estimation

A certain criterion estimates the parameter .

2

a 1 , a2 ......ap , σ

p+1. If the order p in the model is

also used as an estimated parameter, the number of parameters to be estimated will be p+2.

εt = Xt - a1Xt -1 - a2 Xt -2 - ...... - apXt -p

(11)

(12)

Least squares estimation (minimum sum f squared residuals) o

j = xj - (
1xj -1 +
2 xj -2 + ...... +
pxj -p )

(12)

Usually referred

j for residuals. Our optimization goal is to make the sum of squared residuals:

Obtain the following linear equation system:

Y = Xa + ε

The objective function of the sum of squared residuals can be expressed as: S (a) = (Y - Xa )T (Y - Xa ) = YTY - 2YTXa + aTXT Xa

Taking the derivative of parameter a from the above equation and setting it to 0 yields:

(14)

(15)

(16)

Page 14 of 24

Therefore, the least squares estimate of parameter a is:

= (XTX)-1X TY

At this point, the least squares estimate of the error variance is:

·Model validation

Including validity and significance testing of the model:

(17)

(18)

·Model prediction

The main purpose of time series analysis is to predict the next time series value Xt+1 or the sequence value Xt+k after k steps, given a time series X1, X2... Xt

3.2.2 Model Solving

When solving this problem, we will continue to use the machine learning algorithm from the first problem, but due to the addition of future developments, we need to establish a machine learning based time series prediction model.

In this section, in addition to focusing on time, we also need to adjust the insurance model for the situations that arise. In this section, we need to predict the level of risk losses in the area and judge whether to insure or increase or decrease premiums based on the new level of risk losses.

This chart shows the changes in insurance premiums in the Beijing region from 2006 to 2016 (approximately 800 cases per year):

Figure 102006~2016 Beijing premium change chart

In the first question, we observed a stable trend of WRI values over time using violin plots and scatter plots, without significant fluctuations. In this question, we analyzed and predicted WRI index, exposure index, and vulnerability using a time series model.

The following data is presented using the WRI index as an example:

Based on the variable WRI, when the difference is of order 0, the significance P-value is 0.000 * *, showing significance at the horizontal level. Rejecting the null hypothesis, this series is a stationary time series. When the difference is of 1st and 2nd order, the significance P-value is 0.000 * * *, showing significance at the horizontal level. Rejecting the null hypothesis, this sequence is a stationary time series.

Page 15 of 24

Verify the model:

Based on the variable WRI, the model formula is as follows:

y(t) = 0.716 + 0.906* y(t -1) - 0.265*ε(t -1) 19

The goodness of fit R of the model ² At 0.678, the model performs well. The predicted results for the next 7 years (2021-2027) in Beijing are 1.331711410151208, 1.9230514728671209, 2.4588635560156,2.9443618461786567,3.384270904589377,3.782871649040758,and4.1440430103471 1, respectively.

Of course, this completely disregards factors such as population and economic pressure. However, according to the reality, the addition of communities will inevitably have consequences for local vulnerability. Therefore, we have introduced a new influencing factor, "lack of urban capacity," as a factor that constitutes urban vulnerability. This is also why we continue to use the idea of the first level analysis method, and based on the population density of Beijing, The level of economic development is once again using the CRITIC objective weighting method to re assign weights to influencing factors.

The empowerment results are shown in the following figure:

Figure 11Weights of six factors

Using the new weights to recalculate the WRI index and substituting it into the time series model again, the result is as follows: the goodness of fit R of the model ² A score of 0.869 indicates excellent performance. The predicted WRI index for Beijing in the next 7 years is 5.33, 6.92, 4.45, 5.94, 5.38, 6.78, and 8.14, respectively.

3.2.3 Analysis and Evaluation of results

We have learned from the above data that currently, the density of communities within the third ring road of Beijing is relatively high, and the construction of communities will have a certain degree of damage to the stability of the community. Based on the predicted data, 2023 is more suitable for community investment and construction, but the next few years are not suitable for real estate development and construction, with higher risks and lower cost-effectiveness.

According to data, in recent years, premiums in Beijing have shown an upward trend, and the number of high premiums is increasing. Therefore, appropriately increasing scrapping can hedge against the high risks in Beijing.

Page 16 of 24

3.3 Task3

3.3.1 Model Establishment:

The Luding Bridge in Luding County, Sichuan Province, was selected as a research landmark.Due to the frequent earthquakes in the area where the Luding Bridge stands, the insurance model does not recommend that the Luding Bridge be insured.But the Luding Bridge is an essential embodiment of China's revolutionary spirit. The cultural and economic value of Luding Bridge makes leaders hesitate.

Figure 12Schematic diagram of the earthquake in Sichuan Province, China

Invoke the knn algorithm, Critic model, and disaster model in Problem 1.CRITIC uses the differences in the values within each indicator to evaluate and decide how to allocate weights.It is considered that the greater the standard deviation and the more information the indicator can reflect, the higher weight should be given to the indicator.The knn algorithm gives a score and risk level for the combined effect of risk and loss.

Then, the original model is improved by adding the two indicators of culture and economy.CRITIC recalculates the weights of each index.Using the knn algorithm obtaine the score and risk level.Comparing the results of the original and improved models, a lower WRI category score and risk level indicates that the landmark is valuable to the site and should be preserved in place, and vice versa.

3.3.2 Model Solving

The weights of each index of Luding Bridge in Luding County, Sichuan Province were obtained with CRITIC.

Table 6:Weight distribution table of each index of Luding Bridge, Luding County, Sichuan Province

%

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

Exposure

29.56

32.39

32.75

32.65

32.73

31.62

33.56

35.63

36.62

34.39

Vulnerable

14.51

13.86

14.72

14.13

14.10

13.93

14.01

12.92

12.89

13.95

Receptivity

22.54

19.26

22.23

18.26

20.13

21.54

21.61

17.07

18.54

18.47

Coping(L)

19.65

19.39

18.89

18.92

17.10

18.75

17.16

18.55

17.13

17.85

Adapt(L)

13.74

15. 11

11.41

16.04

15.94

14.16

13.65

15.82

14.84

15.34

Page 17 of 24

According to the statistical data and disaster model, the number of earthquakes and their losses in Luding County, Sichuan Province were obtained.

Table 7 Data table of the number of earthquakes and their losses in Luding County, Sichuan Province

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

503

0

5115

2220

423.25 751.3 0

730.48

49281.05 175.63

1

0

5

3

1

1

0

2

5

1

503.00 0.00

1023.00 740.00 423.25 751.30 0.00

365.24 9856.21 175.63

The knn algorithm is used to obtain the scores and risk levels of each indicator.

Table 8: Scores for each indicator and a table of risk levels

year

wri

explore

Vulnerability

Receptivity

Coping ability

Adaptability

LDQ

2025

5.68

25.52

42.48

30.25

49.54

30.26

year

Wc

Ec

Vc

Sc

Cc

Ac

LDQ

2025

High

Med

Med

Low

Med

Med

After the above assessment, it can be concluded that Luding Bridge in Luding County, Sichuan Province is a high-risk area and is not recommended for underwriting.

Cultural and economic indicators were added to the model, the weights of each indicator were recalculated with CRITIC, and the results obtained from the disaster model were used to derive new scores and risk levels through the knn algorithm.

Table 9 New Percentage of Disasters by Year

%

2011

2012

2013

2014

2015

2016

2017

2018

2019

2020

Exposure

20.44

25.39

22.75

23.63

22.73

24.52

24.46

25.31

26.62

24.39

Vulnerable

12.51

9.62

19.52

12. 11

9.06

11.13

11.81

9.61

12.38

13.95

Receptivity

16.43

11.26

15.23

14.22

15.10

15.54

11.41

15.03

13.51

15.45

Coping(L)

13.65

13.29

11.76

10.92

13.07

12.65

9. 14

9.53

10.13

12.41

Adapt(L)

8.64

12.9

11.37

11.04

11. 14

10.16

13.15

10.82

14.41

13.31

Culture(L)

18.21

20. 12

19.23

19.32

18.78

17.56

18.57

19.45

17.65

16.23

Eco(L)

10. 12

8.24

10. 14

9.56

10. 12

9.44

11.45

10.54

5.32

4.26

Page 18 of 24

The KNN algorithm is used to calculate the score and risk level after adding cultural and economic indicators.

Table 10New indicator score and risk level table (2025)

wri

explore

V(L)

R(L)

C(L)

A(L)

C(L)

E(L)

LDQ

4.48

13.52

19.48

20.25

39.54

20.26

13.75

17.25

Wc

Ec

Vc

Sc

Cc

Ac

Cc

Ec

LDQ

Med

Low

Low

Low

Med

Med

Low

Low

After the re-evaluation of cultural and economic indicators, the wri score was reduced, and the risk level was changed from High to Medium.

3.3.3 Analysis and Evaluation of results

The risk assessment of Luding Bridge in Luding County, Sichuan Province was carried out by the risk loss model established in Problem 1, and the result was that underwriting was not recommended.However, the economic and cultural value of the Lugou Bridge itself and the role of the local area are very large.Therefore, cultural and economic indicators were added to the original risk model, the weights were recalculated, and the scores and risk levels were recalculated and compared with the calculation results of the original model.

The calculation result of the original model is 5.68 in WRI category, and the risk level is high risk.The calculated result of the improved model is 4.48 in WRI category and medium risk level.This landmark has great cultural and economic value to the area and should be preserved in situ.

3.4 Task4Saving the Luding Bridge

3.4.1 Model Source:

In the fourth question, we will continue to use the optimized CRITIC-KNN model from the third question, and add an ARIMA time prediction model on this basis, so that our model can predict the risk loss types of the Luding Bridge in the future more accurately and respond to changes in various data.

Taking the data from 2025 mentioned in the third question as an example, after adding cultural and economic factors, the weight structure of WRI has undergone some changes. Therefore, the changes in other factors have great practical significance for the protection of Luding Bridge. For example, the lack of adaptability index of Luding Bridge has decreased, and the decrease is greater than the lack of adaptability index score. This indicates that we should pay attention to Luding Bridge in Sichuan, How to improve his coping ability, such as having more rescue teams, improving rescue efficiency, reducing casualties, and thus reducing the lack of responsiveness index in the area.

Of course, in the third question, we already know that due to its enormous cultural and economic value, we have adopted a policy of on-site protection for the Luding Bridge in Sichuan. The relevant initiatives will make decisions based on local weather conditions, disaster tendencies, cultural and economic factors, pay attention to the three time periods before, during, and after the disaster, and propose comprehensive suggestions to protect the Luding Bridge.

Page 19 of 24

3.4.2 Model Solving

The value of the region is often directly proportional to the local risk index, especially when cultural and economic factors are included in the local risk index. We can use the ARIMA model to predict the WRI, and obtain an R2 value of 0.786. The model is effective, and the WRI values for the next ten years (2021- 2030) are 4.41, 5.56, 4.55, 4.34, 4.48, 4.57, 5.04, 5.45, 5.56, 6.43. Using the same method, we can obtain exposure values, vulnerability values, lack of sensing ability values, lack of coping ability values, lack of adaptability values, as well as newly added cultural and economic values.

Table 11Indices of various factors

WRI

expose

delicate

Pre(L)

Res(L)

Adop(L)

Cul(L)

Eco(L)

2021

4.41

8.30

11.95

12.43

24.27

12.43

17.75

20.52

2022

5.56

9.21

13.27

13.79

26.94

13.80

15.45

17.86

2023

4.55

10.59

15.26

15.86

30.98

15.87

14.68

16.97

2024

4.34

11.76

16.94

17.61

34.38

17.62

14.55

16.82

2025

4.48

13.52

19.48

20.25

39.54

20.26

13.45

15.55

2026

4.57

13.76

23.69

22.07

39.58

21.07

13.98

16.16

2027

5.04

14.01

28.80

24.06

39.62

21.91

13.61

15.73

2028

5.45

14.26

35.03

26.22

39.66

22.79

12.89

14.90

2029

5.56

14.52

42.59

28.58

39.70

23.70

12.56

14.52

2030

6.43

14.78

51.79

31.16

39.74

24.65

12.43

14.37

3.4.3 Analysis and Evaluation of results

From the model results, it can be seen that the WRI index shows a slow upward trend, indicating that the risk index of the region will continue to rise without intervention. Before 2025, the exposure index of the location of Luding Bridge in Sichuan Province has changed significantly and is showing an upward trend. Faced with the objective reality that cannot be changed, we suggest that the local area can build a "smart city" to reduce the vulnerability of the city.

From the following five aspects:

·

Prediction: According to the data, the level of technological development in Luding County, Sichuan

Province is limited, and the risk of lacking inductivity has increased significantly, requiring high attention. By developing the predictive ability of cities and fully integrating next-generation scenario prediction, we can better capture the potential of disasters. Our model can be optimized and upgraded, and machine learning based disaster models can be developed. For Lugou Bridge, a policy of on-site protection can be adopted. So, from the government's perspective, more funds can be invested in repairing and strengthening the Lugou Bridge, which can also continuously protect the cultural and economic value of the Luding Bridge (tourism industry). On the public side, it is necessary to raise individual risk awareness. For residents in the Sichuan earthquake zone, suitable materials can be selected to reinforce houses and reduce casualties; You can also build shock-absorbing corners at home to protect yourself in a timely manner. There are many similar measures, mainly focused on cultivating awareness, which are conducive to reducing the local "lack of sensitivity index".

·

Coping ability: The ability of the area to cope with risks will stabilize after 2025, but the risks are at a

high level and require some attention. In order to respond to disasters in a timely manner, private rescue

Page 20 of 24

teams can be organized to provide real-time rescue, combined with government rescue, to enhance rescue capabilities. You can organize and participate in earthquake drills regularly, and respond in an orderly manner when disasters occur.

·

Adaptability: With the changes in local population, economy, and climate, although the psychological

adaptability of residents will gradually increase, it cannot be ignored that the difficulty of adaptation increases. This requires the local government to make full use of "smart cities" to timely pay attention to the local population and economic situation, adjust the post disaster reconstruction plan in a timely manner, and refine the post disaster reconstruction plan. Strengthen the connection between the city and other cities, help each other, and achieve maximum resource utilization.

·

Cultural: For Lugou Bridge, its cultural and economic value can be said to complement each other. In

terms of culture, Luding County must actively promote the cultural background and value of Lugou Bridge, improve cultural influence, and minimize cultural deficiency. According to predictions, Luding County has done well in this regard, but there is still room for development because the cultural weight in value evaluation is high, which will have a significant impact on the WRI index.

·

Economy: To reduce the lack of economic index in the area, it is necessary to ensure the local cultural

value. Therefore, the tourism industry should be developed to promote culture while also increasing tourism revenue and promoting economic development; It is also possible to strengthen connections with other scenic spots, leverage integration effects, and elevate the local economic level. In addition to culture and economy, adaptability and economy are also closely related. For example, in addition to investing in post disaster reconstruction, the government can also distribute some consumption vouchers to stimulate residents' consumption and accelerate the recovery process.

This is our suggestion based on time, and we have developed a more detailed plan for the landmarks of the area, which is presented in the appendix.

Figure 13Performance chart of impact factors

4 Sensitivity analysis

Firstly, use the Analytic Hierarchy Process to determine the weight of each factor.The Analytic Hierarchy Process allows us to establish a judgment matrix by comparing the importance of each factor pairwise.We can rate each factor to reflect its relative importance to other factors. Through the feature vector method, we can calculate the weight of each factor.Through the feature vector method, we can calculate the weight of each factor.

Then, by combining the AHP method to determine weights and the K-nearest neighbor algorithm for disaster risk index calculation, we found that the Analytic Hierarchy Process (AHP) did not achieve ideal prediction results during the k-value reduction process.

So we chose an objective entropy weighting method, CRITIC. CRITIC evaluates and decides how to allocate weights based on the differences in values within each indicator.

From this, we have identified various indicators that have different impacts on the model.

Page 1

5 Evaluation of the model

5.1 Strengths

When using the k-nearest neighbors (KNN) algorithm, we encountered the issue of sample imbalance. To address this, we employed the Analytic Hierarchy Process (AHP) and Objective Entropy Weighting method to determine the weight ratios for the imbalanced samples. After several iterations of analysis, we finally established reasonable weight ratios, which we incorporated into the KNN algorithm. The results showed a significant improvement in mitigating the inherent limitations of the KNN algorithm.

The Objective Entropy Weighting method is suitable for multi-indicator comprehensive evaluation problems and is not limited by the number of indicators. It can be flexibly applied to different fields and scales, including cases involving a large number of indicators. When analyzing disaster risks, it can effectively assist us in analyzing and calculating the weight ratios of various factors.

For the optimized dataset, we avoid the original data missing anomaly, which can better lay the foundation for our research. In the new dataset, sorted by year, we find a temporal regularity in the frequency of disasters, excluding those sudden and catastrophic disasters. This helps us a lot to make reasonable predictions about the future.

For the index assessment of disaster risk and the prediction of future risk in a certain area, the data set we optimize and the model we choose can make up for some of the shortcomings of each other, so as to obtain a more accurate assessment of disaster risk and provide more rigorous standards for our judgment.

5.2 Weaknesses

The biggest drawback of the model we are using is the occurrence of sudden catastrophic events. These events not only greatly interfere with our initial data analysis but also pose significant challenges in predicting future disasters. Currently, our model lacks effective solutions to address the sudden occurrence of major catastrophes.

6参考资料

[ 1]刘辉、宋广斌、曹雪礼。四川省泸定市地震滑坡的风险评估与管理。《工程地质学学报》,

23(6):1009-1019。 (2015)

[2]UNISDR(联合国国际减灾战略)。关于减少灾害风险的全球评估报告:气候变化中的风险和 贫困。联合国(2009).

[3]昆鲁特,H.,米歇尔-克尔詹,E。与天气作战:在灾难发生的新时代管理大规模风险。麻省 理工学院出版社。 (2009).

[4]BotzenW。J.,范登伯格,J。C.荷兰应对气候变化和洪水的保险:现在、未来以及与其他国 家的比较。风险分析,28(2),413-426。 (2008).

[5] Clarke, D., & 锯尼 B.保险和气候变化:对经合组织全球保险市场趋势项目的贡献。经合 组织出版。 (2013).

[6] Michel-Kerjan,E.,拉奇基,P。A. (2011).保险在降低直接风险中的作用:洪水保险的案 例。《风险与不确定性杂志》,43(2),147-164 (2011).

[7] Raschky, P.A., & 施瓦泽,R。欧洲的恐怖主义风险保险:实证研究的政策含义。《风险与 不确定性杂志》,47(3),303-325。 (2013).