Agricultural Risk Pricing in Senegal

The purpose of this article is to determine the pure premium to be paid by the Senegalese farmer insured at conventional risks. Using the general linear model (GLM), the frequency and severity of different types of risks to farmers were determined. They depend positively on the type of risk and the parameters of the estimated models are all significant. We have shown that the health risks, locusts (wild locusts), wild animals and ducks have higher claims than climatic events (rainfall deficit, floods). Health risks, floods and rainfall deficits are extreme phenomena whose probability of achievement is low. This explains the low premiums of these risks. For better pricing, the insurance company will need to consider the type of risk to which each insured is most exposed and determine the corresponding premium. This segmentation will determine the correct premium.


Introduction
The role of agriculture as the main engine of economic growth in Senegal has made this sector, since the 1980s, a priority in the various strategic documents of economic policies that have succeeded one another. Thus, there have always been questions in the various economic policy strategies, to make the agricultural sector efficient enough to develop the rural world and bring economic growth.
The long-term vision in the agricultural sector is defined by the Agro-Sylvo-Pastoral Orientation Law (ASPO) developed for the period 2004-2024 and which bases the agro-sylvo-pastoral development policy constitutes the basis of the development and implementation of operational programs. Ninety five percent (95%) of agriculture production depends on rainfall conditions. Overwintering is often characterized by late settlement, poor spatial and temporal distribution, and early rains in many parts of the country. In addition, the rainy season is rel-atively short in Senegal, 3 to 4 months in the year (for an average of about 600 mm/70 days), which is a major constraint to agricultural practices based on the availability of water. The consequence is that cereal production and industrial crops are very erratic. The results of agricultural campaigns in Senegal are characterized by instability of production even though it has experienced significant leaps.
This situation hides very large disparities from one zone to another. Pastures have also been affected, particularly in the northern regions of the country (Saint-Louis and Louga). The agricultural sector is therefore confronted with a multitude of risks, among which those related to climate hazards, sanitary and phytosanitary diseases and market fluctuations.
In this context, the challenge is to pursue the implementation of the strategic guidelines in terms of protection, in particular: risk control, preventive management of water resources, the dissemination of good farming practices, the integration of adaptation to climate change in agricultural projects, the pursuit of the establishment of an agricultural insurance system, and this by adopting a differentiated approach, adapted to each production sector and aimed as much at producing productive and productive agriculture as solidarity farming.
The problematic of the effects of climate change and the possible multiplication of unusual climatic disorders, reinforce the need to improve farm protection mechanisms against hazards by setting up agricultural insurance. Agricultural insurance is insurance against one or more of the following losses: 1) the loss of production of designated agricultural products resulting from one of the designated risks; 2) the loss suffered when seeding or planting is prevented by one of the designated risks; 3) the loss of designated agricultural products resulting from one of the designated risks; 4) loss of income from designated agricultural products resulting from a designated risk; 5) any regulatory loss.
The National Company of Agriculture Insurance of Senegal (NCAIS) uses index insurance and its objectives include: reducing farmers' vulnerability to hazards, increasing agricultural production and food security and stabilizing and growing farmers' incomes.
Index insurance is a relatively innovative insurance approach that compensates for asset losses or working capital losses primarily on the basis of a pre-determined index (for example, the level of rainfall). This, as a result of bad weather or natural disasters, without requiring the use of traditional services of experts in claims assessment. Before the start of the insurance period, a statistical index is developed to measure deviations from normal by parameters such as precipitation, temperature, magnitude of earthquake, speed of crop yield or livestock mortality rates.
The insurance systems against calamities rely for the most part on an index. They only cover one or more risks that can lead to a bad harvest, and the compensation of the insured depends on an objective trigger that is easy to follow.
Many are insurance systems against excess and bad weather rather than direct property.
An important problem facing the insurer is pricing. It is even more important that after 10 years of experience, NCAIS has recorded a large amount of information on the behavior of these subscribers. Experimenting with other rainfall indices on existing products or on new products or the application of experience-based pricing will allow each risk to be assigned a fair and equitable premium. This premium for a period depends solely on the (unknown) loss distribution of this risk for the period [1].
The main purpose of this article is to determine the pure premium according to the nature of the claim. To achieve this goal, cost and frequency modeling using the general linear model will be adopted.
This article is divided into three sections: the first is devoted to the literature review on agricultural insurance, the second the source of the data, the third to the methodological framework and the fourth to the econometric estimation and interpretation of the results.  In the individual crop insurance program, three important issues arise; namely adverse selection, moral hazard and high administrative cost. These problems weaken the actuarial performance of the crop insurance program and do not attract a large number of farmers. In this situation, Miranda offers crop insurance based on the performance of the zone of acceptance. In this insurance program, in the insured zone, farmers receive the same amount of compensation per insured hectare because of crop losses and pay the same premium to the insurer.

Literature Reviews
The regional yield crop insurance program significantly reduces adverse selection problems and significantly reduces administrative costs and finally eliminates moral hazard issues. He emphasized that regional crop insurance is an important tool for risk reduction in agricultural insurance policy. Empirical analysis shows that wheat and sorghum producers would prefer a crop insurance or disaster assistance program in addition to the government's commodity program. They also mentioned that disaster assistance is better than taking out a crop insurance policy.
Risk-averse managers of grain sorghum in south-central Kansas and northwestern Kansas prefer individual crop insurance instead of regional crop insurance to produce crops with relatively higher risk yields. On the other hand, farmers growing low-risk crops (wheat) in south-central Kansas are more likely to prefer zone crop insurance. If adverse selection and moral hazard continue to be a problem for the individual crop insurance program, a subsidized regional crop insurance plan could be an alternative. In fact, the crop insurance system of the subsidized zones would prevent moral hazard and reduce the problems of adverse selection. Here again, farmers prefer the preferred risk of crop insurance [5] find that the barriers to insurance are lack of respect for economic frameworks, problems in the statistical system, lack of competition in the service sector, and lack of monitoring and evaluation. In addition, they found the threats that insurers face: they are inappropriate production entities, degraded lands, the lack of production standards and the existence of poor operating systems.
[6] [7] are the most comprehensive and recent studies on crop insurance and land use issues in the United States. They use a combination of econometric and simulation techniques and improve the earlier literature by focusing on marginal lands (a critical part of the northern plains that comprise much of the Prairie Basin region). By distinguishing between converted grassland types and using field data rather than county-level data. Their findings are consistent with previous literature that the effect of subsidized crop insurance in marginal land cropping is statistically significant but low at less than 1 percent.
In particular, [7] estimate that the effect of crop prices is much larger than Crop Insurance subsidies on marginal land conversion. They find that a 5% decrease in the crop insurance premium subsidy rate results in 0.6% of insured cropland being converted to non-cropland. While a 5% decrease in crop prices results in the conversion of 1.01% of cropland insured to non-cropland.
Beyond the weak expansionary effect on conversion of grassland to cropland, crop insurance has offsetting effects on cropland in the form of less use of other risk-reduction strategies, such as intensive chemicals. Empirical results from the Great Plains suggest that farmers who purchase crop insurance use fewer chemical inputs [8]. Similar results were obtained in [9] for Iowa corn. [8] [9] refuted the contradictory results of the earlier study by [10]. [8] concluded that environmental consequences should not be the basis of efforts to persuade legislators to terminate the crop insurance program. [11] in his study on "The Performance of Nigerian Food Crop Producers in Imo State, South-East Nigeria", they Evaluate the Production Performance of Food Crops of Farmers Who Have Adopted the Crop Production Regime. Agricultural insurance introduced in 1984 and the influence of socio-economic characteristics on farmers' production. Primary and secondary data were used in the study. Primary data were obtained from 77 food crop producers selected by simple random sampling from a list of 145 farmers under the Imo State Insurance Scheme. The Z-test and the multiple regression model were used to determine the impact and influence of socio-economic characteristics such as age, agricultural experience, education, etc. on farmers' production, respectively. The Z-test of the impact of the program on farmers 'production showed that there was a significant change in farmers' output after insurance. The results of the analyzes of the socio-economic characteristics of the farmers interviewed showed that the majority (66.23%) of the sampled farmers are men. It also showed that the majority (46.75%) of sampled farmers were in the 41 to 50 age group. In addition, more than 70% of insured farmers had secondary and higher education. The Z-test showed that farmers' agricultural production had increased after the application of the insurance scheme. Average agricultural production was 16.01 metric tons before insurance but [13] analyze the willingness to pay for cocoa price insurance in Ghana in the cocoa industry using the contingent valuation method on data collected from 201 cocoa farmers in Bibiani-Anhiawso-Bekwai District, Ghana. A constrained model is used to determine the factors influencing farmers' adoption of cocoa price insurance and the premiums that farmers are willing to pay. Empirical results of the study reveal that farmers' interest in cocoa price insurance was affected by the variety of explanatory variables such as marital status, number of years of cocoa culture, level of education, household size, farm size, ownership of farmland for agriculture, age of cocoa plantation, age squared of cocoa plantation, farmers aware of insurance scheme and income of the cocoa farm. On the other hand, the premium that farmers were willing to pay is heavily influenced by marital status, achievements, use of farmland for agricultural purposes, raising farmers' awareness of the insurance scheme. On average, cocoa farmers are willing to pay between 9.3% and 10.5% of the value of the option they intend to receive as a bonus based on value. The study recommends paying particular attention to farmer insurance education. [14] analyze performance gaps in the context of crop insurance. They estab-

The Model of the Cumulative Amount of Claims
Define abbreviations and acronyms the first time they are used in the text, even after they have been defined in the abstract. In practice, an insured can be at the origin of 0, one or more of claims. Note Y ij the cost of the insured's first claim, X i the annual charge for the insured, and K i the number of claims for this insured.
The number K i is a random variable and the costs Y ij are also random variables. The total random charge for the insurer is the sum of the losses attributable to each policyholder. We can stop looking at the identification of claims specific to each insured by asking: where na represent the number of insured. We can then rewrite: By renumbering the claims, leaving aside the insured who gave them birth. Subject to making two assumptions, the expectation and variance of the claim burden can be calculated.

The Assumptions of the Model
Two hypotheses are formulated: the first on the independence and the stationarity of the costs of disaster and the second on the independence of the frequencies and the costs.
Hypothesis 1: Independence and stationarity of claims costs The random variables Y ij are independent and identically distributed (iid).
This hypothesis requires that discounted values (by a carefully chosen rate of "inflation") be considered for the amounts of claims observed over long periods.
Hypothesis 2: Frequency-cost independence The common distribution of Y ij does not depend on the value taken by K i .
This assumption is not always verified in reality (hence the interest of the tariff zones which "decorrelate" frequency and cost of claims).

The Parameters of the Model
The hypotheses set out above make it possible to obtain interesting properties of expectancy and variance: -Expectation of the loss burden The pure premium is given by the following formula: What is often expressed, for a contract, by pure premium = frequency x average cost.
-Variance of the claim burden

The Generalized Linear Model
The purpose of this section is to predict the frequency/claim load (N/Y) for a client. The methodology used consists in finding the link between (N or Y) with the explanatory variables available at the level of the database. In other words, find the linear predictor β ( 1 2 , , , p β β β ) that corresponds to the following relation:

Constructive Assumptions for the Application of Linear Regression
The estimation of the conditional expectation of the frequency or the load amounts to identifying the function φ such that: ( ) This writing assumes a linear model. This assumption comes from the fact that the estimation of a function on R k is too complex numerically. However, the behavior of the frequency and the burden of the incident is not linear. The costs of claims, for example, when they materialize, follow a very asymmetrical density clearly non-Gaussian. Often, the data also show a constant coefficient of variation σ/μ rather than a constant variance (fundamental property in the linear model). ( ) 1 2 , , , p X X X X ϕ β

= =
We then need a link function g to establish the linear link between μ and the explanatory variables X: With a(·), b(·) and c(·) specific functions. The parameter θ is called the natural parameter of the exponential family. The parameter φ is called the dispersion parameter. This is a nuisance parameter that does not depend on y i observation.

Restoring Linearity: Link Function
Now that N or Y can follow any exponential law, we need an appropriate link function g that can link them to a linear predictor. There are several link functions, the ones we use frequently is the canonical link function. That is to say the function g which makes it possible to relate the expectation to the natural parameter θ:g(μ) = θ. Each of the laws of the exponential family has its own canonical bonding function.

Data Source
The primary source of data that we have available to model risks, and build a segmented tariff is the base formed from the information collected in the subscription forms. The data on the insured persons come from the National Company of Agricultural Insurance, they are stored in a base called "Slip of claims". They are observed over two years (2016 to 2017) and concerns 491 insurance companies. The variables are: Policy number: this field will link the "insured" database with that of the contracts as well as that of the claims. The customer number "customer number": it corresponds to the customer reference assigned to the subscription of the contract. The use of this variable will be important because it will make it possible to erase duplicates and to avoid counting twice the same person in our study (this will be our primary key). The registration number "registration number" allows you to follow the order in which customers have subscribed to an insurance policy. The effective date is the date from which the contract takes effect. The due date is when the contract ends. The region variable includes the 14 regions of Senegal; the Department (Depart) comprises the 44 departments of Senegal, the common variable and rural community determines the public authority or the subscriber of the insurance policy resides and where is located the insured property. The insured variable gives the natural person or the group having subscribed to a policy. The variable "Invoice" materializes the number of the customer's invoice following the payment of the premium. The variable "branch" gives the branch in which the insurance policy is taken out, it comprises 4 modalities: harvest, breeding, poultry, equipment. The variable "sup" gives the area of the insured field in the harvest branch. The "Assured Value" data reflects the CFA franc value of the insured asset. "NS" gives the number of losses suffered by any policy. "MS" gives the cost of the claim. The variable "type of loss" gives the nature of the claim and includes 19 terms.

The Variables of the Model
The variables selected are nine (9) including two endogenous variables (the number of claims and the average cost of a claim) and seven exogenous variables, five of which are dichotomous (1: if the risk was realized on the insured and 0: otherwise) and a quantitative in this case the area. The extent of damage to agricultural production is in complex relation with the dynamics and density of local population, the diet and the body size of the responsible species, as well as with the capacity of reception of the environment.

2) Wild duck (CS)
Damage caused by waterbirds in crops is high. Farmers know the risk, but cannot predict the location of the damage, although the risks are higher at the center of the trap, near a pond for example, than at the edges. , or in case of delay in the drainage of rice fields or in the vicinity of faults in the crop (free water spots). In fact, the risk of damage to ducks is similar to that of hail in temperate countries or the risk of migrating locusts in tropical areas Finally, the perception of the damage is more acute if the harvests are bad: the pests then take an indispensable part of the food of the peasants and their family and not only a part of the surplus of harvest when the cultivation conditions (pluviometry, floods) have been good.

3) Rainfall deficit (DP)
In most Sahelian countries, rainfall deficit agricultural production. Vegetation is at risk of rapid degradation due to overexploitation. For many years, many regions have suffered from an exceptional rainfall deficit. Any form of drought comes from a rainfall deficit. This dependence weakens the Senegalese economy and makes it vulnerable to fluctuating commodity prices and rainfall deficits.
However, it is possible that the level of production is barely average in some regions, especially those with a rainfall deficit.

4) Flood (INOND)
A flood is a temporary flood, natural or artificial, of a space by water. Flooding is one of the main natural hazards in the world; it is the natural disaster causing the most damage to the crops. During flooding, we also face disturbances and losses in food production. A difficult situation for farmers, especially since there is no flood insurance.

5) Health risks (RS)
The exact contours of the health risk are difficult to pin down. Diseases (animal and plant) are obviously part of it. Pests (insects, nematodes, rodents, etc.) can also be attached to them. The same is true for micro-organisms and chemical substances that, when they exceed a certain threshold, threaten the food security of consumers, even if they do not lead to a quantitative loss of production, or even a decline in production. The apparent quality of the products. Unlike be divided into two groups: 1) "exceptional" diseases, in which there is no effective treatment and against which it can only be controlled by destroying the infected plants; 2) "common" pathologies which, occasionally, can lead to substantial production losses but which can be controlled by curative or preventive treatments.

6) Desert Locust (CP)
Locust swarms have for centuries been a threat to agricultural production in West Africa. The livelihoods of the population may be affected by this voracious insect. The Desert Locust is potentially the most dangerous locust pest because of the ability of swarms to fly rapidly over long distances. It is two to five generations a year.

7) The area (SUP)
The area used by farmers has an impact on the frequency of agricultural claims. Indeed the more the cultivated area increases the more the number of claims will tend to increase.

Endogenous Variables
The modeling focuses on two variables: the number of claims and the average cost of a claim.

1) The number of claims (NS)
The number of claims is the number of times the insured has suffered a particular claim. The number of claims is a count variable.
2) The average cost of the incident (CMS) The average cost of the claim is the indemnity paid by the insurer following the realization of a given type of claim. The average cost of the claim is a variable belonging to R+.

Modeling of the Frequency
The law of Poisson is a law that applies to the modeling of phenomena whose occurrence is not very frequent or rare compared to the size of the population concerned. Events within the study population must be independent. Poisson's law is fundamental in modeling the number of claims for property and casualty insurance risks. It is in a way the basic law.
This property is called equi-dispersion.
When equi-dispersion is not respected, that is to say, when we have an overdispersion, we consider a quasi-Poisson law, such as where φ dispersion parameter. It is a parameter to estimate. Under certain assumptions it is shown that the process of the number of claims is a Poisson process. The law of Poisson can be constructed from a single hypothesis: the probability of occurrence of a disaster in the near future is proportional to the envisaged duration and does not depend on past observations. It also has the advantage of requiring only one parameter l. The law of Poisson is therefore of "natural" use in insurance.
The purpose here is to determine whether the ratio of the number of patients to the number of exposures, n i /N i , is approximately constant or not according to risk. We suppose that the count Y i = n i follows a mean Poisson distribution: ( )

Modeling the Average cost of Claims 1) The normal log law
The normal log law is a law that allows modeling of approximately symmetrical or asymmetric data to the right. A random variable X follows a normal log law when its logarithm follows a normal distribution. The probability density of this law is written as follows: The variance is: The average cost regression model derived from log-normal, taking into account risk variables, is as follows: 2) The Gamma Law A real random variable follows a gamma law of parameters γ and a, if and only if its probability density is given by the following formula: Hence the moments of the real random variable X are: The average:

Adjustment Test
The quantile-quantile diagram allows a graphical appreciation of the fit of an observed distribution to a theoretical model. On this graph, the y-axis carries the quantiles i x of the observed distribution, while the x-axis carries the corresponding quantiles of the theoretical law. The cloud of points align with the first bisector when the proposed theoretical distribution is a good representation of the observations. It should be noted that the appreciation of the alignment of points along the bisector can be considered subjective. All the deviations from the alignment (ends with curvature, distant points) can be identified and analyzed.
Quantile-quantile diagrams are plotted for any adjustment by a continuous law whose distribution function is strictly increasing, that is to say a law whose distribution function is bijective over the interval corresponding to non-zero values of the density function and not having "holes". We will show the application for normal, log-normal and gamma laws etc.
The histograms show the average cost of claims adjusted to a law, if the distribution of the variable is consistent with the curve of each of the predetermined laws (Figure 1). The chart below tests the distribution to which the average cost for difference law adjusts. We find that the average cost adjusts to the   log-normal law and the gamma law.
To determine if the data follows a log-normal or gamma law, we compare the QQ-Plot diagrams (Figure 2). The regression on a plane of the empirical and theoretical quantiles of the different distributions shows that the average distribution of costs adjusts to the gamma and lognormal distribution.
The adequacy tests carried out by comparing the distribution functions lead to the conclusion that the average cost of claims also depends on several other variables.
Indeed, the regressions between the theoretical and empirical quantiles show that almost all the points are on the first bisector. This shows the empirical dis-

The Empirical Model
Three models are estimated, a Poisson model for the frequency of claims, a lognormal model and a gamma model for the average cost of a claim (see Table 2).
We have taken as a reference the risk "accident". The parameters of the three models are estimated by maximum likelihood and the estimation of the va- The coefficients of the Poisson regression for each of the variables, as well as standard errors, are robust and are shown in the table below. The coefficient for the area is 0.34. This means that increasing the cultivated area by 10% leads to an increase in the number of claims by 3.4%. The coefficient for the variable "wild animals" is the expected difference between this risk and the reference risk (Aphid Invasions). Compared with aphid invasions, the log of the number of expected casualties with wild animals increases by approximately 0.401. The number of claims increases by exp (0.401) = 1.493. For wild ducks, the log number of claims increased by 1.425, giving rise to 4.15 claims. For locusts, we note an increase of 0.848 log of the number of claims or 2.33 claims. The log of the rainfall deficit records a 1.648 or 5.196 claims. The loss ratio is very high with the floods. Indeed, an increase in the log of the accident number of 2.585 compared to accidents is recorded, ie 13.26 claims. For diseases, the log number of claims increased by 1.872, reflecting an increase in the number of claims compared to accidents at exp (1.872) = 6.50.
To choose between two models (gamma and log-normal). We will use the deviance statistics, the Pearson statistic, and the AIC and BIC criteria of both models. Pearson's deviance and statistics are measures of the quality of fit of a generalized linear model. Or rather, it's measures of the wrong fit. Higher values indicate an adjustment. We note that the deviance and the Pearson statistic of the log-normal law are higher than those of the gamma law. In addition, the values of the AIC and BIC criteria for the gamma law are lower than those of the lognormal law. Therefore we have opted for the model of the gamma law presented by the From the above equation it can be deduced that farmers exposed to health risks, desert locusts and grain-eating birds have a greater risk of loss-making than others. Therefore, if an insured suffers a health risk, his log-cost expectation increases by 3.26 compared to an aphid invasion, so his cost is multiplied by exp (3.26) = 26.049. Conversely, if an insured person exposes himself to Desert Locusts, the expectation of his increases by 2.25, so his cost is multiplied by exp (2.25) = 9.48 compared to an invasion of aphids. For policyholders invading grain-eating birds, the cost logarithm increases by 1.898, a multiplication of the cost of exp (1.898) = 6.72 compared to an accident. Exposure to wild animals increases the logarithm of the cost of 1.275, a multiplication of the cost of 3.578. Flooding causes a multiplication of the cost of 3.553, wild ducks 3.016; the rainfall deficit of 2.232. In terms of area, a 10% increase in area leads to an increase in the cost of 6.82%.
In conclusion, we note that the health risks, locust invasions (Desert Locust), wild animals and ducks have higher claims than climatic events (rainfall deficits, floods).

Determination of the Pure Premium
The pure premium is determined by multiplying the probability of loss by the  cost of the loss (Table 3). If the premium is subsidized by 50%, the net premium to be paid by the insured is given in the last column of the table. We found that Desert Locusts, grain-eating birds, wild animals and wild ducks are the most common. The pure premium varies according to the risk to which the insured is exposed.
The pure premium is evaluated for invasions at 68,873 CFA francs for carnivorous birds, 57,103 for wild birds, 20,239 for wild ducks, 34,822 for desert locusts, and 19,865 for aphids, respectively. In terms of climatic phenomena such as floods and precipitation deficits, the pure premiums are estimated at 1607 CFA francs and 6179 respectively, while the health risks are 9482 CFA francs.
Health risks, floods and rainfall deficits are extreme phenomena whose probability of achievement is low. This explains the low premiums of these risks.

Conclusions
The purpose of this article was to determine the pure premium to be paid by the (wild locusts), wild animals and ducks have higher claims than climatic events (rainfall deficit, floods). Health risks, floods and rainfall deficits are extreme phenomena whose probability of achievement is low. This explains the low premiums of these risks. For better pricing, the insurance company will need to consider the type of risk to which each insured is most exposed and determine the corresponding premium. This segmentation will determine the correct premium.
This study has some limitations: other risks such as bushfires, drought and performance risk could be incorporated as variables in the model; here the aggregated cost of claims is considered.
In order to improve this model, we propose to add to the model the variables, such as: the geographical area (region, department, etc.), the client's claims history, as any customer is likely to suffer a similar loss. A previously committed; the density of the fields in the region because it is obvious that the probability of disaster of a customer living in a zone with a lot of fields is greater than that of a customer living in an area where the fields are few. Another proposed improvement perspective is to combine several predictive analysis algorithms to make the predictions and not rely solely on the generalized linear regression.
This study could be improved by determining the pure premium depending on each type of crop and the risks to which the crop is exposed. The pure premium could also be determined based on the yield deficit. Indeed, the fixing of a reference yield calculated from, for example, the yields observed over the last five years and set as a trigger, would make it possible to calculate the loss suffered by the farmer if his yield for a given crop is below the reference.

Conflicts of Interest
The author declares no conflicts of interest regarding the publication of this paper.