Elaboration of Explanatory Factors of Accidents in Cameroon by Factorial Correspondence Analysis

The aim of this paper is to examine the causes of 
road accidents in Cameroon. The Douala-Yaounde highway was chosen as the case 
of study. Available field data recorded from the year 2006 to 2011, have 
enabled the analysis of each accident. The method used here is the factorial 
correspondence analysis; which aims to bring in a small number of dimensions, 
most of the initial information, focusing 
not on the absolute values, but the correspondence between the 
variables, that is to say the relative values. From this analysis, it appears 
that, of the 906 accidents recorded during this period, top five causes account 
for nearly 83% of the information provided by the set of variables on the 
occurrence of road accidents. These causes are: driver inattention, lack of 
control, over speeding, improper overtaking and tire puncture. These results require involvement in the construction of road 
safety policies through training, sensitization and adequate repressions 
as well as administrative reforms and research policy in road safety.


Introduction
Today, road accidents remain a blight both for public authorities in all countries around the world and for international organizations. Indeed, according to the specifically in Africa, road accidents are the second leading cause of death after malaria. Thus, Africa accounts for about 27% of the 1.35 million deaths worldwide for just over 2% of vehicles [1]. This situation can be explained by a combination of several factors, the weight or importance of which varies from country to country.
In Cameroon, a middle-income country and therefore prone to road accidents [1], although the mortality rate has declined sharply from 28.1% in 2007 to 20.6% in 2015, road accidents remain the second leading cause of death after malaria [2]. In this context, although road accidents occur throughout the country, they are concentrated around three main roads due to the importance of the cities linked by them. As a result, road accidents have a very high socio-economic cost.
In human terms, there have been just over 1000 deaths and over 6000 injuries.
On the economic level, the economic losses suffered by Cameroon due to road accidents represent nearly 100 billion CFA francs per year, equivalent to 1% of the GDP of this period [3]. In the light of the above, it is becoming essential to conduct a study to identify the explanatory factors for road accidents in order to draw up a policy which, if it does not eliminate road accidents, will reduce them or provide a framework for their occurrence.

Literature Review
Road accident studies have shown that four main causal dimensions have been identified as contributing to the occurrence of an accident. These include driver behaviour, the environment, the vehicle and pedestrian behaviour. Each dimension includes the causes that are linked to it [4]. Several studies have been carried out for the causal analysis of accidents. Those based on the examination of reports or on the variation of behaviour according to causal explanations and beliefs, or those based on a quasi-experiment analysing the variation of causal explanations and attitude towards preventive measures, according to the situational relevance, personal relevance and severity of the accident, and these studies have given disparate results depending on the position of each analyst [4]. Causal explanations for accidents thus vary from one source to another depending on the analytical techniques used or the location.
In addition, several methods are commonly used for studying the causality of road accidents. Many of them are based on the collection of road accident data.
The principle here is to group the accidents according to their profile for a good understanding of their production [5]. This method usually leads to a subjective analysis. This is why we are moving towards Correspondence Factory Analysis.
According to Grangé et al. [6], the studies carried out in this framework are based on the analysis of contingency tables, which makes it possible to study the links between two qualitative variables. Here we then have the possibility of reducing the dimension arising from the existence of correlation between the variables.
In addition to descriptive statistical analysis, whose interest is recognised (quantification of the situations studied) but also some limitations (difficulty of crossing mul-Journal of Transportation Technologies tiple data and interpretation), Factor Correspondence Analysis enables all the data (circumstances and characteristics of the accidents) to be cross-referenced/taken into account and to highlight their dependence/independence (correlation strength and significance) [7] [8]. In view of all these performances, it becomes obvious for us to approach this method in the framework of the present study in order to limit the dispersion of results observed.

Methodology
In order to analyse the causalities factors of road accidents, we use Factor Correspondence Analysis (FCA), which aims to gather most of the initial information in a reduced number of dimensions, focusing not on absolute values but on the correspondences between the variables [7]. This reduction is all the more useful if the number of initial variables is high. The notion of "reduction" common to all factor techniques has the particularity of providing a common representation space for variables and individuals. The goal of the CFA is therefore to read the information contained in a multidimensional space by reducing the dimension of this space while retaining a maximum of the information contained in the original space. To do this, the AFC uses the reduction or frequency table as the basis for its reasoning. This method makes it possible to compare the distance between the different responses modalities to the variables selected on axes whose significance is determined by the variables that characterize them. The CFA is used to determine and prioritize all dependencies between the rows and columns of the table. The total variance explained allows us to appreciate the amount of information explained by a factorial axis. It defines axes that best summarize the information obtained from the selected variables.
In our study, given the heteroscedasticity of the variables, it seemed appropriate to conduct an analysis of the correlations of the variables using the correlation matrix. It represents the correlation coefficients calculated on several variables taken in pairs. To assess the relationships between the variables and the factor axes, we used the post-rotation component matrix based on the varimax method with Kaiser normalization.

Choice of Road Sample
The road sample is the Yaoundé-Douala highway which is 242 km long. This road has been chosen for its heavy traffic (6000 vehicles/day) due to the economic and political importance of the towns it connects.  That is why we can describe the results of each analysis separately [10]. Sections 3 and 4 (total length of 114 km + 20 km) are selected as a sample for their heavy traffic and especially the availability of police reports in the existing data in the said sections.

The Study Variables
From the accident reports, we have identified several causes of accidents which we group into variables for a good exploitation of the data. The measurement of these different variables and their designation in the simulation process is given in the following Table 1.  Table 2 and Figure 2).  Lack of control 1 when the information is significant and 0 in the contrary causemm Driver carelessness 1 when the information is significant and 0 in the contrary causeimp

Collection and Classification of Accidents from Police Reports
Bad parking 1 when the information is significant and 0 in the contrary casemst Brake failure 1 when the information is significant and 0 in the contrary causup Dangerous maneuvering 1 when the information is significant and 0 in the contrary causemda Mechanical failure 1 when the information is significant and 0 in the contrary causem Driver inattention 1 when the information is significant and 0 in the contrary causeina Wheel bursting 1 when the information is significant and 0 in the contrary causecl Excessive speed 1 when the information is significant and 0 in the contrary causevit Bad overtaking 1 when the information is significant and 0 in the contrary causemde Journal of Transportation Technologies

Treatment of Accident Reports
A grid of relevant data from police reports has been implemented from an adaptation of road traffic injury analysis bulletin used in France. A grid was filled for all the police reports of the considered section. A quality control consisted of the cross validation of the pre-filled grid. This process helped to ensure concordance between the information on grids and those on the corresponding minutes and messages.
The encoded information was entered into a designed database and analyzed using SPSS software (SPSS version 11.0).

Results and Discussion
From the correlation matrix (Table 3), it appears that there is no strong correlation between the variables in the analysis. Moreover, from the matrix of components after rotation (Table 4) and the table of total variance explained (   The correspondence factorial analysis allowed us to build a typology of accidents, thus constituting profiles. Without describing the causes of accidents, these profiles highlight the multi-causality of accidents on Cameroonian roads. Finally, the axes retained are: driver inattention, lack of control, excessive speed, poor passing and wheel bursting. These axes represent the main causes of road accidents in our sample (see Figure 3 and Table 5). The graph below illustrates their positioning. The points furthest away from the axes are those that have a high correlation with them.
We have thus proceeded to the elaboration of accident profiles on our line of study. These results reflect the national and indeed international trend in road accident causality studies and even their categorisation. These results are similar to those obtained by the psychologist Robert Ngueutsa [4] and BPA [11]. In Ngueutsa's study on the explanation of road accidents according to causes and analysts in Cameroon, he notes that speeding, dangerous overtaking and wheel bursts are ranked among the most important causes of road accidents. Furthermore, the study conducted by the BPA notes that lack of control and driver inattention are among the three predominant types of accidents on Swiss roads.

Conclusions
In this study, correspondence factor analysis is used to classify accidents according to their profile using the reports drawn up by the police forces. In the period from 2006 to 2011, 906 accidents were recorded. The objective here is not to analyse accidents according to their production but to be able to categorise them while retaining as much information as possible from the set of variables.
Although factor analysis does not make it possible to describe the causes of accidents, it does make it possible to develop an overall policy which, if accidents are not eliminated, would make it possible to reduce them or to control their occurrence in cases of force majeure. This policy provides guidance on certain groups of accidents or individuals that can range from awareness to enforcement, thereby reducing the number of accidents and fatalities on our roads.
Accident data sources can be used to establish the presence or absence of a number of factors that can modify the risk of an accident. A description is never an explanation, and the decision-maker is interested in the role of the factor under consideration in the accident, not whether it is present or absent. Therefore, it is necessary to be able to use other elements to produce useful information and to be able to move from describing to attempting to explain the mechanisms and the assessment of risk factors. It is becoming necessary to apply to accidents the methods used in all scientific approaches based on the analysis of epidemiological data if we want to control the phenomenon of accidents in our environment, although the availability and quality of the reports is a brake on this.