Genetic Diversity Analysis of Coffee ( Coffea arabica L . ) Germplasm Accessions Growing in the Southwestern Saudi Arabia Using Quantitative Traits

Coffee (Coffea arabica) is an economically important crop cultivated on the terraced slopes and narrow valleys of the mountains in the south-western Saudi Arabia. The objective of the study was to evaluate genetic diversity among nineteen coffee accessions collected from different coffee growing areas based on quantitative morphological and pomological traits. Data of 17 quantitative traits were collected and subjected to various statistical analyses. The analysis of variance showed significant differences (P < 0.05) among the accessions for the majority of quantitative traits measured. A principal component analysis (PCA) and hierarchical cluster analysis (HCA) were used to determine the nature and degree of divergence among accessions. PCA showed that four principal components explained about 85.62% of the total variation. Differentiation of germplasm into different clusters was because of cumulative effects of a number of characters. Cluster and distance analysis of quantitative traits revealed the existence of five different groups. The maximum distance was seen between clusters IV and V (76.4) while the minimum was seen between I and II (16.13). Accessions like KSA6 and KSA8 from Tallen valley and KSA10 and KSA11 from Hada and Shada valleys collection are recommenced for the next breeding work as they are high yielder accessions compared to the others. However, it is suggested to complete the present results by additional assessment of the in situ collection and variety trials. These results are of great interest for future breeding programs for this species.


Introduction
Coffea, a member of the Rubiaceae family, has more than 70 species, but only two species, arabica (Coffea arabica L) and robusta (Coffea canephora Pierre), have commercial value.C. arabica is originated from Ethiopia and was propagated and dispersed all over the world from a low number of plants, which in addition to its autogamous reproduction led to a narrow genetic basis within Arabica coffee cultivars [1] [2].In the southwestern tip of the Arabian Peninsula which encompasses southwestern Saudi Arabia and Yemen, C. arabica L. has been cultivated for the last 3 to 4 centuries [3] on the terraced slopes and narrow valleys of the Mountains at altitude ranging mostly from 1200 to 1800 m above sea level [4].In south-western Saudi Arabia, trees of more than 100 years of age were found in traditional coffee growing areas in Jazan, Asir and Al-Baha regions.According to Fyfa Development Authority (FDA, governmental organism), the number of trees being about 78,000 with 84% was in Addayer district (Jazan region).These trees produce about 500 tonnes of coffee beans each year.
Still the number of trees is most likely underestimated; since FDA operates a large nursery which offers annually thousands of seedlings for free for the local farmers (more than 40,000 coffee seedlings were distributed in 2016 alone).
Therefore, Saudi Arabia continues to produce some of the best coffees in the world [5] [6], as coffee is grown under mostly organic practices without use of synthetic fertilizers, pesticides or herbicides and growers often use goat manure and stone mulching [7].Regrettably, coffee genetic resources are under threat mostly due to the scarcity of water for irrigation and the erosion of the terraces and the spread of the cultivation of guat (Catha edulis) [7].Besides, coffee farmers have replaced coffee by other higher priority monoculture crops and/or there has been a shift in coffee cultivation from productive to marginal sites [7].
To reduce such genetic erosion, efforts to collect and conserve Saudi coffee germplasm were carried out by the Centre for Environmental Research and Studies (CERS, Jazan University) in cooperation with Fyfa Development Authority (FDA) and about 20 germplasm accessions have been collected from different coffee growing areas of the southwestern Saudi Arabia and maintained at the experimental field of the FDA.In fact, our aim is to build gene pool by collecting coffee accessions of various geographical origins and centres of diversity and so to guarantee preservation of co-adapted gene complexes [8] [9].Interestingly, the germplasm collected from the southwestern Saudi Arabia, where stressful conditions prevail, could provide a base material for developing cultivars tolerant to environmental stresses.
The economic value of a population is allied to morphology, agronomic performance, seed quality and nutritional qualities as well as on the efficient use of native germplasm [10].Still, the convenient use of the potential held in germplasm collections needs detailed knowledge about the collections, including genetic diversity studies, evaluation and classification [9] [10].Despite the appropriate environmental conditions, the productivity and production of coffee per unit area in the highlands of the southwestern Saudi Arabia remain very low as compared to world average.This is mainly attributed to lack of adaptable cultivars for each ecological zone of the region.Previous reports have valued the extent of genetic diversity of Arabica coffee germplasm collected from different locality in Ethiopia [11] [12].However, no systematic diversity analysis has been carried out to quantify and verify the level of genetic diversity of Arabica coffee in the southwestern Saudi Arabia.A good understanding of genetic variability is a useful tool in the genetic improvement of the crop.This will certainly boost the identification and use of interesting genes in breeding program.In fact, crop improvement through breeding depends on the disposal of genetic variability as well as on how easily this variability could be fixed in genotypes [13] [14].
In the southwestern Saudi Arabia, coffee has been for centuries propagated mainly from seeds, thus we expect that most of what growers here call cultivars are actually populations of a multitude of landraces with a degree of genetic variability.The relative isolation between coffee valleys could be considered as an outstanding occasion for the fixation of characters and varieties in these regions.
Therefore the coffee gene pool might be unique to this region.It is, therefore, interesting to safeguard.Besides the only information we have concerning coffee accessions in the southwestern Saudi Arabia is that almost all Saudi and Yemeni coffee types arises from ancient, "heirloom" varieties of Coffea arabica first naturalized hundreds of years ago [4] [15] [16].The characteristics of "Typica" variety seem to prevail over those of "bourbon" in these regions.Keeping this in view, this present study aimed to estimate the genetic diversity among coffee germplasm accession based on several quantitative traits.Combined multivariate analyses including the principal component analysis (PCA) and hierarchical cluster analysis (HCA) were employed to achieve such objective.

Description of the Study Area and Plant Material
We carried out an-area-wide survey during 2015-2016 and examined several coffee accessions from 19 localities in the southwestern Saudi Arabia where the species was known to be cultivated (Figure 1).The main coffee producing valleys with old plantations visited were depicted in the Table 1.These sites have similar environmental conditions.In the most coffee farm we visited, most soils are sandy-loam, with slightly alkaline pH (7.5 to 8).
The Mountains of the southwest Saudi Arabia receive the highest rainfall in the country.The highest average annual rainfall recorded in this area includes 577 mm at Fyfa Mountain and 357 mm at Abha.Since the area falls within the boundaries of an arid region, rainfall is always inconsistent.Most of the rain is between April and September.Occasional showers may also fall during the cold months.Still the west facing slopes receive the maximum amount of rainfall, while in the eastern slopes, the intensity of the rain decreases and rains are unpredictable throughout the year.The climate of the mountains is characterized by cold winters and cool summer.Climatic data achieved from the Fyfa Meteorological Station during a period from 1984 to 1998 shows that the highest in June to 79% in July, while in winter it ranges from 74.1% in December to 70.9% in March.

Data Collection
As many as possible the existing variability in several morphological and pomological quantitative traits was recorded (Table 2).Such traits concern the tree (plant height (m), and plant yield (kg)), leaves (length (cm), width (cm) and leaf area (cm 2 )), fruits (100 of fruits weight (g), length (mm), width (mm), and thickness (mm) and geometric diameter (mm)), and beans (50 beans weight (g), length (mm), width (mm), and thickness (mm), and percent seed defects (%)).Most samples were a mixture of fruits picked from more than one tree.The bulk of fruits samples were collected before January.The color of flushes and adult leaves was also used as a qualitative trait.The quantitative traits were complemented with more information provided by the farmers, such as the local names of the different coffee types and productivity.For the estimation of the productivity, a number of trees from each germplasm accession were harvested.Coffee production in each coffee farm was divided by the number of plants harvested per farm, thus the average production of a plant was gotten.Afterwards, the average production of each plant per farm was multiplied by the number of plants per hectare, which allowed obtaining the productivity, in kg•ha −1 , in each population.

Data Analysis
A digital Vernier caliper with 0.01 mm accuracy (Series 500, Mitutoyo, Japan) Table 1.Geographical origin of the studied coffee (Coffea arabica L.) germplasm accessions used in the study.The main coffee producing valleys with old plantations in Jazan, Asir and Al-Baha regions were visited.Coffee plants were cultivated on the terraced slopes and narrow valleys of the Mountains at altitude ranging mostly from to 1800 m above sea level.was used to measure the dimensions of the 100 samples of coffee berries and beans.The length (L), width (W) and thickness (T) measurements were taken.
The geometric diameter (FGD) of the fruits was calculated as follow according to Mohsenin [17].

FGD LWT =
As well the fresh mass of 100 coffee fruits and 50 coffee beans were determined.The data collected were processed by different methods of statistical analysis using the SAS software [18].Analysis of variance (ANOVA) was used to assess the significant differences by studied traits.Genetic divergence between clusters was determined using the generalized Mahalanobis D 2 statistics.Mahalanobis [19] developed this method to determine divergence prevailing among population in terms of generalized group distance [20].The generalized distance between any two population is given by formula: where, D 2 p = total generalized distance based on p characters.Xi and Xj are the p mean vectors of accessions i and j, respectively.

S p p =
× pooled error variance-covariance matrix [19].Testing the significance of D 2 values obtained for a pair of clusters were taken as the calculated value of Chi-square (χ 2 ) and tested against the tabulated value of χ 2 for p degrees of freedom (p is the number of characters) [21] at appropriate probability level that was considered.
In order to assess the extent of genetic variation and percentage similarity within the accessions and its nature, the collected data were then subjected to a multivariate analysis by PCA followed by HCA.PCA was achieved using correlation matrix in order to study the relationship among quantitative traits that are correlated among each other's by converting into uncorrelated traits called PCs [22].
The PCA produced Eigen-vectors and factor scores that were used respectively to measure the relative discriminative power of the axes and their associated characters.The HCA constitutes a simple and less demanding model and is employed to summarize the position of accessions by sorting them into distinct groups [23].Clustering was done using the proc cluster of SAS [18] by employing the method of average linkage clustering called Un-weighted pair group methods with arithmetic average (UPGMA).

Analysis of Variance
Analysis of variance for the measured quantitative morphological and pomological traits reveal a significant (P < 0.05) differences among the coffee germplasm accessions for all traits (data not shown).Because of this all traits were included in multivariate analysis like cluster analysis and PCA.

Cluster Analysis
The 19 coffee germplasm accessions were classified into five distinct groups (Table 3, Figure 2).Cluster II had the highest number of accessions with eight accessions (44% of the total population) followed by cluster I with four accessions (22%).Clusters III had three accessions (17%), while IV and V had two (11%) and one (6%) accessions, respectively.The cluster I includes the accessions KSA15 and KSA16 from Rayda valleys while KSA18 and KSA19 from Wadi Ghill.These accessions were characterized by their low to medium productivity.They have bold fruits with medium to large discs and the beans were of large size.This type contains low percentages of elephant beans (beans defect).The new leaf flushes were bronze to light bronze, and adult leaves were of average size.According to our observation and talk with the coffee growers, these accessions have a relatively flexible branches, resulting in many stem and heads for old plants.Some coffee growers named some accessions as "Tufahi".The Cluster II includes accessions KSA11, KSA7, KSA10, KSA2, KSA4, KSA5, KSA12 and KSA17.These accessions were collected from Shada, Tallan, Hada, Khacher, Fyfa, Wadi Dafa, Maadi, and Rayda valleys.Most of these accessions were of great vigour (more than 6 m height), moderate productivity and their compact cylindrical growth.The new leaf flushes were green to brown and adult leaves were green and of medium size shape.The fruits were bold with medium discs and large irregularly formed seeds, due to high percentage of elephant beans (beans defect) present in several representative of this cluster.The fruits and seeds were smaller than those of the cluster I. Some representatives of this type were named by local coffee growers as "Tessawi".When the spacing was not enough, the branches of different plants inter-cross making a dense canopy.The cluster III includes KSA8, KSA1, and KSA3.All accessions of this cluster are highly productive.This type was seen in different valleys such as Tallan and Khacher in Jazan region.Trees were of medium size (about 4 m high).This type has smaller and more rounded fruits.The beans were of medium size with elliptic rounded shape.Seeds contain only a little proportion of elephant beans.The pulp was very sweet.The shape of young and old trees with one stem is conical, with relatively open growth.All the representative of these cluster had a green leaf flushes, and mature leaves were of average size.The stems are of great flexibility.Some representatives of this type were named by some farmers as "Kholani".
The Cluster IV includes the accessions KSA6 and KSA13.All accessions of this cluster are vigorous and very highly productive (the highest productive cluster).
The trees were slightly taller than those of the cluster III (4 -5 m height).It has a large elongated elliptic fruits, with small to medium sized discs.The beans were also large, regularly elongated elliptic with low fraction of elephant beans.New leaf flushes are brown to green, and adult leaves were large.The leaves were of narrow and pointed shape.The leaf surface is quite flat (this type has the highest leaf area).Some representatives of this type were named by some coffee growers "Dawairi or Dohaïri".Finally the cluster V includes only one accession KSA9.
This cluster has relatively the lowest vigour (short stature) and productivity.
This type was only found in Tallan valleys and with very limited number of trees.The tree shape was conical.Fruits and beans were small, elliptic rounded and very few defected beans (elephant beans) were present.Leaf flushes were light bronze, and old leaves were small and dark green.The most interesting character of this accession is it capacity to bears fruit all year round.This type is named "Udaini" by some coffee growers.
The distance between the clusters of the 19 coffee accessions based on 17 quantitative traits is given in the Table 4.The inter cluster distance analysis showed significance (P < 0.05) differences between all cluster.The minimum distance was recorded between cluster I and II (16.13) flowed by cluster I and IV (20.23).The highest inter cluster distance was shown between cluster IV and V flowed by cluster I and V (75.7).
The dendrogram drawn from the HCA shows that all the accessions were dis-

Principal Component Analysis
The principal component (PC) and percentage contribution of each component to the total variation in quantitative morphological and pomological traits were represented in the  3).

Discussion and Conclusions
Analysis of variance for the measured quantitative morphological and pomological traits reveals a significant (P < 0.05) differences among the coffee germplasm accessions for all traits.Therefore all traits were included in the multivariate analysis like HCA and PCA.Such variability in several interesting traits clearly evidenced the option to bring significant perfection for coffee yield and/or coffee fruits and beans characteristics through selection and crossing.Variability between Saudi coffee accessions may be credited either to the evolutionary trends or to the natural mutation occurring to the population [24] [25].The existence of board morphological diversity among Arabica coffee accessions was further confirmed by many authors who reported substantial morphological and agronomic variability among coffee germplasm accessions [11] [12] [26] [27] [28] [29].Still, the major variations observed for most of morphological traits in a population were the result of the combination of the genotypic and environmental effects [30].
Cluster analysis based on coffee quantitative traits grouped 19 coffee genotypes into five distinct clusters (Table 3, Figure 2), in which the first cluster consisted of four accessions (22%).The second cluster consisted of eight accessions (44%), the third cluster consisted of three accessions (17%), the fourth cluster contained two accessions (11%) and the fifth cluster consisted of only one accession (6%) from the total accessions.This indicates that coffee accessions of the same cluster group were at least morphologically similar.The distribution pattern of genotypes in five clusters confirmed the existence of diversity among the coffee accessions.Interestingly, our results showed that several coffee accessions were clustered together despite they were collected from dif- Maadi were grouped in cluster IV.This could be attributed to the unrestricted movement of the coffee seed from different valleys by man as well as wild animals [11] [31].The same results were reported by Seyoum et al. [32] for coffee accessions from Ethiopia.Besides, previous report showed that morphological variation was more significant as an indicator of genetic diversity in coffee than variation in geographic origin [33].Still our study reveal that coffee accessions collected from the same valley were clustered into different clusters which suggest a high genetic diversity within each collection area.The best example of this is Tallan valley where four different accessions were growing in the same place and under those same conditions and all of these accessions were clustered in four different clusters.This diversity could be exploited in order to increase the genetic base of coffee varieties.
The distance between the clusters of the 19 coffee accessions based on 17 quantitative traits is given in the Table 4.The inter cluster distance analysis showed significant (P < 0.05) differences between all clusters.The minimum distance was recorded between cluster I and II (16.13) flowed by cluster I and IV (20.23).The highest inter cluster distance was shown between cluster IV and V flowed by cluster I and V (75.7).This confirms that the coffee accessions among clusters were divergent from each other.The lowest inter cluster distance observed indicated that the accessions in these clusters were not genetically diverse or there was little genetic diversity between them.However, a high inter cluster distance indicated that there is a high chance for obtaining transgressive segregates and maximizing heterosis by crossing accessions belonging to different clusters as there is a higher chance that distinct accessions would contribute by unique desired alleles at different loci [34] [35].According to our results, the maximum recombination and segregation of the progenies is expected from crosses involving parents selected from cluster IV and V followed by cluster I and V and cluster II and V. Still, the breeder needs to specify his objectives in order to make best use of the characters where the characters are divergent.Falconer [36] reported that genetic diversity has probably arisen through diversity in origin (geographical separation), ancestral relationship, gene frequencies and morphology.These workers indicated that plants differing in either one or more of these factors would differ by a significant number of genes.
The present results showed that clusters III and IV consisted of 3 and 2 germplasm accessions respectively were the most interesting cluster regarding productivity and fruits and beans characteristics.The accessions grouped under the cluster IV gave the best characters of interest (highest productivity with big fruits and beans).This cluster is followed by the cluster III with also a very good productivity; while the fruits and beans were smaller than the cluster IV.The biggest fruits and beans were shown by the cluster I; however theses accessions were less reproductive (low to medium productivity).On the other hand, the cluster II consisted of the majority of the accessions (44%).This cluster was the most heterogeneous including accessions with moderate productivity and compact cylindrical growth.The fruits were bold with medium discs and large irregularly formed seeds, due to high percentage of elephant beans (beans defect) present in all representative of this type.However, cluster V consisted of only one accession (KSA9) has lowest productivity and fruits were of very small size.
Still this accession has an interesting character which is its capacity to bears fruit all year round.Therefore, we can conclude that different clusters have different breeding value that enable breeders to improve different traits and potential selection should be made based on the relative merits of each cluster for each trait depending on the objectives of the breeding program.
The dendrogram drawn from HCA shows the relationship between the 19 accessions based on the 17 quantitative traits (Figure 2).This graph showed that all the accessions were distinct from each other at 100% level of similarity, however at 85% level of similarity, five different clusters were formed.The first cluster one include the accession KSA9, the second one include KSA6, KSA13, and KSA18, while the third cluster include KSA3, KSA1 and KSA8, the fourth cluster include KSA4, KSA2, and KSA7 and finally the fifth cluster include KSA11, KSA10, KSA12, KSA5, KSA17, KSA16, KSA19 and KSA15 (the most heterogeneous cluster).These results corroborate our above results.
Eigenvalues, percentage of total variance, percent of total cumulative variance and eigenvectors for 17 morphological and pomological quantitative traits in 19 Saudi coffee accessions were given in Table 5. PCA was performed to assess the relative importance of each trait for characterization of accessions (Figure 3).
The results showed that about 85.62% of the variations present among accessions were explained by four principal components (PC).The first 2 principal components PC1 and PC2 with values of 51.0% and 15.0% respectively contributed more to total variations among coffee germplasm accessions.According to Chahal and Gosal [37], characters with largest absolute values closer to the unity within the first principal component (PC1) influence the clustering more than those with lower absolute values closer to zero.So, morphological and pomological traits mainly those contributed to the PC1 and PC2 played a major role in classifying coffee accessions into different clusters and should be considered in selecting diverse parents in crossing program.This finding is partly in agreement with the finding of Olika et al. [29] who have reported bean length, hundred bean weight, leaf length and leaf width contributed to the variation among Limmu coffee accessions.This finding is similar with the finding of Tikader et al. [38].
In conclusion, Saudi Arabia is endowed with immense potential of diverse coffee materials and suitable ecological conditions for coffee cultivation.The present study confirmed that coffee accessions showed variations for several morphological and pomological traits.Characterization of germplasm accessions based on 17 quantitative traits using the linkage method of Hierarchical cluster analysis resulted in grouping of the germplasm accessions into five clusters.
Most of the inter-cluster distances were significantly different indicating the presence of variability which can be exploited through selection and hybridization.The PCA reveals that the first four components accounted for 85.62% of the total variation.The traits with the largest values in the first two PCs played major role in clustering the accessions and should be considered for selection program.Characters with high variability are expected to provide high level of gene transfer during breeding programs [39].Interestingly, the germplasm collected from the southwestern Saudi Arabia, where stressful conditions prevail, could provide a base material for developing cultivars tolerant to environmental stresses.

Figure 1 .
Figure 1.Map of the study area showing sampling sites in the south-western Saudi Arabia (Jazan, Asir and Al-Baha regions).The study areas are marked with black stars.

Figure 3 .
Figure 3. Plots of the first and second components (a and the first and third component (b) resulting from a PCA of the coffee accessions using 17 quantitative traits.
ferent geographic locations (sometimes very far from each other), as shown in the clusters I, II, III and IV.For example accessions collected from different place such Rayda and Wadi Ghill were clustered in cluster I. Likewise accessions collected from Shada, Tallan, Hada, Khacher, Fyfa, Wadi Dafa, Maadi, and Rayda valleys were also clustered together in cluster II.Cluster III includes accessions from Tallan and Khacher.Finally accession collected from Tallan and

Table 2 .
Quantitative morphological and pomological traits used to characterize different accessions of Coffea arabica.

Table 3 .
Grouping of accessions of the C. arabica analyzed for 17 quantitative morphological and pomological traits and some qualitative traits, distributed in five groups.
Figure 2. UPGMA dendrogram depicting the genetic relationship of coffee germplasm based on 17 quantitative and qualitative characters evaluated in the field.The symbols used are indicated in the Table 1.

Table 5 .
The results showed that 4 PCs accounted for 85.62%

Table 4 .
Distance between clusters of the coffee germplasm accessions.

Table 5 .
Eigenvalue, factor scores and contribution of the first four principal component axes to variation in the coffee accessions.