Molecular Characterization of a Large Diversity of Bambara Groundnut Potyviruses (Vigna subterranea (L.) Verdc.) in Burkina Faso

Potyviruses are major constraints to grain legume production 
by causing significant yield losses. Potyviruses infecting Bambara groundnut (Vigna subterranea) 
were investigated in Burkina Faso. Leaf samples collected from three 
agroclimatic zones were subjected to RT-PCR and sequence analyses. Of a total 
of 135 samples, 36 (26.67%) were detected positive in RT-PCR tests using 
potyvirus universal primers. Analysis of full coat protein (cp) sequences from 
24 isolates revealed the occurrence of three groups of Bambara 
groundnut-infecting potyviruses. Virus isolates in group 1 shared 94.5% - 100% nucleotide (nt) 
identity with CABMV whereas those in group 2 and group 3 were distantly related Bean common necrosis virus (BCMNV) 
and Passiflora virus Ugandan which 
were their respective closest potyviruses. Group 2 shared 77.1% nt and 78.8% - 79.9% aa 
identity with BCMNV and group 3 shared 77.3% - 78.3% nt and 80.7% - 81.5% aa 
identity with Passiflora virus Ugandan. 
All three groups were confirmed by phylogenetic analyses. Taking into account 
potyvirus demarcation criteria, group 1 isolates belonged to CABMV species. 
Group 2 and group 3 were assigned to a potentially new potyviruses species and 
designated Bambara groundnut potyvirus 1 (BGPV1) and Bambara groundnut potyvirus 2 (BGPV2).


Introduction
Bambara groundnut is one of the most economical important legume crops in Burkina Faso. In addition to be a source of income for farmers, it is an important source of energy (387 kcal/100g) which can contribute to the fight against malnutrition and undernourishment [1]. Its high protein content, over 19% [2] makes it a food that can replace animal protein in poor populations. However, its production faces enormous constraints of which viral diseases are one of the most important. The main viruses reported globally on the plant include Cowpea mild mottle virus (CPPMV, carlavirus), Voandzeia mosaic necrosis virus (VMNV, tymovirus), Cowpea mottle virus (CPMoV, carmovirus), Southern bean mosaic virus (SBMV, sobemovirus), Cucumber mosaic virus (CuMV, cucumovirus) as well as a number of potyviruses, namely Cowpea aphid aphid-borne mosaic virus (CABMV), Bean common mosaic virus blackeye strain (BCMV-BlCM), Voandzeia distortion mosaic virus (VDMV), Peanut mottle potyvirus (PnMV) [3] [4]. Viruses of genus Potyvirus are major factors of the reduction of Bambara groundnut production by causing yield losses ranging between 13% and 100% in cowpea [5] [6]. Potyviruses are mostly transmitted by aphid vectors and through virus contaminated seeds, which makes their control difficult. In Burkina Faso, some studies on the characterization of CABMV and BCMV-BlCM were done in Bambara groundnut [4] [7]. However, very few virus isolates were included in these studies, which was a limited help in assessing potyvirus diversity. The objective of this study was to identify Bambara groundnut infecting potyviruses in Burkina Faso and characterize their molecular diversity based on the analysis of complete coat protein (cp) sequences.

Samples Collection
From 2016 to 2018 and during the August to October growing seasons, 135 symptomatic Bambara groundnut leaf samples were collected from farmers' fields across the three agroclimatic zones of Burkina Faso. Agroclimatic zones included the Sudan zone with annual rainfall of 900 -1100 mm, the Sudan-Sahel zone (600 -900 mm) and the Sahel zone (less than 600 mm). In order to cover virus diversity, sampling sites were separated by at least 3 km. One hundred samples were collected in the Sudan zone and the Sudan-Sahel zone (50 samples per zone) whereas 35 samples were from the Sahel zone. Leaf Samples were kept on ice before being transferred to the laboratory where they were stored at −80˚C before molecular analyses.

Total RNA Extraction
Total RNA from the 135 samples were extracted individually with Trizol (Invitrogen) according to the protocol described by [8]. Before the extraction, the mortars were first sterilized during 2 hours at 180˚C and then kept at −80˚C until use. Briefly, RNA extraction followed was done according to the following

Potyvirus Detection by RT-PCR
Detection of potyviruses was carried out in an RT-PCR assay using the universal potyvirus primers P077-F: ATGGTHTGGTGYATHGARAAYGG and P078-R: CARATGAARGCMGCAGCA [9]. Viral cDNAs were obtained from the ex-

Potyvirus Complete Coat Protein Gene Sequencing
Samples positive for potyvirus detection were used for amplification of the complete cp gene. Therefore, independent PCRs were performed using primer combinations P105-F/P078-R and P077-F/P106-R (Table 1) as described by [4]. Expected size fragments were sequenced, assembled and blasted against GenBank NCBI. New primers were designed (Table 1) based on the sequences obtained and taking into account closely related potyvirus sequences in GenBank database. Independent PCR amplifications were performed as described above using the newly designed primers and conditions indicated in Table 1. Amplicons of the expected sizes were directly sequenced using respective primer pairs.

Sequence Analysis
All sequences contigs were edited and assembled using DNAMAN software to A. E. Zongo et al.
generate complete CP sequences. The cp sequences were compared to GenBank database sequences by Blastn NCBI (BLAST, http://www.ncbi.nlm.nih.gov/blast). Nucleotide and protein sequence identities between among query and retrieved sequences were determined using SDT v1 software [10]. Relative synonymous codon usage (RSCU) was determined using Seqinr [11]. Phylogenetic trees were reconstructed from complete CP sequences using the MEGA 6 software [12]. After Clustal alignment, the Maximum Likelihood (ML) method was applied for the phylogenetic trees reconstruction using the TN93 + G + I and Protein JTT + G + I nucleotide substitution models, with a bootstrap of 1000 replications. The amino acid composition and the nucleotide diversity between the different groups of potyviruses identified by this study were determined with the Mega 6 software.

Potyvirus Detection in Bambara Groundnut
As illustrated in Figure 1, amplicons of expected size (327 bp) were obtained in RT-PCR tests for potyvirus detection in Bambara groundnut using potyvirus universal primers. The universal potyvirus primers [9] associated to the blast NCBI [13] analyses were effective in detecting potyviruses in 36 isolates out of 135 samples analyzed, indicating a virus prevalence of about 26.7%. Twenty of these isolates (20/50 samples) originated from Sudan zone (40% in prevalence), sixteen (16/50 samples) from the Sudan-Sahel zone (30% in prevalence), while no positive samples were detected in the Sahel zone. This distribution makes the Sudan zone the most infested by the Bambara groundnut potyviruses in the country.

Nucleotide Diversity in Potyviruses Coat Protein Sequences
Complete CP sequences were obtained from 24 isolates out of 36 positive samples for potyvirus detection. Full CP sequence could not be obtained from the remaining isolates due the insufficient quality of corresponding sequencing data.
The 24 full CP sequences were submitted to the Genbank database (Table 2).   Nucleotide and amino acid analysis of the coat protein sequences showed a high virus diversity. As shown in Table 3, three groups of potyvirus sequences designated group 1 (7 isolates), group 2 (4 isolates) and group 3 (13 isolates) were differentiated. Within group nucleotide identity was 94.6% -100%, 93.7% -100% and 94.2% -100% in group 1, group 2 and group 3, respectively. Higher amino acid identity appeared in each group, reaching 97.8% -100% in group 1, 97.7% -100% (group 2) and 97.8% -100% (group 3). Percentages of nucleotide and amino acid identity were significantly lower between groups. At nucleotide level, isolates in group 1 shared roughly 72% -74% identity with isolates of group 2 or isolates of group 3. Nucleotide identity between group 2 and group 3 ranged between 75.8% and 77.7%. Further between group comparisons indicated that both nucleotide and at amino acid identities showed similar trends. Isolates in the current study were also compared to the closest virus isolates from the Genbank database. As presented in Table 3 isolates in group 1 shared high sequence identity only with CABMV isolates at both nucleotide (94.6% -100%) and amino acid (97.8% -100%) levels. Percentages of nucleotide or amino acid identity with any closest isolate were less than 77%. Isolates in group 2 showed only 70.6% to 77.7% nt identity and less than 80% aa identity with any closest virus isolate outside the group. The highest nucleotide identity (76.9% -

Coat Protein Features and Relative Synonymous Codon Usage
The twenty amino acids were found in all three groups but CP sequences in Group 2 isolates consisted of 259 aa which was 16 less amino acids than in the other two groups (275 aa). All CP amino acid sequences displayed the N-terminal DAG motif which is conserved in most potyviruses. same preferences. Group 1 of potyvirus also differs from group 2 and group 3, preferably using GGG codons for glycine and UCU for Serine. Another distinction between the potyvirus groups was evident in the preferred codon usage for asparagine (AAC in group 1 versus AAU in groups 2 and 3) and by histidine (CAC in groups 1 and 2 with respect to CAU in group 3). The clearest divergences between the potyvirus groups were observed in groups that did not use specific codons for certain amino acids. This was exemplified by codon UGU (Cys) in group 1, codons UGC (Cys) and CGC (Arg) in group 2 and codons CGA (Arg) and UCC (Ser) in group 3.

Phylogenetic Diversity of Bambara Groundnut Potyviruses
Phylogenetic analysis performed on the nucleotide and protein sequences of the 24 isolates along with homologous sequences retrieved from GenBank clearly distinguished clades that corresponded to the three groups of potyviruses described previously ( Figure 2). In both nucleotide and protein based phylogenetic   [4]. Group 2 and group 3 formed two new clusters which did not include any already known virus isolate.
Therefore, group 2 and group 3 were referred to as Bambara groundnut potyvirus 1 (BGPV1) and Bambara groundnut potyvirus 2 (BGPV2), respectively. Figure 3 presents the geographical distribution of the three groups of potyviruses characterized on Bambara groundnut in this study. BGPV1 isolates were found in the Sudan zone only whereas both CABMV and BGPV2 isolates were found in the Sudan and Sudan-Sahel zones. However, discrepancies were observed between the two agroclimatic zones. CABMV isolates were most frequent in the Sudan Sahel zone (25% of the isolates) than in the Sudan zone (8.3%). By contrast, most BGPV2 isolates (29.2%) were found in the Sudan zone while only 4.2% of the isolates originated from the Sudan Sahel zone. Altogether, all three virus groups were found in the Sudan zone which held 70% of virus isolates.

Discussion
Viruses of the genus Potyvirus were clearly detected by RT-PCR from field samples of Bambara groundnut using widely used potyvirus universal primers [4] [9] [14] [15]. In addition, sequencing of the CP gene and sequence analysis confirmed that virus isolates belonged to the genus Potyvirus. Moreover, all protein sequences displayed at their N-terminal ends, the DAG motif which is known to be conserved in aphid-transmissible potyviruses [16] [17].  Virus isolates from group 1 shared more than 94% with CABMV at both nucleotide and amino acid levels. Consequently, they were considered as CABMV isolates. Phylogenetic analysis of CABMV isolates revealed some intraspecies diversity whereby isolates from this study split in two groups. This consistent with earlier studies on Bambara groundnut CABMV isolates [4].
Unlike group 1, Isolates in group 2 and group 3 were not closely related to any particular virus species. On the one hand, isolates from group 2 shared 77% nucleotide identity with BCMNV or Passiflora chlorosis virus but their amino acid identity with these two viruses did not exceed 79% which was clearly below the 82% demarcation limit. On the other hand, isolates from group 3 shared no more than 78.3% nt identity and 81.5% aa identity with their closest virus species (Passiflora virus Ugandan [J896003]). Altogether, Bambara groundnut potyviruses in groups 2 and 3 appeared as distinct potyviruses which are characterized for the first time. Thus, the names Bambara groundnut potyvirus 1 (BGPV1) and Bambara groundnut potyvirus 2 (BGPV2), respectively, have been proposed.
The distribution of Bambara groundnut potyviruses in agroclimatic zones was characterized by more viruses occurring in the Sudan zone. This is in agreement with results found in cowpea viruses [15]. In this study, no potyvirus was found in Bambara groundnut in the Sahel zone as previously reported by [4]. This is likely due to the rare occurrence of potyvirus infections in the Sahel zone where climatic conditions are less favorable to the buildup of aphid vector populations [7]. In addition, Bambara groundnut cultivation is less common in the Sahel zone more prone to drought.

Conclusion
Potyviruses infecting Bambara groundnut in Burkina Faso were characterized at molecular level. In addition to the CABMV, two putative new potyvirus species referred to as BGPV1 and BGPV2 were reported for the first time. All three viruses occurred in the more humid Sudan zone, whereas only two were found in the Sudan Sahel zone and none in the Sahel zone. Further molecular characterization of BGPV1 and BGPV2 isolates through the determination of full genomic sequences associated with the determination of key biological properties is needed for better description of Bambara groundnut potyviruses.
A. E. Zongo et al.