Plant Disease Severity Assessment Based on Machine Learning and Deep Learning: A Survey ()
1. Introduction
The world’s agricultural production suffers huge losses estimated between 20% and 40% every year [1] [2] . 40% to 50% of such losses are due to pest and diseases [3] [4] . These losses are both quantitative and qualitative and as a result, they are responsible for significant economic losses annually and affect the Gross Domestic Product (GDP) of countries. Plant pest and disease protection has become an important research area, due to its highly correlation with food security, climate change and environmental sustainability. It is an essential tool for precision agriculture.
However, knowledge of the disease severity is an essential factor in crop disease protection and management [4] [5] . Crop disease severity is the ratio of plant units with visible disease symptoms to the total plant unit (e.g. fruit, leaf) [6] . Diagnosis of crop disease and disease severity estimation are closely related. It is therefore essential to identify the disease severity stage as early as possible in order to remedy the yied loss. Traditional methods for disease severity estimatimation mostly rely on manual labor, which is labor-intensive, slow and highly subjective [7] [8] [9] . It requires plant protection experts to visit the field to identify the disease and determine its severity.
In recent years, thanks to advances in computer imaging technology and improved hardware performance, computer vision and artificial intelligence have been widely used in agriculture for plant species classification, disease identification and plant disease severity assessment [10] . Solutions proposed by researchers the last few years for disease severity estimation are based on images processing techniques (IPT), classical Machine Learning (ML) and Deep Learning (DL) algorithms [1] [7] [8] .
This paper is a review of plant disease severity assessment solutions proposed by researchers the last few years. The specific contributions of this paper include:
· Advantages of plant disease severity assessment.
· Analysis of proposed crop diseases severity estimation solutions based on IPT, ML or DL.
· Limitations of proposed solutions and potential challenges.
The paper is organized as follows: Section 2 identifies the advantages of plant disease severity assessment in agriculture, Sections 3 provides a critical analysis of the proposed solutions based on IPT, classical ML and DL algorithms, Section 4 shows the potential challenges in crop disease severity estimation and the last Section concludes the paper.
2. Advantages of Plant Disease Severity Assessment
In a field, a disease can spread rapidly over the entire crop batch and cause a field-wide epidemic, which is undoubtedly devastating. In order to effectively monitor and control such situations, it is important to specify earlier not only the type of disease, but also its severity, which is the ratio between the plant unit showing visible symptoms of the disease and the total plant unit [6] . For example, the ratio of diseased area to leaf area (see Figure 1 and Figure 2). This is why the precise quantification of crop diseases is an absolute necessity in agriculture. Thus, asssessing disease severity enables growers to rationalize disease control, for example by deciding on the right dose of fungicide and type of pesticide, as well as the time of day to spray [8] [11] . A reliable and accurate estimation of disease severity enables farmers to predict epidemics in their fields and yield losses, and to assess disease resistance in crop germplasm [12] [13] [14] . It can also help farmers with pesticide management, disease forecasting, spatio-temporal epidemic modeling and crop loss modeling [7] [15] [16] . Pesticide management plays an important role in protecting the environment by simultaneously reducing crop, soil and water pollution and avoiding pesticide residues in fruits
![]()
Figure 1. Visual scale of greasy spot severity in grape fruit leaves [17] .
![]()
Figure 2. Leaf images of four severity stages (healthy, early, middle and end) of apple black rot disease [18] .
[8] [19] . Early disease severity estimation is essential in global food security [18] .
However, traditionally, disease severity is determined manually by experts by estimating the visual surface area of lesions on plants (e.g. leaf, fruit, etc.). This process is slow, time-consuming, highly subjective and largely dependent on the level of experience of agronomists and farmers for visual scoring [1] [7] [20] [21] . Incorrect assessment of disease severity in plants can lead to erroneous conclusions, resulting in wasteful or inefficient use of pesticides, which can further exacerbate losses [15] [22] [23] [24] .
3. Automatic Disease Severity Assessment
In the literature, several solutions have been proposed for estimating the severity of plant diseases. These solutions can be divided into three types:
· Visual assessment, traditionally used by plant experts,
· Solutions based on hyperspectral imaging (image processing techniques) or ML,
· And more recently, solutions based on DL.
For the purposes of this paper, we worked on 47 articles whose work focused on estimating the severity of plant diseases. The articles were searched on Google Scholar, Springer Link, Web of Science, IEEE Xplore, Scientific Research, Frontiers, etc., using the keywords “disease-severity-assessment-plant”. Figure 3 shows the number of studies carried out on this topic every year, between 2008 and 2023.
These work concerns different crops, different diseases and different approaches for determining disease severity. Among the crops covered by the 47 reviewed articles, tomato and maize are the most treated (8 times). It is followed by tomato (8 times), wheat (5 times) and strawberry (4 times) (see Figure 4).
![]()
Figure 3. Graph showing the distribution of publication years for 47 articles based on the keywords “disease-severity-assessment-plant” from Google Scholar, Springer Link, Web of Science, IEEE Xplore, Scientific Research, etc.
![]()
Figure 4. Graph showing the number of times crops were treated in the 47 reviewed papers.
The definition of the severity grades or lavels is essential in diseases severity estimation. Based on the 47 reviewed articles, we can say there are three categories of severity grades namely qualitative grade, quantitative grade and direct calculation of the percentage of the disease lesions (see Figure 5). For example, [18] used Healthy, Early, Middle and End severity levels which are qualitatives. [25] used healthy (<0.1%), very low (0.1% - 5%), low (5.1% - 10%), high (10.1% - 15%) and very high (>15%) grades which are quantitatives. In [26] , the percentage of the disease lesions is directly calculated. In the following subsections, we take a closer look at solutions based on IPT and ML and DL algorithms.
3.1. Solutions Based on IPT
In agricultural research, IP technology has undergone significant development. Solutions based essentially on IP technology have been proposed by reseachers for assessing plant leaf disease severity. For instance, Wijekoon et al. [27] used multispectral images thresholding operation to calculate the ratio of infected area, lesion color index and severity index of soybean leaf infected by rust disease. Weizhong et al. used the Sobel operator to segment soybean rust disease and to determine the spot edge and disease severity of the plant, which is measured by calculating the quotient of disease spot area and leaf area [28] . In [29] , authors applied simple threshold and triangle thresholding methods to respectively segment the diseased leaf area and lesion region on the leaf. The results show an accuracy of 98.60% for estimating the severity of brown spot on soybean leaves. Thus IP technology to measure crop disease severity is convenient and accurate but the severity of the disease measured is depends upon segmentation of the image. Authors of [30] developed a mobile application based on image processing. This app is able to calculate the severity percentage of six different diseases with typical lesions of varying severity. Palma et al. proposed an approach for automatic quantitative assessment of disease severity based on leaf images, regardless of disease type. The proposed method is based on a highly
![]()
Figure 5. Graph showing the number of times categories of grades are used in the 47 reviewed papers.
efficient, noise-free positive nonlinear dynamic system that recursively transforms the image of the leaf until only the symptomatic patterns of the disease remain [31] .
3.2. Solutions Based on ML
In the literature, there are very few works based on classical ML algorithms for estimating crop disease severity. ML-based solutions often use IPT. IPT are used to improve the quality of the images used, while ML algorithms are used for image segmentation, feature extraction and image classification. For example, Owomugisha et al. [32] used image processing technics and classical ML algorithms such as linear SVC, KNN and Extra Trees to classify four cassava diseases (mosaic, brown streak, bacterial blight and green mite) and assess their severity on diseased leaves. They used a dataset of 7386 images divided into 5 severity levels ranged from 1 to 5. They obtained accuracy scores of 99%. Authors of [33] utilized ML models to detect and classify downy mildew (DM) disease severity in watermelon in five disease severity stages namely low, medium, high and very high. They used Hyperspectral watermelon leave images collected in laboratory and in the field by a UAV and implemented multilayer perceptron (MLP) and decision tree (DT). Results show that classification accuracy increased when the disease severity increased and the best classification results were obtained from the MLP method in high and very high severity stages (87% - 90%). Jiang et al. [34] used two unsupervised learning algorithms namely K-means clustering and spectral clustering and three supervised learning algorithms including SVM, RF, and KNN to assess the severity of wheat leaf stripe rust disease. They used a dataset of 400 samples splitted into height severity levels namely 1%, 5%, 10%, 20%, 40%, 60%, 80%, and 100%. RF model obtained the best assessment performance with an overall accuracy of 100%. Table 1 summarizes the above-mentioned works according to the year of publication, the crop concerned, the parts of the crop infected, the diseases treated, the disease severity grades, source of the dataset and the algorithms used.
3.3. Solutions Based on DL
Based on the 47 articles we worked on, the majority of solutions proposed for estimating plant disease severity are based on DL and essentially on Convolutional Neural Networks (CNNs). DL-based solutions can be divided into two categories: CNN-based solutions and CNN-based segmentation networks. These solutions generally follow the workflow described in Figure 6.
3.3.1. CNN-Based Solutions
During the last decade, several solutions based on CNN have been proposed in the filed of crop diseases diagnosis. The estimation of disease severity, an extension of disease diagnosis, is also an area in which several CNN-based solutions have been proposed in recent years. For instance, automatic disease severity assessement of plant based on CNN was first proposed in 2017 by Wang et al. [18] .
![]()
Table 1. Summary of solutions based on ML.
![]()
Figure 6. CNN-based solutions general flowchart.
Similarly, several authors have proposed plant disease severity assessment solutions based on well-known CNN such as VGG16, VGG19, ResNet18, ResNet50, ResNet101, Inception-V3, GoogLeNet, AlexNet, SqueezeNet, DenseNet121, MobileNetV2, NASNetMobile [4] [5] [12] [14] [15] [18] [24] . Some authors have proposed their own CNNs, which have given better results than well-known CNNs [11] [21] [23] [35] - [41] . Some solutions are based on the combination of CNN and classical ML algorithms such as Random Forest [19] and SVM [13] . Among these CNN-based solutions, several have used the principle of Multi-task learning [1] [4] [8] [10] [14] [25] [35] [37] [42] . Multi-task learning, as opposed to single-task learning, is a learning principle in which several linked tasks can be learned simultaneously [1] . Plant disease classification and disease severity estimation are two related tasks, which can be learned simultaneously using multitask learning. Multi-task learning is more advantageous than Single-task learning because it can reduce the risk of overfitting and lead to better generalization of a model [43] .
These CNN-based solutions have achieved extraordinary performances ranging from 70% to 99% accuracy.
Table 2 is a summarize of theses above-mentioned works according to the year of publication, type of architecture (single or multi task), crop concerned, parts of the crop infected, diseases treated, the disease severity grades, source of the dataset, models used and results obtained.
3.3.2. CNN-Based Segmentation Networks Solutions
CNN-based segmentation is widely used in object detection and localization. CNN-based segmentation networks have also been used to assess the severity of plant diseases and other related agricultural tasks. Of the 47 articles reviewed, seven used a CNN-based segmentation network, such as Unet [10] [12] , Mask R-CNN [49] [50] [51] , Faster R-CNN [8] [42] , SegNet [26] , DeepLav [26] and so on. They are based on segmentation of infected areas (of the leaf, fruit, etc.) to quantify disease severity (see Figure 7). For crop disease severity assessment. Su et al. [49] proposed a solution based on Mask-RCNN for the detection and severity estimation of Fusarium Head Blight (FHB) on wheat spikes. Authors defined fifteen severity grades (from grade 0 to grade 14). The proposed solution achieved accuracies of 77.76% and 98.81%, respectively for FHB detection and severity assessment. Chen et al. [20] developped a Deep Learning algorithm (BLSNet) based on Unet for rice leaf bacterial lesion segmentation and severity estimation. Goncalves et al. [26] used six CNNs namely Unet, SegNet, PSPNet, FPN, DeepLabV3 (Xception) and DeepLabV3 (MobileNetV2) to estimate the severity of Coffee leaf miner, soybean rust and wheat. They used a dataset for each of the three diseases. Results show that average precision values are ranged from 90.4% to 95.6% and recall values are ranged from 89.4% to 94.7%. Hu et al. [8] used Faster R-CNN and VGG16 for detection and severity assessment of tea leaf blight disease, respectively. Detection average precision and the severity grading accuracy improved by more than 6% and 9%, respectively, compared to existing solutions. Pillay et al. [51] applied Mask R-CNN to quantify the severity of common rust disease in maize leaf. The Mask R-CNN performed better than standard image processing algorithms more than 5%. Gerber et al. [51] examined automated tuning of Mask R-CNN parameters which are very numerious and use also a genetic algorithm (GA) in order to enhance performance achieved in their previous work [50] . Pan et al. [42] proposed a two-stage model including object detection by Faster R-CNN and few-shot learning by siamese network to estimate strawberry leaf scorch severity. The proposed two-stage method achieved the highest estimation accuracy of 96.67%. Table 3 is a summarize of theses above-mentioned works according to the year of publication, type of architecture (single or multi task), crop concerned, parts of the crop infected, diseases treated, the disease severity grades, source of the dataset, models used
and results obtained.
4. Limitations of Proposed Solutions and Potential Challenges
Results obtained by solutions based on IPT, classic ML and DL algorithms are very promising in crop disease severity estimation. They help to avoid yield losses, reduce production costs, ensure good disease management and so on.
However, these solutions have limitations that need to be overcome:
· For IPT-based solutions, quality of the images has a strong influence on image segmentation and, consequently, on the determination of the area infected by a disease. To obtain good results in quantifying disease severity, it is essential to use high-quality images (noiseless, without complex backgrounds, etc.).
· Estimating the severity of a crop disease from an image containing several leaves is not addressed in the reviewed works, but it is a situation that can occur in real life.
· The accuracy of severity classification increases with the severity of the crop disease. In other words, for solutions using quantitative severity levels, it is difficult to quantify disease at an early stage.
· For solutions using qualitative or quantitative severity grades, image labeling has a major impact on the classification result, and must therefore be carried out by an expert. A bad image labeling systematically leads to incorrect quantification of disease severity and, consequently, to poor a disease management.
· For solutions using the segmentation of disease lesions (on leaves, fruit, etc.), much of the edge information is lost, impacting the result of disease severity quantification.
· Crop diseases can affect leaves, fruits, flowers, panicles and so on. But until now, researchers have focused on assessing only the severity of leaf diseases.
![]()
Table 3. Summary of CNN-based segmentation networks.
![]()
Figure 7. Example of leaf disease severity quantification based on lesion segmentation [26] .
Estimating plant disease severity on other parts, such as fruits, would also be of great use in crop disease management.
5. Conclusions and Future Works
Diagnosing crop diseases goes hand in hand with assessing their severity. Estimating the severity of diseases is very useful for plant disease management. Several solutions based on IPT, ML or DL have been proposed by researchers to estimate the crop diseases severity. These solutions have achieved very impressive results, but have limitations that need to be taken into account.
For future work, we aim to propose a CNN-based solution to assess the severity of four mango fruit diseases namely anthracnose, Alternaria, black mould rot and stem and rot. This solution will be adapted to the reality of Africa in general, and Senegal in particular, since we will use the MangoFruitDDS dataset [52] containing images of the above-mentioned diseases and collected in an orchard located in Senegal.