This paper mainly presented a good simple and multi - linear regression model of verbs in the Quran book. This model, gives an analysis for the influence to frequency of words with the form (—un, ---) made by the frequency of plural present verbs (t—un, --- ) or (y—un, --- ), and models , and the relationship between independent variables and dependent variable by fitting a linear equation to the observed data with simple linear regression model. The matlab function is used for find ing the parameters of the linear regression model and plotting the fits. The results show that the parameters of the model are one vector (1, 1) and mean of dataset is (6, 7) . It s corresponding to the verb with input is frequency of the verb they enter and the frequency of enter (yadkolun dakilun), also other 17 points exist in the line and in the dataset of 387 verbs and their derivate verbs in Quran. The name of Allah ( ) showed when we use tree variables and plot it in 3D with option “Show Text” for a multi regression model.
The scripture of the Quran has been subjected to various intense mathematically based studies to reveal the protection mechanisms embedded in the composition of the Quran and to provide evidence of its credibility, authenticity and divinity see for instance [
Therefore, the development of the mathematical theory has been highly motivated and driven by the categorical recognition of the author that Allah may have embedded varying mathematical algorithms, equations and regression models for protecting the Quran, as well as to prove its divinity and to emphatically exclude any human influence on the manufacture of the Quran. Because Allah promises that the Quran will always be preserved and protected from any corruption such as addition or deletion or relocation of any of its verses from chapter to another. Therefore, unveiling any of these algorithms would help unlock many of the Quranic secrets, particularly those related to the Quran’s primary parameters like words and verbs as well as how the Quran’s design is related to the fit of linear regression.
Furthermore, this work is set to statistically and numerically validate and authenticate the first drawing of the Quran (Uthmanic manuscript) related statistics such as the total number of words and verbs of the Quran.
Regression analysis describes the relationship between a dependent variable and several independent variables.
Regression analysis describes the relationship between a dependent variable and several independent variables, for the estimation of the parameters model see for instance [
This paper is organized as follows: in Section 2, we give the initiation of linear regression; linear regression model in Quran with numerical results is given in Section 3.
Regression analysis is a statistical technique for estimating the relationship among variables which have reason and result relation. Main focus of univariate regression is analyses the relationship between a dependent variables X 1 , ⋯ , X n and one independent variable Y and formulates the linear relation equation between dependent and independent variable.
The simple linear regression model is the simplest regression model in which we have only one predictor X.
This model, which is common in practice, is written as
Y i = b + a X i + ε i , i = 1 , ⋯ , n ,
where
- Y i , X i are the values of the response and predictor variables in the trial, respectively;
- The unknown parameters: a is called the intercept, and b is the slope of the line;
- ε i is usually assumed to be iid (error) from N ( 0 , σ ε 2 ) specially for inference purposes (see for instance [
Then estimates of simple linear model’s parameters should be obtained accordingly, using some method like the ordinary least squares method, which relies on minimizing the sum of square of errors ∑ ε i 2 .
For the simple linear regression model the ordinary least squares estimations of a and b are
b ^ = Y ¯ − a ^ X ¯
and
a ^ = ∑ i ( X i − X ¯ ) ( Y i − Y ¯ ) / ∑ i ( X i − X ¯ ) 2
where X ¯ and Y ¯ are the mean of the variable X and the variable Y respectively.
The goodness R 2 of fit is defined as
R 2 = ∑ i ( Y i − Y ¯ ) 2 / ∑ i ( X i − X ¯ ) 2 .
Regression models with one dependent variable and more than one independent variable are called multi-linear regression (see for instance [
Multivariate regression analysis model is formulated as in the following:
Y = α 0 + α 1 X 1 + ⋯ + α n X n + ε
where
- Y is the dependent variable,
- X i is the independent variables,
- α i is the parameters,
- ε is the error.
The assumptions of multi-linear regression analysis are normal distribution, linearity, freedom extreme values and having no multiple ties between independent variables [
The linear model can be written as
Y = X α + ε
where
- Y ∈ R n is the vector of observations on the dependent variable,
Y = ( Y 1 , ⋯ , Y n ) T .
- X ∈ R n × R p + 1 is the matrix consisting of a column of ones and p column vectors of the observations on the independent variables, the form of X is
1 X 11 ⋯ X 1 p ⋮ ⋮ ⋱ ⋮ 1 X n 1 ⋯ X n p
- α ∈ R p + 1 is the vector of parameters to be estimated,
α = ( α 0 , α 1 , ⋯ , α p ) T .
- ϵ ∈ R n is the vector of random errors,
ϵ = ( ϵ 0 , ϵ 1 , ⋯ , ϵ n ) T .
The vector α is a vector of unknown constants to be estimated from the data by α ^ .
The normal equations [
X T X α ^ = X T Y .
If X T X has an inverse, then the unique solution of normal equations given by
α ^ = ( X T X ) − 1 ( X T Y ) .
The vector Y ^ of estimated means of the dependent variable Y for the values of the independent variables X 1 , ⋯ , X n in the dataset is computed as
Y ^ = X α ^ .
However, to express Y ^ as a linear function of Y. Thus,
Y ^ = [ X ( X T X ) − 1 X T ] Y .
In this section we conceder the following integer variables:
- X 1 the frequency of plural present verbs with a form (y―un, ي---ون) or there inverse (ly―un, لي---ون);
- X 2 the frequency of plural present verbs with a form (t―un, ت---ون) or there inverse (lt―un, لت---ون);
- X 3 the frequency of the verbs in the above form without (t,ت) or (y,ي ).
We use the software of Quran statistics [
Let Y 7 , Y 9 , Y 32 is the sum of frequency of all above derivative verbs with same form.
However, the dataset are given in
Since X 3 are not a frequency of plural present verbs [
Y ≈ a X + b ,
where a and b are the parameters of the model.
For estimate the parameters a and b, calculate the coefficient of correlation R, plotting a fit and test hypothesis of simple linear regression model we used the following matlab codes:
[r, m, b] = regression (X, Y);
plotregression (X, Y);
fitlm (X, Y);
Index | ( Y i , X 1 i , X 2 i ) | Index | ( Y i , X 1 i , X 2 i ) | Index | ( Y i , X 1 i , X 2 i ) | Index | ( Y i , X 1 i , X 2 i ) |
---|---|---|---|---|---|---|---|
1 | (143, 57, 83) | 2 | (141, 85, 56) | 3 | (94, 87, 7) | 4 | (91, 83, 8) |
5 | (74, 42, 32) | 6 | (74, 2, 0) | 7 | (71, 0, 0) | 8 | (66, 22, 24) |
9 | (51, 0, 0) | 10 | (48, 28, 20) | 11 | (45, 22, 19) | 12 | (43, 13, 14) |
13 | (41, 28, 5) | 14 | (38, 14, 20) | 15 | (37, 18, 19) | 16 | (36, 19, 7) |
17 | (32, 25, 7) | 18 | (29, 9, 19) | 19 | (29, 6, 10) | 20 | (28, 11, 17) |
21 | (26, 20, 2) | 22 | (25, 21, 4) | 23 | (25, 10, 12) | 24 | (25, 0, 1) |
25 | (23, 15, 6) | 26 | (23, 10, 13) | 27 | (23, 10, 2) | 28 | (22, 20, 2) |
29 | (22, 16, 6) | 30 | (21, 16, 5) | 31 | (21, 12, 9) | 32 | (21, 0, 0) |
33 | (20, 18, 2) | 34 | (20, 1, 0) | 35 | (18, 14, 4) | 36 | (17, 14, 3) |
37 | (16, 15, 1) | 38 | (16, 11, 4) | 39 - 40 | (16, 10, 6) | 41 | (15, 14, 1) |
42 | (15, 14, 1) | 43 | (14, 13, 1) | 44 | (14, 12, 2) | 45 | (14, 11, 3) |
46 | (14, 5, 1) | 47 | (13, 7, 6) | 48 | (13, 2, 11) | 49 | (12, 11, 1) |
50 | (12, 10, 2) | 51 | (12, 9, 3) | 52 | (12, 5, 7) | 53 - 54 | (12, 3, 9) |
55 | (11, 8, 3) | 56 | (11, 7, 4) | 57 | (11, 5, 6) | 58 | (10, 9, 1) |
59 | (10, 9, 1) | 60 - 61 | (10, 7, 3) | 62 | (10, 6, 4) | 63 | (10, 5, 5) |
64 | (10, 3, 7) | 65 | (9, 8, 1) | 66 | (9, 7, 2) | 67 | (9, 6, 3) |
68 | (9, 4, 5) | 69 | (9, 4, 4) | 70 | (9, 0, 1) | 71 - 73 | (8, 7, 1) |
74 | (8, 5, 2) | 75 | (8, 4, 4) | 76 | (8, 0, 8) | 77 | (8, 0, 1) |
78 - 82 | (7, 7, 0) | 83 - 85 | (7, 6, 1) | 86 | (7, 6, 1) | 87 | (7, 5, 2) |
88 - 89 | (7, 5, 0) | 90 | (7, 5, 2) | 91 | (7, 4, 3) | 92 | (7, 3, 0) |
93 | (7, 2, 2) | 94 | (7, 1, 0) | 95 - 96 | (6, 6, 0) | 97 - 99 | (6, 5, 1) |
100 | (6, 5, 0) | 101 | (6, 4, 2) | 102 - 104 | (6, 3, 3) | 105 | (6, 2, 0) |
106 | (6, 1, 1) | 107 | (6, 1, 4) | 108 - 113 | (5, 5, 0) | 114 - 117 | (5, 4, 1) |
118 | (5, 3, 2) | 119 | (5, 1, 2) | 120 - 121 | (5, 1, 4) | 122 - 129 | (4, 4, 0) |
130 - 137 | (4, 3, 1) | 138 - 141 | (4, 2, 2) | 142 | (4, 2, 1) | 143 | (4, 1, 0) |
144 - 145 | (4, 1, 3) | 146 - 162 | (3, 3, 0) | 163 - 168 | (3, 2, 1) | 169 - 173 | (3, 1, 2) |
174 - 175 | (3, 1, 1) | 176 | (3, 1, 0) | 177 - 178 | (3, 0, 1) | 179 - 212 | (2, 2, 0) |
213 - 217 | (2, 1, 0) | 218 - 230 | (2, 1, 1) | 231 - 235 | (2, 0, 2) | 236 | (2, 01) |
237 - 338 | (1, 1, 0) | 339 - 387 | (1, 0, 1) |
The results of regression m function is given as the parameters a = 1 and b = 1 the coefficient of correlation R = 0.91441 , see also
( Y 86 , X 86 ) is frequency of verbs of enter (yadkolun دخلون ، يدخلون dakilun).
Also there exist 17 point in dataset in the line fit their equation is Y = X + 1 , when we use the commend find ( Y = X + 1 ) there indexes in dataset are:
For more information’s about the correspondence of this points in the dataset see
Number of observations: 387, Error degrees of freedom: 384Root Mean Squared Error: 6.18R-squared: 0.836, Adjusted R-Squared 0.835F-statistic vs. constant model: 981, p-value = 1.2e−151Also in the
Index | Words in Arabic | Words in English | Translate in English | Frequency ( X 1 , X 2 , X 3 ) |
---|---|---|---|---|
18 | شكرون | Shakurun | (Be) grateful | (9, 19, 1) |
27 | ملكون | Malikun | (Are the) owners | (10 , 2, 1) |
38 | خافون | Kafun | They are afraid | (11, 4, 1) |
69 | شهدون | Shahidun | (Were) witnesses | (4, 4, 1) |
74 | جعلون | Jaelun | They made | (5, 2, 1) |
86 | دخلون | Dakhilun | Enter (it) | (6, 0, 1) |
99 | كتبون | Katabun | (Are) recorders | (5, 0, 1) |
107 | جهلون | Jahilun | Ignorance | (1, 4, 1) |
140 | حذرون | Hadharun | Forewarned | (2, 1, 1) |
171 | سلمون | Salamun | (Were) sound | (1, 1, 1) |
174 | غلبون | Ghalibun | Victorious | (1, 1, 1) |
213 | كيدون | Kidun | Scheme against me | (1, 0, 1) |
214 | ركعون | Rakaeun | (are) those who bow down | (1, 0, 1) |
216 | طوفون | Tufun | (As) moving about | (1, 0, 1) |
217 | خصمون | Khsimun | Argumentative | (1, 0, 1) |
230 | قومون | Kuamun | (Are) protectors | (1, 0, 1) |
236 | فكهون | Fakahun | (In) amusement | (0, 1, 1) |
The present dataset in this paper fined in Quran gives a good simple and multi linear regression model between the frequency of verbs with a form (―un, ون---) made by the frequency of plural present verbs (t―un, ت---ون) or (y―un, ي---ون) and there inverses.
The results show that the parameters of the model are ones vector (1,1) and mean of dataset is (6, 7). It corresponds to the verb point enter (yadkolun دخلون ، يدخلون dakilun), also other 17 points exist in the line and in the dataset of 387 verbs and their derivate verbs in Quran. The name of Allah (الله) showed when we use tree variables and plot it in 3D with option “Show Text”.
For future work, the estimation parameter of this dataset will be done by using l 1 norm and sub-gradient method, and comparing this model in other drawing of the Quran.
The author thanks Allah for this miracle dataset in Quran. Also we are indebted to the anonymous reviewers and editors of AJCM for many suggestions and stimulating comments to improve the original manuscript.
El Mouatasim, A. (2018) Simple and Multi Linear Regression Model of Verbs in Quran. American Journal of Computational Mathematics, 8, 68-77. https://doi.org/10.4236/ajcm.2018.81006