1. Introduction
Malignant (cancer) cells can be formed in the tissues of the cervix, the lower, narrow end of the uterus to result in cervical cancer. Cervical cancer, predominantly attributable to infection, usually develops slowly over time and is the second [1] [2] most common cancer in women worldwide, and is a leading cause of morbidity and mortality in women. Each year about 265,700 women die from cervical cancer worldwide while approximately 527,600 new cases are diagnosed [3] . Human papillomavirus is considered to be one of the most important risk factors in the development of cervical cancer while sexual transmission is the predominant route of HPV infection. Treatment options for patients with cervical cancer depend on several factors and include surgery or a concurrent chemo-radiotherapy regimen consisting of cisplatin-based chemotherapy with external beam radiotherapy and brachytherapy. A large and consistent body of studies (case series, case-control studies, cohort studies, and intervention studies) documented a relationship between a human papillomavirus (HPV) infection, particularly the oncogenic subtypes such as HPV 16 and 18, and the development of human cervical cancer. In the absence human papillomavirus (HPV) viral DNA, human cervical cancer appears not to develop. Thus far, most studies conducted identified human papillomavirus as key risk factors of human cervical cancer. Even if the research in relation to the etiology of human cervical cancer has made substantial progress, a cause or the cause of human cervical cancer is still not identified.
2. Material and Methods
Human cervical cancer can be a deadly disease too. Identifying the cause of cervical cancer of is of strategic importance in public health.
2.1. Search Strategy
For the questions addressed in this paper, was searched Pubmed for case-control studies conducted in any country and published in English which investigated the relationship between human papilloma virus and cervical cancer at least by polymerase chain reaction (PCR). The search in Pubmed was performed while using medical key words like “case control study” and “human papilloma virus” and “cervical carcinoma” and “PCR DNA” et cetera. The articles found where saved as a *.txt file while using Pubmed support (Menu: Send to, Choose Radio Button: File, Choose Format: Abstract (text). Click button “create file”). The created *.txt file was converted into a *.pdf file. The abstracts where studied within the *.pdf file. Those articles were considered for a review which provided access to data without any data access barrier.
2.2. The Data of the Studies Analyzed
Novel laboratory techniques [4] (Southern Blot hybridization, Immunohistochemistry (IHC), introduced by Coons [5] in 1941, In-situ hybridization (ISH), described in the year 1969 by Joseph G. Gall [6] , Fluorescent ISH (FISH), RNA in situ hybridization (RNA ISH), Polymerase chain reaction (PCR), Nested PCR, Quantitative polymerase chain reaction (QPCR) et cetera) can improve our understanding of the pathogenesis of diseases. In principle, it is possible to distinguish between benign and malignant cell populations (Immunohistochemistry (IHC)) or to distinguish virus in tumor cells from virus in non-tumor cells (In situ hybridization (ISH)) et cetera. Still, false positive or false negative results or bias is possible. In the light of thoughts like these, the data of the studies analyzed are presented by table (Table 1). The meaning of the abbreviations at, bt, ct, dt, Nt of Table 1 (Table 1) and Table 2 (Table 2) are explained by a 2 by 2-table (Table 3).
![]()
Table 1. Human papillomavirus PCR DNA and human cervical cancer due to the studies analysed.
![]()
Table 2. Without a human papillomavirus infection no human cervical cancer.
![]()
Table 3. The sample space of a contingency table.
2.3. Statistical Analysis
All statistical analyses were performed with Microsoft Excel version 14.0.7166.5000 (32-Bit) software (Microsoft GmbH, Munich, Germany). In order to simplify the understanding of this article, to increase the transparency for the reader and to correct some of the misprints of former publications, several of the following lines are repeated word by word and taken from former publications.
2.3.1. The 2 × 2 Table
The 2 × 2 table in this article is defined [15] - [37] in general more precisely (Table 3) as follows.
In general it is
,
,
,
and
. Equally, it is
. In this context, it is
,
or in other words
while p(At) is not defined as p(at). In the same context, it is
and equally that
. Furthermore, the joint probability of At and Bt is denoted by
. Thus far, it is
or in other words it follows that
. Define
, the famous Einstein’s term under conditions of probability theory and we obtain
. In general, it is
. These relationships are viewed by the table (Table 4) as follows.
2.3.2. Independence
In the case of independence of At and Bt it is
(1)
2.3.3. Necessary Condition (Conditio Sine qua Non)
The mathematical formula of the necessary condition relationship (conditio sine quam non) [15] - [37] of a population was defined as
(2)
and used to proof the hypothesis: without At no Bt . In particular it is
![]()
Table 4. The probabilities of a contingency table.
(3)
Scholium.
The study design and other factors can have an impact on bias with respect to the necessary condition. A different question worth asking concerns the relationship between the independence of an event At (a condition) and another event Bt (conditioned) and the necessary condition relationship. A fundamental question worth considering at this stage is whether is it possible that an event At is a necessary condition of event Bt an even if the event At (a necessary condition) is independent of an event Bt (the conditioned). In this context, the conditio sine qua non was defined as
(4)
or as
(5)
Under conditions where an event At is independent of an even Bt it is equally true that
(6)
Rearranging equation before it is
(7)
or
(8)
or
(9)
Only under conditions where p(At) = 1, theoretically it is possible to treat At as a necessary condition of Bt even if At is independent of Bt and vice versa, otherwise not. In other words, it is very difficult to treat a statistically significant conditio sine qua non relationship as very convincing if at the same time an event At is independent of and event Bt and vice versa. While discussing the statistical significance of results with respect to a necessary condition (or a sufficient condition or a necessary and sufficient condition), such or similar arguments should be considered. Due to an inappropriate study design or other sources of possible bias, the statistical significance of a conditio sine qua non relationship should be treated with very great cautious if evidence is provided that at the same time the same investigated parameters are independent of each other.
2.3.4. The X2 Goodness of Fit Test of a Necessary Condition
Under some circumstances, the rule three and other methods can be used to test the significance of a necessary condition. In this publication, the chi-square [38] goodness of fit test was used to determine whether sample data are consistent with a hypothesized (theoretical) distribution of a necessary condition. In particular, the hypotheses can take the following form.
H0: The sample distribution does agree with the hypothetical (theoretical) distribution of a necessary condition.
HA: The sample distribution does not agree with the hypothetical (theoretical) distribution of a necessary condition.
The X2 Goodness-of-Fit Test can be shown schematically as
(10)
The degrees of freedom are calculated as N − 1. Interestingly, if there is no discrepancy between an observed and a theoretical distribution at all, then the value of the calculated X2 = 0. As the discrepancy between an observed and the theoretical distribution of a necessary condition becomes larger, the X2 becomes larger. This X2 values are evaluated by the known X2 distribution. An adjustment (Yate’s correction for continuity) can be used when there is one degree of freedom. When there is more than one degree of freedom, the same adjustment is not used. Applying this to the formula above, we find the X2 Goodness-of-Fit Test with continuity correction shown schematically as
(11)
Under circumstances, where the term (|Observedt − Expectedt|) is less than 1/2, the continuity correction should be omitted. The theoretical (hypothetical) distribution of a necessary condition is shown schematically by the 2 × 2 table (Table 5).
The theoretical distribution of a necessary condition (conditio sine qua non) is determined by the fact that ct = 0. The X2 Goodness-of-Fit Test with continuity correction of a necessary condition (conditio sine qua non) is calculated as
![]()
Table 5. The theoretical distribution of a necessary condition (conditio sine qua non).
(12)
or more simplified as
(13)
Under these circumstances, the degree of freedom is d.f.
. The conditio sine qua non model can be used widely and is one of the new and appropriate methods of analysis of binary outcome variables. In this context, meta-analysis and systematic reviews aims to combine effects estimated from several studies to achieve greater precision of the conclusions drawn and can provide us with more convincing and reliable evidence of some special aspects of medicine. In meta-analysis the heterogeneity between the studies can be modelled via the additive properties of the chi square distribution too. In general, let Xt denote n independent random variables which follow a chi-square distribution. The sum of these independent chi-square variates is itself a chi-square variate which is known as the additive property of independent chi-squares. There may be disadvantages in the use of the chi-square-goodness-of-fit test. Still, the chi square distribution, a continuous probability distribution, is related to the standard normal distribution and is a simple and good measure of model adequacy. However, a particular concern with the use of the chi-square-goodness-of-fit test is a priori justified if expected cell frequencies of a 2 × 2 table are too small (all are less than one).
2.3.5. The Mathematical Formula of the Causal Relationship k
Huxley [38] and Darwin [39] claimed more than a century ago that humans share recent common ancestors with the African apes. Modern molecular methods have spectacularly confirmed their prediction. Genomic divergences between humans and other hominoids and especially our closest living evolutionary relatives the common chimpanzee (Pan troglodytes) and bonobo (Pan paniscus or pygmy chimpanzee) are very small but not zero. Ebersberger et al. [40] , Fujiyama et al. [41] and other sequenced the chimpanzee genome. According to Ebersberger et al. “the chimpanzee genome were sequenced and compared to corresponding human DNA sequences ... the average sequence difference is low (1.24%)” [40] . The Chimpanzee Sequencing and Analysis Consortium calculated “the genome-wide nucleotide divergence between human and chimpanzee to be 1.23%” [42] and confirmed results from other and more limited studies. In other words, the difference between chimpanzee genome and compared to corresponding human DNA sequences is very small. Still there is a difference and this very small difference makes the difference. A chimpanzee is not a human being, a human being is not a chimpanzee. Even if both are similar and “relatives” both are equally not the same. The relationship between the mathematical formula of the causal relationship k [15] - [37] and the closest existing mathematical relatives, Pearson’s measures of relationships, is similar to the circumstances aforementioned. In contrast to Pearson’s product-moment correlation coefficient [43] or to Pearson’s Phi [44] Coefficient (Mean Square Contingency Coefficient et cetera), the mathematical formula of the causal relationship k [15] - [37] is defined at every single event, at every single Bernoulli trial t, as
(14)
where RUt denotes the cause and 0Wt denotes the effect while the chi-square distribution [45] can be applied to determine the significance of causal relationship k. This small difference makes the difference. Only under conditions where the probability of events is constant from trial to trial, we can extrapolate from one Bernoulli trial to N Bernoulli trials with some consequences one of which is that
(15)
or that
(16)
or at the end
(17)
where N is the sample size,
,
,
,
,
. Several factors can have an impact on the calculated causal relationship k with the potential of bias.
Scholium.
Firstly, the relationship between condition and cause has an impact on the causal relationship k. A proper and deeper analysis of the relationship between cause and condition is beyond the scope of this article and can be found in literature [15] - [37] . We will be concerned with the latter sort of entity in this article from a pragmatically point of view. In the hope of casting light on the tricky problems of the relationship between condition and cause, the concept of independence is of use too. The question whether an event At can be a (necessary, sufficient, necessary and sufficient) condition of an event Bt even if both are independent of each other, is already answered few lines before. Still, under which circumstances can we treat an event as a cause or as the cause of another event? Can an event be a cause of another event without being a (necessary, sufficient, necessary and sufficient et cetera) condition of the same event? The concept of this article is restricted on its capacity to bring high degrees of conceptual exactness and rigour to questions like these but not incapable. Most authors who have written on the question of the relationship between condition and cause came to different conclusions. Currently still worthy of consideration is the remark of von Bar.
“Die erste Voraussetzung, welche erforderlich ist, damit eine Erscheinung als die Ursache einer anderen bezeichnet werden könne, ist, daß jene eine der Bedingungen dieser sein. Würde die zweite Erscheinung auch dann eingetreten sein, wenn die erste nicht vorhanden war, so ist sie in keinem Falle Bedingung und noch weniger Ursache. Wo immer eine Kausalzusammenhang behauptet wird, da muß er wenigstens diese Probe aushalten… Jede Ursache ist nothwendig auch eine Bedingung eines Ereignisses; aber nicht jede Bedingung ist Ursache zu nennen.” [46]
Translated into English:
“The first requirement, which is required, thus that something could be called as the cause of another, is that the one has to be one of the conditions of the other. If the second something had occurred even if the first one did not exist, so it is by no means a condition and still less a cause. Wherever a causal relationship is claimed, the same must at least withstand this test… Every cause is necessarily also a condition of an event too; but not every condition is cause too.”
From this statement, it could appear that there is a gap between what is a cause and what is a condition. Is it possible to generalize this finding? In probabilistic approaches to causation, it is obvious that a cause of an event is equally a condition of the same event too. Clearly, the same relationship must not be given the other way too. A condition of an event must not equally be a cause of the same event. In summary, the objections build on the contradiction between condition and cause are no longer justified. A cause is a condition of an event too but not necessarily vice versa. A condition of an event must not be equally the cause of the same event. Thus far, like other fundamental concepts, the concepts of necessary conditions, the concepts of sufficient conditions and the concepts of necessary and sufficient conditions can be one of the handy tools to determine precisely whether a causal relationship is significant or not. A study which provides evidence of a significant causal relationship k without at the same time providing evidence of a significant necessary condition, or of a significant sufficient condition or of a significant necessary and sufficient condition should be treated with some cautious.
Secondly, a proper study design is necessary to use the mathematical formula of the causal relationship k with confidence otherwise bias is possible. For example, the probabilities of two events within a population are know precisely and shown schematically by the 2 × 2 table (Table 6).
The causal relationship k (Table 6) is calculated as k = +0.314800094. Using the conditional probabilities, we obtain the following picture (Table 7).
Now we perform a study A with a sample size of n = 10,000. The verum group is
and the placebo group is
. We do expect that the probabilities within the sample are the same like in the population. Under these conditions we obtain the following picture (Table 8).
Thus far, this study has provided the following data (Table 9).
Calculating the causal relationship k under these conditions, we obtain
(18)
![]()
Table 6. The probabilities within a population.
a causal relationship k of k = +0.991031209. The study with the sample size n = 10,000 should have obtained a causal relationship k = +0.314800094 while the same obtained a causal relationship k = +0.991031209. This example demonstrates that the study design as such can be a source of bias if inappropriate measures are taken. To reduce the bias, it makes sense to consider the prevalence of a factor/an event within the population as much as possible as an essential part of study design. According to the data above, the prevalence of a cause within the population is p(RUt) = 0.999. The sample size of the study is still n = 10,000. Under these assumptions, the sample size of the verum group should be considered as
while the sample size of the placebo group is
(Table 10).
The causal relationship k (Table 10) is calculated as k = +0.314800094. The situation does not really change if a case control study is regarded. If the same problem is analysed within a case control approach, this should not have any systematic influence on the probabilities (Table 11) of the sample studied.
Let us assume that the sample size of this case control study B is n = 1000, with 500 cases and 500 controls. We obtain the following data (Table 12).
The data of this case control study (Table 12) are demanding that the causal relationship k is equal to k = +0.229415734 while the same causal relationship k should be equal to k = +0.314800094 what motivates us to affirm the following conclusion. Inappropriate study design can lead to severe bias. Given the difficulty of the problems as associated with study design it is useful to adopt a strategy of extreme caution under conditions when the data of a study provide evidence of a significant cause effect relationship but fails in the same respect to provide some evidence of a significant necessary condition relationship or of a significant sufficient condition relationship or of a significant necessary and sufficient condition relationship otherwise conclusions may run into difficulties.
2.3.6. The Chi Square Distribution
The chi-squared distribution [38] is a widely known distribution and used in hypothesis testing, in inferential statistics or in construction of confidence intervals. The critical values of the chi square distribution are visualized by Table 13.
2.3.7. The X2 Goodness of Fit Test of a Causal Relationship k
Under some circumstances the chi-square [45] goodness of fit test can be used to
![]()
Table 10. The impact of study design on the causal relationship k.
![]()
Table 11. The data of the case control study B.
![]()
Table 12. The data of the case control study B.
![]()
Table 13. The critical values of the chi square distribution (degrees of freedom: 1).
test the significance of a causal relationship. Under conditions where the probability of events is constant from trial to trial, we expect a constant causal relationship kt. In other words, at each Bernoulli trial t it is
(19)
Performing N Bernoulli trials (Sample size N), the basic relationship will not change. It follows that
(20)
or that
(21)
Simplifying equation we obtain
(22)
Multiplying equation by itself it is
(23)
or
(24)
Dividing equation by N * |1| = N, we obtain
(25)
or
(26)
or the X2 value as
(27)
The chi square (X2) statistic can be used to investigate whether the observed distribution of the causal relationship differ from the theoretical expected distribution of the causal relationship. The table (Table 8) contains the critical values of the chi-square distribution (degrees of freedom, df = 1). Upper-tail and lower-tail critical values of the chi-square distribution with v degrees of freedom are provided by software packages.
3. Results
3.1. Without the Presence of Human Papillomavirus DNA No Human Cervical Cancer
Claims.
Null hypothesis:
The presence of human papillomavirus DNA is a necessary condition (a conditio sine qua non) of human cervical cancer. In other words, the sample distribution agrees with the hypothetical (theoretical) distribution of a necessary condition.
Alternative hypothesis:
The presence of human papillomavirus DNA is not a necessary condition (a conditio sine qua non) of human cervical cancer. In other words, the sample distribution does not agree with the hypothetical (theoretical) distribution of a necessary condition.
The significance level (Alpha) below which the null hypothesis will be rejected is alpha = 0.05.
Proof.
The data reviewed by this article which investigated the relationship between the presence of human papillomavirus DNA and human cervical cancer are viewed by the table (Table 1). Altogether, 8 studies with N = 7657 cases and controls were meta-analyzed while the level of significance was alpha = 0.05. Under these circumstances (degrees of freedom = 8, alpha = 0.05) the calculated Chi-square value (Chi-square Goodness-of-Fit Test with continuity correction) of a necessary condition (conditio sine qua non) is equal to X2 Calculated (SINE) = 9.070460764. The critical Chi square can be obtained (degrees of freedom = 8, alpha = 0.05) as X2 Critical = 15.50731306. In particular, due to the data of the studies meta-analyzed, human papillomavirus DNA and human cervical cancer are not independent (degrees of freedom = 1, Chi-square value calculated = 4009.949276) of each other. The detailed calculations are shown by the table (Table 2). Furthermore, the calculated Chi square value of the necessary condition (X2 Calculated (SINE) = 9.070460764) is less than the critical Chi square value (X2 Critical =15.50731306). Due to this evidence, we do not reject the null hypothesis in favor of the alternative hypotheses. The data as published by studies presented by the table (Table 1) do support our Null hypothesis that the sample distribution of the studies analyzed agrees with the hypothetical (theoretical) distribution of a necessary condition. In point of fact, the presence of human papillomavirus DNA is a necessary condition (a conditio sine qua non) of human cervical cancer. In other words, without the presence of human papillomavirus DNA no human cervical cancer.
Q. e. d.
3.2. Human Papillomavirus Is the Cause of Human Cervical Cancer
Claims.
Null hypothesis: (no causal relationship, k = 0)
There is no causal relationship between human papillomavirus and human cervical cancer.
Alternative hypothesis: (causal relationship, k ¹ 0)
There is a causal relationship between human papillomavirus and human cervical cancer.
Conditions.
Alpha level = 5%. The two tailed critical Chi square value (degrees of freedom = 1) for alpha level 5% is 3.841458821.
Proof.
The data for this hypothesis test are illustrated in the 2 × 2 table (Table 14).
The causal relationship k (HPV DNA, Human cervical cancer) is calculated [15] - [37] as
The value of the test statistic k = +0.723669245 is equivalent to a calculated [15] - [37] chi-square value of
The chi-square statistic, uncorrected for continuity, is calculated as X² = 4009.949276 and thus far equivalent to a p value of p < 0.00001. The calculated chi-square statistic exceeds the critical chi-square value of 3.841458821 (Table 13). Consequently, we reject the null hypothesis and accept the alternative hypotheses. There is a highly significant causal relationship between (k = +0.723669245, p value < 0.00001) human papillomavirus and human cervical cancer. The result is significant at p < 0.00001.
Q. e. d.
4. Discussion
The nature of the relationship between HPV and cervical cancer has been exhaustively investigated over more than 20 years. Several studies which have unequivocally shown that HPV-DNA can be detected in about 95% to 100% of adequate
![]()
Table 14. The causal relationship between HPV and cervical cancer.
specimens of cervical cancer, while there was no significant variation in HPV positivity among countries and support the claim that HPV is a necessary condition cervical cancer. The most reviews available have consistently concluded that there is a strong evidence of an association between HPV and cervical cancer [47] [48] . Still, the cause or a cause of cervical cancer still remains unclear and is not identified without any doubt. To re-evaluate the role of HPV in the etiology of cervical cancer, we re-analyzed some outstanding HPV DNA polymerase chain reaction based case control studies. In point of fact, the studies presented (Table 2) support strong evidence for the hypothesis that HPV is a necessary condition cervical cancer. In other words, without HPV infection no cervical cancer. Only the study of Eluf-Neto et al., 1994 [7] failed on this point. In this context, PCR technology is highly sensitive. Still, contaminated specimens may have induced false (positive) results, particularly in the earliest PCR based studies. Thus far, ignoring factors like varying inclusion criteria, the possibility of contaminated specimens, the dependence of detection rates of HPV using different HPV type-specific PCR primers some detailed investigations of few cervical cancer specimens that appeared to be HPV-DNA negative suggest that these were largely false negatives [49] [50] . With the development of technology and science, the methods for detecting HPV DNA should become increasingly sensitive. It is reasonable to assume that the detection rates of HPV using special HPV type PCR primers may be higher compared with those using other PCR primers. Future and more precise studies should avoid contamination as much as possible while taking the aforementioned and other factors into account. In particular, all studies presented provide strong evidence (Table 2) that there is a highly significant (Table 14) cause-effect relationship between HPV and cervical cancer. As with other sciences, general topics relating to matters like methodology and explanation are still very much present and numerous potential limitations can be acknowledged in the present meta/re-analysis of the studies above. More than that, the sample size of the studies analyzed was equal to N = 7657 while a highly precise and accurate molecular PCR-technology was used to investigate the relationship between HPV and CC which cannot be ignored. Besides of all, as long as other studies are not able to provide a better and more convincing explanation of the etiology of human cervical cancer it appears to be more than justified to accept the following and inescapable conclusion.
5. Conclusion
Human papillomavirus is the cause of human cervical cancer.