Validation of the DSP2 Tool in a Contemporary Identified Skeletal Collection from Northeastern Brazil

The Diagnose Sexuelle Probabiliste v.2 (DSP2) is an accurate tool used for estimating the sex of an individual through the os coxae. The goal of this study was to verify the applicability of the DSP2 tool in a skeletal sample from Northeastern Brazil and to attest for its precision, accuracy, and reliability. The sample was composed of 301 os coxae from the Center for Studies in Forensic Anthropology of the College of Odontology from the University of Pernambuco, in Pernambuco, Brazil. The results reveal that it was possible to correctly estimate the sex of 83.7% of the total sample. The error rate was 0.4%, and the percentage of undetermined individuals varied according to the combination of measurements used. The results demonstrated a high index of accuracy and a low error rate, indicating that DSP2 is a reliable tool for sex estimation applied to this studied Brazilian population.


Introduction
The biological profile of skeletal remains performed when forensic anthropological analysis is essential for the identification of individuals (Krishan et al., 2016).
The os coxae display their sexual dimorphism pattern throughout the entire Advances in Anthropology modern human population, which does not occur with other skeletal elements, such as the cranium, which is population specific. Therefore, the os coxae are a primary choice for studies regarding sex estimation (Brůžek & Murail, 2006, Murail et al., 2005Quatrehomme et al., 2017;Brůžek et al., 2017). This generalized display of high sexual dimorphism indicates that the use of population-specific methods and formulas is unnecessary to estimate the sex of an individual when using the os coxae (Murail et al., 2005).
Among the different existing methodologies for sex estimation in the literature, the Diagnose Sexuelle Probabiliste v.2 (DSP2) should be highlighted as an accurate tool, with higher levels of reliability and precision (Brůžek et al., 2017).
Recently, DSP2 was tested in independent populations, including in Brazil.
However, considering the continental size of the country and its known racial admixture (Native Brazilians, Europeans, Africans, and Asians) (Oliveiraet al., 2012), it is crucial to validate the tool in different regions of Brazil to cover for its population variation.
The goal of this study was to verify the applicability of the DSP2 on a contemporary identified skeleton collection from Northeastern Brazil, to analyze the precision and reliability of the method as a tool for sex estimation in forensic anthropology to be used in the country.

Materials and Methods
The sample was composed of 301 human os coxae in a good state of preservation An individual with trauma, pathologies, or morphological anomalies in the os coxae, were excluded from the study. Individuals considered to be non-adults, under 21 years old, with still incomplete development of the skeleton, or with os coxae with a bad state of preservation, which could be brought some prejudice to the measurements, were also excluded.
The age of male individuals varied between 21 -99 years of age, with a mean age of 60 years old. The age of female individuals varied between 21 -109 years of age, with a mean age of 65 years old. Only the left os coxae were used for the analysis, and they should provide a minimum of 04 variables to be measured.
Seven groups of variables (method variation) were tested for the precision of Correlation Coefficient (ICC) and the confidence interval of 95% were used to evaluate the inter-and intra-observer replicability. Of the 25 os coxae evaluated, the inter-observation happened between two independent observers, and the intra-observation was done by the same observer within a one-month interval between observations.
The descriptive statistic was performed using measurements of the absolute frequency, percentages, means, standard deviations, and amplitude. The numeric variables were tested for normality through the Shapiro-Wilk test. A Mann-Whitney test was then performed to compare the ten pelvic measurements between males and females. A Chi-square test (χ 2 ) was used to compare the frequency of sex estimation (% of sex estimation) among the seven methods (variables groups). The Chi-square test (χ 2 ) and Fisher's exact test were used to compare the most probable sex estimation on inconclusive cases. For all the analyses, the significance was 5%.
Diagnose Sexuelle Probabiliste v2 (DSP 2) The Diagnose Sexuelle Probabiliste v2 is a method of sex estimation developed by Murail et al. (2005). It was later improved by the same authors, Brůžek et al. (2017), and it has a global population metric database as a reference, originated from Europe, Africa, North America, and Asia (Murail et al., 2005). Murail et al. (2005) wanted to validate the hypothesis that the pelvic bones follow a typical sexual dimorphic pattern that is shared among all the modern human population, independent of the geographic region of origin. The authors also had a goal to develop a tool capable of diagnosing the sex and capable of being effectively replicated in different populations.
Therefore, DSP2 contains ten variables groups (anthropological measurements) to estimate sex through the os coxae. Also, one can calculate the sex probability, using any combination of at least four of these proposed variables  Because not every measurement needs to be used to estimate sex through the DSP2 tool, and the possibility of working with a variety of variables combinations, the tool is applicable even when a skeletal element is fragmented (Murail et al., 2005;Quatrehomme et al., 2017;Brůžek et al., 2017). However, the authors clarified that the method's performance is positively correlated to the number of variables used (Murail et al., 2005).
The classic discriminant functional analysis measures the accuracy in classifying the sexes, whereas the DSP2 also provides the reliability of this distinction (Mestekova et al., 2015). Accuracy is understood as the percentage of skeletal remains in which sex is estimated correctly; in the sample, the method is developed. Reliability is evaluated by testing the method in independent populations (Brůžek & Murail, 2006). Hence, when the confidence level equal or superior to 0.95 is not achieved, the individual is considered as undetermined sex (Murail et al., 2005;Brůžek et al., 2017;Mestekova et al., 2015).

Results and Discussion
The goal of this study was to validate the DSP2 method in a population sample from Northeastern Brazil to consolidate the tool's use in the practice of forensic anthropology in the country. Currently, independent studies were performed applying DSP2 in European and Brazilian populations. ICC values varied from 0.926 (IIMT) to 0.996 (SCOX), indicating excellent intra-observer replicability (Table 1). For inter-observer replicability, only IIMT (ICC = 0.837) showed a value inferior to 0.90, still demonstrating excellent replicability between independent observers.
The comparative analysis of the ten pelvic measures between females and males is shown in Table 2. SA was the only measure that did not present a statistically significant difference between the sexes (P = 0.0059), following the findings of Mestekova et al. (2015). This result is also partially supported by Machado et al. (2018), who did not find statistically significant differences be-   corroborates with the idea that os coxae have a characteristic pattern of sexual dimorphism in different geographic regions.
A posterior probability superior to 0.95 was used as a threshold for sex classification. The resulting sex estimation was compared to the known data of each individual.  % sex estimation = frequency of specimens in which sex was estimated. % undetermined = frequency of samples in which sex estimation could not be performed. % accuracy rate = frequency of samples in which sex estimation was correct in relation to the total performed. % accuracy= frequency of samples in which sex estimation was performed correctly from the total of samples possible to be estimated.
Only one error on sex classification was spotted among the entire sample (classified as female, and which was proven to be morphologically male), resulting in a low error rate (0.4%). The result was more satisfactory than the one found by Machado et al. (2018), who had five (9.43%) pelvic bones misclassified as males and seven (14%) as females. The high precision rate in this study follows the ones presented by Murail et al. (2005), who showed a precision between 98.7% to 99.63% and an error rate of <2%.  % sex estimation = frequency of specimens in which sex was estimated. % undetermined = frequency of samples in which sex estimation could not be performed. % accuracy rate = frequency of samples in which sex estimation was correct in relation to the total performed. % accuracy = frequency of samples in which sex estimation was performed correctly from the total of samples possible to be estimated.
concluded that using the group of first eight variables, more than 90% of individuals can be sexed, and by using only the four best variables, more than 87% of individuals can be sexed. achieving 100% when the best combination of four variables was used. This result shows the high level of precision and reliability of DSP2. The comparative analysis of the sex estimation rate and gross accuracy are shown in Figure 1 and The undetermined index varied between 4.7% and 58%, according to the number of variables available. The detailed comparative analysis using Chi-Square is shown in Table 5.
Whenever all ten variables (M1) were used, the undetermined index was 6%.
Different lowercase letters show statistically significant differences between methods (P < 0.05, through Chi-Square).   According to Chapman et al. (2014) When the variable VEAC is excluded (M5), the study reached 4.7% undetermined index, the lowest among the groups tested. For the group of the first eight variables (M6), the undetermined index was 5.3%, higher than the 2.56% found by Chapman et al. (2014). The worst combination of four variables (M7) had an undetermined index of 39%, following findings by Machado et al. (2018)  These results demonstrate that, as seen in the studies by Mestekova et al. (2015) and Quatrehomme et al. (2017), M7 is not necessarily the worst possible combination of four variables, as M4 presents a higher level of undetermined sex estimation. It is important to note that when the variable VEAC (M5), or SIS and VEAC (M6), are excluded, the undetermined index is lower than when all ten variables are used (M1). Table 6 shows the comparison of percentages of undetermined pelvic bones among data from the literature and the current study. The analysis presents that the most critical disadvantage of the method is the number of undetermined individuals, depending on the number of variables available. This result happens because of the high threshold for sex classification (0.95), instead of using the usual 0.5. Conversely, the use of such a high threshold reduces the error rate significantly, allowing for better precision and reliability.
The analysis of inconclusive cases is shown in Table 7. The methods M1, M5,   M6 obtained 100% of the correct result in case the DSP2 adopted a probability superior to 0.5. The lowest percentage of correct results was observed for M3 and M7, which had 71.4% and 72.3% of correct classifications, respectively.
However, these differences were not statistically significant, as shown in Table 8.

Conclusion
This study proved that the DSP2 tool has a high replicability rate, and it applies to Brazilian populations. The results showed a high index of accuracy and low error rate, indicating that DSP2 is a reliable tool for sex estimation in forensic anthropology. The best variable combinations (the ones with lower levels of undetermined results) were the first nine or eight (M5 and M6); however, the worst combination was composed of the four variables of the central bone parts (M4).
If the level of undetermined cases is considered to be satisfactory when below 10%, every combination can be used, except for M4 and M7. The best combination of four variables (M3), showed a high index of sexual estimation and 100% accuracy. Thus, they can be used in this Brazilian sample. It was observed that although the tool can be used in fragmented bone, the method should be used with caution when only four variables of the central bone parts are available.
Female individuals were more accurately classified. In sum, this study confirms the validity, reliability, and accuracy of the DSP2 tool.