Reliability and Validity of a Chinese Version of the Students ’ Attitudes toward Physical Education Scale and Its Related Factors

This study aimed to examine the reliability and validity of a Chinese version of the Students’ Attitudes toward Physical Education Scale (SPEA), and to ascertain the associations between student attitudes toward physical education and gender, grade levels, school, and area. In 2013, a crosssectional study was conducted in eight middle schools in Beijing, China. A sample of 1793 students was enrolled. Cronbach’s alpha and intraclass correlation coefficients (ICC) were performed to evaluate the scale’s internal and test-retest reliability. Construct validity was assessed with confirmatory factor analysis. Linear mixed models were used to examine the associations of attitude scores with gender, grade, school, and area (urban and suburban). Our results showed high reliability (Cronbach’s alpha = 0.90, ICC = 0.94; 95% CI, 0.92 0.96) of the SPEA. For the construct validity, results of confirmatory factor analysis showed an acceptable model fit. Overall, the mean of the attitude scores was 75.5 ± 13.4 out of 100. The attitude scores were found to be statistically different between grade levels, schools, and areas. No differences were found for gender. This study provides evidence that the SPEA is acceptable for the assessment of the attitude toward physical education in Chinese students. The school plays a more important role in the student attitudes than school locations (urban or suburban) in our study. For better understanding of students’ attitudes toward physical education, further studies that focus on the environment of physical education in schools, and use more representative samples of schools from various locations are warranted.


Introduction
There is increasing evidence that childhood physical activity is positively associated with a range of beneficial childhood health and fitness outcomes (Hills et al., 2011;Pate & O'Neill, 2008).School physical education is seen as a societal institution with a responsibility to promote physical activity in youth that reaches most schoolaged youths.Physical education classes can provide youths with substantial amounts of physical activity, which may be particularly important for those who have little or no physical activity in daily life (Gallahue & Donnelly, 2003).
Understanding student attitudes toward physical education can aid teachers in improving teaching content and forming positive attitudes toward physical education among students (Cothran & Ennis, 1998).Emerging evidence has shown that students who show a more positive attitude toward physical activity in physical education are more likely to participate in physical activity outside of school (Portman, 1995;McKenzie, 2003).
Different scales for assessing student attitudes have been reported in studies of physical education.In a review of attitude research in physical education, Silverman and Subramaniam (1999) found that these instruments either failed to follow proper instrument validation steps or used attitude theory as a guiding framework in assessing the attribute, which may have contributed to the mixed findings of attitude studies in the physical education area (Biddle & Mutrie, 2001).After realizing the inadequacy of these instruments, a two-component instrument grounded in attitude theory was developed and validated (Subramaniam & Silverman, 2000;Subramaniam & Silverman, 2007).The Students' Attitudes toward Physical Education Scale (SPEA) measures students' enjoyment (affect) and perceived usefulness (cognition) of physical education in their present school with their current physical education teachers (Subramaniam & Silverman, 2000).
Until now, the SPEA has not been available in Chinese.Development of a Chinese version of the SPEA would allow Chinese investigators to participate in international research studies when this scale is proposed.With this instrument developed to successfully measure the construct in a physical education setting, researchers can draw sound conclusions about student attitudes (Subramaniam & Silverman, 2000).
The main objective of this study was to examine the reliability and validity of a Chinese translation of the scale in students.The secondary objectives were to use the Chinese SPEA to measure middle school students' attitudes toward physical education and to ascertain the relationships between student attitudes toward physical education and gender, grade level, school, and area (urban or suburban).

Design and Setting
This observational cross-sectional study was conducted in 8 middle schools in Beijing, China.

SPEA
The 20-item SPEA was designed to measure student attitudes toward physical education on affect and cognition (Subramaniam & Silverman, 2000).The two components were labeled Enjoyment and Usefulness, respectively.Each component had two sub-domains: the physical education teacher and the curriculum.Response choices were on a 5-point Likert response scale ranging from "strongly disagree" (1) to "strongly agree" (5), with a total possible score of 100 points for overall attitude.A higher score indicates a more positive attitude.The validation of the instrument included an elicitation study, a pilot study, a content validity study, and a reliability and validity study in middle school students.Validation of scores from the instrument demonstrated psychometric evidence of validity and reliability (Subramaniam & Silverman, 2000).
Translation and Adaptation of the SPEA We received permission from the original author to use of the SPEA.A forward and back translation was carried out to confirm accuracy.The instrument was independently translated into Chinese by a graduate student majoring in English and a medical doctor, both of whom were fluent in English and Chinese.Two forward translations were synthesized into one Chinese version by the first author of this manuscript.Culture adaptation was achieved during a consensus meeting attended by experts in school health.The reverse translation from Chinese to English was carried out by another medical doctor who was not involved in developing the initial version.The original and the back-translated English versions were compared and inconsistencies were resolved through consensus meetings.The Chinese version was finalized when there were no disputes or new sugges-tions.

Participants
Participants for this study were students in grades 7, 8, and 9 from eight middle schools in two school districts in Beijing, China.
First, 2 school districts were contacted to participate in this study (1 from urban districts and 1 from suburban districts); Then, four middle schools were contacted and requested to participate in this study from each selected district.In each school, 2 classes were randomly selected from each grade.The students who underwent physical education were invited to participate in this study.
A total of 1793 students from 8 schools consented to participate.Of these, 1717 completed the questionnaires.The mean age was 13.3 ± 1.0 years: 51.8% (883 of 1704) were boys, and 47.9% (823 of 1717) were from suburban schools.

Procedures
The principals of selected schools were informed about the objectives and procedures of the study.Then the principals introduced the study team to the school nurses, physical education teachers, and students.The instrument was administered during class recess in the classroom.The school nurses who have received training for this study were available to assist if students had questions on completing the instrument.It took participants an average person 15 minutes to finish the scale.For test-retest reliability, the participants in one of the schools completed the SPEA a second time 2 weeks later, giving a final sample of 146 students for test-retest data for the Chinese SPEA (67 girls, 79 boys; mean age 13.6 ± 1.1 years).We were not able to include the other schools for a second measurement because of a lack of available time slots.

Ethical Considerations
The ethical review boards in the Beijing Center for Disease Control and Prevention and Waseda University granted permission for us to conduct this study.Informed consent was obtained from the principals, teachers, students, and their custodians prior to data collection.An information sheet describing the study was available for all students.Before instrument administration, the students were informed that the survey was voluntary and that they could choose to participate or not.The questionnaire was completed anonymously by students.

Data Management and Statistical Analysis
Data were double-entered and cross-checked using Epi Info version 6 statistical software.Descriptive statistics such as means and standard deviations were used where appropriate.
The scale's internal consistency reliability was determined by calculating Cronbach's alpha.Internal reliability is acceptable if Cronbach's alpha coefficient is greater than 0.70 (Bland & Altman, 1997).Test-retest reliability was evaluated using the intraclass correlation coefficient (ICC) and an ICC value of 0.40 represents moderate, 0.60 good, and 0.80 high agreement (Wilson et al., 2000).
Construct validity was assessed with confirmatory factor analysis (CFA) using the maximum likelihood estimation method.Based on previous findings and a preliminary construct validity analysis of the Chinese SPEA, both the two-factor clusters (Enjoyment and Usefulness) and sub-factor clusters models (enjoyment teacher (ET), enjoyment curriculum (EC), usefulness teacher (UT), and usefulness curriculum (UC)) of the SPEA were estimated.
A number of indices of model fit were proposed (Browne & Cudeck, 1993;MacCallum & Hong, 1997).In this study, the root mean square error of approximation (RMSEA), Comparative Fit Index (CFI), Normed Fit Index (NFI), Goodness of Fit Index (GFI), and Adjusted Goodness of Fit Index (AGFI) were used as indexes of model fit.For the RMSEA, values of less than 0.05 are considered to be indicative of close fit, values of 0.08 demonstrate a fair fit, and values greater than 0.10 indicate poor fit (Browne & Cudeck, 1993).For the CFI and NFI, values greater than 0.90 are considered to be indicative of an acceptable model fit (Suhr, 2006).Values for GFI and AGFI are from zero to one with values closer to one indicating a better model fit (Byrne, 1989).
We used linear mixed models to examine the associations of attitude scores with gender, grade, school, and area (urban or suburban).These models accounted for potential correlations among students within the same school.Statistical analyses were conducted using SAS version 9.3 (SAS Institute, Inc., Cary, North Carolina).

Internal Consistency and Test-Retest Reliability
Cronbach's alpha for the Chinese SPEA was 0.90 for the complete instrument, representing an acceptable internal consistency.Retests for reliability were completed by 146 students who completed the first questionnaires.The ICC was 0.94 (95% CI, 0.92 -0.96) for the Chinese SPEA total score.The ICC of individual items ranged from 0.63 to 0.91.All of these ICCs were in the good to excellent reliability range.

Construct Structure
We first tested the two primary factors (Enjoyment and Usefulness) with CFA.In the first analysis, RMSEA = 0.206, CFI = 0.720, NFI = 0.718, GFI = 0.82, and AGFI = 0.416.The analyses yielded low values of fit.Thus, we further analyzed the two sub factors (physical education teacher and curriculum) of each primary factor (Enjoyment and Usefulness).In the second CFA, results showed that RMSEA = 0.099, CFI = 0.929, NFI = 0.925, GFI = 0.83, and AGFI = 0.79.The new fit indices displayed a relatively better fit when compared to the previous analysis.The maximum likelihood factor loadings for all items were greater than 0.30, furthering support this model (Table 1).

Students' Attitudes toward Physical Education
The descriptive data for students' attitudes toward physical education in gender, grade, school, area (urban or suburban), and all are shown in Table 2 to Table 4. Overall, the mean of the attitude scores was 75.5 ± 13.4 out of 100.The overall values ranged from 70.5 ± 14.7 to 85.4 ± 11.4 at school levels (Table 2).As shown in Table 3, descriptive data showed that students in suburban schools had a higher mean score, and students in grade 9 had a lower mean score than those students in the other two grades.The overall values were 18.9 ± 4.5 for ET, 19.8 ± 4.2 for EC, 17.8 ± 2.6 for UT, and 19.0 ± 4.2 for UC out of a total possible score of 25.Means and standard deviations for the subscales are reported in Table 4.
The covariance parameter estimates of linear mixed models indicated that 9.5% of the variability in student physical education attitude was accounted for by the schools (Z = 1.77,P = 0.038) and were not significant for grade or area.The overall attitude scores were found to be statistically different between grades, schools, and areas.Compared to model 2, model 3 showed that the school played a more important role in student attitude than school location (urban, suburban) in our study (Table 5).The interaction effects of grade × school were also statistically significant (P < 0.05).There were no significant differences for gender.A further subscale analysis was carried out.For UT and ET, attitude scores were found to be statistically different between schools and areas (P < 0.05).For UC and EC, attitude scores were found to be statistically different between grades, schools, and areas (P < 0.05).

Discussion
This study validates the Chinese version of SPEA for use in Chinese students.The associations of students' attitudes toward physical education with gender, grade levels, schools, and areas were also examined.
Our results showed high reliability (Cronbach's alpha = 0.90, ICC = 0.94; 95% CI, 0.92 -0.96) of the SPEA.For construct validity, factor analysis results indicated that the two primary factor structure (Enjoyment, and    Usefulness) was a low fit to the observed data.However, a further subscale (UT, ET, UC, and UT) analysis showed a relatively better model fit.It was acknowledged that, in confirmatory factor analysis, any model can be rejected if the sample size is large enough, because hypothesized models are best regarded as approximations to reality rather than exact statements of truth (Marsh et al., 1988).Previous studies have shown that the NFI, GFI, and AGFI values had high sensitivity toward sample size and argued that RMSEA value was relatively independent of sample size (Marsh et al., 1988;Fan & Sivo, 2007;Widaman & Thompson, 2003).From this perspective, in this study, the subfactor structure is acceptable.This was further supported by the factor loadings which were greater than 0.30 for all items.
In their previous study, Subramaniam and Silverman (2000) demonstrated that the instrument produced valid scores for both primary factors and two of the subfactors (teacher and curriculum).Although the potential reasons for the difference in the results are unclear, one possible explanation might be caused by culture-based differences in students' perceptions of teachers (Adams & Sargent, 2009;Roberts & Tuleja, 2008).As education in China is much more "teacher centered and content oriented" than in Western countries, which feature "student centered and learning oriented" education, the educational experiences of students in China may lead Chinese students to be sensitive to the differences in teacher and curriculum.Further study is needed to validate the structure of the SPEA.
Students' attitudes toward physical education have been studied.Several findings in this study are consistent with previous studies.Similar to findings from Subramaniam and Silverman (2007), our participants indicated moderately positive attitudes toward physical education, and no gender difference was found for attitude scores.Our findings also agreed with those showing decline in attitudes toward physical education as students progressed in grade level (Subramaniam & Silverman, 2007;Prochaska, 2003).In our study, students in grade 9 had a lower score than those in the other two grades.Studies from Western countries indicated the decline in attitude maybe due to the repetitive nature of activities: students in upper grades may become bored when teachers use the same activities and sports every year (Carlson, 1995).The difference between grades was also attributed to the developmental differences (cognitive and social-emotional) among these groups (Subramaniam & Silverman, 2007;Chatterjee, 2013).These two explanations may be applied to our study.However, we also need to notice the education culture difference between China and Western countries (Adams & Sargent, 2009;Roberts & Tuleja, 2008;Sun et al., 2012).In our participants, students in grade 9 have to face an important transitional examination to enter senior high school.The high academic burden and pressure due to high parental expectations and fierce competitions with their peers during the final year may hinder their enjoyment of physical education classes.
Several studies showed that students in urban schools had more positive attitudes toward physical education than those in rural schools (Abraham, 2013;Chatterjee, 2013).It was supposed that urban schools have more opportunity for running physical education programs properly as there is more funding, facility, and equipment (Chatterjee, 2013).On the contrary, in our study, students from the suburban district had a higher score compared to their peers in the urban district.One possible explanation is that students in urban areas experience more stress from parents to obtain good grades, peer pressure, a competitive environment in the schools, and teacher demand for good grades, which may influence their attitudes toward physical education (Taragar, 2009).Since we did not survey the environment of physical education in school and out of school and only a small number of schools participated in this study, these findings are not easy to interpret.Therefore, a recommendation for future research is that environmental factors of physical education and physical activity in and out of schools should be surveyed, and more schools from various locations (urban, suburban, and rural) need to be involved in the future.
The mixed linear model results in our study showed that the school plays a more important role in student attitudes than school location (urban, suburban).Student attitudes varied from one school to another, ranging from 70.5 ± 14.7 to 85.4 ± 11.4.Previous studies have indicated that the school environment such as status of physical education in the curriculum, curriculum content, teacher behavior, class atmosphere, and facilities, was related to students' attitudes toward physical education (Bibek, 2007;Trudeau & Shephard, 2005).Thus, further investigation of the difference of environment for physical education between schools may be useful to help local schools properly design and implement physical education.
The findings of this study indicate that the reliability and validity of SPEA are acceptable for the assessment of attitude toward physical education in Chinese students.Attitude scores differed between grade levels, schools, and areas.Unresolved issues and challenges have been identified in the present study when attempting to ana-lyze students' attitudes toward physical education.To better understand students' attitudes toward physical education, further study is warranted with a view toward environment of physical education in and out of schools and more representative samples of schools from various locations.

Table 1 .
Instrument items and confirmatory structure coefficients.

Table 2 .
Overall means and standard deviations for school.

Table 3 .
Overall means and standard deviations for gender, grade, and region level.
Note: Values are means, and standard deviations.

Table 4 .
Means and standard deviations for gender, region and grade level for enjoyment (teacher and curriculum) and perceived usefulness (teacher and curriculum).
Note: Values are means, standard deviations.

Table 5 .
Estimates from mixed linear models predicting students' attitudes.
* Statistically significant, P < 0.05; Values based on SAS Proc Mixed.Entries show parameter estimates with standard errors in parentheses.