Abstract
The Zurich Claudication Questionnaire (ZCQ) has been translated and validated in multiple languages but few people have verified the measurement performance of the Chinese version of Zurich Claudication Questionnaire (Ch-ZCQ). The purpose of this study is to assess the performance of the Ch-ZCQ in lumbar spinal stenosis (LSS) patients undergoing non-surgical treatment. It includes the reliability, validity, responsiveness and minimally clinical important difference (MCID) of the two dimensions of symptom severity (SS) and physical function (PF). The results shows that the internal consistency and test-retest reliability were good. The content validity index was 0.764. The structural validity was good and moderate suitability. The correlation between the two dimensions of ZCQ is good, which is strongly correlated with Oswestry Disability Index (ODI) and moderately correlated with 12-item Short Form Health Survey Version 2(SF-12v2). Discriminative validity had significant differences in the degree of classification. The ZCQ’s SS and PF dimensions demonstrated a moderate Effect Size (ES) of 0.46 and 0.35. The Standardized Response Mean (SRM) was low, with values of 0.34 and 0.25. Additionally, the Change Rate (CR) was 10% for both, indicating a low level of change. MCID SS= -0.21[95% CI (-0.36, -0.05)]; MCID PF= -0.16[95% CI (-0.36, -0.03)]. This study demonstrates the Ch-ZCQ to be a reliable and valid tool, which can effectively evaluate the effectiveness of nonsurgical treatments for patients with LSS. However, to optimize its application for the Chinese population, further refinement is needed to address the ceiling/floor effects of some items.
Similar content being viewed by others
Background
Lumbar spinal stenosis (LSS) is defined by diminished space for the neural and vascular elements in the central canal of the lumbar spine secondary to degenerative changes of the facet joints, ligaments, vertebrae, and intervertebral discs1 .Symptoms of neurogenic claudication including pain in the buttocks and lower extremities provoked by walking or extended-period standing1. About 60–80% of persons had low back pain at some time in their lives2, making LSS one of the most prevalent low back pain conditions. The treatment options range from non-surgical approaches such as medication, electrophysical agents, manual therapy, general exercises and spinal stabilization exercises3, to surgery.
The Zurich claudication questionnaire (ZCQ), also known as the Swiss Spinal Stenosis Measure or the Brigham Spinal Stenosis Questionnaire was developed in 1996 by Gerald Stucki et al.4. It is an 18-item, self-administered, disease-specific questionnaire that consists of three domains: symptom severity (SS), physical function (PF), and satisfaction (SAT)4. The original ZCQ’s SAT domain is only evaluated after surgery to gauge how satisfied patients are with their care; it is not used to gauge how therapy has changed outcomes. Therefore, several trials opted to use the SS and PF domains of ZCQ for outcomes evaluation, particularly with non-surgery therapies5.
For patients with LSS, the ZCQ has been shown to be a reliable and precise disease-specific questionnaire that has been translated and validated in several languages6,7,8,9. The Chinese national clinical practice guidelines for LSS10 and core outcomes sets for LSS clinical trials11 propose the use of the Chinese version of the Zurich Claudication Questionnaire (Ch-ZCQ), which has been Chinese cultural-adapted since 20146. Despite this, the Consensus-based Standards for the Selection of Health Status Measurement Instruments (COSMIN) checklist discovered that the low quality of the evidence made ZCQ difficult to use in LSS trials12. Moreover, there have been few attempts to confirm the validity of ZCQ with the Chinese population, particularly with regard to response validity or MCID for the SS and PF domains.
Thus, the objective of this research is to assess LSS patients receiving non-surgical therapy using the Ch-ZCQ scale. The COSMIN checklist was used to assess the Ch-ZCQ’s measurement properties, which included responsiveness, validity, reliability, MCID, and ceiling/floor effect.
Methods
A validity study was performed at a single center to validate the scale. The COSMIN methodology checklist13 was used to validate the Ch-ZCQ.
The study was approved by the Ethics Committee of Dongzhimen Hospital of Beijing University of Chinese Medicine, approval No.: DZMEC-KY2017-128.
Patients
Patients diagnosed with LSS14,between 50 and 85 years of age, getting 3 months’ non-surgical treatment (acupuncture, cupping therapy, tuina, epidural steroid injection, interference electrical therapy, hot compress, and oral analgesics) were included after providing informed consent. Patients with severe cauda equina syndrome, spinal fracture, lumbar tuberculosis, spinal tumors or rheumatoid arthritis, severe hematopoietic, cardiovascular, or endocrine system diseases, cancer, severe anxiety, severe depression, postoperative spine pain, menstrual or lactating women, or vascular intermittent claudication were not included. The study was conducted during 2021 to 2022 at the inpatients and outpatients.
Since the sample size was determined using an accumulation of five times the number of ZCQ variables (12 items), this study required at least 75 LSS patients after accounting for potential dropouts. The sample requirements for confirmatory factor analysis (CFA) state that a minimum of 100 LSS patients are required15. Thus, taking these two factors into account, we required a minimum of 100 patients.
Measurements
The Ch-ZCQ6, Oswestry Disability Index (ODI)16 and 12-item Short Form Health Survey Version 2 (SF-12v2)17 were administered before treatment, and 1.5-3 months after treatment.
The Ch-ZCQ consists of Symptom Severity (SS) and Physical Function (PF) domains with 12 items. All responses are reported on a Likert-type scale. It includes 7 items for SS with scores of 1 to 5, 5 items for PF with scores of 1 to 418. Higher scores indicate more severe LSS.
ODI contains 10 items and each item is scored on a 6-point scale (0–5). Item 8 was eliminated from the ODI for this study (sexual life). The ODI score is calculated by the following formula: (total score/(5 x number of questions answered)) x 100. A higher score indicates more functional limitations due to low back problems19. The Chinese version of the ODI was previously confirmed to be reliable and valid16.
SF-12v2 consisted of 12 items and evaluated 8 dimensions of health-related quality of life, including General Health (GH), Physical Functioning (PF), Role-Physical (RP), Bodily Pain (BP), Vitality (VT), Social Functioning (SF), Role-Emotional (RE), Mental Health (MH). Physical Component Summary (PCS) was calculated for GH, PF, RP and BP, and Mental Component Summary (MCS) was calculated for SF, RE, MH and VT. This study utilized a Chinese version of the SF-12v2, which has shown good reliability and validity20.
Change in score on SF-12v2 item 1 was used to evaluated responsiveness, which is the change in overall health status assessment before and after treatment. (“Overall, what do you think of your current health?”). Responses were “excellent,” “very good,” “good,” “moderate” and “poor”.
Measurement properties and statistical analysis
The analysis of the metric properties included reliability (internal consistency and reproducibility), content validity, construct validity, discriminant validity, and structural validity, as well as the analysis of the responsiveness and interpretability of this scale by following the COSMIN guidelines21.Additionally, the quality of the ZCQ was evaluated by using the current updated criteria for good measurement properties22.
Descriptive statistics were performed on all baseline scores and follow-up scores. The domain scores, total scores, and mean values of ZCQ, SF-12v2 and ODI scores were calculated.
For test-retest reliability, the level of agreement between two-time points was evaluated by using the intra-class correlation coefficient (ICC). Patients were filled in the form before admission and 1.5-3 months after treatment in the hospital. For calculations, we chose patients whose scores on item 1 of the SF-12 scale were unchanged following therapy. The coefficient ranged from 0 to 1, a coefficient greater than or equal to 0.7 was considered sufficient to determine test-retest reliability23.
For internal consistency, the homogeneity of the items within the domain was evaluated by calculating Cronbach’s alpha coefficients. A Cronbach’s alpha of 0.7 or higher was considered acceptable for internal consistency, while a score above 0.8 was considered good and above 0.9 was excellent22.
The content validity index at each item level was determined by the number of experts with a score of 7–10 and the total number of experts participating in the evaluation. The experts were requested to score the important evaluation of each item for LSS patients when the items were evaluated as outcomes following therapy, taking into account the items’ comprehensiveness, relevance, and comprehensibility. The content validity index at the scale and domain level was calculated as the average of all items within the scale or domain. The content validity index equal to or larger than 0.7 was considered high quality of the content validity24.
For construct validity, it is critical to establish evidence of the construct’s validity through correlations with external criteria25. In this study, the Pearson correlation coefficients of pretreatment and posttreatment assessments for SS and PF were used to evaluate the degree of correlation with external criteria (ODI and SF-12v2). It was anticipated that scales assessing related ideas would have a moderate to strong correlation whereas scales measuring dissimilar concepts would exhibit a poor correlation. We hypothesized that SF-12v2 would have a weak to moderate correlation with ZCQ and that ODI would have a strong association with ZCQ.
The correlation coefficient can be classified into five levels: very strong (r = 0.80 to 1.00), strong (r = 0.60 to 0.79), moderate (r = 0.40 to 0.59), weak (r = 0.20 to 0.39), and very weak (r = 0.00 to 0.19), respectively.
For structural validity, confirmatory factor analysis (CFA) was used to explore the underlying structure. CFA models were a preferred standard in testing many aspects of scale construction because they offer the ability to test model hypotheses26. It was considered as indicators of a well-fitting model when values greater than 0.9 for Comparative Fit Index (CFI) and Goodness of Fit Index (GFI), less than 0.08 for Standardized Root Mean Square Residual (RMSEA), less than 0.05 for Root Mean Square Residual (RMR), greater than 0.9 for Normed Fit Index (NFI), and greater than 0.9 for Non-normed Fit Index (NNFI)27.To test the models, a lower X2 value indicates a better fit, assuming an equal number of degrees of freedom (df).
For discriminant validity, patients were grouped based on their disease severity at baseline, as measured by the ODI. According to the ODI score, the degree of disability is divided into mild (< 56 points) and moderate to severe (> 56 points)28.ZCQ domains were compared across groups by using independent sample t-test.
Responsiveness was assessed by using the Effect Size (ES), Standardized Response Mean (SRM) and Change Rate (CR). The values of ES and SRM were approximately 0.2, 0.5, and 0.8, indicating low, medium, and high magnitudes of change over time29. CR can generally take 5%, 10%, 15%, or 20%30. The higher the ES or SRM are, the greater the level of sensitivity to detect change is.
For interpretability, the commonly used estimation methods of determining the MCID include the criterion method, the distribution method, the expert opinion method, and the literature analysis method31,32,33.Based on the high correlation between SF-12v2 and ZCQ scale, so we used the criterion method to evaluate MCID.
The existence of floor effect and ceiling effect will affect the evaluation of reliability and validity. If more than 15% of respondents obtain the minimum or maximum score, a floor or ceiling effect is considered to be present22.Floor and ceiling effects were analyzed by using ZCQ scores and by calculating the frequency of the lowest and highest possible scores.
All statistical tests were two-sided with a significance level of 5%. All analyses were performed by using SPSS 26 and Scientific Platform Serving for Statistics Professional, SPSSPRO (spsspro.com).
Results
Of the 105 patients enrolled at baseline, 75% completed the questionnaire sets after 1.5-3 months treatment with no missing data. The demographics and scale measurement values were shown in Table 1.
Reliability
The results of the internal consistency were good. The total Cronbach’s alpha coefficient of ZCQ baseline data was 0.874, and the Cronbach’s alpha coefficient after deleting a single item was between 0.855 and 0.884 (details in appendix 1). The Cronbach’s alpha coefficient of SS was 0.793, and PF was 0.870.
To analyze test-retest reliability, the study included 59 participants who had undergone treatment and exhibited no change in responses to SF-12v2 item 1 between baseline and the 1.5-3 months follow-up. Since patients received treatment within the measurement interval, the study’s reliability was not assessed under strictly stable conditions. This measure may not accurately capture patient changes. However, if a patient’s condition remains unchanged post-treatment compared to baseline item 1, the changes observed in the retest evaluation can be explained. The ICCs for SS and PF of the ZCQ were 0.836 (95%Cl 0.724–0.903) and 0.741(95%Cl 0.583–0.841) (time interval, days ± SD = 63.23 ± 22.23), respectively.
Validity
Content validity was assessed by 30 experts who scored the importance of the items in the Ch-ZCQ. The content validity indexes of the scale in this study were 0.764. Additionally, the content validity indexes of each item level ranged from 0.500 to 0.933. The content validity of this scale was considered good, except item 10, 11, and 12 (details in appendix 2).
The structural validity of ZCQ was presented in Table 2. To avoid the effects of treatment or time-related changes, we assessed structural validity using only baseline data. The baseline data model had good level of suitability.
In order to evaluate the construct validity, the correlation coefficients between the ZCQ and ODI, SF-12v2 domains were calculated (see Table 3). The ZCQ showed a strong correlation between its SS and PF. ZCQ and ODI were shown to be strongly positive correlated. Additionally, the SF-12v2 showed moderate negative relationships with ZCQ.
Discriminative validity
The results were presented in Table 4 after independent sample t-test had been conducted. There were significant differences in the degree of classification. It was observed that the ZCQ scale demonstrated good discriminative validity.
Responsiveness
To assess the responsiveness, the ESs between the ZCQ and external criteria (ODI and SF-12v2) were calculated. The ES was moderate in ZCQ SS, PF and ODI, with values of 0.46, 0.35, 0.21, respectively. While, the SRMs was low with values of 0.34, 0.25, and 0.12, respectively. Furthermore, the average CRs were 10%, 10% and 5%. The assessment failed to address the patient’s subjective reports of overall improvement. The difference in the SF-12v2 PCS and SF-12v2 MCS was not statistically significant (P > 0.05).
Interpretability
MCID was calculated by using the criterion method. Changes in 79 patients before and after treatment were analyzed using SF-12v2 item 1 as an anchor. There were 16 patients in the improved group, 4 patients in the deteriorated group, and 59 patients in the unchanged group. The mean value was used to estimate MCID. MCID SS= -0.21[95% CI (-0.36, -0.05)]; MCID PF= -0.16[95%CI (-0.36, -0.03)].
Floor and ceiling effects
There were ceiling and floor effects observed in the two domains of ZCQ. There were 6 items with ceiling/floor effect in the baseline data, and 8 items in the follow-up data (See Appendix 3).
Discussion
The Ch-ZCQ was translated and linguistically validated prior to this study6. The psychometric properties of the ZCQ were assessed by using the data collected from Chinese LSS patients. The Ch-ZCQ demonstrated good validity and reliability. The internal consistency of the ZCQ scale in this study was good. Construct validity assesses a scale’s accuracy in measuring a theoretical construct by comparing it with other scales, while structural validity evaluates the reasonableness of a scale’s internal structure using CFA. Our study demonstrated that the ZCQ scale had a good correlation with the SF-12v2 and the ODI, and the CFA model fit was good. The SS showed a moderate level of responsiveness, while the PF exhibited poor responsiveness. The above results indicated that evaluating the therapeutic effect of patients with degenerative lumbar spinal stenosis by using the ZCQ scale was meaningful. The MCID of ZCQ SS and PF were − 0.21, and − 0.16, respectively. The MCID is a patient-derived score that reflects meaningful changes in clinical interventions for patients. Understanding the MCID in the ZCQ score will help clinicians clarify the treatment effect.
In our study, the internal consistency of the ZCQ showed that the total Cronbach’s α (95% CI) was 0.874 (0.855–0.884), with the Cronbach’s α coefficient of SS was 0.793 and PF was 0.870.The previous studies also supported our findings, and suggested that the Cronbach’s α coefficients of SS were 0.787, 0.896, and 0.948, and the Cronbach’s α coefficients of PF were0.847, 0.866, and 0.968.
The test-retest reliability can reflect the consistency of the scale, which is primarily assessed by using the intra-group correlation coefficient. In the study, the test-retest reliability was found good, with ICCs of 0.836 and 0.741 in SS and PF. Previous studies have demonstrated that ICCs of SS and PF were 0.81 and 0.89 (the time interval of 3 months)7, 0.93 and 0.91 (3–5 days)6, 0.89 and 0.92 (one week)8.
In terms of construct validity, the study showed that strong correlations of SS and PF with ODI (0.646 ~ 0.817), and moderate correlations of PCS(− 0.527~ − 0.416) and MCS (− 0.506~ − 0.353) of SF-12v2. Nobuhiro Hara et al.7 found that SS and PF domains were strongly correlated with ODI (r = 0.63 ~ 0.75) and SF-36 physical function (r=− 0.65~− 0.28). Honglei Yi et al.6 study showed that strong correlation of ZCQ and SF-36 (r=− 0.685~ − 0.700).
In terms of discriminative validity, this study utilized the ODI to classify the severity of the condition. The results indicated that the ZCQ can discriminate the patients with mild to moderate/sever condition. This is beneficial for doctors as it allows them to quickly assess the severity of a patient ‘s condition in clinic.
In terms of responsiveness, the study showed that ZCQ could respond to the changes of patient who underwent the non-surgical treatment. Previous studies have demonstrated good responsiveness that the ES values for SS and PF were 1.737, 2.63 and 2.359,while the SRM values were 1.54 and 1.387,9.Low responsiveness in the study may be attributed to the different treatment methods. The baseline data from the study were similar to the two studies that used surgical treatment8,but this study employed non-surgical treatment. The low response may also be due to the varying follow-up time. In the references, the follow-up time ranges from 6 months to 1 year, whereas the follow-up period in this study is only 1.5-3 months. The low ES and SRM values may be attributed to the existence of floor/ceiling effects in the items.
In terms of MCID, the study showed that the MCID values for SS and PF were 0.21 and 0.16. Cleland Joshua et al.34 suggested that clinicians should consider using an MCID of SS and PF were 0.36 and 0.10. It is very similar to the findings of this paper.
The content and quality of the item have a direct impact on the content validity of the scale. Since the Ch-ZCQ is a translation, the items’ contents were identical to those of the original. Consequently, when the items were assessed as outcomes after therapy, the experts were asked to score the important assessment of each item, considering the items’ comprehensiveness, relevance, and comprehensibility, particularly for the conditions of the Chinese LSS population. The study showed that the score of item importance of items 10, 11, and 12 were below 7.0. Experts noted that most of these items concentrated on walking distance and contained repetitious information that was inappropriate for Chinese patients. The low significant index of content validity may also be explained by the manner that items 10, 11, and 12 displayed ceiling and floor effects.
The structural validity reflected the correspondence between the theoretical structure of the scale and the measured value. Despite the fact that half of the items in this study exhibited floor and ceiling effects, the confirmatory factor analysis (CFA) model fit was good. Although the CFA had been thought to be the most efficient method for assessing structural validity15, there were not many research on the topic at the time. The ZCQ Thai version’s exploratory factor analysis, which had identified four factors, somewhat confirmed our findings. The first and second factors had been related to the original ZCQ’s PF and patient satisfaction domains, respectively, while the third and fourth factors had dealt with pain symptoms and neurological disability, which were related to the original ZCQ’s SS domain36. Therefore, the Ch-ZCQ might have been an effective instrument for assessing the outcomes of LSS patients following non-surgical treatment, according to our evidence of structural validity and content validity, which were crucial for questionnaire development and psychometric assessment, particularly for cross-cultural adaptation.
Strength and weaknesses
Previous ZCQ measurement performance studies in China6, Japan7, Iran8, and Korea9 focused on surgically treated patients and lacked MCID evaluation, while our study has the following advantages. The ZCQ’s psychometric qualities were thoroughly evaluated in LSS patients receiving non-surgical therapies, offering important new information on the validity and reliability of the instrument in Chinese patient population. Additionally, this study fills a major gap in the literature by being the first to assess content validity, construct validity, responsiveness, and MCID in the Chinese LSS population. It also provides vital information for treatment evaluation and clinical decision-making. Furthermore, scientific rigor is ensured and the study is in line with global best practices for outcome measurement instrument assessment by using the COSMIN checklist and criteria to examine the ZCQ’s attributes. Overall, these strengths enhance the credibility and applicability of the study’s findings, contributing meaningfully to the field of LSS research and patient care.
There are also some limitations. Firstly, this study was conducted in Beijing, which may result in an over-representation of the urban population. Therefore, the population of other suburban areas of China needs to be considered. Secondly, we only assessed the known-group validity based on the changes in the ODI score. The ODI is a recognized scale for evaluating low back and leg symptoms, encompassing various domains, and it is both feasible and sufficient to be a criterion for discriminative validity35. Thirdly, patients whose overall rating has not changed during therapy are the target of the test-retest reliability assessment. For the computation, we chose patients whose SF-12v2 item 1 selection remained unchanged. This measure might not accurately capture patient changes, but if changes remain constant following therapy compared to baseline, it could account for test-retest evaluation alterations.
Data availability
All data analysed during this study are included in this published article and supplementary information files.
References
Lurie, J. & Tomkins-Lane, C. Management of lumbar spinal stenosis. BMJ (Clinical Res. ed). 352, h6234 (2016).
Özüdoğru, A. et al. Five times Sit-to-Stand test in people with non-specific chronic low back pain-a cross-sectional test-retest reliability study. Ir. J. Med. Sci. (1971-). 192 (4), 1903–1908 (2023).
References 2, Yildiz, N. T., Canlı, M., Kocaman, H. & Alkan, H. The effectiveness of Kinesio taping in addition to conventional treatment in patients with chronic low back pain: A randomised controlled trial. J. Health Sci. Med. 7 (5), 564–570 (2024).
Stucki, G. et al. Measurement properties of a self-administered outcome measure in lumbar spinal stenosis. Spine (Phila Pa. 1976). 21, 796–803 (1996).
Brøgger, H. A., Maribo, T., Christensen, R. & Schiøttz-Christensen, B. Comparative effectiveness and prognostic factors for outcome of surgical and non-surgical management of lumbar spinal stenosis in an elderly population: protocol for an observational study. BMJ Open. 8 (12), e024949. https://doi.org/10.1136/bmjopen-2018-024949 (2018). PMID: 30573489; PMCID: PMC6303618.
Yi, H. et al. Reliability and validity of simplified Chinese version of Swiss spinal stenosis questionnaire for patients with degenerative lumbar spinal stenosis. Spinevol 39,10 (2014): 820–825. https://doi.org/10.1097/BRS.0000000000000273
Hara, N. Matsudaira Ko,Masuda Kazuhiro,et al. Psychometric Assessment of the Japanese Version of the Zurich Claudication Questionnaire (ZCQ): Reliability and Validity. PloS one,2016,11(7).
Heshmati Afshin Ahmadzadeh,Mirzaee Moghaddameh, reliability and validity of the Swiss spinal stenosis questionnaire for Iranian patients with lumbar spinal stenosis. Arch. Bone Jt. Surg., 6: 119–123. (2018).
Kim Ho-Joong,Lee Young-Kyun,Kim Dong Ook et al. Validation and Cross-cultural Adaptation of the Korean Version of the Zurich Claudication Questionnaire in Patients With Lumbar Spinal Stenosis.Spine (Phila Pa 1976), 43: E105-E110. (2018).
China Association of Chinese Medicine, Guidelines for prevention on intervention of diseases in degenerative lumbar spinal stenosis. (In Press). (2024).
Sun, Y. N. et al. Development of CORE-CM core outcome domain sets for trials of Chinese medicine for lumbar spinal stenosis. BMJ Open. 13 (10), e075856. https://doi.org/10.1136/bmjopen-2023-075856 (2023). PMID: 37844991; PMCID: PMC10582923.
Wertli, M. M. et al. Validity of outcome measures used in randomized clinical trials and observational studies in degenerative lumbar spinal stenosis. Sci. Rep. 13, 1068 (2023).
Mokkink, L. B. et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: An international Delphi study. Qual. Life Research: Int. J. Qual. Life Aspects Treat. Care Rehabilitation Vol. 19 (4), 539–549. https://doi.org/10.1007/s11136-010-9606-8 (2010).
Sekiguchi, M. & The Essence of Clinical Practice Guidelines for Lumbar Spinal Stenosis., : 2. Diagnosis and Evaluation. Spine Surg Relat Res. 2023;7(4):300–305. (2021). https://doi.org/10.22603/ssrr.2022-0080. PMID: 37636148; PMCID: PMC10447202.
Mokkink, L. B. et al. COSMIN risk of Bias checklist for systematic reviews of Patient-Reported outcome measures. Qual. Life Res. 27 (5), 1171–1179. https://doi.org/10.1007/s11136-017-1765-4 (2018). Epub 2017 Dec 19. PMID: 29260445; PMCID: PMC5891552.
Liu, H. Tao Huiren,Luo Zhuojing,Validation of the simplified Chinese version of the Oswestry Disability Index. Spine (Phila Pa 1976) 34 1211-6 (2009).
Lin, Y. et al. Nov. Comparing the reliability and validity of the SF-36 and SF-12 in measuring quality of life among adolescents in China: A large sample cross-sectional study. Health and quality of life outcomes vol. 18,1 360. 9 (2020). https://doi.org/10.1186/s12955-020-01605-8
Stucki, G. et al. Measurement properties of a self-administered outcome measure in lumbar spinal stenosis.[J].Spine (1996). (Phila Pa 1976), 21: 796–803 .
Song, C. Y. et al. Validation of the Brief Pain Inventory in Patients With Low Back Pain. Spine41 15(2016): E937-E942. https://doi.org/10.1097/BRS.0000000000001478
Lam, E. T. P. et al. Is the SF-12 version 2 health survey a valid and equivalent substitute for the SF-36 version 2 health survey for the Chinese? J. Evaluation Clin. Pract. Vol. 19 (1), 200–208. https://doi.org/10.1111/j.1365-2753.2011.01800.x (2013).
Mokkink, L. B. et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J. Clin. Epidemiol. 63, 737–745 (2010).
Prinsen, C. A. C. et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Qual. Life Res. 27, 1147–1157 (2018).
Terwee, C. B. et al. Quality criteria were proposed for measurement properties of health status questionnaires. J. Clin. Epidemiol. Vol. 60 (1), 34–42. https://doi.org/10.1016/j.jclinepi.2006.03.012 (2007).
Yusoff, M. S. B. ABC of content validation and content validity index calculation. Educ. Med. J. 11 (2), 49–54. https://doi.org/10.21315/eimj2019.11.2.6 (2019).
Smith, G. T. On construct validity: issues of method and measurement. Psychol Assess.17(4):396–408. (2005). https://doi.org/10.1037/1040-3590.17.4.396. PMID: 16393005.
Cahill, P. T., Missiuna, C. A., DeCola, C., Dix, L. & Campbell, W. N. Structural validity and internal consistency of an outcome measure to assess self-reported educator capacity to support children with motor difficulties. Front. Educ. 8, 1174097. https://doi.org/10.3389/feduc.2023.1174097 (2023).
Cheng Zhiqiang, Z. et al. Chinesization and reliability and validity test of intensive care oral care frequency assessment scale [J]. China Nurs. Manage. 23 (05), 703–707 (2023).
Pearson, A. et al. Who should have surgery for spinal stenosis? Treatment effect predictors in SPORT. Spine 37,21 : 1791–1802. (2012). https://doi.org/10.1097/BRS.0b013e3182634b04
HUSTED, J. A. et al. Methods for assessing responsiveness: A critical review and recommendations [J]. J. Clin. Epidemiol. 2000, 53(5):459–468 .
Chonghua, W. A. N. et al. Development of a common module for the quality of life scale system for Cancer patients (III): Analysis of responsiveness.Cancer26(4):4. (in Chinese) (2007).
Wright, A. et al. Clinimetrics corner:a closer look at the minimal clinically important difference(MCID). J man manip Ther,20(3):160–166 (2012) .
Revicki, D. et al. Recommended methods for determining responsiveness and minimally important differences for patient-reported outcomes. J. Clin. Epidemiol. 61 (2), 102–109 (2008).
Brozek, J. L., Guyatt, G. H. & Schünemann H J.How a well-grounded minimal important difference can enhance transparency of labelling claims and improve interpretation of a patient reported outcome measure. Health Qual. Life Outcomes. 4 (1), 69 (2006).
Cleland Joshua, A. et al. Psychometric properties of selected tests in patients with lumbar spinal stenosis. Spine J. 12, 921–931 (2012).
Yao, M. et al. Jun. A comparison between the low back pain scales for patients with lumbar disc herniation: Validity, reliability, and responsiveness.Health Quality life Outcomes 18 175 (2020). https://doi.org/10.1186/s12955-020-01403-2
Wilartratsami, S., Santipas, B., Jerapornprapa, R., Vamvanij, V. & Luksanapruksa, P. Reliability and Validity of the Thai Version of the Swiss Spinal Stenosis Questionnaire. Spine (Phila Pa 1976)46(5): E338-E343. (2021). https://doi.org/10.1097/BRS.0000000000003781. PMID: 33534443.
Acknowledgements
Thanked all researchers for their contributions.
Funding
This study was funded by the Beijing Tongzhou District Science and Technology Project; the National Natural Science Foundation of China (Grant No. 81803956, 82374617); Capital Health Development Research Project (No. 2020-4-4195); Seed Funding of Golden Bridge Project of Beijing Municipal Science and Technology Commission (No. ZZ21053).
Author information
Authors and Affiliations
Contributions
Y.G : article writing; X.W and Y.S : provide guidance and support; Z.W, H.S, H.B, L.L, L.Z : responsible for the content of the article, quality evaluation proofreading; Z.C, M.S, J.L, Y.W, M.Z : responsible for methodological quality evaluation; C.Y : responsible for methodological design and article quality control, responsible for the paper.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
This study was approved by the Ethics Committee of Dongzhimen Hospital, Beijing University of Traditional Chinese Medicine (DZMEC-KY2017-128). The research all methods were performed in accordance with the Declaration of Helsinki. All patients who participated in the study had given their written informed consent.
Competing interests
The authors declare no competing interests.
Consent to publish
Not applicable.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Gao, Yx., Weng, Zw., Shao, H. et al. Reliability, validity and minimum clinical importance difference of the Chinese version of the Zurich claudication questionnaire. Sci Rep 15, 16654 (2025). https://doi.org/10.1038/s41598-025-01380-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-01380-w