Introduction

Post-traumatic stress disorder (PTSD), described as the complex somatic, cognitive, affective, and behavioral effects of psychological trauma1, is a significant mental health concern for pregnant and postpartum women. PTSD can affect both the mother and the newborn2, and it is linked with other mental health issues3.

Several tools exist to screen PTSD and, to our knowledge, there is only one tool specific for the perinatal period. The Perinatal PTSD Questionnaire (PPQ) was initially designed by Hynan in 19984. It was composed of 14 items with dichotomous responses. When tested in american population it demonstrated good psychometric properties5. In 2006, Callahan et al. changed the dichotomous answers to a Likert scale and re-validated the questionnaire, giving guidance on a cut-off score for clinical6. This tool has been examined for its psychometric properties in many languages and cultures including English7, Korean8, Chinese9, and French (in its original dichotomous version)10. For a Spanish version, we searched PubMed using kewords, text terms and word variants for the concept ‘Spanish and validation and posttraumatic stress disorder”. There were 50 citation. We found articles describing use of PPQ in the Spanish setting11, 12, but there was not a single citation of a validation study in this population.

Spanish is one of the most commonly spoken languages worldwide13. In the USA in 2018, it was estimated that 13.5% (41.5 million people) of the population over five years of age speaks Spanish14. As no validation study of Spanish PPQ exists and Spanish is an international language, there is a need to examine the performance of Spanish PPQ. This background led us to conduct a study to validate PPQ in Spanish.

Methodology

An observational cross-sectional validation study was conducted using the modified PPQ questionnaire6 in the Spanish language with a sample of postpartum Spanish women who gave birth in Spain with the approval of the Clinical Research Ethics Committee of Universidad de Jaen (reference number TD-VCDEPP-2019/1417-N-19). All participants received written information on the study, including the fact that participation was entirely voluntary with anonymity guaranteed. Before starting the questionnaire, the women had to read an information sheet about the study and its objectives, and check a box in which they showed their consent to participate in the study: that is, they signed an ad hoc digital informed consent.

The study was conducted in accordance with the principles of the Declaration of Helsinki and was conducted in accordance with Strobe cross sectional guidelines.

Participants

The sample size was estimated in accordance with the criteria for carrying out a factor analysis. These criteria envisage 10 subjects for each item15, and therefore we needed a sample of at least 140 participants. This minimum sample size was exceeded. The exclusion criteria for participation in the study were refusal to participate, being less than 18 years of age, and longer than 6 months after childbirth.

Data collection

An online questionnaire was developed and distributed from November to December 2019 with the collaboration of Spanish midwives' associations. This questionnaire included items collecting sociodemographic and clinical variables, the PPQ and the Edinburgh Postnatal Depression Scale (EPDS).

The following sociodemographic and clinical data were recorded: maternal age, education level, whether or not the pregnancy was desired, live newborn, parity, induction of labor, using natural analgesia, using epidural analgesia, using general anesthesia, type of birth, episiotomy, perineal tear, skin to skin, admission of the newborn to a care unit, degree of partner support, feeling respected by healthcare staff, type of feeding after discharge, surgical intervention, and postnatal hospital readmission. Some of the variables were used to describe the population, and some used in convergence validity.

The second component of the survey consisted of a series of questionnaires. First was the modified PPQ (Appendix S1: Spanish version), a 14-item measure assessing post-traumatic symptoms related to the childbirth experience, including intrusiveness or re-experiencing, avoidance behaviors, and hyperarousal or numbing of responsiveness. The PPQ also contains one item pertaining to feelings of guilt. Response options were modified from the original dichotomous scale to a five-level Likert-type scale (scored 0 to 4). Mothers were instructed to provide responses that reflected their experience during the targeted time frame (1 to 18 months postpartum). The total possible score on the modified PPQ ranged from 0 to 56. In the current investigation, internal consistency was superior to previous investigations using the dichotomous scaling, with an α = 0.906.

The EPDS is a 10-items self-reported scale designed as a specific instrument to detect postnatal depression and has been validated in the Spanish population during pregnancy16 and postnatally17. With a cut-off point of ≥ 10, the sensitivity was 79% and specificity 95.5%. The positive predictive value was 63.2% and negative predictive value 97.7%17. Moreover, it is a simple and widely accepted tool by clinical practitioners18. The EPDS was included to establish the criteria validity.

Data analysis

For sociodemographic and clinical data, the absolute and relative frequencies were used to describe the qualitative variables, and the mean and standard deviation (SD) used to describe the quantitative variables.

First, to determine the validity of the scale used, we analyzed three of the most common validity types: construct validity, convergent validity, and criterion validity.

For construct validity, we opted to carry out an exploratory factor analysis (EFA) to determine the underlying factors through a principal component analysis (PCA). Before carrying out the EFA, we analyzed the Kaiser–Meyer–Olkin (KMO) tests and Bartlett's sphericity tests, to determine whether it was appropriate to apply this analysis. For this to be the case, the KMO should be above 0.6 and as close as possible to 1, and Bartlett's sphericity, which consists of statistical hypothesis testing, should be less than 0.05 to reject the null hypothesis of sphericity and ensure that the factor model is adequate to explain the data. In the EFA, we used Varimax rotation to help clarify the assignation of items to different factors. To determine the number of factors to retain, we used the Kaiser criterion, which is one of the most used criteria. It retains factors with eigenvalues greater than the unit value19.

Within the construct validity, we also analyzed convergent validity, in order to establish the relationship between the PPQ and factors which are believed to be associated with PTSD risk, such as type of birth, admission of the newborn to an intensive care unit (NICU), type of feeding, hospital length of stay, among others. Hence, a bivariate analysis was performed using Pearson's chi-squared or Fisher's t-student tests, depending on whether the variable data were qualitative or quantitative. The results were considered statistically significant when p < 0.05.

To study criteria validity, the Edinburgh scale was exerted with a ≥ 10 cut-off point. To do this, we carried out a sensitivity and specificity study with an analysis of the area under the received operating characteristic curve (AUC) obtained using Swets' criteria20. We also carried out a bivariate analysis between the scores obtained on the PPQ scale and the Edinburgh scale. We again used non-parametrical statistical tests and considered significant associations with p < 0.05.

The reliability analysis was done by studying the Cronbach's (α) to evaluate the internal consistency (IC). The IC indicates to what extent the items in the questionnaire are correlated with each other, and how they fit together and measure the same concept. The α is one of the most widely used measures to assess the reliability of a scale21. Its values range from 0 to 1. One of the most accepted rules is to consider α > 0.9 as excellent, α > 0.8 as good, α > 0.7 as acceptable, α > 0.6 as questionable, α > 0.5 as poor, and α < 0.5 as unacceptable22.

Version 24.0 of the SPSS statistics package was used for analyses.

Ethical approval

This study was approved by the the approval of the Clinical Research Ethics Committee of Universidad de Jaen (reference number TD-VCDEPP-2019/1417-N-19). Before starting the questionnaire, the participants read a fact sheet about the study, its objectives, etc., and marked a box by which they showed their consent to participate in it, i.e., they signed an online informed consent (ticking the option if they wanted to participate or not doing so when refusing to take part in the study). We followed the protocols established to carry out this type of research with the purpose of publication/disclosure to the scientific community. The study was conducted according to the strobe guidelines set in the Declaration of Helsinki and all procedures involving human subjects were approved by the Ethics Committee. All women involved in this study filled out informed consent and data treatment forms to enter the study, in accordance with the ethical standards of the Ethics Committee. All participants received written information on the study, including the fact that participation was entirely voluntary with anonymity guaranteed.

Results

Characteristics of participants

Four hundred thirty-two women agreed to participate in the study and completed the PPQ with a PTSD risk (score ≥ 19) of 11.1% (48). The mean age was 35.4 years (SD = 4.22), and 65.0% (281) were primiparous. Labor was induced in 38.7%, 57.6% (259) had a normal vaginal delivery, and 72.9% (315) needed regional analgesia. Neonatal data showed that 7.9% (34) were admitted to NICU, and 78.0% (337) were exclusively breastfed at the moment of hospital discharge. The remaining descriptive data are detailed in Table 1.

Table 1 Characteristics of the sample included in a validation study of the PPQ Spanish questionnaire for PTSD risk.

Psychometric properties

Factor construct validity

The KMO test gave a value of 0.902, and Bartlett's sphericity test was < 0.01. Therefore, we proceeded to carry out the EFA. Three components explained 63.3% of the variance. The first component, "Arousal," consisted of items 7, 8, 9, 10, 11, 12, and 13 and accounted for 43.0% of variance. The second component, "Avoidance," consisted of items 2, 4, 5, 6, and 14 explained 13.8% of variance, while the third component "Intrusion" was formed by two items, 1 and 3, accounted for 6.3% of total variance. Furthermore, all the anti-image diagonal correlations showed figures higher than 0.86. Table 2 presents the scale items together with their respective factor weights.

Table 2 Rotated component matrix.

Convergent validity

Next, the convergent validity was analyzed using bivariate analysis of the scores from the PPQ questionnaire and various sociodemographic and clinical factors. A statistically significant relationship was observed between PTSD risk with the following variables: Birth plan, type of birth, hospital length of stay, hospital readmission, skin to skin, admission of the newborn to NICU, degree of partner support, feeling respected by healthcare staff, and type of feeding on discharge.

Criterion validity

Using the EPDS as a comparative instrument, it was found that the PPQ, translated and transculturally adapted into Spanish, presented an AUC of 0.86 (95% CI 0.82–0.91), with a good capacity to classify the subjects according to Swets' criteria. The ROC curve can be seen in Fig. 1. The bivariate analysis between the scores of the PPQ and Edinburgh scales shows a significant positive relationship (r = 0.69, p < 0.001).

Figure 1
figure 1

ROC curve. Predictive capacity of the score in the PPQ for PD risk using the Edinburgh questionnaire.

Internal consistency

To evaluate internal consistency, the α of the total of the questionnaire was used, as well as that of each of the dimensions found with the EFA. For the total scale, α was 0.896. All the alfa values scored higher than 0.880 when removing an item, and the general α did not increase by more than 0.01; therefore, we decided to keep them. The α values for each factor are shown in Table 3.

Table 3 Internal consistency of the PPQ.

Discussion

Our analyses demonstrate the internal consistency, and construct and criterion validity of the Spanish PPQ. This allows for confidence in the use of the PPQ tool in a Spanish setting, something that had not been asured prior to our study.

Another important aspect to consider is the detected prevalence of PTSD risk, which stood at 11.1% in our sample. Apparently, it can be high, however, the Prevalence of PTSD is variable depending on the established cut-off point and the study population type. In 2017, a systematic review and meta-analysis of PTSD reported prevalence rates of 4.0% (95% CI 2.77–5.71) in the general population, with 18.5% (95% CI 10.6–30.38) of women at risk23. In addition, it should be clarified that the PPQ tool has a screening and not a diagnostic purpose, therefore it is normal that it presents a higher prevalence than the diagnosed cases.

With regard to factor construct validity the values obtained in KMO tests and Bartlett's sphericity test were adequate; thus, we conducted the EFA. Three components accounted for 63.3% of variance. The English, Korean, and Chinese versions explained 65%, 67%, and 51%, respectively, of variance7,8,9. Regarding components distribution, none of them match among the published versions, as can be seen in Table 2. Moreover, the Korean, Chinese, and Spanish versions coincide in the first component items.

The questionnaire presents an adequate convergent validity as is associated with variables linked previously with PTSD risk, such as type of birth11, 12, 23,24,25,26,27,28,29,30, prematurity5, 31, neonatal admission to care unit25, skin-to-skin11, and type of feeding23. Furthermore, other associations with PTSD risk were observed, including the degree of partner support and feeling respected by healthcare staff. We also used the same cut-off point (≥ 19) used by Callahan et al.6 to consider the risk of PTSD, being the same as that considered by the authors for clinical application; thus bringing the validation closer to its true clinical setting application.

The criterion validity was then evaluated using the EPDS. I opted for this tool because other authors have observed a strong correlation between PTSD and postpartum depression (33), it is a very well-known instrument used by professionals in clinical practice18. Specifically, we used EPD scores of ≥ 10 for determining the predictive capacity, finding almost excellent ROC AUC values.

Finally, internal consistency was evaluated and we found values very close to those found in the English version (Cronbach's α = 0.90)7 and for the Korean version (Cronbach's α = 0.91)8. The lowest internal consistency values were found in the Chinese version (Cronbach's α = 0.837)9, and especially in the French version (Cronbach's α = 0.77)10. This large difference could be explained because all the versions except the French used Likert-type scale while the French version used dichotomous responses.

With the validation of this questionnaire, practitioners can count on a new tool to identify those women who are at risk of developing PTSD after delivery. The tool is simple and easy to apply, in such a way that it could be included as another assessment tool during the postpartum period, just as the EPDS is used almost systematically for PD screening18. Health professionals with this type of tool can direct efforts towards the early detection and prevention of the consequences of a prevalent problem with an increasing trend and that has important consequences for the health of women and their offspring32, 33.

The validation of this instrument has a special relevance in the field of PTSD research, since to date there is no specific instrument for assessing the risk of perinatal PTSD in the Spanish-speaking population. Validations, as recommended by scientific societies34, they are essential so that researchers can use the assessment instruments in future research and can obtain valid results, establish comparisons, and measure the impact on women's health.

The strengths of our study include the opportunity to evaluate the Spanish PPQ across a diverse sample of a sociodemographically and clinically varied group of puerperal women. We had ample sample size for our evaluation with precisions. We also decided to only include women who had given birth 6 months ago to reduce memory bias as much as possible. There are various limitations of our study. Once the women who declined to take part in the study had been considered, there was no reason to believe there had been any selection bias as the number of non-participants was small, and the sample was consecutively selected. Regarding information bias, using an online questionnaire to collect data could be a limitation due to the lack of access that some women may experience, however, Callahan et al. already used this system for validation previously6.

Conclusions

In conclusion, the validation of the Spanihs PTSD Questionnaire, PPQ in postpartum Spanish women, showed adequate psychometric characteristics, which makes it appropriate for clinical practice in Spanish setting.