Introduction

It has been globally accepted that spinal anesthesia (SA) is the first-line anesthesia for deliveries by cesarean section (CS), since SA provides a better neonatal outcome as well as maternal safety1,2. General anesthesia (GA) is thus used in a very limited number of CS cases and is typically indicated for emergent/urgent cases or for patients with contraindications for neuraxial anesthesia.

At our institution, almost 100% of the cesarean cases performed since the 1960s were done with GA due to our institutional policy, but in 2013 we adopted SA as the first-line anesthesia in cesarean cases in accordance with globally accepted obstetric practice guidelines. We have thus given priority to SA for CS cases since 2013. However, we speculated that the actual rate of SA usage at our institution after 2013 was around 60% to 70%. In contrast, an analysis of recent obstetric anesthetic practice in the U.S. using National Anesthesia Clinical Outcomes Registry data (n = 218,285 CS cases) showed that the rate of SA usage was 94.2% of all CS cases and 85.4% of emergent CS cases3. In Japan, a nationwide population-based study using obstetric data from 2010 to 2013 revealed that the proportion of SA usage in elective CS cases was 88.7%1. We thus realized that the rate of SA usage for CS at our institution was still “moderate,” considering that SA usage at other institutions is approximately 90%.

This led us to ask the following question: "How often should anesthesiologists perform SA in order to contribute to better neonatal outcomes when managing CS delivery?” Put another way: "Did this moderate rate of SA implementation at our institution improve neonatal outcomes?".

To answer these questions, we conducted the present retrospective case–control study using the obstetric data and compared the neonatal outcomes before to those after the CS anesthetic protocol was modified.

Results and discussion

A total of 1350 patients treated during the period from 1994 to 2017 were included in the analyses. Among them, 24 patients were excluded due to incomplete data; thus, 1326 cases were included in the final analysis.

Review of the 2013–2017 obstetric data

After the anesthetic protocol was modified, 413 CS cases were performed. Among the 413 patients, 287 (69.5%) received SA and 126 (30.5%) received GA. The rates of SA use for each type of surgery are shown in Table 1: these rates were 78% in the elective cases, 61.4% in the emergent cases, and 30.4% in the urgent cases.

Table 1 GA and SA after the modified protocol.

Among the patients who received GA for elective CS, there were no cases with a fetal cardiotocography grade over 3. Even in the emergent surgery group, cases with a fetal cardiotocography grade over 3 accounted for only 35.7% of cases. In contrast, almost 70% of the urgent cases had fetal bradycardia, and > 90% had obstetric complications.

The effect of the modified anesthetic protocol on the neonatal outcomes

Patient characteristics: all CS cases in 1994–2017

The characteristics of all 1326 cesarean sections performed during the study period (i.e., 1994–2017) are summarized in Table 2. Compared to the patients after 2013 (i.e., when the anesthesia protocol was modified), those who underwent a CS before 2013 were significantly older and were at earlier gestational ages (i.e., non-full term). The proportions of emergent surgeries and of neonates with birthweights ≤ 1500 g were significantly higher before 2013. In contrast, the proportion of patients with obstetric complications was significantly lower before 2013. The incidence of neonatal asphyxia after 2013 was significantly lower at 1 min but not at 5 min.

Table 2 The characteristics of all cesarean section cases between 1994 and 2017.

A case–control study

Logistic regression analysis of the primary outcome

No linearity was observed for any of the quantitative variables, but among the four possible confounders (the anesthetic protocol, maternal obstetric complications, type of surgery, and GA usage), the GA usage had significant multicollinearity with the modified anesthesia protocol (Spearman's rank correlation coefficient rs = 0.78). This result indicated that the modified anesthetic protocol was significantly associated with the rate of GA usage, since the rate of GA usage was almost 100% before the protocol was modified at our institution but it clearly decreased thereafter. Therefore, inclusion of the GA usage in this regression analysis would have interfered with the inverse matrix calculations, resulting in low accuracy. We thus excluded general anesthesia as a variable in the subsequent analyses.

The results of the logistic regression analysis are given in Table 3. The p-values of six variables (modified protocol, maternal age, gestational age, emergent surgery, neonatal weight, and cord blood pH) were < 0.25 in the univariate analysis. We thus used these six variables in the multivariate analysis. The Hosmer–Lemeshow test showed that the fit for the logistic model was good (p = 0.403). The percentage of correct classifications was also good (93.1%). No outliers (defined as a prediction value ≥ the measured value ± 3 SD) were identified. When the six variables in the prediction model were allowed to vary independently, 82.7% of the cross-validated grouped cases were correctly classified.

Table 3 Univariate and multivariate analysis of risk for APS at 5 min < 7.

The modified protocol variable was not a significant independent factor for neonatal asphyxia (p = 0.145). Emergent surgery was also not significant. In contrast, the other four variables (maternal age, gestational age, neonatal birthweight, and cord blood pH) were significant independent factors.

Logistic regression analysis of the secondary outcome

We performed a multivariate analysis using the seven variables in the logistic regression analysis of the secondary outcome (i.e., the incidence of neonatal asphyxia at 1 min after birth). The Hosmer–Lemeshow test showed that the fit for the logistic model was good (p = 0.780). The percentage of correct classifications was also good (84.0%). No outliers (defined as a prediction value ≥ the measured value ± 3 SD) were detected. When the seven variables in the prediction model were allowed to vary independently, 78.6% of cross-validated grouped cases was correctly classified. The modified anesthesia protocol variable was not a significant independent factor for neonatal asphyxia (p = 0.999) (Table 4).

Table 4 Univariate and multivariate analysis of risk for APS at 1 min < 7.

We conducted a retrospective case–control study to clarify the impact of the modified anesthetic protocol for CS at our institution on neonatal outcomes. This protocol gives priority to SA. Our findings demonstrated that the rate of SA usage for CS at our institution was approximately 70%. Given that the rate of SA usage in other institutions is approximately 90%1,3, we considered that the observed rate of implementation of SA at our institution was “moderate”. The results of our analyses also revealed that this moderate prioritization of SA did not improve the neonatal outcomes compared to a strict prioritization of GA (SA 0%), as the modified protocol was not associated with the incidence of neonatal asphyxia at either 1 or 5 min after birth (1-min APS < 7, OR: 1.00, 95%CI: 0.64–1.57, p = 0.999; 5-min APS < 7, OR: 1.56, 95%CI: 0.86–2.85, p = 0.145).

Before 2013, almost none of the patients who underwent a cesarean section at our institution received SA, but after 2013, the rate of SA usage was dramatically increased from 0 to 70% of all CS deliveries. We assumed that such a dramatically increased rate of SA usage would be associated with an improvement in neonatal outcomes, since SA has been reported to be ranked best for APS in CS, among the available anesthetic techniques1,2. However, the results of the present analyses showed that there was no significant difference in the incidence of neonatal asphyxia between a 70% SA-adoption rate for CS deliveries and a 0% SA-adoption rate for CS deliveries. The rate of SA use has been reported to be strict in the U.S. (i.e., 94.2% of all CS cases)3 as well as in Japan (i.e., 88.7% of elective CS cases)1. The perinatal registration database of the Japan Society of Obstetrics and Gynecology showed that the rate of neonatal asphyxia with 5-min APS of < 7 in 2018 was 2.4% of all deliveries. A retrospective analysis of institutional data from Finland showed that the rate of neonatal asphyxia with 5-min APS of < 7 was 0.4–0.6% in CS cases4. A population-based study in Australia also showed that the incidence of neonatal asphyxia in CS (5-min APS < 7) was low at 1% with 83.2% of SA usage5. In contrast, the rate of neonatal asphyxia at our institution was higher at 7% with lower SA usage at 70%. These findings indicate that “strict” (i.e., > 90% or > 80%) SA performance may be needed to improve neonatal outcomes.

The lower rate of SA usage at our institution may have been due to a high rate of maternal indications for GA in the elective as well as emergent cases. Among the present group of elective GA cases, we observed no cases with a CTG grade over 3, and only 35.7% of the emergent GA cases had a CTG grade over 3. We therefore speculate that the use of a stricter set of SA/GA indications can significantly decrease the incidence of neonatal asphyxia.

SA is advantageous for neonatal well-being. However, we should consider the limits of SA that cannot be prolonged during surgery. In the case of bladder injuries during the CS, GA should be considered. Franchi et al. reported that among 28,822 deliveries, 7616 were CS cases (26.4%) and 3 cases of unintentional transvesical (UTV)-CS, defined as any extraction of the fetus through a double full thickness bladder wall cystotomy, were identified (incidence rate: 0.039%)6. Considering this previous report on the incidence of UTV-CS6, and extrapolating the perinatal registration data base of Japan, we also had estimated 3 UTV-CS cases in 2018. We did not collect the data on UTV-CS in the present study. The effect of this uncollected data on the results is unknown.

Some limitations of this study should be acknowledged. First, this study had a single-center, retrospective design. Thus, the external validity of the present prediction model was not assessed. Second, we conducted a multivariate logistic regression analysis for risk adjustment. However, unknown or unmeasured confounders might exist and could lead to a residual bias. We could not collect the detailed obstetric data such as the CTG grade, IUGR, congenital anomaly, and obstetric complications before 2013, while those were obtained after 2013. Such detailed maternal and fetal data may have been identified additional confounders in this study. In addition, maternal background pathologies can also be possible confounders, since these factors can affect the neonatal outcomes as well as the choice of anesthesia for CS. We collected obstetric complications such as preeclampsia, pregnancy induced hypertension, gestational diabetes mellitus7, placenta previa, and placental abruption during the study period. However, we did not collect the data on endometriosis. This must be considered an unmeasured confounder in the present study, since endometriosis is an estrogen-dependent disease that affects women in reproductive age8,9 and is correlated with a risk of hypertension, placenta previa, and low newborn weight. We should consider investigating this point in a future paper. Third, almost 100% of the CS patients at our institution received GA before 2013. Such completely homogenous data might have influenced the results. However, we were able to use the data to compare the neonatal outcomes of strict GA usage (0% SA) with that of “moderate” SA usage (70% SA), and this could be considered a strong point of this study. Fourth, we evaluated the neonatal outcomes using the 1-min and 5-min APS values, but the predictive value of a low APS for poor neonatal outcome can be controversial. Low APS has been reported to be no evidence of asphyxia2. In contrast, it has been reported that the “traditional” low APS defined as < 7 remains relevant for the prediction of neonatal adverse outcomes10,11. Some significant studies showed that APS at 5 min was a prognostic factor of neonatal adverse outcomes2,10,12 and indeed chose APS at 5 min as a measure to evaluate neonatal outcomes5,13. In a joint statement issued by the Japan Society of Obstetrics and Gynecology and the Japan Association of Obstetricians and Gynecologists, neonatal asphyxia was defined as a case with 5-min APS of < 7. However, asphyxia was defined as a cord umbilical arterial pH value < 7.20 as well14. We thus should have evaluated the risk of neonatal asphyxia defined as a cord umbilical arterial pH value < 7.20 as well; however, it was not possible to evaluate this due to the underpowered number of neonates with a cord pH value of 7.2 in this study (n = 32). Finally, we tried to answer the question “How often should anesthesiologists perform SA for better neonatal outcomes?” In answer, we found that a rate of more than 80–90% SA use is needed. However, this answer assumes that the incidence of neonatal asphyxia is low at institutions with a high rate of SA implementation, and also assumes that SA is advantageous for neonatal well-being. The effect of a high rate of SA use on neonatal outcomes must be investigated in the future.

In conclusion, SA is considered the first-line anesthesia for cesarean sections due to the better neonatal outcomes it provides. However, the results of this retrospective case–control study demonstrated that an approx. 70% rate of SA implementation did not improve the neonatal outcomes compared to a 0% rate of SA implementation. Because other institutions with higher rates of SA implementation have reported better neonatal outcomes than ours, an 80–90% rate of SA use may be needed to contribute to better neonatal outcomes of cesarean sections.

Methods

Patient data collection

We obtained the patients' data from the obstetric and anesthetic charts of cases of cesarean sections performed at our institution between 1994 and 2017.

Review of obstetric characteristics after 2013

We first reviewed the obstetric and anesthetic characteristics of the CS cases managed after the anesthetic protocol was modified in 2013 in order to determine how often SA was administered. The collected data were as follows: maternal age, body mass index (BMI), gestational age, type of surgery (elective, emergent, urgent) and anesthesia (spinal or general anesthesia), the number of patients who received tocolytic therapy, premature rupture of the membrane, and obstetric complications as well as the neonatal birthweight, umbilical cord blood pH, Apgar score (APS) at 1 and 5 min, cardiotocography (CTG) grade, the number of congenital anomalies, intrauterine growth retardation, malpresentation, and meconium staining. An APS < 7 was defined as neonatal asphyxia15, and abnormal umbilical cord blood pH was defined as < 7.22.

A case–control study

The definition of “cases” and “controls”

In accordance with the STROBE statement for observational study16, we considered the present retrospective study as a case–control study. We defined cases as CS deliveries performed after 2013, when the CS anesthetic protocol was modified at our institution. In contrast, controls were the CS deliveries performed before 2013. The primary neonatal outcome was the infant's well-being evaluated by the incidence of neonatal asphyxia at 5 min after birth, defined as a 5-min APS of < 7. The secondary outcome was the incidence of neonatal asphyxia at 1 min after birth, defined as a 1-min APS of < 7. We conducted the present case–control study using a logistic regression analysis. The dependent variable was the primary and secondary outcomes described above (i.e., neonatal asphyxia defined as a case with APS at 1 and 5 min < 7). The details of the analysis are described below.

The definition of neonatal asphyxia

In the present study, neonatal asphyxia was defined as a case with 5-min APS of < 7. A low APS has been reported to be defined as < 7, and has been associated with adverse clinical outcomes2,10. In an analysis of the data for 113,300 preterm infants from the Swedish Medical Birth Register, APS at 5 and 10 min provided prognostic information about neonatal survival12. In a prospective study investigating the effect of anesthetic technique (GA vs. SA) on neonatal morbidity in emergency CS for fetal distress, the neonatal morbidity was defined as a 5-min APS < 7, need for mechanical ventilation, admittance to a neonatal intensive care unit, or respiratory insufficiency symptoms13.

The logistic regression analysis

We investigated whether the modified protocol (i.e., using SA as the first-line anesthesia) affected the fetal outcomes at our institution by performing a logistic regression analysis of the clinical data obtained in the CS cases at our institution from 1994 to 2017. The collected data were as follows: maternal age, BMI, gestational age, obstetric complications, type of surgery (elective, emergent), anesthesia (spinal or general anesthesia), neonatal birthweight, umbilical cord blood pH, and APS at 1 and 5 min.

We prepared logistic regression models to estimate the risk of neonatal asphyxia associated with potential confounders, including clinical variables such as the modified anesthetic protocol, maternal age, BMI, gestational age, obstetric complications, type of surgery (elective or emergent), use of general anesthesia, neonatal birthweight, and umbilical cord blood pH. The inclusion of variables in the models was based on the current clinical knowledge of risk factors for neonatal asphyxia. We converted the quantitative variables (e.g., maternal and gestational age, BMI, neonatal body weight, and umbilical cord pH) into categorical variables as follows: maternal age (years old: ≤ 19 = 1, 20–34 = 0, 35–39 = 2, ≥ 40 = 3), gestational age (weeks: 37–41 = 0 as full-term, others = 1 as non-full-term), BMI (≤ 24 = 0, 25–29 = 1, ≥ 30 = 2), neonatal birthweight (> 1500 g = 0, ≤ 1500 g = 1), umbilical cord pH (> 7.2 = 0, ≤ 7.2 = 1).

Other categorical variables such as the anesthetic protocol, type of surgery, maternal obstetric complications, and use of GA were coded as follows: anesthetic protocol (patients before = 0, patients after the protocol was modified = 1), type of surgery (elective = 0, emergency = 1), maternal complications (without = 0, with = 1), GA usage (no = 0, yes = 1). The linearity of the dependent variable (i.e., the APS at 5 min) for neonatal asphyxia in the logit of the independent variable was assessed using locally weighted least squares smoothing curves. This analysis was conducted to confirm that binomial logistic regression was the appropriate method of analysis.

As a first step, we performed a univariate logistic regression analysis between the neonatal outcomes and each potential independent variable. Pearson's product-moment correlation coefficient or Spearman's rank correlation coefficient was calculated to check for multicollinearity of categorical variables. The scatter plot was also used to check for multicollinearity of quantitative variables. The probability (p)-levels for the model used a variable entry criterion of 0.25. The variables with p-values < 0.25 were thus analyzed in the next step (i.e., the multivariate analysis).

We used multivariate logistic regression models with the direct method. We estimated a logistic model which contained all covariates as possible confounders. None of the interaction terms were included in the multivariate model. The goodness of fit for the logistic model was assessed with the Hosmer–Lemeshow test. Data with missing values were excluded from the logistic regression analysis with commonly used statistical software. To examine the degree of overfitting of the prediction model to the development sample, we performed a cross validation procedure.

Statistical analysis

For continuous variables with a normal distribution, the mean (± standard deviation [SD] or standard error [SE]) is reported. For variables not normally distributed, the median and interquartile ranges are reported. Categorical variables are presented as numbers and percentages. P-values < 0.05 were considered significant. The χ2 test was used for the analysis of categorical data, and Student's t-test was used for continuous variables with normal distributions. The Mann–Whitney rank-sum test was used for continuous variables without a normal distribution. The 95% confidence interval (CI) was computed around the mean difference in two values or the odds ratio (OR).

In the logistic regression analysis of the obstetric data obtained in the period 1994–2017, we followed standard methods to estimate the sample size that is necessary for a multivariate logistic regression, with at least ten outcomes needed for each included independent variable. With an expected neonatal asphyxia at 5 min incidence rate of 7%, we required 1000 patients to appropriately perform the multivariate logistic regression with seven variables. All statistical analyses were conducted with IBM SPSS statistics ver. 22.0 software (IBM, Tokyo, https://www.ibm.com/support/pages/downloading-ibm-spss-statistics-22).

Ethical approval and informed consent

This single-center, retrospective study was carried out at Hirosaki University Hospital, Hirosaki, Japan. The study was approved by the Hirosaki University Graduate School of Medicine Institutional Review Board (rinri@hirosaki-u.ac.jp, approval no. 2014-079, data of approval: June 11th, 2014) before it began, and written informed consent was obtained from all patients participating in the study. If subjects were under 18, written informed consent was provided from a parent or legal guardian. All experiments were performed in accordance with relevant guidelines and regulations.