External recordings of the electrohysterogram (EHG) can provide new knowledge on uterine electrical activity associated with contractions. Better understanding of the mechanisms underlying labor can contribute to preventing preterm birth which is the main cause of mortality and morbidity in newborns. Promising results using the EHG for labor prediction and other uses in obstetric care are the drivers of this work. This paper presents a database of 122 4-by-4 electrode EHG recordings performed on 45 pregnant women using a standardized recording protocol and a placement guide system. The recordings were performed in Iceland between 2008 and 2010. Of the 45 participants, 32 were measured repeatedly during the same pregnancy and participated in two to seven recordings. Recordings were performed in the third trimester (112 recordings) and during labor (10 recordings). The database includes simultaneously recorded tocographs, annotations of events and obstetric information on participants. The publication of this database enables independent and novel analysis of multi-electrode EHG by the researchers in the field and hopefully development towards new life-saving technology.
Background & Summary
Preterm birth is defined as birth before 37 completed weeks of gestation. On average, 12% of babies are born preterm in lower-income countries and 9% in higher-income countries1. Babies who are born preterm often require special care and face greater risks of serious health problems, including cerebral palsy, intellectual impairment, chronic lung disease, and vision and hearing loss. Preterm birth is the leading cause of newborn deaths (babies in the first four weeks of life) and the second leading cause of death in children under five years (after pneumonia). Preterm birth rates are increasing in almost all countries1.
Currently, there is no effective way of preventing preterm birth. The main reason is that no good objective method currently exists to evaluate the stepwise progression of pregnancy through to labor, neither at term nor preterm. Various techniques have been adopted to monitor and/or diagnose labor, but they are either subjective or indirect and do not provide an accurate prediction of when labor will take place2,
Parts of the data have already been used for developing and analyzing various signal processing methods and have led to several publications8,10,
We provide open access to the database so that the international scientific community can freely generate greater understanding of the mechanics of the uterus and develop applications that improve obstetric care and hopefully accurately predict preterm labor. This paper describes the recording methods used and gives a detailed description of the Icelandic 16-electrode electrohysterogram database.
The recordings were performed between 2008 and 2010 in Iceland. Pregnancy recordings, defined as recordings performed at antenatal care clinics on participants in the third trimester and not suspected to be in labor, were performed at Akureyri Primary Health Care Centre and Landspitali University Hospital. Labor recordings, defined as measurements performed on participants suspected to be in labor, present in the labor wards and who delivered within 24 h, were performed at Landspitali University Hospital and Akureyri Hospital. Participants were invited to take part in the recordings during antenatal care visits or at the labor wards and had normal singleton pregnancies and no known risk factors for preterm birth. Informed consent was obtained from every participant and the protocol was approved by the National Bioethics Committee in Iceland (VSN 02-006-V4). After each pregnancy recording, the participant was invited to take part in another recording one to two weeks later. All data can be found in PhysioNet (Data Citation 1: PhysioNet http://dx.doi.org/10.13026/C2159J).
Reusable Ag/AgCl electrodes with a 13.0 mm outer diameter and an 8.0 mm inner diameter were used for the recordings. An alignment frame, a double sided hypoallergenic adhesive sheet and a silicone backing were designed and manufactured to enable a standardized electrode setup with a 17.5 mm distance between adjacent electrode centers. The alignment frame was used to align and attach an uncovered side of the double sided adhesive sheet to the silicone backing. The dimensions of the double sided adhesive sheet and silicone backing can be seen in Fig. 1 along with the back view of the silicone backing attached to the double sided adhesive sheet.
The electrodes were then placed into the holes in the silicone backing and attached to the adhesive sheet. The abdominal skin of the participant was carefully prepared using an abrasive paste and alcohol solution. After filling the electrode holes with electrode gel, wiping the excess gel away with a straight edged card and uncovering the other side of the double sided adhesive sheet, the electrode-adhesive-silicone matrix was attached to the abdomen. The electrode numbering scheme, as seen when looking at the abdomen of the participant, can be seen in Fig. 2.
The desired position on the abdomen was with the third vertical line of electrodes (electrodes 9 to 12) placed on the median axis of the uterus and the 10th–11th pair of electrodes half way between the uterine fundus and pubic symphysis. The navel was avoided by displacing the matrix up or down whilst staying as close as possible to the desired position. The skin over the iliac crests on both sides was prepared in the same way as the abdomen and a patient ground electrode and reference electrode with electrode gel were then attached on each side using adhesive washers, with inner diameters corresponding to the inner diameter of the electrodes. The locations of the reference and patient ground electrodes were not standardized to certain sides for the recordings. The electrode positions can be seen in Fig. 3.
A tocodynamometer was also attached to the abdomen during recordings. For pregnancy recordings, the participants were seated in recliner chairs and a support, such as a small pillow, was positioned under the right side of the participants to prevent potential aortocaval compression syndrome. For labor recordings, the participants were lying on their beds in the maternity wards and the researcher did not try to affect their positioning. A photo of the setup during a recording can be seen in Fig. 4.
The intended duration of a pregnancy recording was one hour and the intended duration of a labor recording was at least half an hour, but the participant could stop the measurement at any time.
The measurements were performed using a sixteen channel multi-purpose physiological signal recorder (Embla A10), most commonly used for investigating sleep disorders. An anti-aliasing filter with a high cut-off frequency of 100 Hz was used but no high pass filter was used. The signal sampling rate was 200 Hz and the signal was digitized to 16 bits. The sixteen monopolar electrode signals were originally stored in the EDF (European Data Format) format by the Somnologica software used to control the Embla A10.
All the recordings were performed by the same person (A.A.). Each participant was assigned an ID number and for each recording, information on participant age, body mass index (BMI) gestational age, placental position, gravidity, parity, history of cesarean section, eventual mode of delivery and gestational age at delivery was noted. During a recording, the researcher recorded participant movements, equipment manipulation, participant-reported possible contractions and fetal movements, and any other unusual events. The participant and researcher conversed freely during the recordings and no restrictions were placed on the participant in terms of changing position if needed.
No processing of data was performed beyond converting between file formats. The aim is that those that use the data can do so with a fresh start. Future additions to this database may include pre-treated signals and segmented contractions.
The EDF files obtained during the recordings were converted into WFDB (WaveForm DataBase—www.physionet.org/physiotools/wfdb.shtml) compatible signal (.dat) and header (.hea) files using the edf2mit WFDB application (www.physionet.org/physiotools/wag/edf2mi-1.htm) and Cygwin software (cygwin.com). Annotation (.atr) files containing events during recordings were created manually by using the WFDB-compatible signal and header files and WAVE, an X Window System client application (www.physionet.org/physiotools/wfdb.shtml#WAVE). The wfdb2mat WFDB application (www.physionet.org/physiotools/wag/wfdb2m-1.htm) was used to convert the WFDB-compatible signal and header files into MATLAB (.mat) and corresponding header (.hea) files. The rdann function from the WFDB Toolbox for MATLAB (www.physionet.org/physiotools/matlab/wfdb-app-matlab) was used to create MATLAB (.mat) files containing the information from the annotation files. The tocograph paper traces were scanned to JPEG images and a time axis corresponding to the recording was inserted onto the scanned images. All information that could possibly lead to the identification of the participant, such as personal data and dates, was manually removed from these images using a graphics editor.
A total of 122 recordings were performed on 45 participants. Of the 45 participants, 32 were measured more than once during the same pregnancy and the highest number of recordings for a participant was seven recordings. Ten recordings were performed during labor and five participants took part in measurements during both pregnancy and labor. The lowest gestational age was twenty nine weeks and five days (29w5d—pregnancy recording) and the highest gestational age was forty one weeks and five days (41w5d—labor recording). The average recording duration for pregnancy recordings was 61 min (range 19–86 min) and the average recording duration for labor recordings was 36 min (range 8–64 min).
File names in the database are of the form ice###_*type*_*record number*, where ice### is the ID of the participant (e.g., ice001), *type* refers to the type of recording: p (pregnancy) or l (labor), and *record number* is the number of recording for that particular participant (e.g., 1of3).
Each recording has three associated files:
-A scanned tocograph with a manually inserted recording time axis (.jgp file). Each small square represents 30 s.
-A binary signal (.dat) file containing the data from the 16 monopolar electrodes.
-A header (.hea) file:
The fields in the header files are according to the WFDB convention and are listed in detail on the PhysioNet website (www.physionet.org/physiotools/wag/header-5.htm). Lines 2–17 of the header files are signal specification lines and the strings at the end of these lines correspond to the signal labels. The signals are labelled with ‘EHGn’ where ‘n’ refers to the relevant electrode number (electrode numbers are not in ascending order).
Information from each recording is at the end of the header files and includes:
-Record type (labor, pregnancy)
-Age of participant (years)
-BMI (body mass index) of participant before pregnancy
-BMI of participant at time of recording
-Gravidity (number of times participant has been pregnant, including current pregnancy)
-Parity (previous births after 22 weeks gestation)
-Previous cesarean (Yes, No)
-Gestational age at recording (weeks/days), according to a first trimester ultrasound
-Gestational age at delivery (weeks/days),
-Mode of delivery (Vaginal, Vaginal/Induction, Elective cesarean, Emergency cesarean due to slow progress, Emergency cesarean due to other than slow progress). Vaginal delivery indicates spontaneous onset unless appended with /Induction.
-Synthetic oxytocin use in labor (Yes, No)
-Epidural during labor (Yes, No)
-Comments for recording
-Comments for delivery
For 111 of the recordings, there is also an annotation (.atr) file containing the type and timing of events.
The types of event are:
- C—Contraction. Used when the participant feels a contraction or there is a very likely contraction on the tocograph (not always used when there is an obvious contraction on the tocograph).
- (c)—Possible contraction. Used when there is not a very likely contraction but the participant has pressure sensation or a contraction is suspected on the tocograph.
- pm—Participant movement.
- pos—Participant change of position.
- fm—Fetal movement. Used when the participant feels fetal movement.
- em—Equipment manipulation. Used when electrodes are pressed more firmly onto the abdomen if otherwise not explained in the comments.
The database also includes:
-The zip file icelandic16ehgmat.zip that includes MATLAB (.mat) versions of all the signal files along with header files (file names of the form ice###_*type*_*record number*m) and MATLAB (.mat) versions of the annotations (file names of the form ice###_*type*_*record number*m_ann). This is provided for the convenience of users that want to analyze the data in MATLAB.
-RECORDS.txt containing a list of the recordings by record name, with one record name per line.
Table 1 (available online only) contains the clinical information from each recording (information from the header files excluding comments) along with the recording duration and whether or not the recording has a corresponding annotation file. This information can also be found in info.txt in the database.
The EHG signal has been shown to be representative of uterine contractions6 and EHG is, in general, a well-proven technique5. The proof of concept and technical validation for the recording method used for the database was made in a preliminary study in 2007 (ref. 9).
The preliminary study applied recognized EHG techniques to a new recording setup involving a 4-by-4 grid. This was mainly to better observe and analyze the spatial characteristics and propagation of the electrical activity during contractions rather than just the activity in a single location of the uterus as had been done before2,6. The results from the preliminary study showed a very acceptable SNR (signal to noise ratio) of bipolar signals (difference between adjacent monopolar electrodes)29. The use of the monopolar signals singly was however considered problematic, even with adaptive filtering methods due to external common mode noise (maternal ECG, respiration movements etc.). Efforts to use recent techniques such as empirical mode decomposition (EMD) and canonical component analysis (CCA) to clean up the signal have since met with some success30. The preliminary study data was also used to present a moving picture of the electrical activity during contractions. The activity was clearly observed and correlated well with the simultaneous tocograph trace9.
In the preliminary study, electrodes were placed one at a time for the 4-by-4 grid, which was a time-consuming task and achieving the desired inter-electrode distance required great operator precision. To address these issues, a placement guide system was specifically developed (described in Methods). The system has a standardized setup ensuring a consistent distance between electrodes. The database therefore contains recordings made in a very similar way to the preliminary study, with the same electrode configuration, the same electrodes and same recording device but with a slightly smaller inter-electrode distance (17.5 versus 21 mm). The system has a shorter electrode attachment time than for the one by one electrode attachment method from the preliminary study and the placement of the electrodes into the silicone backing can be done before a recording, shortening the setup time for an actual recording to around five minutes. This enables recordings to be performed when there is little time and reduces any inconveniences for participants and health professionals. The data from the preliminary study is not included in the database presented here.
All recordings were performed by the same researcher (A.A.) using the same protocol. The researcher (A.A.) stayed with the participants during recordings, recorded events first hand and monitored the equipment and electrode readouts continually. The tocodynamometer was recalibrated to 20 if readouts were zero. In the Embla A10 machine, an electrode that is unconnected or floating will give a signal which very quickly goes to saturation and is therefore easily recognized. If during a recording, an electrode gave a trace that was visually very unlike the traces of other electrodes or displayed values notably different from other values, it was pressed more firmly onto the abdomen to ensure connection to the skin. If all electrode traces seemed suboptimal (e.g., very noisy visually), the patient ground and reference electrode connections were inspected and the electrodes pressed more firmly onto the skin. In a few cases when suboptimal traces did not improve, the electrodes were reconnected and the recording then resumed. We therefore assume that although there may be parts of the data where the contact is faulty between the skin and the electrode, they are few and moreover they are easily recognizable.
Parts of the data have already been used for developing and analyzing various signal processing methods and have led to several publications8,10,
Although EHG has become, in general, a well-proven technique, the EHG signals are known to be problematic in the sense that they are very low frequency and very low amplitude.
The individual monopolar signals contain the measurement of the electric potential at each site in the matrix as referenced to the patient ground, with the reference electrode as a reference for the patient ground circuit. The patient ground and reference electrodes were always more than 20 cm away from each electrode in the matrix (varying between participants and gestational ages) and positioned over the iliac crest where little electrical activity is suspected. No use was made of the other electrodes as reference, average or otherwise. The raw signals therefore contain everything that creates a potential difference between the monopolar electrode and the patient ground. This includes the maternal and fetal ECG and some EMG from striated muscles. The signal also contains artifacts related to movements of the participant, fetal movement, and even fetal respiration has been observed. Fetal hiccups can give periodic spikes that are clearly visible. Our system proved very robust to 50 Hz line noise and this was never observed during measurements. Neither active shielding nor active grounding was used in the recordings.
The setup resulted in a high level of electrode offset varying very slowly over time. There were considerable differences between electrodes during the same measurement sessions, with a typical value of the offset being several millivolts while the EHG signal is at the level of several tens of microvolts. Any processing of the signals could include high pass filtering with a very low cut-off frequency and/or a creation of a new reference using adjacent electrodes (i.e., bipolar or Laplacian).
During pregnancy recordings, the fetal or maternal heart rates were not recorded. During labor recordings, fetal heart rate was usually monitored by the clinical team, but was not collected as a part of the study.
The annotations and the tocograph complement each other. Some contractions that are present in the annotations are not obvious on the tocograph and some obvious contractions on the tocograph are not in the annotations. This explains in part why some recordings are without annotations. Noted fetal movement could last for differing amounts of time and participants did not always notify if they felt fetal movement. Sometimes fetal movement can be seen on the tocograph.
The first sample of a signal file is indexed with 1 in the .mat files but with 0 in the .dat files and so there is an index number discrepancy of one between the two file formats (.mat and .dat).
The MATLAB files contain the absolute raw units. Division by 131.068 gives the physical units in mV.
Even though some pregnancies ended in cesarean, the participant was on occasion already in spontaneous labor. These incidences are explained in the comments sections of the header files.
If a recording was close to birth, then the timing of the recording with regard to the birth is in the comments of the header file.
Our aim is to publish this database as is, without giving the user any detailed directions and thus encouraging open-minded exploitation of the data. Making raw data available is the best way to do this. There are however some pointers about how to make sense of the data which we would like to communicate to the users.
The data is sampled at 200 Hz. The EHG signals are generally assumed to be of very low frequency, from almost dc up to 3 Hz maximum. Decimating the raw signal (after low pass filtering) is advised, before or after creating a bipolar/multipolar signal or other de-noising. This will create more manageable files and it is often better to work with signals when the frequency of the signal to be observed is not as far away from the sampling frequency as in this case. This will also cut down calculation time in any complex analysis. Results have shown that decimation of this signal has little or even a positive effect on the performance of analysis methods20.
Please be aware that there are inherent imprecisions in the synchronization between the EHG recordings and the annotations and tocographs. There can be differences in when participants start to feel a contraction or fetal movements and so differences in when participants notify about events. Events were therefore occasionally approximated to the nearest whole minute. There are also delays internal to the tocodynamometers. Due to factors such as these, the inserted recording times on the tocographs and the annotation times may be up to±30 s from the actual recording or event times.
A user doing intensive work using these signals will have to develop an effective methodology to keep track of the signals, the way they have been pre-treated and the associated clinical parameters. Inspiration on how to organize such work can be sought in the structure of the SQL database framework which we developed31.
WAVE can also be used for viewing and analyzing the signals (www.physionet.org/physiotools/wfdb.shtml#WAVE). Annotations can be opened by using the ‘–a’ option with the wave WFDB application (www.physionet.org/physiotools/wag/wave-1.htm).
The WFDB Software Package can be used to work with the recordings (www.physionet.org/physiotools/wfdb.shtml). For example, the command ‘rdann -r *record name* -a atr -f 0’ can be used to read the annotation files (www.physionet.org/physiotools/wag/rdann-1.htm). The WFDB Toolbox for MATLAB and Octave can also be used to work with the recordings (www.physionet.org/physiotools/matlab/wfdb-app-matlab).
Table 1 is only available in the online version of this paper.
How to cite this article: Alexandersson, A. et al. The Icelandic 16-electrode electrohysterogram database. Sci. Data 2:150017 doi: 10.1038/sdata.2015.17 (2015).
Alexandersson, A., Steingrimsdottir, T., Terrien, J., Marque, C., & Karlsson, B. PhysioNet http://dx.doi.org/10.13026/C2159J (2014)
The Icelandic Centre for Research (RANNIS) supported this work. Óskar Þór Lárusson and Snorri Már Snorrason assisted with the development of the placement guide system. Shirin Najdi and Arnaldo Batista modified the rdann function from the WFDB Toolbox for MATLAB, which enabled the reading of annotation information from this database. The authors of this manuscript had full access to all the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0 Metadata associated with this Data Descriptor is available at http://www.nature.com/sdata/ and is released under the CC0 waiver to maximize reuse.