Cholec80-CVS: An open dataset with an evaluation of Strasberg’s critical view of safety for AI

Ríos, Manuel Sebastián; Molina-Rodriguez, María Alejandra; Londoño, Daniella; Guillén, Camilo Andrés; Sierra, Sebastián; Zapata, Felipe; Giraldo, Luis Felipe

doi:10.1038/s41597-023-02073-7

Download PDF

Data Descriptor
Open access
Published: 08 April 2023

Cholec80-CVS: An open dataset with an evaluation of Strasberg’s critical view of safety for AI

Manuel Sebastián Ríos¹,
María Alejandra Molina-Rodriguez²,
Daniella Londoño³,
Camilo Andrés Guillén¹,
Sebastián Sierra³,
Felipe Zapata³ &
…
Luis Felipe Giraldo ORCID: orcid.org/0000-0002-2492-4422⁴

Scientific Data volume 10, Article number: 194 (2023) Cite this article

8190 Accesses
3 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Strasberg’s criteria to detect a critical view of safety is a widely known strategy to reduce bile duct injuries during laparoscopic cholecystectomy. In spite of its popularity and efficiency, recent studies have shown that human miss-identification errors have led to important bile duct injuries occurrence rates. Developing tools based on artificial intelligence that facilitate the identification of a critical view of safety in cholecystectomy surgeries can potentially minimize the risk of such injuries. With this goal in mind, we present Cholec80-CVS, the first open dataset with video annotations of Strasberg’s Critical View of Safety (CVS) criteria. Our dataset contains CVS criteria annotations provided by skilled surgeons for all videos in the well-known Cholec80 open video dataset. We consider that Cholec80-CVS is the first step towards the creation of intelligent systems that can assist humans during laparoscopic cholecystectomy.

TacticAI: an AI assistant for football tactics

Article Open access 19 March 2024

Segment anything in medical images

Article Open access 22 January 2024

Transparent medical image AI via an image–text foundation model grounded in medical literature

Article 16 April 2024

Background & Summary

Laparoscopic cholecystectomy is one of the most common surgeries performed in the world. For example, in the United States, around 20 million people suffer from gallbladder diseases that imply nearly 300,000 cholecystectomy surgeries per year. Despite being considered a low-risk surgery, bile duct injuries (BDIs) have occurred at a constant rate in the last 30 years¹, leaving devastating consequences on the affected patients. Strasberg and colleagues² showed that most BDIs occur due to miss-identification of the bile duct and the cystic duct. This fact encouraged Strasberg to propose a method known as Critical View of Safety (CVS), which provides criteria to correctly identify cystic structures before the complete removal of the gallbladder. However, even though this identification method is commonly used and accepted for surgeons as an important attempt to reduce bile duct miss-identifications³, Way et al. showed that around 97% of the BDI occurrences were due to visual perceptual illusions and, in most of the cases of BDIs, surgeons did not even know what the problem was^4,5. These findings encourage the development of technologies that aid surgeons to accurately identify the cystic structures during surgery and therefore to reduce BDI occurrence rates.

This type of technologies for automatic detection and identification are not new in the context of laparoscopic surgeries. For example, researchers have studied the problems of surgical instrument segmentation and tracking^6,7,8, surgery phase identification⁹ and anatomical structure localization¹⁰. Most of the current work leverage complex pre-trained convolutional neural network architectures and handcrafted databases annotated by experts in the medical field¹¹. In particular, the automatic detection of CVS for BDI prevention has gained recent interest from several researchers. The work of Mascagni and colleagues¹² aimed to detect critical views of safety on still images carefully annotated and selected by skilled surgeons. They used binary annotations of the occurrence of CVS criteria and anatomical segmentation annotations, and trained a deep neural network using both types of annotations to predict CVS occurrence. Tokuyasu et al.¹³ tackled the problem in an object detection setting by detecting four important anatomical landmarks: the common bile duct, the cystic duct, the lower edge of the left medial liver segment, and the Rouviere’s sulcus. The correct identification of these four landmarks has proven to help surgeons to avoid bile duct injuries. In this case, the authors used the fully-convolutional architecture YOLO and bounding box-like annotations. On the other hand, Madani et al.¹⁴ addressed the problem of identifying safe and dangerous zones of dissection as well as anatomical landmarks during laparoscopic cholecystectomy as a segmentation problem providing real-time intra-operative guidance.

Despite the above mentioned studies and the effort of Mascagni and colleagues in proposing a reproducible method for objective video reporting of CVS in laparoscopic videos¹⁵, there are not publicly available datasets with annotations of the CVS criteria that can be used by researchers interested in addressing the problem of automatic CVS identification. Annotating videos based on CVS criteria on videos of laparoscopic cholecysectomy procedures is a challenging task that is time consuming and must be done only by skilled surgeons. In this data descriptor, we introduce Cholec80-CVS, the first open dataset with annotations based on CVS criteria on all the videos in the well-known Cholec80 video dataset¹¹. Cholec80 is a popular open dataset that contains 80 video recordings of cholecystectomies performed by 13 surgeons and that has been widely used to study artificial intelligence-based models for real time analysis in surgeries.

Methods

Strasberg’s criteria in the critical view of safety

Strasberg originally proposed three main criteria over a given still image that allow the surgeon to determine if it is safe to proceed with the clipping and cutting process for gallbladder removal, and thus avoid bile duct injuries¹⁶. Strasberg’s method, known as Critical View of Safety, is based on three criteria: first, only two structures can be clearly seen connected to the gallbladder; second, the lower one third of the gallbladder is separated from the liver to expose the cystic plate; third, the hepatocystic triangle must be completely clear of tissue allowing proper visibility of all cystic structures.

Cholec80 dataset

Cholec80 dataset contains 80 high-quality videos of gallbladder laparoscopic surgeries. It was constructed by researchers from the research group CAMMA at University of Strasbourg, France. Videos in Cholec80 were recorded at 25 frames per second, and all videos were recorded for different patients, surgeons, and light conditions, facilitating the application of learning methods for surgery analysis. Additionally, each video was labeled frame by frame according to the surgical phase and the surgical instruments in the scene. Cholec80 has been widely used in recent research^8,17,18,19, establishing it as one of the most popular benchmarks for research in laparoscopic cholecystectomy.

Cholec80-CVS: annotations of CVS criteria

Our annotators strictly followed a scoring system to ensure the creation of a high quality dataset. This system was originally proposed by Sanford’s and Strasberg’s¹⁶ and latter used by Mascagni and colleagues¹⁵ to annotate Strasberg’s CVS on still images. However, we extended it by adding a new set of complementary rules to reduce the uncertainty during the annotation process.

For each video in Cholec80, we analyzed the preparation and Calot’s triangle dissection phases. Our annotators carefully selected video segments where at least one of the non-zero Strasberg criteria was satisfied. Then, a score of one or two was assigned to the video segment following the scoring system proposed by Sanford¹⁶ and a complementary set of rules for each of the criteria proposed by our annotators as described in Tables 1–3. The annotators only considered video segments where the presence of the criteria was consistent for at least three seconds, allowing for the presence noise induced by occlusions or abrupt camera movements during this time lapse. We consider that all frames contained in the annotated video segment share the same annotation. Therefore, it is possible to get annotations for each frame. Those frames with a score of 0 are denoted as negative examples, while the other ones are considered as positive examples. We compute the total score of an image by adding up the individual scores of each criterion. A view of safety is achieved when the total score is equal to or greater than 5.

Table 1 Set of rules used to produce consistent and reliable annotations for the two structures criteria.

Full size table

Table 2 Set of rules used to produce consistent and reliable annotations for the cystic plate criteria.

Full size table

Table 3 Set of rules used to produce consistent and reliable annotations for the Hepatocystic triangle criteria.

Full size table

We are aware that annotating time-lapses may induce noise, given that low quality frames can occur within an annotated video segment. For instance, brief occlusions and blurred or over exposed images may occur due to the abrupt camera movements that are common in laparoscopic cholecystectomy. However, we consider that Cholec80-CSV is the first step towards the creation of intelligent systems that can assist humans during surgeries. Therefore, it is crucial to model noisy input data to build robust systems in real scenarios.

Data Records

Cholec80 Videos can be obtained from the University of Strasbourg’s research group CAMMA official web site http://camma.u-strasbg.fr/datasets. On the other hand, our annotations are publicly available as a XLSX file²⁰. Each row in this file details the initial and final frame of a video segment where our annotators detected any of the Strasberg’s criteria as well as the score assigned to it. All video segments that are not in the provided dataset are considered to have a 0 score for all criteria. Table 4 describes each data column in our database.

Table 4 Description of each data column present on Cholec80-CVS database.

Full size table

Technical Validation

Experience of the annotators

Our annotators were highly skilled surgeons: one of them with a record of approximately 700 successful cholecystectomies performed, while the second has performed about 470 successful cholecystectomies. Each of the surgeons was paired with one surgical resident. Even though residents had prior experience in laparoscopic cholecystectomy, they also received additional training from the experienced surgeons. It consisted of weekly meetings before starting the annotation process. During these meetings, the annotators carefully studied the original set of rules proposed by Sanford and Strasberg to detect CVS. Additionally, the annotators proposed a complementary set of rules shown in Tables 1–3 seeking consistency in the annotations. Each team evaluated 40 videos independently. When one of the teams considered their annotation unreliable, the final score was defined after a discussion between the four annotators.

Analysis of the annotations

In contrast to the methodology proposed by Mascagni¹⁵ where only the last 60 seconds before clipping and cutting phase were annotated, we analyzed the entire portion of the video prior the clipping and cutting phase. Figure 2 shows the relative occurrence of the Strasberg’s criteria across all 80 videos. The horizontal axis corresponds to a normalized time line, where 0 is the beginning of the surgery and 100 is the instant when clipping and cutting is performed. The vertical axis indicates the total number of frames across all 80 videos that contain each Strasberg’s criterion as explained in the methods section. Note that all criteria are heavily concentrated in the last quarter of the process, particularly for a score of two. However, a considerable large amount of frames annotated with a score of one occur in the second half of the analyzed phases. The latter suggest that analyzing a small video segment of 60 seconds before clipping and cutting can potentially lead to a considerable loss of important data samples.

It is evident the high imbalance between scores in the dataset. Given that most of the time a view of safety is achieved just for a couple of seconds before the gallbladder dissection, frames tagged with scores of two are very scarce. This behavior is consistent with the dynamics of a typical laparoscopic cholecystectomy and the methodology followed in the work of Mascagni and colleagues¹⁵.

Figure 3 shows the low occurrence rates of these frames for each of the Strasberg criteria, indicating a strong data imbalance between the frames annotated with score of zero and all the other frames. It is also evident an imbalance between classes, being the cystic plate criteria a clear example of an under represented class. Given that Cholec80-CVS provides annotations of the starting and ending seconds where non-zero scores are present instead of frame-wise annotations, abrupt camera movements or partial camera occlusion may occur briefly during the annotated time frame, inducing noise to the database since they can be annotated as positive samples.

Usage Notes

To ease the use of Cholec80-CVS we provide public access to both the database and all the code related to this work. The database is publicly available in XLSX format²⁰. Cholec80-CVS can be used for researchers to assess novel image classification architectures to automatically detect CVS criteria. We also encourage researchers to contribute to enriching the current dataset either by refining the current version or by adding new samples.

Limitations

Cholec80-CVS dataset has some important limitations. First, abrupt camera movements are very common, creating blurry low quality frames. Some of them may occur within the time-lapse annotated by the surgeons, producing noisy samples. Additionally, due to the nature of the surgery, in some frames a criteria may be briefly occluded by actions performed by the surgeon, but the frame will still be considered as a positive sample since it occurs within the annotated time-lapse. This behavior suggests possible research directions, for instance, implementing video analysis techniques. Analyzing multiple subsequent frames may reduce the negative impact of these noisy samples as well as reduce the probability of miss-classify images due to temporal occlusions.

The second limitation of Cholec80-CVS is the high data imbalance due to the nature of the surgery. As it is shown in Fig. 3, there is an significant imbalance between different criteria. Moreover, there is also an important imbalance between the annotations of the same criterion, as in the case of the hepatocystic triangle. In general, positive samples have very low occurrence rates, and even in some Cholec80 videos the CVS is never achieved.

Code availability

We provide scripts to transform our annotations to the frame-wise labels and also the source code of some baseline models that use standard deep learning techniques to detect CVS criteria using our database for interested users. All these scripts were coded using Python 3.8.11 and Pytorch as the machine learning framework. All scripts were tested on Linux Machines. The repository README file contains detailed instructions to ease the use of the repository and brief descriptions of all files. The code is publicly available at https://github.com/ManuelRios18/CHOLEC80-CVS-PUBLIC, licensed under MIT OpenSource license. Therefore, permission is granted free of charge to copy and use this software and its associated files.

References

Björn, T., Cecilia, S., Gunnar, P. & Magnus, N. Effect of intended intraoperative cholangiography and early detection of bile duct injury on survival after cholecystectomy: Population based cohort study. BMJ (Clinical research ed.) 345, e6457, https://doi.org/10.1136/bmj.e6457 (2012).
Article Google Scholar
Strasberg, S. M., Hertl, M. & Soper, N. J. An analysis of the problem of biliary injury during laparoscopic cholecystectomy. Journal of the American College of Surgeons (1995).
Strasberg, S. M. & Brunt, L. M. Rationale and use of the critical view of safety in laparoscopic cholecystectomy. Journal of the American College of Surgeons 211, 132–138, https://doi.org/10.1016/j.jamcollsurg.2010.02.053 (2010).
Article PubMed Google Scholar
Way, L. W. et al. Causes and prevention of laparoscopic bile duct injuries: analysis of 252 cases from a human factors and cognitive psychology perspective. Annals of surgery https://doi.org/10.1097/01.SLA.0000060680.92690.E9 (2003).
Article PubMed PubMed Central Google Scholar
Sierra, S., Zapata, F., Méndez, M., Portillo, S. & Restrepo, C. Colecistectomía subtotal: una alternativa en el manejo de la colecistectomía difícil. Revista Colombiana de Cirugia 35, 593–600, https://doi.org/10.30944/20117582.565 (2020).
Article Google Scholar
González, C., Bravo-Sánchez, L. & Arbelaez, P. Isinet: An instance-based approach for surgical instrument segmentation. In:, et al. Medical Image Computing and Computer Assisted Intervention – MICCAI 2020. MICCAI 2020. Lecture Notes in Computer Science, vol 12263. Springer, Cham. https://doi.org/10.1007/978-3-030-59716-0_57 (2020).
Allan, M. et al. 2017 robotic instrument segmentation challenge Preprint at https://arxiv.org/abs/1902.06426 (2019).
Nwoye, C. I., Mutter, D., Marescaux, J. & Padoy, N. Weakly supervised convolutional lstm approach for tool tracking in laparoscopic videos. Nwoye, C.I., Mutter, D., Marescaux, J. et al. Weakly supervised convolutional LSTM approach for tool tracking in laparoscopic videos. Int J CARS 14, 1059–1067. https://doi.org/10.1007/s11548-019-01958-6 (2019).
Hashimoto, D. A. et al. Computer vision analysis of intraoperative video: Automated recognition of operative steps in laparoscopic sleeve gastrectomy. Ann Surg https://doi.org/10.1097/SLA.0000000000003460 (2019).
Article PubMed Google Scholar
Madad Zadeh, S. et al. Surgai: deep learning for computerized laparoscopic image understanding in gynaecology. Surgical Endoscopy 34, 5377–5383, https://doi.org/10.1007/s00464-019-07330-8 (2020).
Article PubMed Google Scholar
Twinanda, A. P. et al. Endonet: A deep architecture for recognition tasks on laparoscopic videos. IEEE Transactions on Medical Imaging, vol. 36, no. 1, pp. 86-97, Jan. 2017, https://doi.org/10.1109/TMI.2016.2593957. (2016).
Mascagni, P. et al. Artificial Intelligence for Surgical Safety: Automatic Assessment of the Critical View of Safety in Laparoscopic Cholecystectomy Using Deep Learning. Ann Surg https://doi.org/10.1097/SLA.0000000000004351 (2020).
Article PubMed Google Scholar
Tokuyasu, T. et al. Development of an artificial intelligence system using deep learning to indicate anatomical landmarks during laparoscopic cholecystectomy. Surg Endosc 35, 1651–1658, https://doi.org/10.1007/s00464-020-07548-x (2021).
Article PubMed Google Scholar
Madani, A. et al. Artificial intelligence for intraoperative guidance: Using semantic segmentation to identify surgical anatomy during laparoscopic cholecystectomy. Annals of Surgery https://doi.org/10.1097/SLA.0000000000004594 (2022).
Article PubMed Google Scholar
Mascagni, P. et al. Formalizing video documentation of the Critical View of Safety in laparoscopic cholecystectomy: a step towards artificial intelligence assistance to improve surgical safety. Surg Endosc 34, 2709–2714, https://doi.org/10.1007/s00464-019-07149-3 (2020).
Article PubMed Google Scholar
Sanford, D. & Strasberg, S. A simple effective method for generation of a permanent record of the critical view of safety during laparoscopic cholecystectomy by intraoperative “doublet” photography. Journal of the American College of Surgeons 218, 170–8, https://doi.org/10.1016/j.jamcollsurg.2013.11.003 (2014).
Article PubMed Google Scholar
Chen, W., Feng, J., Lu, J. & Zhou, J. Endo3d: Online workflow analysis for endoscopic surgeries based on 3d cnn and lstm. In Stoyanov, D. et al. (eds.) OR 2.0 Context-Aware Operating Theaters, Computer Assisted Robotic Endoscopy, Clinical Image-Based Procedures, and Skin Image Analysis, 97–107, https://doi.org/10.1007/978-3-030-01201-4_12 (Springer International Publishing, Cham, 2018).
Hong, W. et al. Cholecseg8k: A semantic segmentation dataset for laparoscopic cholecystectomy based on cholec80. Preprint at https://arxiv.org/abs/2012.12453 (2020).
Shi, P., Zhao, Z., Hu, S. & Chang, F. Real-time surgical tool detection in minimally invasive surgery based on attention-guided convolutional neural network. IEEE Access 8, 228853–228862, https://doi.org/10.1109/access.2020.3046258 (2020).
Article Google Scholar
Rios, M. et al. Cholec80-cvs: An open dataset with an evaluation of strasberg’s critical view of safety for AI, Figshare, https://doi.org/10.6084/m9.figshare.c.5880458.v1 (2023).

Download references

Acknowledgements

We thank Google DeepMind for partially funding this project through the scholarship programme. We also thank Universidad de los Andes, Universidad CES, and Proelium for joining efforts to make this research possible, and Professor Pablo Arbelaez for his useful discussions.

Author information

Authors and Affiliations

Department of Electric and Electronic Engineering, Universidad de Los Andes, Bogotá D.C., Colombia
Manuel Sebastián Ríos & Camilo Andrés Guillén
Proelium SAS, Bogotá D.C., Colombia
María Alejandra Molina-Rodriguez
Department of General Surgery, Universidad CES, Medellín, Colombia
Daniella Londoño, Sebastián Sierra & Felipe Zapata
Department of Biomedical Engineering, Universidad de Los Andes, Bogotá D.C., Colombia
Luis Felipe Giraldo

Authors

Manuel Sebastián Ríos
View author publications
You can also search for this author in PubMed Google Scholar
María Alejandra Molina-Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar
Daniella Londoño
View author publications
You can also search for this author in PubMed Google Scholar
Camilo Andrés Guillén
View author publications
You can also search for this author in PubMed Google Scholar
Sebastián Sierra
View author publications
You can also search for this author in PubMed Google Scholar
Felipe Zapata
View author publications
You can also search for this author in PubMed Google Scholar
Luis Felipe Giraldo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M. Molina, D. Londoño, S. Sierra, and F. Zapata provided the dataset annotations and guided the research with their medical expertise. M. Ríos, C. Guillén, and L.F. Giraldo helped the surgeons to design the annotation process, evaluated the potential applications for AI, and wrote the manuscript. All authors reviewed the manuscript before submission.

Corresponding author

Correspondence to Luis Felipe Giraldo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ríos, M.S., Molina-Rodriguez, M.A., Londoño, D. et al. Cholec80-CVS: An open dataset with an evaluation of Strasberg’s critical view of safety for AI. Sci Data 10, 194 (2023). https://doi.org/10.1038/s41597-023-02073-7

Download citation

Received: 09 March 2022
Accepted: 15 March 2023
Published: 08 April 2023
DOI: https://doi.org/10.1038/s41597-023-02073-7