Towards Better Laparoscopic Video Database Organization by Automatic Surgery Classification

Twinanda, Andru P.; Marescaux, Jacques; De Mathelin, Michel; Padoy, Nicolas

doi:10.1007/978-3-319-07521-1_20

Towards Better Laparoscopic Video Database Organization by Automatic Surgery Classification

Andru P. Twinanda²⁰,
Jacques Marescaux²¹,
Michel De Mathelin²⁰ &
…
Nicolas Padoy²⁰

Conference paper

1713 Accesses
8 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8498))

Abstract

Minimally invasive surgery is an important breakthrough in the domain of medicine. Not only does it improve the quality of surgery, but the underlying digitization also provides invaluable information that opens up many possibilities for teaching, assistance during difficult cases, and quality evaluation. For instance, with a well-organized database, professors are one click away from showing and comparing various surgical procedures in their classes; surgeons can also retrieve and observe a video segment of a specific surgical task performed by another surgeon in varying conditions. However, to the best of our knowledge, database organization is done manually by experts. Considering the large number of surgical videos recorded, manual annotation is a tedious task. In this paper, we take the first step towards automatic surgical database organization by introducing the laparoscopic video classification problem, which consists of automatically identifying the type of abdominal surgery performed in a video. In spite of the visual challenges of such videos, such as blank frames, rapid movement, and sometimes incomplete recording, we show that we can rely on visual features alone to classify the videos with high accuracy. We use kernel Support Vector Machines (SVMs) for this classification task and compare their performance on different types of visual features. We also show that the result can be improved by combining the visual features using Multiple Kernel Learning approach. The classification pipeline demonstrates a classification accuracy of 91.39% on a database of 151 abdominal videos totaling over 200 hours of 8 different kinds of surgeries performed by 10 surgeons.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

WebSurg: the e-surgical reference, http://www.websurg.com/ (last access: November 21, 2013)
Atasoy, S., Mateus, D., Meining, A., Yang, G.Z., Navab, N.: Endoscopic video manifolds for targeted optical biopsy. IEEE Trans. Med. Imaging 31(3), 637–653 (2012)
Article Google Scholar
Munzer, B., Schoeffmann, K., Boszormenyi, L.: Relevance segmentation of laparoscopic videos. In: IEEE International Symposium on Multimedia, pp. 84–91 (2013)
Google Scholar
Zappella, L., Bejar, B., Hager, G., Vidal, R.: Surgical gesture classification from video and kinematic data. Medical Image Analysis 17(7), 732–745 (2013)
Article Google Scholar
Padoy, N., Blum, T., Ahmadi, S.A., Feussner, H., Berger, M.O., Navab, N.: Statistical modeling and recognition of surgical workflow. Medical Image Analysis 16(3), 632–641 (2012)
Article Google Scholar
Blum, T., Feußner, H., Navab, N.: Modeling and segmentation of surgical workflow from laparoscopic video. In: Jiang, T., Navab, N., Pluim, J.P.W., Viergever, M.A. (eds.) MICCAI 2010, Part III. LNCS, vol. 6363, pp. 400–407. Springer, Heidelberg (2010)
Chapter Google Scholar
Lalys, F., Riffaud, L., Bouget, D., Jannin, P.: A framework for the recognition of high-level surgical tasks from video images for cataract surgeries. IEEE Trans. Biomed. Engineering 59(4), 966–976 (2012)
Article Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)
Article Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: International Conference on Computer Vision & Pattern Recognition, pp. 886–893 (2005)
Google Scholar
Aharon, M., Elad, M., Bruckstein, A.: K-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation. IEEE Transactions on Signal Processing 54(11), 4311–4322 (2006)
Article Google Scholar
Gonen, M., Alpaydin, E.: Multiple kernel learning algorithms. Journal of Machine Learning Research 12, 2211–2268 (2011)
MathSciNet Google Scholar
Vedaldi, A., Fulkerson, B.: Vlfeat: An open and portable library of computer vision algorithms. In: Proceedings of the International Conference on Multimedia, pp. 1469–1472. ACM (2010)
Google Scholar
Varma, M., Babu, R.B.: More generality in efficient multiple kernel learning. In: Proceedings of the International Conference on Machine Learning, pp. 1065–1072. ACM (2009)
Google Scholar
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: International Conference on Computer Vision & Pattern Recognition, pp. 1794–1801. IEEE (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

ICube, University of Strasbourg, CNRS, France
Andru P. Twinanda, Michel De Mathelin & Nicolas Padoy
IRCAD & University Hospital of Strasbourg, France
Jacques Marescaux

Authors

Andru P. Twinanda
View author publications
You can also search for this author in PubMed Google Scholar
Jacques Marescaux
View author publications
You can also search for this author in PubMed Google Scholar
Michel De Mathelin
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Padoy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Centre for Medical Image Computing and Department of Computer Science, University College London, United Kingdom
Danail Stoyanov
Montreal Neurological Institute, McGill University, Montreal, Canada
D. Louis Collins
Graduate School of Engineering, The University of Tokyo, Japan
Ichiro Sakuma
Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, BC, Canada
Purang Abolmaesumi
Inserm/Université de Rennes 1, 2, Avenue du Pr. Léon Bernard, CS 34317, 35043, Rennes, France
Pierre Jannin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Twinanda, A.P., Marescaux, J., De Mathelin, M., Padoy, N. (2014). Towards Better Laparoscopic Video Database Organization by Automatic Surgery Classification. In: Stoyanov, D., Collins, D.L., Sakuma, I., Abolmaesumi, P., Jannin, P. (eds) Information Processing in Computer-Assisted Interventions. IPCAI 2014. Lecture Notes in Computer Science, vol 8498. Springer, Cham. https://doi.org/10.1007/978-3-319-07521-1_20

Download citation

DOI: https://doi.org/10.1007/978-3-319-07521-1_20
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07520-4
Online ISBN: 978-3-319-07521-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics