Skip to main content

Towards Better Laparoscopic Video Database Organization by Automatic Surgery Classification

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8498))

Abstract

Minimally invasive surgery is an important breakthrough in the domain of medicine. Not only does it improve the quality of surgery, but the underlying digitization also provides invaluable information that opens up many possibilities for teaching, assistance during difficult cases, and quality evaluation. For instance, with a well-organized database, professors are one click away from showing and comparing various surgical procedures in their classes; surgeons can also retrieve and observe a video segment of a specific surgical task performed by another surgeon in varying conditions. However, to the best of our knowledge, database organization is done manually by experts. Considering the large number of surgical videos recorded, manual annotation is a tedious task. In this paper, we take the first step towards automatic surgical database organization by introducing the laparoscopic video classification problem, which consists of automatically identifying the type of abdominal surgery performed in a video. In spite of the visual challenges of such videos, such as blank frames, rapid movement, and sometimes incomplete recording, we show that we can rely on visual features alone to classify the videos with high accuracy. We use kernel Support Vector Machines (SVMs) for this classification task and compare their performance on different types of visual features. We also show that the result can be improved by combining the visual features using Multiple Kernel Learning approach. The classification pipeline demonstrates a classification accuracy of 91.39% on a database of 151 abdominal videos totaling over 200 hours of 8 different kinds of surgeries performed by 10 surgeons.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. WebSurg: the e-surgical reference, http://www.websurg.com/ (last access: November 21, 2013)

  2. Atasoy, S., Mateus, D., Meining, A., Yang, G.Z., Navab, N.: Endoscopic video manifolds for targeted optical biopsy. IEEE Trans. Med. Imaging 31(3), 637–653 (2012)

    Article  Google Scholar 

  3. Munzer, B., Schoeffmann, K., Boszormenyi, L.: Relevance segmentation of laparoscopic videos. In: IEEE International Symposium on Multimedia, pp. 84–91 (2013)

    Google Scholar 

  4. Zappella, L., Bejar, B., Hager, G., Vidal, R.: Surgical gesture classification from video and kinematic data. Medical Image Analysis 17(7), 732–745 (2013)

    Article  Google Scholar 

  5. Padoy, N., Blum, T., Ahmadi, S.A., Feussner, H., Berger, M.O., Navab, N.: Statistical modeling and recognition of surgical workflow. Medical Image Analysis 16(3), 632–641 (2012)

    Article  Google Scholar 

  6. Blum, T., Feußner, H., Navab, N.: Modeling and segmentation of surgical workflow from laparoscopic video. In: Jiang, T., Navab, N., Pluim, J.P.W., Viergever, M.A. (eds.) MICCAI 2010, Part III. LNCS, vol. 6363, pp. 400–407. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  7. Lalys, F., Riffaud, L., Bouget, D., Jannin, P.: A framework for the recognition of high-level surgical tasks from video images for cataract surgeries. IEEE Trans. Biomed. Engineering 59(4), 966–976 (2012)

    Article  Google Scholar 

  8. Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60, 91–110 (2004)

    Article  Google Scholar 

  9. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: International Conference on Computer Vision & Pattern Recognition, pp. 886–893 (2005)

    Google Scholar 

  10. Aharon, M., Elad, M., Bruckstein, A.: K-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation. IEEE Transactions on Signal Processing 54(11), 4311–4322 (2006)

    Article  Google Scholar 

  11. Gonen, M., Alpaydin, E.: Multiple kernel learning algorithms. Journal of Machine Learning Research 12, 2211–2268 (2011)

    MathSciNet  Google Scholar 

  12. Vedaldi, A., Fulkerson, B.: Vlfeat: An open and portable library of computer vision algorithms. In: Proceedings of the International Conference on Multimedia, pp. 1469–1472. ACM (2010)

    Google Scholar 

  13. Varma, M., Babu, R.B.: More generality in efficient multiple kernel learning. In: Proceedings of the International Conference on Machine Learning, pp. 1065–1072. ACM (2009)

    Google Scholar 

  14. Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: International Conference on Computer Vision & Pattern Recognition, pp. 1794–1801. IEEE (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Twinanda, A.P., Marescaux, J., De Mathelin, M., Padoy, N. (2014). Towards Better Laparoscopic Video Database Organization by Automatic Surgery Classification. In: Stoyanov, D., Collins, D.L., Sakuma, I., Abolmaesumi, P., Jannin, P. (eds) Information Processing in Computer-Assisted Interventions. IPCAI 2014. Lecture Notes in Computer Science, vol 8498. Springer, Cham. https://doi.org/10.1007/978-3-319-07521-1_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-07521-1_20

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-07520-4

  • Online ISBN: 978-3-319-07521-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics