
Graph Based Multimedia Analysis
- 1st Edition - August 7, 2024
- Imprint: Morgan Kaufmann
- Authors: Ananda S Chowdhury, Abhimanyu Sahu
- Language: English
- Paperback ISBN:9 7 8 - 0 - 4 4 3 - 2 1 4 9 5 - 0
- eBook ISBN:9 7 8 - 0 - 4 4 3 - 2 1 4 8 6 - 8
Graph Based Multimedia Analysis applies concepts from graph theory to the problems of analyzing overabundant video data. Video data can be quite diverse: exocentric (captured by a… Read more

Purchase options

Institutional subscription on ScienceDirect
Request a sales quoteGraph Based Multimedia Analysis applies concepts from graph theory to the problems of analyzing overabundant video data. Video data can be quite diverse: exocentric (captured by a standard camera) or egocentric (captured by a wearable device like Google Glass); of various durations (ranging from a few seconds to several hours); and could be from a single source or multiple sources. Efficient extraction of important information from such a large class of diverse video data can be overwhelming. The book, with its rich repertoire of theoretically elegant solutions, from graph theory in conjunction with deep learning, constrained optimization, and game theory, empowers the audience to achieve tasks like obtaining concise yet useful summaries and precisely recognizing single as well as multiple actions in a computationally efficient manner. The book provides a unique treatise on topics like egocentric video analysis and scalable video processing.
- Addresses a number of challenging state-of-the-art problems in multimedia analysis like summarization, co-summarization, and action recognition
- Handles a wide class of video with different genres, durations, and numbers
- Applies a class of theoretically rich algorithms from the discipline of graph theory, in conjunction with deep learning, constrained optimization and game theory
- Includes thorough complexity analyses of the proposed solutions, and an appendix containing implementable source codes
Researchers and Graduate Students in:- Computer Science/Computer Engineering- Electronics Engineering- Electrical Engineering- Informatics- Multimedia Engineering/Multimedia Systems. Practitioners working in industries focusing on multimedia and video processing
- Cover image
- Title page
- Table of Contents
- Copyright
- Dedication
- List of figures
- List of tables
- Biography
- Ananda S. Chowdhury
- Abhimanyu Sahu
- Foreword
- Preface
- Chapter One: Introduction
- Abstract
- 1.1. Motivation
- 1.2. Chapter organization
- 1.3. Basics of multimedia
- 1.4. Preliminaries of a video
- 1.5. Multimedia problems
- 1.6. Graph based solutions
- 1.7. Other solution models
- 1.8. Organization of the book
- References
- Chapter Two: Theoretical foundations
- Abstract
- 2.1. Motivation
- 2.2. Organization
- 2.3. Graph basics
- 2.4. Delaunay graph
- 2.5. Bipartite graph
- 2.6. Minimum spanning tree
- 2.7. Optimum path forest
- 2.8. Random walks on a graph
- 2.9. Knapsack problems
- 2.10. Elementary game theory
- References
- Chapter Three: Exocentric video summarization
- Abstract
- 3.1. Motivation
- 3.2. Chapter organization
- 3.3. Related works
- 3.4. Method I: Delaunay graph based solutions for exocentric video summarization
- 3.5. Method II: A graph modularity based clustering for exocentric video summarization
- 3.6. Scalable exocentric video summarization with skeleton graph and random walk
- 3.7. Time-complexity analysis
- 3.8. Experimental test bed
- 3.9. Results of Delaunay graph based exocentric video summarization methods
- 3.10. Results of graph modularity based solution
- 3.11. Results of scalable exocentric video summarization
- 3.12. Summary
- References
- Chapter Four: Multi-view exocentric video summarization
- Abstract
- 4.1. Motivation
- 4.2. Chapter organization
- 4.3. Related work
- 4.4. Proposed method
- 4.5. Time-complexity analysis
- 4.6. Experimental results
- 4.7. Summary
- References
- Chapter Five: Egocentric video summarization
- Abstract
- 5.1. Motivation
- 5.2. Chapter organization
- 5.3. Related work
- 5.4. Proposed methods
- 5.5. Time-complexity analysis
- 5.6. Experimental results
- 5.7. Summary
- References
- Chapter Six: Egocentric video cosummarization
- Abstract
- 6.1. Motivation
- 6.2. Chapter organization
- 6.3. Related work
- 6.4. Proposed methods
- 6.5. Time-complexity analysis
- 6.6. Experimental results
- 6.7. Summary
- References
- Chapter Seven: Action recognition in egocentric video
- Abstract
- 7.1. Motivation
- 7.2. Chapter organization
- 7.3. Related work
- 7.4. Proposed method
- 7.5. Time-complexity analysis
- 7.6. Experimental results
- 7.7. Summary
- References
- Chapter Eight: Conclusions
- Abstract
- 8.1. Concluding remarks
- 8.2. Future research directions
- References
- Appendix A: Source codes
- A.1. Organization
- A.2. Source codes – constrained Delaunay graph clustering for exocentric video summarization
- A.3. Source codes – optimum-path forest clustering for multi-view exocentric video summarization
- A.4. Source codes – different graph representations for egocentric video summarization
- A.5. Source codes – deep feature and integer knapsack for egocentric video summarization
- A.6. Source codes – game theoretic approach for egocentric video cosummarization
- A.7. Source codes – random walker on video similarity graph
- Index
- Edition: 1
- Published: August 7, 2024
- Imprint: Morgan Kaufmann
- No. of pages: 370
- Language: English
- Paperback ISBN: 9780443214950
- eBook ISBN: 9780443214868
AS
Ananda S Chowdhury
Ananda S. Chowdhury is a Professor and former Head in the Department of Electronics and Telecommunication Engineering at Jadavpur University, Kolkata, India, where he leads the Imaging, Vision and Pattern Recognition group. He received his Ph.D. degree in Computer Science from The University of Georgia, Athens, GA, USA, and was a Postdoctoral Fellow at National Institutes of Health, Bethesda, MD, USA. His research interests include computer vision, pattern recognition, biomedical image/ signal processing, and multimedia analysis. He is a Senior Member of IEEE, a Member of the International Association for Pattern Recognition Technical Committee (IAPR TC) on Graph based Representations (GbR), and a life member of the Indian Unit for Pattern Recognition and Artificial Intelligence (IUPRAI). He has held invited academic visits to different universities across France, Germany, Norway, Italy, The Netherlands, Singapore and Brazil. Dr. Chowdhury serves/has served on the editorial boards of IEEE Transactions on Image Processing, Pattern Recognition Letters, IEEE Signal Processing Letters, and Springer Nature Computer Science. His Erdös Number is two.
AS
Abhimanyu Sahu
Abhimanyu Sahu is an Assistant Professor in the Department of Computer Science & Engineering at Motilal Nehru National Institute of Technology Allahabad, Prayagraj, India. He received his Ph.D. degree in Computer Engineering from Jadavpur University, Kolkata, India in 2021. He is a life member of ISTE. His current research interests include computer vision, multimedia analysis, and pattern recognition problems. Specifically, he is also interested in exploring different machine learning techniques (self-supervised/unsupervised learning, deep Learning) to solve several challenging problems in multimedia analysis such as summarization and action/object/activity recognition particularly in first-person (Egocentric) videos. He also worked on theoretical aspects of Graph-based modeling of the above fields.