Please use this identifier to cite or link to this item: http://localhost:8081/jspui/handle/123456789/18188
Full metadata record
DC FieldValueLanguage
dc.contributor.authorV. K., VIVEKRAJ-
dc.date.accessioned2025-08-27T11:21:39Z-
dc.date.available2025-08-27T11:21:39Z-
dc.date.issued2021-02-
dc.identifier.urihttp://localhost:8081/jspui/handle/123456789/18188-
dc.guideSen, Debashis and Raman, Balasubramanianen_US
dc.description.abstractVideo skimming is the process of generating an abridged version (video summary) of a given user video by including only the significant contents of the video. Such a video summary helps in conveying the intended message of the video in a significantly shorter time. This thesis contributes in the development of various components of an autonomous video skimming system, focusing on the taxonomy to organize the existing literature on video skimming; different novel frameworks to perform video skimming through importance ranking; and new quantitative measures to evaluate and carry out comparative analysis. Firstly, a comprehensive literature survey is performed using a novel taxonomy for categorizing the existing literature on user video skimming. User videos are unstructured videos that capture interesting events in day-to-day activities. The review elicits the research area’s evolution demonstrating the key developments, and gives insight into possible future directions. Further, it brings out the typical video skimming framework generally preferred by various video skimming algorithms, processing paradigms, the variety of evaluation strategies, and the datasets in vogue. Secondly, a video segment ranking algorithm is proposed for user video skimming that works by considering the relationship between a representative video segment and all other video segments, where the concept of R-ordering of vectors is employed to find the representative video segment. The video segments are grouped based on the number of components in their feature vector greater /smaller than the representative feature vector. This feature component based comparison and grouping preserves their individual contributions during the ranking process. This ranking process is further enhanced by designing the provision to allocate different priorities to the feature components using linear and Gaussian process regression. The priorities are learned using a set of frames and their corresponding importance scores as indicated in the ground truth summaries. Thirdly, a fusion framework is provided for generating user video summaries utilizing different modalities like audio and video. Our ranking algorithm is used to order the video segments according to their importance derived from the chosen audio and visual features separately. A strategy is then proposed to combine these rankings corresponding to all the modalities. Finally, a novel measure to evaluate a dynamic video summary against multiple reference summaries is proposed. To this end, concepts of granular computing are leveraged to theoretically deduce the measure, which captures the inherent (dis)agreement between the multiple references and the resulting clustering tendency. Along with a few attributes of the proposed evaluation measure, it is theoretically shown that the popularly used average F-measure is a special case of the novel measure, which is referred to as the granule-aware F-measure.en_US
dc.language.isoenen_US
dc.publisherIIT, Roorkeeen_US
dc.subjectVideo Skimming /Dynamic video summarization, User videos, R-ordering, Regression, Multi-modal Rank Fusion, Multi-reference evaluation, F-measure, Clustering, Segmentation, Feature components.en_US
dc.titleAUTONOMOUS VIDEO SKIMMING: TAXONOMY, FRAMEWORKS AND QUANTITATIVE EVALUATIONen_US
dc.typeThesisen_US
Appears in Collections:DOCTORAL THESES (CSE)

Files in This Item:
File Description SizeFormat 
VIVEKRAJ V K 14911025.pdf8.13 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.