中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/95277
English  |  正體中文  |  简体中文  |  Items with full text/Total items : 80990/80990 (100%)
Visitors : 41992880      Online Users : 1529
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
Scope Tips:
  • please add "double quotation mark" for query phrases to get precise results
  • please goto advance search for comprehansive author search
  • Adv. Search
    HomeLoginUploadHelpAboutAdminister Goto mobile version


    Please use this identifier to cite or link to this item: http://ir.lib.ncu.edu.tw/handle/987654321/95277


    Title: Combining uncertainty modeling and temporal-channel network with CLIP model for weakly supervised video anomaly detection
    Authors: 江毓晴;Jiang, Yu-Qing
    Contributors: 軟體工程研究所
    Keywords: 弱監督式學習;影片異常檢測;weakly supervised learning;video anomaly detection
    Date: 2024-07-23
    Issue Date: 2024-10-09 16:37:10 (UTC+8)
    Publisher: 國立中央大學
    Abstract: 為了確保公共安全和保護個人財產,監視攝影機被廣泛設置在各種公共場所、公司以及住宅,用於記錄違法和異常活動。然而,異常事件通常只佔整部監視影片的一小部分。因此,影片異常檢測至關重要,因為它的目的是區分異常事件和正常事件,並找到這些異常發生的確切時間。近年來,視覺語言模型(VLM)在各種影像相關任務中取得了巨大成功。許多研究已將 VLM 的應用擴展到各種影片任務中,包括弱監督式影片異常檢測。我們將視覺語言模型與多尺度時序Transformer、通道注意力機制和不確定性建模策略結合,以捕捉更多判別性特徵並更有效地分離異常事件與正常事件。實驗結果表明,對於 UCF-Crime 和 XD-Violence 資料集中的大多數類別,我們的方法在弱監督式影片異常檢測方面優於目前最先進的模型。;To ensure public safety and protect private property, surveillance cameras are widely deployed in various public spaces, companies, and residences to record illegal and anomalous activities. However, abnormal events typically account for only a small fraction of the total surveillance footage. Therefore, video anomaly detection is crucial, as it aims to distinguish abnormal events from normal events and find the exact time of these anomalies. In recent years, Vision-Language Models (VLMs) have achieved significant success in various image-related tasks. Many studies have extended the application of VLMs to video-level tasks, including weakly supervised video anomaly detection. We integrate VLM with multi-scale temporal transformer, channel attention mechanism, and uncertainty modeling strategy to capture more discriminative features and more effectively distinguish abnormal events from normal events. Experimental results show that our method outperforms current state-of-the-art models in weakly supervised video anomaly detection for most of the categories in the UCF-Crime and XD-Violence datasets.
    Appears in Collections:[Software Engineer] Electronic Thesis & Dissertation

    Files in This Item:

    File Description SizeFormat
    index.html0KbHTML33View/Open


    All items in NCUIR are protected by copyright, with all rights reserved.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明