• English
    • العربية
  • العربية
  • Login
  • QU
  • QU Library
  •  Home
  • Communities & Collections
  • Copyrights
View Item 
  •   Qatar University Digital Hub
  • Qatar University Institutional Repository
  • Academic
  • Faculty Contributions
  • College of Engineering
  • Computer Science & Engineering
  • View Item
  • Qatar University Digital Hub
  • Qatar University Institutional Repository
  • Academic
  • Faculty Contributions
  • College of Engineering
  • Computer Science & Engineering
  • View Item
  •      
  •  
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Context-Aware Offensive Meme Detection: A Multi-Modal Zero-Shot Approach with Caption-Enhanced Classification

    View/Open
    Context-Aware_Offensive_Meme_Detection_A_Multi-Modal_Zero-Shot_Approach_with_Caption-Enhanced_Classification.pdf (1015.Kb)
    Date
    2024
    Author
    Abdullakutty, Faseela
    Al-Maadeed, Somaya
    Naseem, Usman
    Metadata
    Show full item record
    Abstract
    Detecting offensive content in memes is a pressing issue, particularly as harmful and toxic materials proliferate on social media platforms. Conventional approaches to offensive meme detection typically focus on analyzing either the visual or textual components in isolation, often missing the nuanced context that arises from the interaction between different modalities. This paper presents an advanced multi-modal zero-shot classification method for offensive meme detection, utilizing large language models (LLMs) alongside image captions generated by the BLIP model. These captions provide crucial contextual information, improving the detection of offensive content, especially in cases where the meme's text or image alone may be insufficient to convey the full meaning. By integrating these captions into the classification prompt, the proposed method offers a more detailed and accurate analysis of meme content. Additionally, the use of Chain-of-Thought (CoT) prompting enhances the reasoning capabilities of the LLMs, enabling a deeper understanding of the relationship between text, images, and captions. Experimental evaluations on the GOAT-Bench and Memotion 2 datasets demonstrate that this approach consistently surpasses traditional methods that omit image captions, highlighting its efficacy in improving the precision and robustness of offensive meme classification across multiple modalities.
    DOI/handle
    http://dx.doi.org/10.1109/ICDMW65004.2024.00025
    http://hdl.handle.net/10576/68969
    Collections
    • Computer Science & Engineering [‎2518‎ items ]

    entitlement


    Qatar University Digital Hub is a digital collection operated and maintained by the Qatar University Library and supported by the ITS department

    Contact Us
    Contact Us | QU

     

     

    Home

    Submit your QU affiliated work

    Browse

    All of Digital Hub
      Communities & Collections Publication Date Author Title Subject Type Language Publisher
    This Collection
      Publication Date Author Title Subject Type Language Publisher

    My Account

    Login

    Statistics

    View Usage Statistics

    Qatar University Digital Hub is a digital collection operated and maintained by the Qatar University Library and supported by the ITS department

    Contact Us
    Contact Us | QU

     

     

    Video