• English
    • العربية
  • العربية
  • Login
  • QU
  • QU Library
  •  Home
  • Communities & Collections
  • Help
    • Item Submission
    • Publisher policies
    • User guides
    • FAQs
  • About QSpace
    • Vision & Mission
View Item 
  •   Qatar University Digital Hub
  • Qatar University Institutional Repository
  • Academic
  • Faculty Contributions
  • College of Engineering
  • Electrical Engineering
  • View Item
  • Qatar University Digital Hub
  • Qatar University Institutional Repository
  • Academic
  • Faculty Contributions
  • College of Engineering
  • Electrical Engineering
  • View Item
  •      
  •  
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-Organized Operational Layer

    Thumbnail
    Date
    2021
    Author
    Soltanian M.
    Malik J.
    Raitoharju J.
    Iosifidis A.
    Kiranyaz, Mustafa Serkan
    Gabbouj M.
    ...show more authors ...show less authors
    Metadata
    Show full item record
    Abstract
    Automatic classification of speech commands has revolutionized human computer interactions in robotic applications. However, employed recognition models usually follow the methodology of deep learning with complicated networks which are memory and energy hungry. So, there is a need to either squeeze these complicated models or use more efficient lightweight models in order to be able to implement the resulting classifiers on embedded devices. In this paper, we pick the second approach and propose a network layer to enhance the speech command recognition capability of a lightweight network and demonstrate the result via experiments. The employed method borrows the ideas of Taylor expansion and quadratic forms to construct a better representation of features in both input and hidden layers. This richer representation results in recognition accuracy improvement as shown by extensive experiments on Google speech commands (GSC) and synthetic speech commands (SSC) datasets.
    URI
    https://www.scopus.com/inward/record.uri?eid=2-s2.0-85116504348&doi=10.1109%2fIJCNN52387.2021.9534232&partnerID=40&md5=4fa7c2bfcd2d285a8af2fff44bdc753a
    DOI/handle
    http://dx.doi.org/10.1109/IJCNN52387.2021.9534232
    http://hdl.handle.net/10576/30587
    Collections
    • Electrical Engineering [‎2821‎ items ]

    entitlement

    Related items

    Showing items related by title, author, creator and subject.

    • Thumbnail

      Generative emotional AI for speech emotion recognition: The case for synthetic emotional speech augmentation 

      Latif, Siddique; Shahid, Abdullah; Qadir, Junaid ( Elsevier , 2023 , Article)
      Despite advances in deep learning, current state-of-the-art speech emotion recognition (SER) systems still have poor performance due to a lack of speech emotion datasets. This paper proposes augmenting SER systems with ...
    • Thumbnail

      Distinct neuropsychological correlates in positive and negative formal thought disorder syndromes: The thought and language disorder scale in endogenous psychoses 

      Nagels A.a Fahrmann; Stratmann M.a; Ghazi S.a; Schales C.a; Frauenheim M.a; Turner L.a; Hornig T.b; Katzev M.b; Muller-Isberner R.c; Grosvald M.d; Krug A.a; Kircher T.a; Kircher, Tilo... more authors ... less authors ( S. Karger AG , 2016 , Article Review)
      The correlation of formal thought disorder (FTD) symptoms and subsyndromes with neuropsychological dimensions is as yet unclear. Evidence for a dysexecutive syndrome and semantic access impairments has been discussed in ...
    • Thumbnail

      Decoding silent speech: a machine learning perspective on data, methods, and frameworks 

      Chowdhury, Adiba Tabassum; Newaz, Mehrin; Saha, Purnata; AbuHaweeleh, Mohannad Natheef; Mohsen, Sara; Bushnaq, Diala; Chabbouh, Malek; Aljindi, Raghad; Pedersen, Shona; Chowdhury, Muhammad E. H.... more authors ... less authors ( Springer Science and Business Media Deutschland GmbH , 2025 , Article Review)
      At the nexus of signal processing and machine learning (ML), silent speech recognition (SSR) has evolved as a game-changing technology that allows for communication without audible voice. This study offers a thorough ...

    Qatar University Digital Hub is a digital collection operated and maintained by the Qatar University Library and supported by the ITS department

    Contact Us | Send Feedback
    Contact Us | Send Feedback | QU

     

     

    Home

    Submit your QU affiliated work

    Browse

    All of Digital Hub
      Communities & Collections Publication Date Author Title Subject Type Language Publisher
    This Collection
      Publication Date Author Title Subject Type Language Publisher

    My Account

    Login

    Statistics

    View Usage Statistics

    About QSpace

    Vision & Mission

    Help

    Item Submission Publisher policiesUser guides FAQs

    Qatar University Digital Hub is a digital collection operated and maintained by the Qatar University Library and supported by the ITS department

    Contact Us | Send Feedback
    Contact Us | Send Feedback | QU

     

     

    Video