Skip to main content

Showing 1–1 of 1 results for author: Tonmoy, M A

  1. arXiv:2408.13201  [pdf, other

    cs.SD cs.IR cs.LG eess.AS

    EAViT: External Attention Vision Transformer for Audio Classification

    Authors: Aquib Iqbal, Abid Hasan Zim, Md Asaduzzaman Tonmoy, Limengnan Zhou, Asad Malik, Minoru Kuribayashi

    Abstract: This paper presents the External Attention Vision Transformer (EAViT) model, a novel approach designed to enhance audio classification accuracy. As digital audio resources proliferate, the demand for precise and efficient audio classification systems has intensified, driven by the need for improved recommendation systems and user personalization in various applications, including music streaming p… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.