Speech and Multimodal AI Researcher and Educator

Speech AI researcher and educator at NAIST. I do research, teaching, tutorials, and demos in speech processing — from speech classification (e.g., emotion recognition) to ASR and TTS — and multimodal information fusion.

mindmap
  root((BTA))
    Research
      Speech Emotion Recognition
      Multimodal Fusion
      Multitask Learning
      Speech/Audio Classification  
    Tools
      Nkululeko
      Speechain
      PaperRAG
      GitHub
    Tutorials
      Shell and Linux
      Python and DSP
      Speech and Audio
      Git and LaTeX
    Publications
      ICASSP
      O-COCOSDA  
      APSIPA
      All Publications
    Contact
      Email
      GitHub Profile
      Google Scholar
      CV
    More
      Courses
      Theses
      Japanese Learning
      Islamic Studies
      Blogs