Speech and Multimodal AI Researcher and Educator

Speech AI researcher and educator at NAIST. I do research, teaching, tutorials, and demos in speech processing — from speech classification (e.g., emotion recognition) to ASR and TTS — and multimodal information fusion.

mindmap
  root((BTA))
    Research
      Speech AI
      Multimodal Fusion
      Multitask Learning
      Speech/Audio Classification  
    Tools
      Nkululeko
      Speechain
      PaperRAG
      GitHub
    Tutorials
      Shell and Linux
      Python and DSP
      Speech and Audio
      Git and LaTeX
    Publications
      ICASSP
      O-COCOSDA  
      APSIPA
      All Publications
    Japanese
      Ayo Belajar Bahasa Jepang
      Minna no Nihongo
      Japanese for work
    Islam
      Kisah Nabi 
      Arbain Nawawi