Practical Speech AI Research and Education

Speech AI researcher and educator at NAIST. I build open-source tools, tutorials, and demos in speech processing — from speech classification (e.g., emotion recognition) to ASR and TTS — and multimodal information fusion.

mindmap
  root((BTA))
    Research
      Speech Emotion Recognition
      Multimodal Fusion
      Multitask Learning
      Pathological Voice
    Tools
      Nkululeko
      Speechain
      PaperRAG
      GitHub
    Tutorials
      Shell and Linux
      Python and DSP
      Speech and Audio
      Git and LaTeX
    Publications
      Nkululeko ICASSP 2026
      COVID-19 Transfer 2025
      Dementia APSIPA 2025
      All Publications
    Contact
      Email
      GitHub Profile
      Google Scholar
      CV
    More
      Courses
      Theses
      Japanese Learning
      Islamic Studies
      Blogs