Practical Speech AI Research and Education
Speech AI researcher and educator at NAIST. I build open-source tools, tutorials, and demos in speech processing — from speech classification (e.g., emotion recognition) to ASR and TTS — and multimodal information fusion.
mindmap
root((BTA))
Research
Speech Emotion Recognition
Multimodal Fusion
Multitask Learning
Pathological Voice
Tools
Nkululeko
Speechain
PaperRAG
GitHub
Tutorials
Shell and Linux
Python and DSP
Speech and Audio
Git and LaTeX
Publications
Nkululeko ICASSP 2026
COVID-19 Transfer 2025
Dementia APSIPA 2025
All Publications
Contact
Email
GitHub Profile
Google Scholar
CV
More
Courses
Theses
Japanese Learning
Islamic Studies
Blogs