Speech and Multimodal AI Researcher and Educator
Speech AI researcher and educator at NAIST. I do research, teaching, tutorials, and demos in speech processing — from speech classification (e.g., emotion recognition) to ASR and TTS — and multimodal information fusion.
mindmap
root((BTA))
Research
Speech Emotion Recognition
Multimodal Fusion
Multitask Learning
Speech/Audio Classification
Tools
Nkululeko
Speechain
PaperRAG
GitHub
Tutorials
Shell and Linux
Python and DSP
Speech and Audio
Git and LaTeX
Publications
ICASSP
O-COCOSDA
APSIPA
All Publications
Contact
Email
GitHub Profile
Google Scholar
CV
More
Courses
Theses
Japanese Learning
Islamic Studies
Blogs