Publications
You can also find my articles on my Google Scholar profile.
2025
Atmaja, B. T., Zanjabila, Suyanto, Asmoro, W. A., & Sasou, A. (2025). Cross-dataset COVID-19 transfer learning with data augmentation. International Journal of Information Technology. https://doi.org/10.1007/s41870-025-02433-z
Atmaja, B. T., & Sasou, A. (2025). Pathological Voice Detection From Sustained Vowels : Handcrafted vs. Self-supervised Learning. 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), 1–5. https://doi.org/10.1109/ICASSPW65056.2025.11011272
Atmaja, B. T., Sasou, A., & Burkhardt, F. (2025). Performance-Weighted Ensemble Learning for Speech Classification. 2025 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), 0044–0048. https://doi.org/10.1109/ICAIIC64266.2025.10920862
Atmaja, B. T., & Sakti, S. (2025). Dementia Prediction From Speech Signal Using Optimized Prosodic Features. APSIPA Annual Summit and Conference.
Burkhardt, F., & Atmaja, B. T. (2025). Nkululeko 1 . 0 : A Python package to predict speaker characteristics with a high-level interface How Does It Work ? Statement of Need Usage in Existing Research. Journal of Open Source Software, 0, 1–4. https://doi.org/10.21105/joss.08049
2024
Atmaja, B. T., Zanjabila, Suyanto, & Sasou, A. (2024). Comparing hysteresis comparator and RMS threshold methods for automatic single cough segmentations. International Journal of Information Technology (Singapore), 16(1), 5–12. https://doi.org/10.1007/s41870-023-01626-8
Suyanto, Zanjabila, Atmaja, B. T., & Asmoro, W. A. (2024). Performance Improvement of Covid-19 Cough Detection Based on Deep Learning with Segmentation Methods. Journal of Applied Data Science, 5(2), 520–531.
Burkhardt, F., Atmaja, B. T., Derington, A., & Eyben, F. (2024). Check Your Audio Data : Nkululeko for Bias Detection. Oriental COCOSDA, 1–6. https://doi.org/10.1109/O-COCOSDA64382.2024.10800580
Atmaja, B. T. (2024). Evaluating Hyperparameter Optimization for Machinery Anomalous Sound Detection. 2024 IEEE REGION 10 CONFERENCE (TENCON).
Atmaja, B. T., & Sasou, A. (2024). Multi-label Emotion Share Regression From Speech Using Pre-Trained Self-Supervised Learning Models. 2024 IEEE REGION 10 CONFERENCE (TENCON).
Atmaja, B. T., Sasou, A., & Burkhardt, F. (2024). Uncertainty-Based Ensemble Learning for Speech Classification. 2024 27th Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques (O-COCOSDA), 1–6. https://doi.org/10.1109/O-COCOSDA64382.2024.10800111
2023
Atmaja, B. T., Zanjabila, Suyanto, & Sasou, A. (2024). Comparing hysteresis comparator and RMS threshold methods for automatic single cough segmentations. International Journal of Information Technology (Singapore), 16(1), 5–12. https://doi.org/10.1007/s41870-023-01626-8
Atmaja, B. T., & Sasou, A. (2023). Ensembling Multilingual Pre-Trained Models for Predicting Multi-Label Regression Emotion Share from Speech. 2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 1026–1029. https://doi.org/10.1109/APSIPAASC58517.2023.10317109
Atmaja, B. T., & Sasou, A. (2023). Multilingual, Cross-lingual, and Monolingual Speech Emotion Recognition on EmoFilm Dataset. 2023 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 1019–1025. https://doi.org/10.1109/APSIPAASC58517.2023.10317223
2022
Atmaja, B. T., Sasou, A., & Akagi, M. (2022). Survey on bimodal speech emotion recognition from acoustic and linguistic information fusion. Speech Communication, 140, 11–28. https://doi.org/10.1016/j.specom.2022.03.002
Atmaja, B. T., Sasou, A., & Akagi, M. (2022). Speech Emotion and Naturalness Recognitions With Multitask and Single-Task Learnings. IEEE Access, 10, 72381–72387. https://doi.org/10.1109/ACCESS.2022.3189481
Atmaja, B. T., Zanjabila, & Sasou, A. (2022). Jointly Predicting Emotion, Age, and Country Using Pre-Trained Acoustic Embedding. 2022 10th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos (ACIIW), 1–6. https://doi.org/10.1109/ACIIW57231.2022.10085991
Atmaja, B. T., & Sasou, A. (2022). Effects of Data Augmentations on Speech Emotion Recognition. Sensors, 22(16), 5941. https://doi.org/10.3390/s22165941
Atmaja, B. T., & Sasou, A. (2022). Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations. Sensors, 22(17), 6369. https://doi.org/10.3390/s22176369
Atmaja, B. T., & Sasou, A. (2022). Evaluating Self-Supervised Speech Representations for Speech Emotion Recognition. IEEE Access, 10, 124396–124407. https://doi.org/10.1109/ACCESS.2022.3225198
Atmaja, B. T., & Sasou, A. (2022). Leveraging Pre-Trained Acoustic Feature Extractor For Affective Vocal Bursts Tasks. 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), November, 1409–1414. https://doi.org/10.23919/APSIPAASC55919.2022.9980083
Atmaja, B. T., Zanjabila, & Sasou, A. (2022). On The Optimal Classifier For Affective Vocal Bursts And Stuttering Predictions Based On Pre-Trained Acoustic Embedding. 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), November, 1690–1695. https://doi.org/10.23919/APSIPAASC55919.2022.9980310
2021
Atmaja, B. T., & Akagi, M. (2021). Two-stage dimensional emotion recognition by fusing predictions of acoustic and text networks using SVM. Speech Communication, 126, 9–21. https://doi.org/10.1016/j.specom.2020.11.003
Atmaja, B. T., & Arifianto, D. (2021). A comparative study of sound sources separation by independent component analysis and binaural model. Journal of Physics: Conference Series, 1896(1), 012002. https://doi.org/10.1088/1742-6596/1896/1/012002
Atmaja, B. T. (2021). Dimensional Speech Emotion Recognition by Fusing Acoustic and Linguistic Information. Japan Advanced Institute of Science and Technology.
Atmaja, B. T., & Sasou, A. (2021). Effect of different splitting criteria on the performance of speech emotion recognition. TENCON 2021 - 2021 IEEE Region 10 Conference (TENCON), 760–764. https://doi.org/10.1109/TENCON54134.2021.9707265
Atmaja, B. T., Sasou, A., & Akagi, M. (2021). Automatic Naturalness Recognition from Acted Speech Using Neural Networks. APSIPA Annual Summit and Conference, 731–736.
Atmaja, B. T., & Akagi, M. (2021). Evaluation of error- And correlation-based loss functions for multitask learning dimensional speech emotion recognition. Journal of Physics: Conference Series, 1896(1), 012004. https://doi.org/10.1088/1742-6596/1896/1/012004
2020
Atmaja, B. T., & Akagi, M. (2020). Multitask Learning and Multistage Fusion for Dimensional Audiovisual Emotion Recognition. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2020-May, 4482–4486. https://doi.org/10.1109/ICASSP40776.2020.9052916
Atmaja, B. T., & Akagi, M. (2020). Dimensional speech emotion recognition from speech features and word embeddings by using multitask learning. APSIPA Transactions on Signal and Information Processing, 9(May), e17. https://doi.org/10.1017/ATSIP.2020.14
Atmaja, B. T., Hamada, Y., & Akagi, M. (2020). Predicting Valence and Arousal by Aggregating Acoustic Features for Acoustic-Linguistic Information Fusion. 2020 IEEE REGION 10 CONFERENCE (TENCON), 1081–1085. https://doi.org/10.1109/TENCON50793.2020.9293899
Atmaja, B. T., & Akagi, M. (2020). On The Differences Between Song and Speech Emotion Recognition: Effect of Feature Sets, Feature Types, and Classifiers. 2020 IEEE REGION 10 CONFERENCE (TENCON), 968–972. https://doi.org/10.1109/TENCON50793.2020.9293852
Atmaja, B. T., & Akagi, M. (2020). Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition. 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Proceedings, 325–331.
Atmaja, B. T., & Akagi, M. (2020). Improving Valence Prediction in Dimensional Speech Emotion Recognition Using Linguistic Information. Proceedings of 2020 23rd Conference of the Oriental COCOSDA International Committee for the Co-Ordination and Standardisation of Speech Databases and Assessment Techniques, O-COCOSDA 2020, 166–171. https://doi.org/10.1109/O-COCOSDA50338.2020.9295032
Atmaja, B. T., & Akagi, M. (2020). The Effect of Silence Feature in Dimensional Speech Emotion Recognition. 10th International Conference on Speech Prosody 2020, May, 26–30. https://doi.org/10.21437/SpeechProsody.2020-6
Elbarougy, R., Atmaja, B. T., & Akagi, M. (2020). Continuous Audiovisual Emotion Recognition Using Feature Selection and LSTM. Journal of Signal Processing, 24(6), 229–235. https://doi.org/10.2299/jsp.24.229
2019
Atmaja, B. T., Arifianto, D., & Akagi, M. (2019). Speech recognition on Indonesian language by using time delay neural network. ASJ Spring Meeting, 1291–1294.
Atmaja, B. T., & Akagi, M. (2019). Speech Emotion Recognition Based on Speech Segment Using LSTM with Attention Model. 2019 IEEE International Conference on Signals and Systems (ICSigSys), 40–44. https://doi.org/10.1109/ICSIGSYS.2019.8811080
Atmaja, B. T., Shirai, K., & Akagi, M. (2019). Speech Emotion Recognition Using Speech Feature and Word Embedding. 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 519–523. https://doi.org/10.1109/APSIPAASC47483.2019.9023098
Atmaja, B. T., Shirai, K., & Akagi, M. (2019). Deep Learning-based Categorical and Dimensional Emotion Recognition for Written and Spoken Text. IPTEK Journal of Proceedings Series.
Atmaja, B. T., Elbarougy, R., & Akagi, M. (2019). RNN-based Dimensional Speech Emotion Recognition. ASJ Autumn Meeting, 743–744.
2018
Lestari, R. Y., Harsono, D., Cahyana, B. T., Atmaja, B. T., Asmoro, W. A., Perindustrian, K., Selatan, K., Fisika, D. T., & Timur, J. (2018). Tingkat redaman suara papan komposit dari tandan kosong kelapa sawit dan serbuk kayu akasia. Prosiding Seminar Nasional Teknologi Dan Inovasi Industri, 31–38.
Elbarougy, R., Atmaja, B. T., & Akagi, M. (2018). Continuous Tracking of Emotional State from Speech Based on Emotion Unit. ASJ Autumn Meeting, 1(1), 1231–1234.
2017
- Arifianto, D., Wirawan, W., Atmaja, B. T., Dhanardhono, T., & Rahman, S. A. (2017). Azimuth Tracking of Underwater Moving Sound Source Based on Time Delay Estimation Using Hydrophone Array. Procedia Engineering, 170. https://doi.org/10.1016/j.proeng.2017.03.039
2016
Atmaja, B. T., Farid, M. N., & Arifianto, D. (2016). Speech enhancement on smartphone voice recording. Journal of Physics: Conference Series, 776(1). https://doi.org/10.1088/1742-6596/776/1/012072
Atmaja, B. T., & Arifianto, D. (2016). Signal Enhancement by Single Channel Source Separation. IPTEK Journal of Proceedings Serie, 1(2), 2–3.
Atmaja, B. T., Puabdillah, M. F., Farid, M. N., & Asmoro, W. A. (2016). Prediction and simulation of internal train noise resulted by different speed and air conditioning unit. Journal of Physics: Conference Series, 776(1). https://doi.org/10.1088/1742-6596/776/1/012072
2014
- Tris Atmaja, B., & Arifianto, D. (2014). Pemisahan Sumber Suara Tercampur Berdasarkan Penelusuran Frekuensi Dasar Pada Sinyal Wicara dan Musik. Seminar Nasional Getaran Dan Akustik. https://drive.google.com/file/d/0B2cAmw9oV5cuTVkwa18yQ194Nnc/view?usp=sharing
2012
Atmaja, B. T. (2012). On Source Signal Segregation Based On Binaural Inputs. Institut Teknologi Sepuluh Nopember.
Atmaja, B. T., Arifianto, D., Chisaki, Y., & Usagawa, T. (2012). Signal Enhancement by Using Sound Separation Methods Based On Binaural Inputs. Basic Science, 1(3).
Putra, B., Atmaja, B. T., & Hidayat, S. (2012). Fusion of artificial neural network and fuzzy system for short term weather forecasting. International Journal of Information and Communication Technology, 4(2–4), 210–226. https://doi.org/10.1504/IJICT.2012.048765
2011
Atmaja, B. T., Arifianto, D., Usagawa, T., Chisaki, Y., & Usagawa, T. (2011). On Performance of Two-Sensor Sound Separation Methods Including Binaural Processors. ASJ Kyushu Meeting, 8555(1), 2–5.
Putra, B., Atmaja, B. T., & Prananto, D. (2011). Prototyping of Quranic Verse Recitation Learning Software Using Speech Recognition Techniques Based on Cepstral Feature. International Conference on Informatics for Development, 2011(Icid), 82–87.
Putra, B., Atmaja, B. T., & Hidayat, S. (2011). Short Term Weather Forecasting Using Fusion of Fuzzy-Artificial Neural Network. International Conference on Informatics for Development, 2011(Icid), 48–53.
Atmaja, B. T. (2011). Kebangkitan Sains Islam, Kebangkitan Peradaban Islam.
2010
Atmaja, B. T., Putra, B., & Prananto, D. (2010). Developing Quranic Verse Recitation Learning Software Based On Speech Recognition Techniques. Prosiding Seminar Nasional Teknik Fisika (SNTF).
Atmaja, B. T. (2010). Rekonstruksi pendidikan pesantren dengan membangun budaya ilmiah dan islamisasi sains.
2009
Atmaja, B. T., & Arifianto, D. (2009). Blind Sound Separation Using Frequency-Domain And Time-Domain Independent Component Analysis For Machines Fault Detection. Proceeding of The International Conference on Advanced Computing and Information System (ICACSIS), 259–263.
Putra, B., & Atmaja, B. T. (2009). Integrasi Sistem Fuzzy-JST Untuk Prakiraan Cuaca Jangka Pendek (Studi Kasus di Surabaya).
Atmaja, B. T. (2009). Pemisahan Banyak Sumber Suara Mesin Dari Microphone Array Dengan Metode Independent Component Analysis (ICA) Untuk Deteksi Kerusakan. Institut Teknologi Sepuluh Nopember.
Putra, B., & Atmaja, B. T. (2009). Implementasi Sistem Fuzzy Untuk Pengaturan Lampu Lalu Lintas.
Atmaja, B. T. (2009). Integrasi Al-Quran dan Sains Untuk Memodelkan Ulang Konsep Perputaran Bumi dan Matahari.
Atmaja, B. T., & Arifianto, D. (2009). Machinery Fault Diagnosis Using Independent Component Analyis and Instantaneous Frequency. Proceeding of International Conference on Instrumentation, Communications, Information Technology and Biomedical Engineering (ICICI-BME). https://doi.org/10.1109/ICICI-BME.2009.5417257
2007
- Atmaja, B. T., & Putra, B. (2007). Kajian Al-Quran Terhadap Absolutisme Kecepatan Cahaya Dalam Teori Fisika Relativistik.
