専門分野・研究テーマ
- 音声情報処理,音声言語インタフェース,パターン情報処理
- 音声認識・音声言語処理技術の開発、音声インタフェースやパターン情報処理などへの応用
- 最近までの研究開発例:音声認識システム(SPOJUS)、音声検索・ダイアライゼーションシステム、音声入力インタフェース、音声対話システム、運転行動信号処理、遠隔発話音声認識・話者認識(共同研究)、音楽情報処理(共同研究)、脳波による想起音声認識(共同研究)
発表論文等のリスト
※ 研究会論文、全国大会論文は含まれません。
論文誌
- Bo Ren, Longbiao Wang, Liang Lu, Yuma Ueda, and Atsuhiko Kai,
"Combination of bottleneck feature extraction and dereverberation for distant-talking speech recognition",
Multimedia Tools and Applications, Volume 75, Issue 9, pp 5093-5108, May 2016.
(DOI: 10.1007/s11042-015-2849-1)
- Yuma Ueda, Longbiao Wang, Atsuhiko Kai and Bo Ren,
"Environment-dependent denoising autoencoder for distant-talking speech recognition",
EURASIP Journal on Advances in Signal Processing,
2015:92, November 2015.
(DOI: 10.1186/s13634-015-0278-y)
- Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Eng Siong Chng and Haizhou Li,
"Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization",
Journal of Signal Processing Systems,
May 2015.
(DOI: 10.1007/s11265-015-1007-3)
- Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai, Takanori Yamada, Weifeng Li and Masahiro Iwahashi,
"Deep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification",
EURASIP Journal on Audio, Speech, and Music Processing,
2015:12, May 2015.
(DOI: 10.1186/s13636-015-0056-7)
- Zhaofeng Zhang, Longbiao Wang and Atsuhiko Kai,
``Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation,''
EURASIP Journal on Audio, Speech, and Music Processing, 2014:15, April 2014.
(DOI: 10.1186/1687-4722-2014-15)
- Longbiao Wang, Atsuhiko Kai, Junki Ema and Toshihiko Itoh, ``Statistical
Model-based Driving Situation Recognition,'' Journal of the Computer Technology
and Application, Vol.3, No.8, pp.544--549, August 2012.
- 安藤和敏,伊藤公人,甲斐充彦,前田恭伸,関谷和之,
“最短距離DEAによるプログラムコンテスト「敢闘賞」の決定”,
オペレーションズ・リサーチ, Vol.57, No.6, pp.315--321, June 2012.
- Kazutoshi Ando, Atsuhiko Kai, Yasunobu Maeda, Kazuyuki Sekitani,
“LEAST DISTANCE BASED INEFFICIENCY MEASURES ON THE PARETO-EFFICIENT
FRONTIER IN DEA,” Journal of the Operations Research Society of Japan, Vol.55,
No.1, pp.73--91, March 2012.
- Longbiao Wang, Kyohei Odani and Atsuhiko Kai, “Dereverberation and denoising
based on generalized spectral subtraction by multi-channel LMS algorithm using a
small-scale microphone array, ” EURASIP Journal on Advances in Signal
Processing, 2012:12, January 2012.(doi:10.1186/1687-6180-2012-12)
- N. Fujiwara, T. Itoh, K. Araki, A. Kai, T. Konishi and Y. Itoh:
``Spoken Language Understanding Method Using Confidence Measure and
Dialogue History'',
Systems and Computers in Japan, Vol.38, No.9, pp.21--31, 2007.
(下記論文の英訳版)
- 藤原敬記, 伊藤敏彦, 荒木健治, 甲斐充彦, 小西達裕, 伊東幸宏:
``認識信頼度と対話履歴を用いた音声言語理解手法'',
電子情報通信学会論文誌, Vol.J89-D, No.7, pp.1493--1503, 2006.
- 甲斐充彦, 盛 浩和, 中野 崇広, 中川聖一:
``フォーム型Web情報検索サービスのための音声ユーザインタフェースシステムと操作性の評価'',
情報処理学会論文誌, Vol.46, No.5, pp.1318--1329, 2005.
- 川本,下平,新田,西本,中村,伊藤,森島,四倉,甲斐,李,山下,小林,徳田,広瀬,峯松,山田,伝,宇津呂,嵯峨山:
``カスタマイズ性を考慮した擬人化音声対話ソフトウェアツールキットの設計'',
情報処理学会論文誌, Vol.43, No.7, pp.2249--2263, 2002.
- 伊藤敏彦, 甲斐充彦, 岩本善行, 水谷 誠, 由浅裕規, 小西達裕, 伊東幸宏:
``目的地設定タスクにおける対話状況の違いによる言語・音響的特徴の比較'',
情報処理学会論文誌, Vol.43, No.7, pp.2118--2129, 2002.
- 甲斐 充彦, 廣瀬 良文, 中川 聖一:
``単語N-gram言語モデルを用いた音声認識システムにおける未知語・冗長語の処理'',
情報処理学会論文誌, Vol.40, No.4, pp.1383--1394, 1999.
- 中川 聖一, 鳥居美和子, 甲斐 充彦, 中西 宏文:
``任意語彙の追加登録可能な単語音声認識システム'',
電気学会論文誌C, Vol.118-C, No.6, pp.865--872, 1998.
- Atsuhiko Kai and Seiichi Nakagawa:
``Comparison of Continuous Speech Recognition Systems with Unknown-Word Processing for Speech Disfluencies'',
Systems and Computers in Japan, Vol.29, No.9, pp.43--53, 1998.
(下記論文の英訳版)
- 甲斐 充彦, 中川 聖一:
``冗長語・言い直し等を含む発話のための未知語処理を用いた音声認識システムの比較評価'',
電子情報通信学会論文誌, Vol.J80-D-II, No.10, pp.2615--2625, 1997.
- Atsuhiko Kai and Seiichi Nakagawa:
``Relationship among Recognition Rate, Rejection Rate and False Alarm
Rate in a Spoken Word Recognition System'',
IEICE Trans. on Information and Systems, Vol.E78-D, No.6, pp.698--704,
1995.
- Seiichi Nakagawa and Atsuhiko Kai:
``A Context-Free Grammar-Driven, One-Pass HMM-Based Continuous
Speech Recognition Method'',
Systems and Computers in Japan, Vol.25, No.4, pp.92--102, 1994.
(下記論文の英訳版)
- 中川 聖一, 甲斐 充彦:
``文脈自由文法制御による One Pass 型 HMM 連続音声認識法'',
電子情報通信学会論文誌, Vol.J76-D-II, No.7, pp.1337--1345, 1993.
- 中川 聖一, 甲斐 充彦:
``ワードスポッティング法を用いた文脈自由文法制御フレーム同期型
HMM連続音声認識法'',
電子情報通信学会論文誌, Vol.J76-D-II, No.7, pp.1329--1336, 1993.
- Jun-ichi Takami, Atsuhiko Kai, and Shigeki Sagayama:
``A pairwise discriminant approach using artificial neural
networks for continuous speech recognition'',
The Journal of the Acoustical Society of Japan (E),
Vol.13, No.6, pp.411--418, 1992.
(DOI: 10.1250/ast.13.411)
国際会議&シンポジウム
- Takumi Kurokawa and Atsuhiko Kai,
"Retrieval-oriented E2E ASR Modeling for Improved Query-by-example Spoken Term Detection",
Proc. APSIPA Annual Summit and Conference (APSIPA ASC 2021), pp.1037-1042, December 2021.
(IEEE Xplore link)
- Takumi Kurokawa and Atsuhiko Kai, "Robust Query-by-example Spoken Term Detection
for Unknown Words Using Speech Retrieval-oriented E2E ASR Modeling", Proc. IEEE
10th Global Conference on Consumer Electronics (GCCE2021), pp.342-343, October 2021.
(DOI: 10.1109/GCCE53005.2021.9621804)
- Ryota Sakai, Atsuhiko Kai, Seiichi Nakagawa,
"Classification of Imagined and Heard Speech Using Amplitude Spectrum and Relative Phase of EEG”,
Proc. IEEE 3rd Global Conference on Life Sciences and Technologies (LifeTech 2021), pp.377-379, March 2021.
(DOI: 10.1109/LifeTech52111.2021.9391883)
- Takumi Kurokawa, Atsuhiko Kai, Hiroki Kondo,
"Effects of End-to-end ASR and Score Fusion Model Learning for Improved Query-by-example Spoken Term Detection",
Proc. Asia-Pacific Signal Information Processing Association Annual Summit and Conference (APSIPA ASC 2020), pp.654-661, New Zealand (virtual), December 2020.
(IEEE Xplore link)
[PDF]
- Raufun Nahar, Atsuhiko Kai,
"Effect of Data Augmentation on DNN-Based VAD for Automatic Speech Recognition in Noisy Environment",
Proc. IEEE 9th Global Conference on Consumer Electronics (GCCE 2020), pp. 477-481, October 2020.
(DOI: 10.1109/GCCE50665.2020.9292034)
- Raufun Nahar, Takashi Kawai, Atsuhiko Kai,
"Multi-Condition Training of Denoising Autoencoder by Augmenting Simulated Reverberant Speech Data",
Proc. 2018 IEEE 7th Global Conference on Consumer Electronics (GCCE 2018), pp.301-305, October 2018.
(DOI: 10.1109/GCCE.2018.8574776)
- Tomonori Kawamura, Atsuhiko Kai, and Seiichi Nakagawa,
"Noise robust fundamental frequency estimation of speech using CNN-based discriminative modeling",
Proc. 5th. International Conference on Advanced Informatics, Concepts, Theory, and Applications (ICAICTA),
Paper ID: 43542-040, 6 pages, August 2018.
(DOI: 10.1109/ICAICTA.2018.8541328")
- Yuji Terada, Kenta Tamiya, and Atsuhiko Kai,
"Investigation of Efficient Semi-automatic Correction Method Using STD for Automatic Captioning",
Proc. IEEE 6th Global Conference on Consumer Electronics (GCCE 2017), pp.418-419, Aichi, October 2017.
(DOI: 10.1109/GCCE.2017.8229451)
- Shuji Oishi, Tatsuya Matsuba, Mitsuaki Makino, Atsuhiko Kai,
"Combining state-level spotting and posterior-based acoustic match
for improved query-by-example spoken term detection",
Proc. INTERSPEECH 2016, pp.740-744, San Francisco, September 2016.
(DOI: 10.21437/Interspeech.2016-1259)
- S M Raufun Nahar, Atsuhiko Kai,
"Robust Voice Activity Detector by Combining Sequentially Trained Deep Neural Networks",
Proc. ICAICTA 2016, Penang, August 2016.
(DOI: 10.1109/ICAICTA.2016.7803133)
- Shuji Oishi, Tatsuya Matsuba, Mitsuaki Makino, Atsuhiko Kai,
"Combining State-level and DNN-based Acoustic Matches for Efficient Spoken Term Detection in NTCIR-12 SpokenQuery&Doc-2 Task",
Proc. of the 12th NTCIR Conference on Evaluation of Information Access Technologies,
Tokyo, June 2016.
[PDF]
(ISBN 978-4-86049-071-3)
- Bo Ren, Longbiao Wang, Atsuhiko Kai and Zhaofeng Zhang,
"Speech selection and environmental adaptation for asynchronous speech recognition",
Proc. Asia-Pacific Signal Information Processing Association Annual Summit and Conference (APSIPA ASC 2015),
pp.119-124, Hong Kong, December 2015.
(DOI: 10.1109/APSIPA.2015.7415485)
- Longbiao Wang, Bo Ren, Yuma Ueda, Atsuhiko Kai, Shunta Teraoka and Taku Fukushima:
Denoising autoencoder and environment adaptation for distant-talking speech recognition with asynchronous speech recording,
Proc. Asia-Pacific Signal Information Processing Association Annual Summit and Conference (APSIPA ASC), pp.1-5(5 pages), December 2014.
- Mitsuaki Makino and Atsuhiko Kai:
Combining Subword and State-level Dissimilarity Measures for Improved Spoken Term Detection in NTCIR-11 SpokenQuery&Doc Task,
Proc. NTCIR-11, pp.1-6(6 pages), December 2014.
[PDF]
- Ikuya Hirano, Kong Aik Lee, Zhaofeng Zhang, Longbiao Wang and Atsuhiko Kai:
Single-sided Approach to Discriminative PLDA Training for Text-Independent Speaker Verification without Using Expanded I-vector,
Proc. ISCSLP 2014, Singapore, September 2014.
- Satoshi Shiota, Longbiao Wang, Kyohei Odani, Atsuhiko Kai and Weifeng Li:
Distant-talking speech recognition using multi-channel LMS and multiple-step linear prediction,
Proc. ISCSLP 2014, Singapore, September 2014.
- Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, EngSiong Chng and Haizhou Li:
Single-channel dereverberation for distant-talking speech recognition by combining denoising autoencoder and temporal structure normalization,
Proc. ISCSLP 2014, Singapore, September 2014.
- Mitsuaki Makino, Naoki Yamamoto, Atsuhiko Kai:
Utilizing State-level Distance Vector Representation for Improved Spoken
Term Detection by Text and Spoken Queries,
Proc. INTERSPEECH 2014, pp.1732-1736, Singapore, September 2014.
- Yuta Kawakami, Longbiao Wang, Atsuhiko Kai, and Seiichi Nakagawa:
Speaker Identification by Combining Various Vocal Tract and Vocal Source Features,
Proc. of the 17th International Conference on Text, Speech and Dialogue (TSD 2014), pp.382-389, September 2014.
- Naoki Yamamoto, Atsuhiko Kai:
Using Acoustic Dissimilarity Measures Based on State-Level Distance Vector Representation for
Improved Spoken Term Detection,
Proc. of APSIPA Annual Summit and Conference 2013, October 2013.
http://dx.doi.org/10.1109/APSIPA.2013.6694151
- Longbiao Wang, Kyohei Odani, Atsuhiko Kai, Weifeng Li:
Speech Recognition Using Blind Source Separation and Dereverberation Method for Mixed Sound of Speech
and Music,
Proc. of APSIPA Annual Summit and Conference 2013, October 2013.
http://dx.doi.org/10.1109/APSIPA.2013.6694159
- Takanori Yamada, Longbiao Wang and Atsuhiko Kai:
Improvement of distant-talking speaker identification using bottleneck features of DNN,
Proc. of INTERSPEECH 2013, pp.3661-3664, August 2013.
- Naoki Yamamoto and Atsuhiko Kai:
Spoken Term Detection Using Distance-Vector based Dissimilarity Measures and Its Evaluation on the NTCIR-10 SpokenDoc-2 Task,
Proc. of the 10th NTCIR Conference, pp.648-653, June 2013.
[PDF]
- Longbiao Wang, Zhaofeng Zhang and Atsuhiko Kai:
HANDS-FREE SPEAKER IDENTIFICATION BASED ON SPECTRAL SUBTRACTION USING A MULTI-CHANNEL LEAST MEAN SQUARE APPROACH,
Proc. of ICASSP2013, pp.7224-7228, May 2013.
- Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai:
Dereverberantion based on Generalized Spectral Subtraction for Distant-talking Speaker Recognition,
Proc. of APSIPA Annual Summit and Conference 2012, OS.41-SLA.15-3(4 pages), December 2012.
[PDF]
- Longbiao Wang, Zhaofeng Zhang, Atsuhiko Kai, Yoshiki Kishi:
Distant-talking speaker identification using a reverberation model with various artificial room impulse responses,
Proc. of APSIPA Annual Summit and Conference 2012, OS.45-SLA.17-3(4 pages), December 2012.
[PDF]
- Ikuya Hirano, Longbiao Wang, Atsuhiko Kai, Seiichi Nakagawa:
On the Use of Phase Information-based Joint Factor Analysis for Speaker Verification under Channel Mismatch Condition,
Proc. of APSIPA Annual Summit and Conference 2012, PS.5-SLA.18-3(4 pages), December 2012.
[PDF]
- Kyohei Odani, Longbiao Wang, Atsuhiko Kai:
``Speech Recognition by Denoising and Dereverberation Based on Spectral
Subtraction in a Real Noisy Reverberant Environment,''
Proc. of the 13th Annual Conference of the International Speech Communication Association (INTERSPEECH 2012), pp.Tue.P5c.06(4 pages), September 2012.
- Kyohei Odani, Longbiao Wang and Atsuhiko Kai:
``Blind Dereverberation Based on Generalized Spectral Subtraction by Multi-channel LMS Algorithm'',
Proc. of APSIPA ASC 2011, Thu-PM.RS4-2(4 pages), October 2011.
[PDF]
- Longbiao Wang, Kyohei Odani, and Atsuhiko Kai:
``Evaluation of hands-free large vocabulary continuous speech recognition by
blind dereverberation based on spectral subtraction by multi-channel LMS algorithm'',
Proc. of the 14th International Conference on Text, Speech and Dialogue (TSD 2011),
pp.131--138, September 2011.
- Junki Ema, Longbiao Wang, Atsuhiko Kai and Toshihiko Itoh:
``Investigation of Driving-Behavior Modeling for Recognition of a Driving Situation'',
Proc. of the 2nd. APSIPA Annual Summit and Conference, pp.161--164, 2010.
[PDF]
- Yonggee Jang, Atsuhiko Kai and Longbiao Wang:
``Multimodal Interface with N-best Display Including Candidates of Spoken Word Fragments'',
Proc. of the 2nd. APSIPA Annual Summit and Conference, pp.478--481, 2010.
[PDF]
- Longbiao Wang, Yoshiki Kishi, Atsuhiko Kai:
``Distant Speaker Recognition Based on the Automatic Selection of Reverberant Environments Using GMMs'',
Proc. CJKPR2009, pp.954--958, 2009.
- Yonggee Jang, Atsuhiko Kai, Longbiao Wang:
``Speech Interface for Isolated Words Based on Combination of Search
Candidates from the Common Word Parts'',
Proc. of the 10th Western Pacific Acoustics Conference (WESPAC X 2009), pp.0261
(7 pages), 2009.
- Toshihiko Itoh, Atsuhiko Kai, Yukihiro Itoh and Tatsuhiro Konishi:
``An understanding strategy based on plausibility score in recognition history using CSR confidence measure'', Proc. of International Conference on Spoken Language Processing (INTERSPEECH 2004 - ICSLP), pp.2133--2136, 2004.
- Atsuhiko Kai and Toshihiko Itoh:
``Prosodic feature and its application for detecting spoken corrections'', Proc. for 2002 2nd Plenary Meeting and Symposium on Prosody and Speech Processing (Organized by Scientific Research of Priority Areas, Ministry of Education, Culture, Sports, Science and Technology, Japan), pp.115--120, 2003.
- Shin-ichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito Utsuro, and Shigeki Sagayama:
``Open-source software for developing anthropomorphic spoken dialog agents'',
In Proceedings of the International Workshop on LIFELIKE ANIMATED AGENTS:
Tools, Affective Functions, and Applications, August 2002.
- Toshihiko Itoh, Atsuhiko Kai, Tatsuhiro Konishi and Yukihiro Itoh:
``Linguistic and acoustic changes of user's utterances caused by different dialogue situations'',
Proc. of International Conference on Spoken Language Processing (ICSLP 2002),
Denver, Colorado USA, pp.545--548, 2002.
- Atsuhiko Kai, Yukari Nonomura, Toshihiko Itoh, Tatsuhiro Konishi and Yukihiro Itoh:
``Influence of different dialogue situations on user's behavior in spoken corrections'',
Proc. of International Conference on Spoken Language Processing (ICSLP 2002),
Denver, Colorado USA, pp.1189--1192, 2002.
- Atsuhiko Kai and Seiichi Nakagawa:
``Analysis of prosodic features on key-phrases and corrections in spoken dialogue'', Proc. for 2001 2nd Plenary Meeting and Symposium on Prosody and Speech Processing (Organized by Scientific Research of Priority Areas(B), Ministry of Science, Culture, Sports, Education, Japan), pp.179--184, 2002.
- Atsuhiko Kai, Takahiro Nakano and Seiichi Nakagawa:
``Usability of Browser-Based Pen-Touch/Speech User Interfaces for
Form-Based Applications in Mobile Environment'', Lecture Notes in Computer Science 1948: Advances in Multimodal Interfaces - ICMI2000, pp.549--556, 2000.
- Atsuhiko Kai, Takahiro Nakano and Seiichi Nakagawa:
``A speech interface system for information retrieval tasks on the WWW'', Proc. of International Workshop Speech and Computer (SPECOM'99),
Moscow, Russia, pp.141--144, 1999.
- Atsuhiko Kai, Yoshifumi Hirose and Seiichi Nakagawa:
``Dealing with out-of-vocabulary words and speech disfluencies in an
N-gram based speech understanding system'', Proc. of
International Conference on Spoken Language Processing (ICSLP 98),
Sydney, Australia, pp.2427--2430, 1998.
- Atsuhiko Kai and Seiichi Nakagawa:
``An acoustic look-ahead method for efficient frame-synchronous search
in a large vocabulary speech recognition system'', Proc. of
International Conference on Speech Processing (ICSP'97), Seoul,
Korea, pp.513--518, 1997.
- Seiichi Nakagawa, Atsuhiko Kai, Toshihiko Itoh and Masaki Ida:
``An Isolated/Continuous Speech Recognition System on a Personal
Computer'', Proc. of the first China-Japan Workshop on Spoken Language
Processing (CJSLP'97), Huang Shan, P.R. China, pp.216--223, 1997.
- Atsuhiko Kai and Seiichi Nakagawa:
``A Continuous Speech Recognition System Using Loosely Constrained
Linguistic Knowledge for Spontaneous Speech'', Proc. of the first
China-Japan Workshop on Spoken Language Processing
(CJSLP'97), Huang Shan, P.R. China, pp.240--245, 1997.
- Atsuhiko Kai and Seiichi Nakagawa:
``Investigation on Unknown Word Processing and Strategies for
Spontaneous Speech Understanding'', Proc. of EUROSPEECH'95, Madrid,
Spain, pp.2095--2098, 1995.
- Atsuhiko Kai and Seiichi Nakagawa:
``Evaluation of Unknown Word Processing in a Spoken Word Recognition System'',
Proc. of International Conference on Spoken Language Processing (ICSLP 94),
Yokohama, Japan, pp.2151--2154, 1994.
- Atsuhiko Kai and Seiichi Nakagawa:
``A Frame-Synchronous Continuous Speech Recognition Algorithm
Using a Top-Down Parsing of Context-Free Grammar'',
Proc. of International Conference on Spoken Language Processing (ICSLP 92),
Alberta, Canada, pp.257--260, 1992.
- Jun-ichi Takami, Atsuhiko Kai and Shigeki Sagayama:
Speech recognition by combining pairwise discriminant time-delay neural networks and predictive LR-parser,
Proc. IEEE Workshop Neural Networks for Signal Processing, pp.327--336, 1991.
doi: 10.1109/NNSP.1991.239509
著書
- 中川聖一編著, 小林 聡, 峯松信明, 宇津呂武仁, 秋葉友良, 北岡教英, 山本幹雄, 甲斐充彦, 山本一公, 土屋雅稔共著,
``音声言語処理と自然言語処理(増補)'', コロナ社, 2018.9
[出版社link]
- Yuta Kawakami, Longbiao Wang, Atsuhiko Kai, and Seiichi Nakagawa,
``Speaker Identification by Combining Various Vocal Tract and Vocal Source Features,''
Petr Sojka, Aleš Horák, Ivan Kopeˇcek, Karel Pala (Eds.),
Lecture Notes in Artificial Intelligence 8655 (LNAI 8655),
pp.382-389, September 2014.
- 中川聖一編著, 小林 聡, 峯松信明, 宇津呂武仁, 秋葉友良, 北岡教英, 山本幹雄, 甲斐充彦, 山本一公, 土屋雅稔共著,
``音声言語処理と自然言語処理'', コロナ社, 2013.3
- Longbiao Wang, Kyohei Odani, Atsuhiko Kai, Norihide Kitaoka and Seiichi Nakagawa , “Dereverberation Based on Spectral Subtraction by Multi-channel LMS Algorithm for Hands-free Speech Recognition,” Chapter 7 in “Modern Speech Recognition Approaches with Case Studies”, Dr. S Ramakrishnan (Ed.), ISBN: 978-953-51-0831-3, InTech, November 2012.
- Longbiao Wang, Kyohei Odani, and Atsuhiko Kai,“Evaluation of hands-free large
vocabulary continuous speech recognition by blind dereverberation based on
spectral subtraction by multi-channel LMS algorithm, ” I. Habernal and V.
Matousek (Eds.), Lecture Notes in Artificial Intelligence (LNAI 6836),
pp.131--138, September 2011.
- S. Nakagawa, A. Kai and T. Itoh, ``The Spoken Dialogue System of TUT'' in S. Nakagawa, M. Okada and T. Kawahara (Eds.), Spoken Language Systems,
pp.129-142, Ohmsha/IOS Press, 2005.
- Shin-ichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta,
Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima,
Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita,
Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu,
Atsushi Yamada, Yasuharu Den, Takehito Utsuro, Shigeki Sagayama,
``Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents,''
Life-Like Characters. Tools, Affective Functions, and Applications.
Helmut Prendinger et al. (Eds.) Springer, pp.187-212, November 2003.
- 松下温,屋代智之編著, ``ITSと情報通信技術''(共著), 裳華房, 2003.
博士学位論文
Last updated: 2022/3
甲斐のトップページへ戻る