Đỗ Văn Hải (TS. )
Giảng viên
BM Kỹ thuật MT & Mạng- Tầng 3- C5


haidv@tlu.edu.vn

- Mạch điện tử

- Thiết kế mạch logic

 

- Xử lý tiếng

- Nhận dạng mẫu

- Deep learning

Chi tiết...

Họ và tên

TS Đỗ Văn Hải (đang Nghiên cứu sau tiến sĩ tại Singapore)

Địa chỉ nơi làm việc:

BM Kỹ Thuật Máy Tính và Mạng- Tầng 3 nhà C5

Các môn giảng dạy:

- Mạch điện tử số, Tin học đại cương

Hướng nghiên cứu:

- Nhận dạng tiếng nói

- Nâng cao chất lượng tiếng nói

- Tìm kiếm tiếng nói

Các bài báo

1. Van Hai Do, Nancy F. Chen, Boon Pang Lim and Mark Hasegawa-Johnson, “Analysis of Mismatched Transcriptions Generated by Humans and Machines for Under-Resourced Languages,” in Proc of the Annual Conference of the International Speech Communication Association (Interspeech 2016), Sept, 8-12, 2016, USA.

2. Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Context-dependent Phone Mapping for Acoustic Modeling of Under-resourced Languages,” the International Journal of Asian Language Processing, vol 23, no. 1, pp. 21-33, 2015.

3. Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Cross-lingual phone mapping for large vocabulary speech recognition of under-resourced languages,” the IEICE Transactions on Information and Systems, vol. E97-D, no.2, pp.285-295, 2014.

4.Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei W ang, Van Hai Do, Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li, “Approximate search of audio queries by using dtw with phone time boundary and data augmentation,” in Proc of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016.

5. Nancy F. Chen, Van Tung Pham, Haihua Xu, Xiong Xiao, Van Hai Do, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Chin-Hui Lee, Eng Siong Chng, Bin Ma, Haizhou Li, “Exemplar-inspired strategies for low-resource spoken keyword search in Swahili,” in Proc of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016.

6. Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Distance Metric Learning for Kernel Density-Based Acoustic Model Under Limited Training Data Conditions,” in Proc of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2015 (APSIPA ASC 2015).

7. Van Hai Do, Xiong Xiao, Haihua Xu, Eng Siong Chng, and Haizhou Li, “Multilingual Exemplar-Based Acoustic Model for the NIST Open KWS 2015 Evaluation,” in Proc of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2015 (APSIPA ASC 2015).

8. Van Tung Pham, Haihua Xu, Van Hai Do, Tze Yuang Chong, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “On the study of very low-resource language keyword search,” in Proc of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2015 (APSIPA ASC 2015).

9. Haihua Xu, Van Hai Do, Xiong Xiao, and Eng Siong Chng, “A Comparative Study of BNF and DNN Multilingual Training on Cross-lingual Low-resource Speech Recognition,” in Proc of the Annual Conference of the International Speech Communication Association (Interspeech 2015), Sept, 6-10, 2015, Germany.

10. Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Kernel Density based Acoustic Model with Cross-lingual Bottleneck Features for Resource Limited LVCSR,” in Proc of the Annual Conference of the International Speech Communication Association (Interspeech 2014), Sept, 14-18, 2014, Singapore.

11. Mirco Ravanelli, Van Hai Do, and Adam Janin “TANDEM-Bottleneck Feature Combination using Hierarchical Deep Neural Networks,” in Proc of the International Symposium on Chinese Spoken Language Processing (ISCSLP-2014), Sep.12-14, 2014, Singapore.

12. Van Hai Do, Xiong Xiao, Eng Siong Chng, Haizhou Li, “Context-dependent phone mapping for LVCSR of under-resourced languages,” in Proc of the Annual Conference of the International Speech Communication Association (Interspeech 2013), Aug. 25-29, 2013, Lyon, France.

13. Korbinian Riedhammer, Van Hai Do, and Jim Hieronymus, “A Study on LVCSR and Keyword Search for Tagalog,” in Proc of the Annual Conference of the International Speech Communication Association (Interspeech 2013), Aug. 25-29, 2013, Lyon, France.

14. Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “Context Dependant Phone Mapping for Cross-Lingual Acoustic Modeling,” in Proc of the International Symposium on Chinese Spoken Language Processing (ISCSLP-2012), Dec. 5-8, 2012, Hong Kong, China.

15. Van Hai Do, Xiong Xiao, Eng Siong Chng, and Haizhou Li, “A Phone Mapping Technique for Acoustic Modeling of Under-resourced Languages,” in Proc of the International Conference on Asian Language Processing (IALP-2012), Nov. 13-15, 2012, Hanoi, Vietnam.

16. Van Hai Do, Xiong Xiao, and Eng Siong Chng, “Comparison and Combination of Multilayer Perceptrons and Deep Belief Networks in Hybrid Automatic Speech Recognition Systems,” in Proc of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011 (APSIPA ASC 2011), Oct.18-21, 2011, Xi'an, China.

17. Van Hai Do, Xiong Xiao, Ville Hautamaki, and Eng Siong Chng, “Speech Attribute Recognition using Context-Dependent Modeling,” in Proc of Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2011(APSIPA ASC 2011), Oct.18-21, 2011, Xi'an, China.

Đề tài nghiên cứu

 

Số Học viên cao học hướng dẫn