Danh mục tài liệu

Doctoral dissertation of computer science: Audio source separation exploiting nmf based generic source spectral model

Số trang: 129      Loại file: pdf      Dung lượng: 1.84 MB      Lượt xem: 7      Lượt tải: 0    
Xem trước 10 trang đầu tiên của tài liệu này:

Thông tin tài liệu:

Aims to tackle the real-world recordings with challenging settings as mentioned earlier, we have proposed novel separation algorithms for both single-channel and multi-channel cases. The achieved results have been described in seven publications. The results of our algorithms were also submitted to the international source separation campaign SiSEC 20164 [81] and obtained the best performance in terms of energybased criteria.
Nội dung trích xuất từ tài liệu:
Doctoral dissertation of computer science: Audio source separation exploiting nmf based generic source spectral model MINISTRY OF EDUCATION AND TRAINING HANOI UNIVERSITY OF SCIENCE AND TECHNOLOGY DUONG THI HIEN THANH AUDIO SOURCE SEPARATION EXPLOITINGNMF-BASED GENERIC SOURCE SPECTRAL MODELDOCTORAL DISSERTATION OF COMPUTER SCIENCE Hanoi - 2019 MINISTRY OF EDUCATION AND TRAINING HANOI UNIVERSITY OF SCIENCE AND TECHNOLOGY DUONG THI HIEN THANH AUDIO SOURCE SEPARATION EXPLOITINGNMF-BASED GENERIC SOURCE SPECTRAL MODEL Major: Computer Science Code: 9480101DOCTORAL DISSERTATION OF COMPUTER SCIENCE SUPERVISORS: 1. ASSOC. PROF. DR. NGUYEN QUOC CUONG 2. DR. NGUYEN CONG PHUONG Hanoi - 2019 DECLARATION OF AUTHORSHIP I, Duong Thi Hien Thanh, hereby declare that this thesis is my original work and ithas been written by me in its entirety. I confirm that: • This work was done wholly during candidature for a Ph.D. research degree at Hanoi University of Science and Technology. • Where any part of this thesis has previously been submitted for a degree or any other qualification at Hanoi University of Science and Technology or any other institution, this has been clearly stated. • Where I have consulted the published work of others, this is always clearly at- tributed. • Where I have quoted from the work of others, the source is always given. With the exception of such quotations, this thesis is entirely my own work. • I have acknowledged all main sources of help. • Where the thesis is based on work done by myself jointly with others, I have made exactly what was done by others and what I have contributed myself. Hanoi, February 2019 Ph.D. Student Duong Thi Hien Thanh SUPERVISORS Assoc.Prof. Dr. Nguyen Quoc Cuong Dr. Nguyen Cong Phuong i ACKNOWLEDGEMENT This thesis has been written during my doctoral study at International ResearchInstitute Multimedia, Information, Communication, and Applications (MICA), HanoiUniversity of Science and Technology (HUST). It is my great pleasure to thank numer-ous people who have contributed towards shaping this thesis. First and foremost I would like to express my most sincere gratitude to my supervi-sors, Assoc. Prof. Nguyen Quoc Cuong and Dr. Nguyen Cong Phuong, for their greatguidance and support throughout my Ph.D. study. I am grateful to them for devotingtheir precious time to discussing research ideas, proofreading, and explaining how towrite good research papers. I would like to thank them for encouraging my researchand empowering me to grow as a research scientist. I could not have imagined havinga better advisor and mentor for my Ph.D. study. I would like to express my appreciation to my supervisor in Master cource, Prof.Nguyen Thanh Thuy, School of Information and Communication Technology - HUST,and Dr. Nguyen Vu Quoc Hung, my supervisor in Bachelors course at Hanoi NationalUniversity of Education. They had shaped my knowledge for excelling in studies. In the process of implementation and completion of my research, I have receivedmany supports from the board of MICA directors and my colleagues at Speech Com-munication department. Particularly, I am very much thankful to Prof. Pham Thi NgocYen, Prof. Eric Castelli, Dr. Nguyen Viet Son and Dr. Dao Trung Kien, who pro-vided me with an opportunity to join researching works in MICA institute and haveaccess to the laboratory and research facilities. Without their precious support wouldit have been being impossible to conduct this research. My warmly thanks go to mycolleagues at Speech Communication department of MICA institute for their usefulcomments on my study and unconditional support over four years both at work andoutside of work. I am very grateful to my internship supervisor Prof. Nobutaka Ono and the mem-bers of Ono’s Lab at the National Institute of Informatics, Japan for warmly welcomingme into their lab and the helpful research collaboration they offered. I much appreciatehis help in funding my conference trip and introducing me to the signal processingresearch communities. I would also like to thank Dr. Toshiya Ohshima, MSc. Yasu-taka Nakajima, MSc. Chiho Haruta and other researchers at Rion Co., Ltd., Japan for iiwelcoming me to their company and providing me data for experimental. I would also like to sincerely thank Dr. Nguyen Quang Khanh, dean of InformationTechnology Faculty, and Assoc. Prof. Le Thanh Hue, dean of Economic InformaticsDepartment, at Hanoi University of Mining and Geology (HUMG) where I am work-ing. I have received the financial and time support from my office and leaders forcompleting my doctoral thesis. Grateful thanks also go to my wonderful colleaguesand friends Nguyen Thu Hang, Pham Thi Nguyet, Vu Thi Kim Lien, Vo Thi ThuTrang, Pham Quang Hien, Nguyen The Binh, Nguyen Thuy Duong, Nong Thi Oanhand Nguyen Thi Hai Yen, who have the unconditional support and help during a longtime. A special thank goes to Dr. Le Hong Anh for the encouragement and his preciousadvice. Last but not the least, I would like to express my deepest gratitude to my family. Iam very grateful to my mother-in-law and father-in-law for their support in the time ofneed, and always allow me to focus on my work. I dedicate this thesis to my motherand father with special love, they have been bein ...