智能影音信號處理實驗室


智能影音信號處理實驗室創立於2002年8月,現位於格致大樓四樓409C室,佔地約十二坪大小。 指導老師為胡懷祖教授,胡老師於1985年畢業於國立成功大學電機系,1990、1993年獲美國佛羅里達大學電機工程碩士及博士學位,學畢歸國後於本校電子工程學系任教迄今。

本實驗室成立之目的旨在發展信號處理相關技術及其應用,一般來說,常見的信號種類不外乎影像及語音, 信號經過取樣與數位化之後便可經由電腦科技的強大運算能力進行資料分析與處理。不論是那種信號的研究,大抵上都可細分為信號的表示、變換、運算、 分析、特徵擷取、雜訊抑制、信號強化、編碼、辨識等不同應用,一旦選定研究課題,便可以數位信號處理的學理為基礎,佐以系統化探討技術改良與效能提昇的可行性。 在過去幾年,由於電腦運算能力大幅增長、經由網路連結與蒐集所得的資料迅速累積,從而加快人工智慧、機器學習與深度學習之類科技的進展,其應用範疇亦益趨多元。 為促進相關技術之整合與實用,本實驗室亦已導入之「智能信號處理」之研發工作,逐步將人工智慧演算法理論應用數位訊號處理與分析,期許相關學理探討與技術應用更能相得益彰。

Find Out More

胡懷祖


 

職稱: 教授

研究專長: 訊號(語音、音訊、影像)處理

電子郵件: hthu@niu.edu.tw

聯絡電話: (03)9357400 ext.7343

 

 

研究計劃


結合生成對抗網絡與基於 FFT 架構之雙模高酬載盲音訊浮水印技術來實現「以聲傳畫」(國家科學及技術委員會)

2023-08-01~2024-07-31

一種用於RGB色層之彩色影像浮水印的補償措施以抵擋JPEG壓縮造成之損害(科技部)

2022-08-01~2023-10-31

結合基於FFT架構的感知向量範數調變與自動邊解碼器的降噪技術以達成強韌之盲音訊浮水印(科技部)

2021-08-01~2022-10-31

於輪廓波與離散小波轉波共構之複合場域遂行高效能之彩色影像盲浮水印(科技部)

2020-08-01~2021-07-31

作為版權保護、內容驗證與自我復原之用的雙重語音盲浮水印(科技部)

2019-08-01~2020-07-31

結合提升式小波轉換之低頻帶係數序列重組與音框同步技術以達成抵抗時間伸縮修改之音訊盲浮水印技術(科技部)

2018-08-01~2019-07-31

植基於同步資料封包與可適性均值調變之語音盲浮水印技術(科技部)

2017-08-01~2018-07-31

術(科技部)

2016-08-01~2017-07-31

以「激源-濾波」複合嵌入策略達成高效能之語音盲浮水印(科技部)

2015-08-01~2016-07-31

以可變酬載之全方位架構實現盲音訊浮水印(科技部)

2014-08-01~2015-07-31

以定態小波與諧波重疊之技術從語音信號中測定聲門關閉瞬間(行政院國家科學委員會)

2013-08-01~2014-07-31

具對抗時間縮放攻擊之離散小波域自調適音訊浮水印技術(行政院國家科學委員會)

2012-08-01~2013-10-31

依循心理聲響特性之自調性複域音訊浮水印技術(行政院國家科學委員會)

2011-08-01~2012-10-31

一種兼具強韌及高容量數位音訊浮水印之多場域架構(行政院國家科學委員會 )

2009-08-01~2010-10-31

提升產業技術及人才培育研究計畫-窄頻語音頻寬擴展技術之研發及其晶片設計(國科會 格瑪數位股份有限公司)

2007-05-01~2008-04-30

植基於HNM之變聲技術及其於字轉音之應用(行政院國家科學委員會)

2007-08-01~2009-10-31

期刊論文 (Journal Papers)


Ready to start your next project with us? Send us a messages and we will get back to you as soon as possible!

  1. Hwai-Tsu Hu, Tung-Tsun Lee, (2022) "Robust complementary dual image watermarking in subbands derived from the Laplacian pyramid, discrete wavelet transform, and directional filter bank, Circuits, Systems, and Signal Processing, Volume(41), p4090~4116.
  2. Hwai-Tsu Hu, , (2022) "Blind image watermarking via psychovisual-based relative modulation in DCT domain with performance optimized by GWO, Multimedia Tools & Applications, Volume(81), p21675~21675.
  3. Hwai-Tsu Hu, , (2022) "All-round improvement in DCT-based blind image watermarking with visual enhancement via denoising autoencoder, Computers & Electrical Engineering, Volume(100), p107845~0.
  4. Ling-Yuan Hsu, Ling-Yuan Hsu, and Hsien-Hsin Chou, (2022) "A high-capacity QRD-based blind color image watermarking algorithm incorporated with AI technologies, Expert Systems with Applications, Volume(199), p117134~0.
  5. Hwai-Tsu Hu, Ling-Yuan Hsu, (2022) "Blind color image watermarking incorporating a residual network for watermark denoising and super-resolution reconstruction, Soft Computing, Volume(27), p917~934.
  6. Hwai-Tsu Hu, Ling-Yuang Hsu and Shyi-Tsong Wu, (2022) "Blind Watermarking for Hiding Color Images in Color Images with Super-Resolution Enhancement, Sensors, Volume(23), p370~0.
  7. Hwai-Tsu Hu, Hsien-Hsin Chou, Tung-Tsun Lee, (2021) "Robust blind speech watermarking via FFT-Based perceptual vector norm modulation with frame self-synchronization, IEEE Access, Volume(vol. 9), p9916~9925.
  8. Ling-Yuan Hsu, Hwai-Tsu Hu, (2021) "QDCT-based blind color image watermarking with aid of GWO and DnCNN for performance improvement, IEEE Access, Volume(vol. 9), p155138~155152.
  9. Hwai-Tsu Hu, Ying-Hsiang Lu, (2020) "Frame-synchronous Blind Audio Watermarking for Tamper Proofing and Self-Recovery, Advances in Technology Innovation, Volume(vol. 5, no. 1), p18~32.
  10. Hwai-Tsu Hu, Ling-Yuan Hsu and Hsien-Hsin Chou, (2020) "An improved SVD-based blind color image watermarking algorithm with mixed modulation incorporated, Information Science, Volume(vol. 519), p161~182.
  11. Ling-Yuan Hsu, Hwai-Tsu Hu, (2020) "Blind watermarking for color images using EMMQ based on QDFT, Blind watermarking for color images using EMMQ based on QDFT, Volume(vol. 149), p113225~0.
  12. Hwai-Tsu Hu, Tung-Tsun Lee, (2019) "Hybrid Blind Audio Watermarking for Proprietary Protection, Tamper Proofing, and Self-Recovery, IEEE Access, Volume(vol. 7), p107438~107452.
  13. Ling-Yuan Hsu, , (2019) "A Reinforced Blind Color Image Watermarking Scheme Based on Schur Decomposition, IEEE Access, Volume(vol. 7), p107438~107452.
  14. Hwai-Tsu Hu, Tung-Tsun Lee, (2019) "High-performance self-synchronous blind audio watermarking in a unified FFT framework, IEEE Access, Volume(vol. 7), p19063~19076.
  15. Hwai-Tsu Hu, , (2019) "Frame-Synchronized Blind Speech Watermarking via Improved Adaptive Mean Modulation and Perceptual-based Additive Modulation in DWT domain, Digital Signal Processing, Volume(vol. 87), p75~85.
  16. Hwai-Tsu Hu, Jieh-Ren Chang and Shiow-Jyu Lin, (2018) "Synchronous blind audio watermarking via shape configuration of sorted LWT coefficient magnitudes, Signal Processing, Volume(vol. 147), p190~202.
  17. Hwai-Tsu Hu, Jieh-Ren Chang, (2018) "Dual image watermarking by exploiting the properties of selected DCT coefficients with JND modeling, Multimedia Tools Applications, Volume(vol. 77, no.20), p26965~26990.
  18. Hwai-Tsu Hu, , (2017) "Improving DWT-DCT-Based Blind Audio Watermarking Using Perceptually Energy-Compensated QIM, Journal of Computers, Volume(vol. 28, no. 4), p63~73.
  19. Hwai-Tsu Hu, , (2017) "Efficient and robust frame-synchronized blind audio watermarking by featuring multilevel DWT and DCT, Cluster Computing, Volume(vol. 20, no.1 ), p805~816.
  20. Hwai-Tsu Hu, , (2017) "Effective Blind Speech Watermarking via Adaptive Mean Modulation and Package Synchronization in DWT domain, EURASIP Journal on Audio, Speech, and Music Processing, Volume(2017:10), p1~10.
  21. Hwai-Tsu Hu, , (2017) "Supplementary schemes to enhance the performance of DWT-RDM-based blind audio watermarking, Circuits Syst. Signal Process, Volume(vol. 36, no. 5), p1890~1911.
  22. Hwai-Tsu Hu, , (2017) "Incorporating Spectral Shaping Filtering into DWT-Based Vector Modulation to Improve Blind Audio Watermarking, Wireless Personal Communication, Volume(vol. 94, no. 2), p221~240.
  23. Hwai-Tsu Hu, , (2017) "Collective blind image watermarking in DWT-DCT domain with adaptive embedding strength governed by quality metrics, Multimedia Tools & Applications, Volume(vol. 76, no. 5), p6575~6594.
  24. Ling-Yuan Hsu, Ling-Yuan Hsu, (2017) "Robust blind image watermarking using crisscross inter-block prediction in the DCT domain, Journal of Visual Communication and Image Representation, Volume(vol. 46), p33~47.
  25. Jieh-Ren Chang, Jieh-Ren Chang, You-Shyang Chen, Hong-Wun Lin and Hwai-Tsu Hu, (2017) "An advanced computing in fuzzy rule-based preprocessing design of image filters’ system for removing impulse noises, The Journal of Supercomputing, Volume(vol. 73, no. 7), p3212~3228.
  26. Hwai-Tsu Hu, Jieh-Ren Chang and Ling-Yuan Hsu, (2016) "Windowed and distortion-compensated vector modulation for blind audio watermarking in DWT domain, Multimedia Tools & Applications, Volume(Vol. 76, Issue 24), p26723~26743.
  27. Hwai-Tsu Hu, Jieh-Ren Chang and Ling-Yuan Hsu, (2016) "Robust blind image watermarking by modulating the mean of partly sign-altered DCT coefficients guided by human visual perception, AEU - International Journal of Electronics and Communications, Volume(Vol. 70, No. 10), p1374~1381.
  28. Hwai-Tsu Hu , Ling-Yuan Hsu, (2016) "A mixed modulation scheme for blind image watermarking, AEU - International Journal of Electronics and Communications, Volume(Vol. 70, No. 2), p172~178.
  29. Hwai-Tsu Hu, Ling-Yuan Hsu, (2016) "A DWT-Based Rational Dither Modulation Scheme for Effective Blind Audio Watermarking, Circuits, Systems, and Signal Processing, Volume(Vol. 35, No. 2), p553~572.
  30. Hwai-Tsu Hu, Ling-Yuan Hsu, (2015) "Robust glottal closure instant detection by jointly exploiting stationary wavelet transform and harmonic superposition, International Journal of Speech Technology, Volume(Vol. 18, No. 4), p685~695.
  31. Ling-Yuan Hsu, , (2015) "Blind image watermarking via exploitation of inter-block prediction and visibility threshold in DCT domain, Journal of Visual Communication and Image Representation, Volume(Vol. 32), p130~143.
  32. Hsien-Hsin Chou, Ling-Yuan Hsu and Hwai-Tsu Hu, (2015) "Multi-level adaptive switching filters for highly corrupted images, Journal of Visual Communication and Image Representation, Volume(Vol. 30), p226~235.
  33. Hwai-Tsu, Ling-Yuan Hsu, (2015) "Exploring DWT-SVD-DCT feature parameters for robust multiple watermarking against JPEG and JPEG2000 compression, Computers & Electrical Engineering, Volume(Vol. 41), p52~63.
  34. 21. Hwai-Tsu Hu, Ling-Yuan Hsu, (2015) "Robust, transparent and high-capacity audio watermarking in DCT domain, Signal Processing, Volume(Vol. 109), p226~235.
  35. Hwai-Tsu Hu, Hsien-Hsin Chou and Ling-Yuan Hsu, (2014) "Perceptual-based DWPT-DCT framework for selective blind audio watermarking, Signal Processing, Volume(Vol. 105), p316~327.
  36. Hwai-Tsu Hu, Hsien-Hsin Chou and Ling-Yuan Hsu, (2014) "The Use of Highpass Filtered Time-Spread Echo for Pitch Scaling Detection, IEICE Trans. Fundamentals, Volume(Vol. E97–A, No. 7), p1623~1626.
  37. Ching-Hsuan Ku, Ching-Hsuan Ku, Hwai-Tsu Hu* and Ling-Yuan Hsu, (2014) "An image watermarking technique developed on the DWT-SVD-DCT domain, International Journal of Advanced Information Technology, Volume(Vol. 8, No. 1), p86~91.
  38. Hwai-Tsu Hu, Ling-Yuan Hsu and Hsien-Hsin Chou, (2014) "Variable-dimensional vector modulation for perceptual-based DWT blind audio watermarking with adjustable payload capacity, Digital Signal Processing, Volume(Vol. 31), p115~123.
  39. Hwai-Tsu Hu, Hsien-Hsin Chou, Chu Yu and Ling-Yuan Hsu, (2014) "Incorporation of perceptually adaptive QIM with singular value decomposition for blind audio watermarking, EURASIP Journal on Advances in Signal Processing, Volume(Vol. 2014, No. 12), p1~12.
  40. Hsien-Hsin Chou, Ling-Yuan Hsu, (2013) "Turbulent-PSO Based Fuzzy Image Filter With No-Reference Measures for High-Density Impulse Noise, IEEE Trans. Systems, Man, and Cybernetics—Part B: Cybernetics, Volume(Vol. 43, No. 1), p296~307.
  41. Hwai-Tsu Hu, Chu Yu, (2012) "A Perceptually Adaptive QIM Scheme for Efficient Watermark Synchronization, IEICE Trans. Inf. and Syst., Volume(Vol. E95- D, No. 12), p3097~3100.
  42. Hwai-Tsu Hu, Wei-Hsi Chen, (2012) "A dual cepstrum-based watermarking scheme with self-synchronization, Signal Processing, Volume(Vol. 92), p1109~1116.
  43. Hwai-Tsu Hu, Chu Yu, (2012) "A HMM-WDLT framework for HNM-based voice conversion with parametric adjustment in formant bandwidth, duration and excitation, International Journal of Speech Technology, Volume(Vol. 15, No. 2), p215~225.
  44. H. T. Hu, Yu Chu, (2010) "Narrowband-to-wideband expansion of telephony speech using piecewise deviation linear transformation, International Journal of Electrical Engineering, Volume(Vol. 17, No. 1), p7~17.
  45. 胡懷祖, Chu Yu, (2009) "Combining HMM and Weighted Deviation Linear Transformation for Highband Speech Parameter Estimation, IEICE Transactions on Information and Systems , Volume(第E92-D卷7期), p~.
  46. 胡懷祖, Hsin-Min Wang, (2009) "Integrating coding techniques into LP-based Mandarin text-to-speech synthesis(2007年的文章), International Journal of Speech Technology, Volume(第10卷), p~.
  47. 胡懷祖, , (2007) "Robust pitch estimation based on a modified comb filtering approach, Electron. Lett., Volume(第43卷25期), p~.
  48. 胡懷祖, Yu, C., (2007) "Adaptive noise spectral estimation for spectral subtraction speech enhancement, IET Signal Processing, Volume(第1卷3期), p~.
  49. Hwai-Tsu Hu, Chen, Y. N., (2007) "Structural design for a 1.6 Kbps GELP speech coder, Bulletin of the college of Engineer, National I-Lan University, Volume(Vol. 2), p69~86.
  50. 胡懷祖, Yu, C, (2006) "Combination of switched predictive network and multi-stage VQ to efficiently encode LSF parameters, 國立宜蘭大學工程學報, Volume(第2卷), p~.
  51. 胡懷祖, 許俊達、游竹, (2003) "Determination of glottal closure instants by harmonic superposition, Signal Processing, Volume(第83卷), p~.
  52. 胡懷祖, , (2002) "An Efficient Algorithm for Hardware Implementation of Low-Bit-Rate Speech Coding, 宜蘭技術學報電機資訊專輯, Volume(第9卷), p~.
  53. 胡懷祖, 郭芳璋、王信仁, (2002) "Supplementary schemes to spectral subtraction for speech enhancement, Speech Communication, Volume(第36卷), p~.
  54. 胡懷祖, Chu Yu, (2002) "Design and implementation of a 1.4kbps glottal excitation linear prediction (GELP) vocoder, 宜蘭技術學報, Volume(第9卷), p~.
  55. Hwai-Tsu Hu, , (2002) "An efficient algorithm for hardware Implementation of low-bit-rate speech coding, Journal of I-Lan Institute of Technology, 9, Special issue for electrical engineering and computer science, Volume(), p39~49.
  56. 胡懷祖, 郭芳璋、Hsin-Jen Wang, (2000) "A glottal-excited linear prediction (GELP) Model for low-bit-rate speech coding, Speech Communication, Volume(第24卷2期), p~.
  57. 胡懷祖, 郭芳璋、Hsin-Jen Wang, (2000) "A Pseudo Glottal Excitation Model for the Linear Prediction Vocoder with Speech Signals Coded at 1.6 Kbps, Institute of Electronics, Information and Communication Engineers Trans. Information. & Systems, Volume(第83卷8期), p~.
  58. 胡懷祖, , (1999) "A filtering method for extracting glottal closure instants from linear prediction residual of speech signal, 宜蘭技術學報, Volume(第3卷), p~.
  59. 胡懷祖, 吳錫聰, (1999) "A glottal-excited linear prediction (GELP) Model for low-bit-rate speech coding, Proceeding National Science Council ROC(A), Volume(第24卷2期), p~.
  60. 胡懷祖, 周賢興, (1998) "將時間延遲類神經網路與動態規劃結合以用於國語數字之辨認, 技術學刊, Volume(第13卷1期), p~.
  61. 胡懷祖, , (1998) "Linear prediction analysis of speech signals in the presence of white Gaussian noise with unknown variance, Institute of Electrical Engineers Proceeding, Vision. Image and Signal Processing, Volume(第145卷4期), p~.
  62. 胡懷祖, , (1998) "Robust linear prediction of speech signals based on orthogonal framework, Electronics Letter, Volume(第34卷14期), p~.
  63. 胡懷祖, , (1998) "Orthogonal framework for linear prediction analysis, 宜蘭技術學報, Volume(第16卷), p~.
  64. 胡懷祖, , (1998) "A real-time implementation of constrained estimate-maximize algorithm for single-microphone speech enhancement, Institute of Electrical and Electronics Engineers Trans on Consumer Electronics, Volume(第44卷2期), p~.
  65. 胡懷祖, , (1998) "Comb filtering of noisy speech using overlap-and-add approach, Electronics Letter, Volume(第34卷1期), p~.
  66. 胡懷祖, , (1998) "Spectral compensation for linear prediction of speech signals in coloured noise, Electronics Letter, Volume(第34卷11期), p~.
  67. 胡懷祖, Chou, H. H., (1997) "Determining the retardation of glottal return phase using LP residue, 宜蘭農工學報, Volume(第14卷), p~.
  68. 胡懷祖, Chou, H. H., (1997) "Spectral compensation for AR process degraded by white noise with unknown variance, 宜蘭農工學報, Volume(第15卷), p~.
  69. 胡懷祖, Chou, H. H., (1996) "A scaling approach to studying voice conversion between different genders, Computer Processing of Oriental Language, Volume(第10卷1期), p~.
  70. 胡懷祖, , (1996) "Noise compensation for linear prediction via orthogonal transformation, Electronics Letter, Volume(第32卷16期), p~.
  71. 胡懷祖, , (1995) "Method for extracting epochal information from noisy LP residual, Electronics Letter, Volume(第31卷25期), p~.
  72. 胡懷祖, , (1995) "Linear prediction using norm in orthogonal vector space, Electronics Letter, Volume(第31卷6期), p~.
  73. Childers, D. G., , (1994) "Speech synthesis by glottal excited linear prediction, J. Acoust. Soc. Am., Volume(Vol. 96, No. 4), p2026~2036.

會議論文 (Conference Papers)


Ready to start your next project with us? Send us a messages and we will get back to you as soon as possible!

  1. Hwai-Tsu Hu, Ying-Hsiang Lu, "Self-synchronous Blind Audio Watermarking for Tamper Proofing and Self-Recovery", International Conference on Advanced Technology Innovation 2019 (ICATI2019), 2019.
  2. Hwai-Tsu Hu, Ling-Yuan Hsu, and Yun-Hsiang Chang, "An Effective Correlation Formula for Enhancing the Detectability of Spread Spectrum-based Watermarking", 41st International Conference on Telecommunications and Signal Processing (TSP 2018), 2018.
  3. Ling-Yuan Hsu, Hwai-Tsu Hu and Yun-Hsiang Chang, "An improvement of embedding color watermarks in color images based on Schur decomposition", 41st International Conference on Telecommunications and Signal Processing (TSP 2018), 2018.
  4. 陳俊錡, 陳俊錡、胡懷祖, "基於DWT與DCT轉換結合自適性嵌入強度調整之QR code數位浮水印技術", TANET2017–臺灣網際網路研討會, 2017.
  5. Hwai-Tsu Hu, Yun-Hsiang Chang, "Blind Audio Watermarking by Configuring the Shape of Sorted LWT Coefficient Magnitudes in Synchronous Frames", 7th International Conference on Information Communication and Management (ICICM 2017), 2017.
  6. 賴聖瑜, 賴聖瑜、胡懷祖、陳俊錡, "結合時間擴展回聲與有理抖動調變以強化音訊盲浮水印技術", WCE2016 民生電子研討會, 2016.
  7. Hwai-Tsu Hu, Jieh-Ren Chang, Chun-Chi Chen and Sheng-Yu Lai, "Synchronized blind audio watermarking via multilevel DWT and windowed vector modulation ", International Conference on IT Convergence and Security, 2016.
  8. Ling-Yuan Hsu, Hwai-Tsu Hu and Hsien-Hsin Chou, "An effective blind image watermarking based on inter-blocks estimation and quantization index modulation", 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD 2016), 2016.
  9. 張育榮, 張育榮、胡懷祖, "結合類神經網路之跨區塊預測能力以及人眼視覺特性改進影像盲浮水印效能", WCE2015 民生電子研討會, 2015.
  10. Hwai-Tsu Hu, Ling-Yuan Hsu, Sheng-Yu Lai and Yu-Jung Chang, "The use of spectral shaping to extend the capacity for DWT-based blind audio watermarking", 5th Int. Conf. on IT Convergence and Security (ICITCS), 2015.
  11. 徐鈴淵, 胡懷祖、張育榮, "利用類神經網路之跨區塊預測能力改進影像盲浮水印效能", WCE2014 民生電子研討會, 2014.
  12. Hwai-Tsu Hu, Szu-Hong Chen and Ling-Yuan Hsu, "Incorporation of Perceptually Energy-Compensated QIM into DWT-DCT Based Blind Audio Watermarking", 10th Int. Conf. on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP), 2014.
  13. Hwai-Tsu Hu, Yu-Jung Chang, Szu-Hong Chen, "A Progressive QIM to Cope with SVD-based Blind Image Watermarking in DWT Domain", 2nd IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP), 2014.
  14. 胡懷祖, , "Usefulness of the Comb Filtering Output for Voiced/Unvoiced Classification and Pitch Detection", International Conference on Signal Processing Systems, 2009.
  15. 胡懷祖, , "Usefulness of the Comb Filtering Output for Voiced/Unvoiced Classification and Pitch Detection", International Conference on Signal Processing Systems, 2009.
  16. 胡懷祖, , "MELP語音編解碼器之錯誤遮隱技術", 追求卓越研討會, 2002.