Session Program Day 2
Asia Pacific Signal and Information Processing Association Annual Summit and Conference 2022
Session | Room | Chair | |
WedAM1-1 (SS11: Transfer Learning for Real World) | Chiang Mai 1 | Xiaoxu Li/ Dome Potikanond | |
Date | Time | Title | Authors |
9 November 2022 | 9.00-9.20 | Semantics-Guided Knowledge Integration for Domain Adaptation Few-Shot Relation Extraction | Zeyuan Wang; Yifan Du; Guangwei Zhang; Ruifan Li; Yongping Xiong; Chuang Zhang |
9.20-9.40 | PVGCRA: Prediction Variance Guided Cross Region Domain Adaptation | Ran Xu; Yixiang Huang; Chuang Zhang | |
9.40-10.00 | Multi-Branch Network for Few-Shot Learning | Kai Ren; Zijie Guo; Zhimin Zhang; Rui Zhu; Xiaoxu Li | |
10.00-10.20 | Few-Shot Classification With Feature Reconstruction Bias | Zhen Li; Lang Wang; Shuo Ding; Xiaochen Yang; Xiaoxu Li | |
10.20-10.40 | Dual Prototypical Network for Robust Few-Shot Image Classification | Qi Song; Zebin Peng; Luchen Ji; Xiaochen Yang; Xiaoxu Li | |
10.40-11.00 | Graph Evolving and Embedding in Transformer | Jen-Tzung Chien; Chia-Wei Tsao | |
Session | Room | Chair | |
WedAM1-2 (Speech, Language, and Audio 1) | Chiang Mai 2 | Xiaofen Xing | |
Date | Time | Title | Authors |
9 November 2022 | 9.00-9.20 | Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin | Abhinav Rao; Ho Thi-Nga; Chng Eng Siong |
9.20-9.40 | C-CycleTransGAN: A Non-Parallel Controllable Cross-Gender Voice Conversion Model With CycleGAN and Transformer | Changzeng Fu; Chaoran Liu; Carlos Toshinori Ishi; Hiroshi Ishiguro | |
9.40-10.00 | The Realization and Perception of Narrow Focus in English Sentences by Cantonese EFL Learners | Chong Cao; Aijun Li | |
10.00-10.20 | Cross-Lingual Dysarthria Severity Classification for English, Korean, and Tamil | Eun Jung Yeo; Kwanghee Choi; Sunhee Kim; Minhwa Chung | |
10.20-10.40 | 3M: An Effective Multi-View, Multi-Granularity, and Multi-Aspect Modeling Approach to English Pronunciation Assessment | Fu-An Chao; Tien-Hong Lo; Tzu-I Wu; Yao-Ting Sung; Berlin Chen | |
10.40-11.00 | I Feel Stressed Out: A Mandarin Speech Stress Dataset With New Paradigm | Shuaiqi Chen; Xiaofen Xing; Guodong Liang; Xiangmin Xu | |
Session | Room | Chair | |
WedAM1-3 ( Deep Learning: Algorithm, Implementations, and Applications) | Chiang Mai 3 | Hiroyoshi Ito | |
Date | Time | Title | Authors |
9 November 2022 | 9.00-9.20 | End-To-End Reinforcement Learning of Robotic Manipulation With Robust Keypoints Representation | Tianying Wang; En Yen Puang; Marcus Lee; Wei Jing; Yan Wu |
9.20-9.40 | BEAM - an Algorithm for Detecting Phishing Link | Sea Ran Cleon Liew; Ngai Fong Law | |
9.40-10.00 | I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization | Dianwen Ng; Jia Qi Yip; Tanmay Surana; Zhao Yang; Chong Zhang; Yukun Ma; Chongjia Ni; Eng Siong Chng; Bin Ma | |
10.00-10.20 | Human-In-The-Loop Chord Progression Generator With Generative Adversarial Network | Yoshiteru Matsumoto; Hiroyoshi Ito; Hiroko Terasawa; Yuya Yamamoto; Yuzuru Hiraga; Masaki Matsubara | |
10.20-10.40 | A Resource-Limited FPGA-Based MobileNetV3 Accelerator | Yutana Jewajinda; Thanapol Thongkum | |
10.40-11.00 | CG-Net: A Compound Gaussian Prior Based Unrolled Imaging Network | Carter A Lyons; Raghu G. Raj; Margaret Cheney | |
Session | Room | Chair | |
WedAM1-4 (Signal Image and Information Processing Theory and Methods) | Board Room 2 | Mingyi He | |
Date | Time | Title | Authors |
9 November 2022 | 9.00-9.20 | A Policy-Based Approach to the SpecAugment Method for Low Resource E2E ASR | Rui Li; Guodong Ma; Dexin Zhao; Ranran Zeng; Xiaoyu Li; Hao Huang |
9.20-9.40 | Manifold Rewiring for Unlabeled Imaging | Valentin Debarnot; Vinith Kishore; Cheng Shi; Ivan Dokmanic | |
9.40-10.00 | CRDet: An Object-Context-Aware Detection Network for Oriented Object in Aerial Images | Lele Liang; Linghan Li; Qi Liu; Yuchao Dai; Mingyi He | |
10.00-10.20 | Effects of Incorporating a Deep-Unfolding Framework Into a Deep Neural Network: Implications for Image Restoration | Tatsuki Itasaka; Masahiro Okuda | |
10.20-10.40 | Cross-Modal Knowledge Distillation With Dropout-Based Confidence | Won Ik Cho; Jeunghun Kim; Nam Soo Kim | |
10.40-11.00 | A Multi-Objective Perceptual Aware Loss Function for End-To-End Target Speaker Separation | Zhan Jin; Bang Zeng; Fan Zhang | |
Session | Room | Chair | |
WedAM1-5 (Research Review) | Board Room 3 | Ying-Hui Lai | |
Date | Time | Title | Authors |
9 November 2022 | 9.00-9.20 | EEG-Based Anomaly Detection Model by One-Class Support Vector Machine for Dream Enactment Behavior in REM Sleep Behavior Disorder | Shumpei Date, Koichi Fujiwara, Yukiyoshi Sumi, Hiroshi Kadotani, Makoto Imai, Keiko Ogawa |
9.20-9.40 | Development of Heat Stroke Detection Model Based on Heart Rate Variability Using LSTM-AutoEncoder | Shota Saeda, Koshi Ota, Koichi Fujiwara, Takatomi Kubo, Toshitaka Yamakawa, Aozora Yamamoto, Yuki Maruno, Manabu Kano | |
9.40-10.00 | Driving Fitness Evaluation Model for Patients With Schizophrenia Based on Driving Data of Healthy Participants and Random Forest | Shuji Tsunoda, Koichi Fujiwara, Seiko Miyata, Akiko Yamaguchi, Shogo Kitagawa, Yuki Konishi, Reiji Yoshimura, Isao Taguchi, Yutaka Sawa, Kunihiro Iwamoto, Norio Ozaki | |
10.00-10.20 | Method for Estimating Test Contrast Peak Time in Computed Tomography Angiography | Toshihide Otsuki; Kazuto Sakamoto; Homare Saisho; Hiroyoshi Yokoi; Toshitaka Yamakawa | |
10.20-10.40 | Development of an Epileptic Seizure Prediction Algorithm Based on R-R Intervals With Temporal Convolutional Networks | Rikumo Ode; Koichi Fujiwara; Miho Miyajima; Toshitaka Yamakawa; Manabu Kano; Taketoshi Maehara | |
Session | Room | Chair | |
WedAM1-6 (SS17: Emerging Diseases and Smart Image Processing) | Chiang Mai 4 | Krisana Chinnasarn | |
Date | Time | Title | Authors |
9 November 2022 | 9.00-9.20 | Pre-Processing SARS-CoV-2 Sequence Data for Application of Machine Learning Techniques for Visualization and Clustering of Virus Characteristics | Juhyeon Kim; Insung Ahn |
9.20-9.40 | Educational Multi-Purpose Kit for Coding and Robotic Design | Atikhun Thongpool; Daranee Hormdee; Raksit Chutipakdeevong; Wasan Tansakul; | |
9.40-10.00 | Forecasting Dengue Fever in France and Thailand Using XGBoost | Thanin Methiyothin; Insung Ahn | |
10.00-10.20 | Fine-Tuning BERT for Question and Answering Using PubMed Abstract Dataset | Saeyeon Cheon; Insung Ahn | |
10.20-10.40 | Coarse X-Ray Lumbar Vertebrae Pose Localization Using Triangulation Correspondence | Watcharaphong Yookwan; Jiranun Sangrueng; Krisana Chinnasarn | |
10.40-11.00 | 4G Signal RSSI Recommendation System for ISP Quality of Service Improvement | Tanatpon Duangta; Watcharaphong Yookwan; Krisana Chinnasarn; Anuparp Boonsongsrikul | |
Session | Room | Chair | |
WedAM1-7 (Speech, Language, and Audio 2) | Chiang Mai 5 | Wei-Ping Zhu | |
Date | Time | Title | Authors |
9 November 2022 | 9.00-9.20 | SE-DPTUNet: Dual-Path Transformer Based U-Net for Speech Enhancement | Bengbeng He; Kai Wang; Wei-Ping Zhu |
9.20-9.40 | Encoder Re-Training With Mixture Signals on FastMVAE Method | Shuhei Yamaji; Taishi Nakashima; Nobutaka Ono; Li Li; Hirokazu Kameoka | |
9.40-10.00 | Unsupervised Disentanglement of Timbral, Pitch, and Variation Features From Musical Instrument Sounds With Random Perturbation | Keitaro Tanaka; Yoshiaki Bando; Kazuyoshi Yoshii; Shigeo Morishima | |
10.00-10.20 | Estimation of Transfer Coefficients and Signals of Sound-To-Light Conversion Device Blinky Under Saturation | Kosuke Nishida; Natsuki Ueno; Yuma Kinoshita; Nobutaka Ono | |
10.20-10.40 | Design and Evaluation of Instrument Sound Identification Difficulty for the Deaf and Hard-Of Hearing | Shiho Akaki; Rumi Hiraga; Keiichi Yasu; Keiji Tabuchi; Hiroko Terasawa | |
10.40-11.00 | Correcting, Rescoring and Matching: An N-Best List Selection Framework for Speech Recognition | Chin-Hung Kuo; Kuan-Yu Chen | |
Session | Room | Chair | |
WedAM1-8 (SS04: Advanced Signal Processing and Machine Learning for Audio and Speech Applications) | Board Room 4 | Shoji Makino | |
Date | Time | Title | Authors |
9 November 2022 | 9.00-9.20 | Hyperbolic Timbre Embedding for Musical Instrument Sound Synthesis Based on Variational Autoencoders | Futa Nakashima; Tomohiko Nakamura; Norihiro Takamune; Satoru Fukayama; Hiroshi Saruwatari |
9.20-9.40 | Multi-Task Adversarial Training Algorithm for Multi-Speaker Neural Text-To-Speech | Yusuke Nakai; Yuki Saito; Kenta Udagawa; Hiroshi Saruwatari | |
9.40-10.00 | Inverse-Free Online Independent Vector Analysis With Flexible Iterative Source Steering | Taishi Nakashima; Nobutaka Ono | |
10.00-10.20 | Accelerating online algorithm using geometrically constrained independent vector analysis with iterative source steering | Kana Goto; Tetsuya Ueda; Li Li; Takeshi Yamada; Shoji Makino | |
10.20-10.40 | A Dilated Inception Convolutional Neural Network for Gridless DOA Estimation Under Low SNR Scenarios | Zhi-Wei Tan; Yuan Liu; Andy W. H. Khong | |
10.40-11.00 | Efficient Low-Latency Convolution With Uniform Filter Partition and Its Evaluation on Real-Time Blind Source Separation | Yui Kuriki; Taishi Nakashima; Kouei Yamaoka; Natsuki Ueno; Yukoh Wakabayashi; Nobutaka Ono; Ryo Sato | |
Session | Room | Chair | |
WedPM1-1 (SS05: Advanced Image and Video Processing using Deep Learning) | Chiang Mai 1 | Chul Lee | |
Date | Time | Title | Authors |
9 November 2022 | 14.00-14.20 | Object Segmentation Using Parametric Representation | Hochang Rhee; Hyung Il Koo; Nam Ik Cho |
14.20-14.40 | Deep Color Constancy Using Multi-Band NIR | Jeong-Won Ha; Dong-keun Han; Min-Je Park; Jong-Ok Kim | |
14.40-15.00 | Smooth Panoramic Walkthrough for Adjacent Panoramic Viewpoints With Dense Spherical Matching Points | Kyungjune Lee; Mingyu Jang; Sanghoon Lee; Kim Taewan | |
15.00-15.20 | Region Adaptive Self-Attention for an Accurate Facial Emotion Recognition | Seongmin Lee; Jeonghaeng Lee; Minsik Kim; Sanghoon Lee | |
15.20-15.40 | Quality Enhancement of Screen Content Video Using Dual-Input CNN | Ziyin Huang; Yue Cao; Sik-Ho Tsang; Yui-Lam Chan; Kin-Man Lam | |
15.40-16.00 | Underwater Image Enhancement Using Realistic Dataset With Turbidity and Color Distortion | Eunpil Park; Eunsung Jo; Jae-Young Sim | |
Session | Room | Chair | |
WedPM1-2 (Speech, Language, and Audio 1) | Chiang Mai 2 | Ashish Panda | |
Date | Time | Title | Authors |
9 November 2022 | 14.00-14.20 | Neural Vocoder Feature Estimation for Dry Singing Voice Separation | Jaekwon Im; Soonbeom Choi; Sangeon Yong; Juhan Nam |
14.20-14.40 | Adapting GCC-PHAT to Co-Prime Circular Microphone Arrays for Speech Direction of Arrival Estimation Using Neural Networks | Jiahong Zhao; Christian Ritz | |
14.40-15.00 | A Novel Approach to Structured Pruning of Neural Network for Designing Compact Audio-Visual Wake Word Spotting System | Haotian Wang; Jun Du; Hengshun Zhou; Heng Lu; Yuhang Cao | |
15.00-15.20 | Hierarchic Temporal Convolutional Network With Attention Fusion for Target Speaker Extraction | Zihao Chen; Wenbo Qiu; Haitao Xu; Ying Hu | |
15.20-15.40 | Acoustic Model Adaption Using x-Vectors for Improved Automatic Speech Recognition | Meet Soni; Aditya Raikar; Ashish Panda; Sunil Kumar Kopparapu | |
15.40-16.00 | Acoustic Pornography Recognition Using Convolutional Neural Networks and Bag of Refinements | Lifeng Zhou; Kaifeng Wei; Yuke Li; Yiya Hao; Weiqiang Yang; Haoqi Zhu | |
Session | Room | Chair | |
WedPM1-3 ( Deep Learning: Algorithm, Implementations, and Applications) | Chiang Mai 3 | Jen-Tzung Chien | |
Date | Time | Title | Authors |
9 November 2022 | 14.00-14.20 | An Optimal Vehicle Counting Framework for Non-Canonical CCTV Placements | Ng Chin Hooi; Edwin Tan Chee Pin; Chiew Yeong Shiong; Lim Mei Kuan |
14.20-14.40 | Response Sentence Modification Using a Sentence Vector for a Flexible Response Generation of Retrieval-Based Dialogue Systems | Ryota Yahagi; Akinori Ito; Takashi Nose; Yuya Chiba | |
14.40-15.00 | End-To-End Stereo Audio Coding Using Deep Neural Networks | Wootaek Lim; Inseon Jang; Seungkwon Beack; Jongmo Sung; Taejin Lee | |
15.00-15.20 | Neural Beamformer With Automatic Detection of Notable Sounds for Acoustic Scene Classification | Sota Ichikawa; Takeshi Yamada; Shoji Makino | |
15.20-15.40 | DNN-Based Frequency-Domain Permutation Solver for Multichannel Audio Source Separation | Fumiya Hasuike; Daichi Kitamura; Rui Watanabe | |
15.40-16.00 | Detection Method From 4K Images Using SSD300 Without Retraining | Kei Irie; Kiyoshi Nishikawa | |
Session | Room | Chair | |
WedPM1-4 (Signal Image and Information Processing Theory and Methods) | Board Room 2 | Zhang Ke | |
Date | Time | Title | Authors |
9 November 2022 | 14.00-14.20 | PAformer: Visually Indistinguishable Bolt Defect Recognition Based on Bolt Position and Attributes | Wenshuo Lou; Ke Zhang; Yangjie Xiao; Xiwang Guo; Jiacun Wang |
14.20-14.40 | Adapted Spectrogram Transformer for Unsupervised Cross-Domain Acoustic Anomaly Detection | Gilles Van De Vyver; Zhaoyi Liu; Koustabh Dolui; Danny Hughes; Sam Michiels | |
14.40-15.00 | A Two-Stage Cascading Method Based on Finetuning in Semi-Supervised Domain Adaptation Semantic Segmentation | Huiying Chang; Kaixin Chen; Ming Wu | |
15.00-15.20 | Landmark Management in the Application of Radar SLAM | Shuai Sun; Beth Jelfs; Kamran Ghorbani; Glenn I. Matthews; Chris Gilliam | |
15.20-15.40 | Parameterization of Dominant Spectral Peak Trajectory for Whisper Speech Recognition | Chang Feng; Xiaolong Wu; Mingxing Xu; Thomas Fang Zheng | |
15.40-16.00 | Specific Emitter Identification at Different Time Based on Multi-Domain Migration | Jiaxu Liu; Jianqing Li; Jiao Wang; Hao Huang | |
Session | Room | Chair | |
WedPM1-5 (Research Review) | Board Room 3 | Koichi Fujiwara | |
Date | Time | Title | Authors |
9 November 2022 | 14.00-14.20 | Long-Term Prognostic Prediction of West Syndrome Based on Scalp EEG Using Convolution Neural Network Autoencoder | Tatsuki Saito; Koichi Fujiwara; Jun Natsume; Ryosuke Suzui |
14.20-14.40 | Modification of RRI Data by NBEATS Model | Hongtao Chen, Koichi Fujiwara, Manabu Kano | |
14.40-15.00 | Transformer With Noise Divider | Mun-Hyung Lee, Seon-Woo Lee, Jung-Mu Choi, Jang-Woo Kwon | |
15.00-15.20 | Schizophrenia Classification Based on the Natural Language Processing Technology-A Pilot Study | Ying Hsuan Chen; Pei-Yun Lin; Tsung-Tse Ho; Yuh-Jer Chang; Ying-Hui Lai | |
15.20-15.40 | Signed Graph Balancing Based on Spectral Clustering | Haruki Yokota, Junya Hara, Yuichi Tanaka | |
15.40-16.00 | Graph Signal Sampling for Multiple Generator Functions | Junya Hara; Yuichi Tanaka | |
Session | Room | Chair | |
WedPM1-6 (Signal Proceesing for Audio and Speech Applications) | Chiang Mai 4 | Tomoyosi Akiba | |
Date | Time | Title | Authors |
9 November 2022 | 14.00-14.20 | Semi-Supervised ASR Based on Iterative Joint Training With Discrete Speech Synthesis | Keiya Takagi; Tomoyosi Akiba; Hajime Tsukada |
14.20-14.40 | Analysis of Amplitude and Frequency Perturbation in the Voice for Fake Audio Detection | Kai Li; Yao Wang; Minh Le Nguyen; Masato Akagi; Masashi Unoki | |
14.40-15.00 | Deep Hashing for Speaker Identification and Retrieval Based on Auditory Sparse Representation | Dung Kim Tran; Masato Akagi ; Masashi Unoki | |
15.00-15.20 | Divide and Conquer: A Low-Complexity Neural Network for Monophonic Speech Enhancement | Bingxiao Fang; Liang Liu | |
15.20-15.40 | Domain Adaptation and Language Conditioning to Improve Phonetic Posteriorgram Based Cross-Lingual Voice Conversion | Pin-Chieh Hsu; Nobuaki Minematsu; Daisuke Saito | |
15.40-16.00 | Von Mises Mixture Model-Based DNN for Sign Indetermination Problem in Phase Reconstruction | Nguyen Binh Thien; Yukoh Wakabayashi; Geng Yuting; Kenta Iwai; Takanobu Nishiura | |
Session | Room | Chair | |
WedPM1-7 (Speech, Language, and Audio 2) | Chiang Mai 5 | Daranee Hormdee | |
Date | Time | Title | Authors |
9 November 2022 | 14.00-14.20 | Speaker Representation Learning via Contrastive Loss With Maximal Speaker Separability | Zhe Li; Man Wai Mak |
14.20-14.40 | Design of Discriminators in GAN-Based Unsupervised Learning of Neural Post-Processors for Suppressing Localized Spectral Distortion | Riku Ogino; Kohei Saijo; Tetsuji Ogawa | |
14.40-15.00 | Simultaneous Frequency Estimation for Three or More Sinusoids Based on Sinusoidal Constraint Differential Equation | Kenta Yamada, Yoshiki Masuyama, Yukoh Wakabayashi, Nobutaka Ono | |
15.00-15.20 | Do You Know How Humans Sound? Exploring a Qualification Test Design for Crowdsourced Evaluation of Voice Synthesis Quality | Moe Yaegashi; Susumu Saito; Teppei Nakano; Tetsuji Ogawa | |
15.20-15.40 | Exploring the Gender Difference on Mandarin Tone Realization in Lombard Speech | Weizhong Zhang; Jian Gong; Kai Sheng; Yuhong Sun; William Bellamy; Xiaoli Ji | |
Session | Room | Chair | |
WedPM1-8 (Data Analytics and Machine Learning) | Board Room 4 | Chern Hong Lim | |
Date | Time | Title | Authors |
9 November 2022 | 14.00-14.20 | Improving Co-SVD for Cold-Start Recommendations Using Sparsity Reduction | Low Jia Ming; Chern Hong Lim; Ian K. T. Tan |
14.20-14.40 | Epoch-Wise Double Descent Triggered by Learning a Single Sample | Aoshi Kawaguchi; Hiroshi Kera; Toshihiko Yamasaki | |
14.40-15.00 | Current Source Localization Using Deep Prior With Depth Weighting | Hajime Yano; Rio Yamana; Ryoichi Takashima; Tetsuya Takiguchi; Seiji Nakagawa | |
15.00-15.20 | A Proposal for Emotion-Expressive Editor:EmoEditor by Font Changing | Yuki Shimamura; Michiharu Niimi | |
15.20-15.40 | Traceback Memory Reduction for Three-Sequence Alignment Algorithm With Affine Gap Models | Rui-Ting Chien; Mao-Jan Lin; Yang-Ming Yeh; Yi-Chang Lu | |
15.40-16.00 | Acceleration of Subspace Learning Machine via Particle Swarm Optimization and Parallel Processing | Hongyu Fu; Yijing Yang; Yuhuai Liu; Joseph Lin; Ethan Harrison; Vinod K. Mishra; C.-C. Jay Kuo | |
Session | Room | Chair | |
WedPM2-1 (SS05: Advanced Image and Video Processing using Deep Learning) | Chiang Mai 1 | Chul Lee | |
Date | Time | Title | Authors |
9 November 2022 | 16.20-16.40 | Enhanced Bidirectional Motion Estimation Using Feature Refinement for HDR Imaging | An Gia Vien; Truong Thanh Nhat Mai; Seonghyun Park; Gahyeon Kim; Chul Lee |
16.40-17.00 | Fast Asymmetric Bilateral Motion Estimation for Video Frame Interpolation | Jintae Kim; Junheum Park; Chang-Su Kim | |
17.00-17.20 | Future Object Localization in Autonomous Driving Using Ego-Centric Images and Motions | Seoyoung Jo; Jung-Kyung Lee; Je-won Kang | |
17.20-17.40 | Restoration of High-Frequency Components in Under Display Camera Images | Youngjin Oh; Gu Yong Park; Nam Ik Cho | |
17.40-18.00 | Non-Intrusive Speech Intelligibility Estimation Using Deep Learning With Speech Enhancement and Convolutional Layers | Kazushi Nakazawa; Kazuhiro Kondo | |
18.00-18.20 | Unified Angle Adjustment Network for Image Composition Enhancement | Jinwon Ko; Nyeong-Ho Shin; Seonho Lee; Chang-Su Kim | |
Session | Room | Chair | |
WedPM2-2 (Speech, Language, and Audio 1) | Chiang Mai 2 | Kasemsit Teeyapan | |
Date | Time | Title | Authors |
9 November 2022 | 16.20-16.40 | Automated Audio Captioning With Epochal Difficult Captions for Curriculum Learning | Andrew Koh; Soham Tiwari; Chng Eng Siong |
16.40-17.00 | Application of Deep Learning-Based Single-Channel Speech Enhancement for Frequency-Modulation Transmitted Speech | Ying Ma; Xueliang Zhang | |
17.00-17.20 | An Empirical Study of Training Mixture Generation Strategies on Speech Separation: Dynamic Mixing and Augmentation | Shukjae Choi; Younglo Lee; Jihwan Park; Hyung Yong Kim; Byeong-Yeol Kim; Zhong-Qiu Wang; Shinji Watanabe | |
17.20-17.40 | Speech Intelligibility Prediction for Hearing Aids Using an Auditory Model and Acoustic Parameters | Benita Angela Titalim; Candy Olivia Mawalim; Shogo Okada; Masashi Unoki | |
17.40-18.00 | Predicting Speech Fluency in Children Using Automatic Acoustic Features | Lionel Fontan; Shinyoung Kim; Verdiana De Fino; Sylvain Detey | |
18.00-18.20 | TC-SKNet With GridMask for Low-Complexity Classification of Acoustic Scene | Luyuan Xie; Yan Zhong; Lin Yang; Zhaoyu Yan; Zhonghai Wu; Junjie Wang | |
Session | Room | Chair | |
WedPM2-3 ( Deep Learning: Algorithm, Implementations, and Applications) | Chiang Mai 3 | Masaomi Kimura | |
Date | Time | Title | Authors |
9 November 2022 | 16.20-16.40 | Design and Control of a Muscle-Skeleton Robot Elbow Based on Reinforcement Learning | Jianyin Fan; Haoran Xu; Yuwei Du; Jing Jin; Qiang Wang |
16.40-17.00 | Non-Autoregressive Speech Recognition With Error Correction Module | Yukun Qian; Xuyi Zhuang; Zehua Zhang; Lianyu Zhou; Xu Lin; Mingjiang Wan | |
17.00-17.20 | A Method for Adversarial Example Generation by Perturbing Selected Pixels | KAMEGAWA Tomoki; KIMURA Masaomi | |
17.20-17.40 | A Title Generation Method With Transformer for Journal Articles | MATSUMOTO Riku; KIMURA Masaomi | |
17.40-18.00 | Catastrophic Forgetting Avoidance Method for a Classification Model by Model Synthesis and Introduction of Background Data | HIRAYAMA Akari; KIMURA Masaomi | |
18.00-18.20 | Consistency Regularization for GAN-Based Neural Vocoders | Kotaro Onishi; Toru Nakashika | |
18.20-18.40 | Parallel Training of TN and ITN Models Through CycleGAN for Improved Sequence to Sequence Learning Performance | Md. Mizanur Rahaman Nayan; Mohammad Ariful Haque | |
Session | Room | Chair | |
WedPM2-4 (SS14:Emerging Signal Processing Technology for Medical Applications/ Biomedical Signal Processing and Systems) | Board Room 2 | Yuttapong Jiraraksopakun | |
Date | Time | Title | Authors |
9 November 2022 | 16.20-16.40 | Laparoscope Manipulating Robot (LMR) Navigation Using Deep Learning-Based Surgical Instruments Detection | Nyi Nyi Myo; Apiwat Boonkong; Daranee Hormdee; Suphachoke Sonsilphong; Amornthep Sonsilphong; Kovit Khampitak |
16.40-17.00 | Human-Machine Interface Device Using Piezoelectric Sensors Based on Facial Muscle Movements for Wheelchair Control | Charoenporn Bouyam; Theerat Saichoo; Nannaphat Siribunyaphat; Yunyong Punsawad | |
17.00-17.20 | Obstructive Sleep Apnea Classification Using Snore Sounds Based on Deep Learning | Apichada Sillaparaya; Apichai Bhatranand; Chudanat Sudthongkhong; Kosin Chamnongthai; Yuttapong Jiraraksopakun | |
17.20-17.40 | Heart Rate Estimation of Car Driver Using Radar Sensors and Blind Source Separation | Keito Murata; Daichi Kitamura; Ryo Saito; Daichi Ueki | |
17.40-18.00 | Total Variation Algorithms for PAT Image Reconstruction | Mary Anjaley Josy John; Imad Barhumi | |
18.00-18.20 | Visual Function and Emotional Regulation in Achromatic Color and Chromatic Color Using Low Resolution Brain Electromagnetic Tomography Analysis (LORETA) | Watchara Sroykham; Yodchanan Wongsawat | |
18.20-18.40 | Effect of Electrooculography on Electroencephalography Classifying Accuracy in Deep Learning and Reducing Number of Channels in Motor-Imagery Brain-Computer Interface | Musashi Ino; Yoshihiro Kono; Nobuaki Kobayashi | |
Session | Room | Chair | |
WedPM2-5 (SS16: Emerging Techniques in Multimedia Data Analytics and Codings) | Board Room 3 | Patiwet Wuttisarnwattana/ Kampol Woradit | |
Date | Time | Title | Authors |
9 November 2022 | 16.20-16.40 | Optimal Deep Multi-Route Self-Attention for Single Image Super-Resolution | Nisawan Ngambenjavichaikul; Sovann Chen; Supavadee Aramvith |
16.40-17.00 | Object Detection in Aerial Images With Attention-Based Regression Loss | Chandler Timm C. Doloriel; Rhandley D. Cajote | |
17.00-17.20 | Performance Analysis of JPEG XR With Deep Learning-Based Image Super-Resolution | Taingliv Min; Supavadee Aramvith | |
17.20-17.40 | MCSNet: Multi-Channel Sharing Network for Single Image Super-Resolution | Wazir Muhammad; Supavadee Aramvith; Watchara Ruangsang | |
17.40-18.00 | DCAN: Deep Consecutive Attention Network for Video Super Resolution | Talha Saleem; Sovann Chen; Supavadee Aramvith | |
18.00-18.20 | Wiener Filter-Based Color Attribute Quality Enhancement for Geometry-Based Point Cloud Compression | Jinrui Xing; Hui Yuan; Chen Chen; Wei Gao | |
18.20-18.40 | Mixed Context Techniques in the Adaptive Arithmetic Coding Process for DC Term and Lossless Image Encoding | Evan Shih; Jian-Jiun Ding | |
Session | Room | Chair | |
WedPM2-6 (Signal Proceesing for Audio and Speech Applications) | Chiang Mai 4 | Sunao Hara/Sutasinee Thovuttikul | |
Date | Time | Title | Authors |
9 November 2022 | 16.20-16.40 | Prediction Method of Soundscape Impressions Using Environmental Sounds and Aerial Photographs | Yusuke Ono; Sunao Hara; Masanobu Abe |
16.40-17.00 | Robust Speech Dereverberation Based on Adaptive Weighted Prediction Error Algorithm With Eigenvector Extraction | Yitong Chen; Wen Zhang | |
17.00-17.20 | Multi-Task Learning for Speech Emotion and Emotion Intensity Recognition | Pengcheng Yue; Leyuan Qu; Shukai Zheng; Taihao Li | |
17.20-17.40 | Karaoke Generation From Songs: Recent Trends and Opportunities | Preet Patel; Ansh Ray; Khushboo Thakkar; Kahan Sheth; Sapan H Mankad | |
17.40-18.00 | Multi-Branch Learning for Noisy and Reverberant Monaural Speech Separation | Chao Ma; Dongmei Li | |
18.00-18.20 | Significance of Quadrature and In-Phase Components for Synthetic Spoofed Speech Detection | Priyanka Gupta; Piyushkumar K. Chodingala; Hemant A. Patil | |
Session | Room | Chair | |
WedPM2-7 (SS20: High Performance Intelligent Technologies for Image and Video Applications) | Chiang Mai 5 | Jing-Ming Guo | |
Date | Time | Title | Authors |
9 November 2022 | 16.20-16.40 | Mammography Quality Evaluation and Model Interpretation Based on CNN-Based Inframammary Fold Classification | Yi-Chong Zeng; Yu-Cheng Wu; Chen-Yen Yeh; Shu-Chi Li; Tzu-Han Chou; Yi-Wen Huang; Giu-Cheng Hsu; Hsian-He Hsu |
16.40-17.00 | Hybrid Image Compression Framework Based on Single Image Training | Tien-Ying Kuo; Yu-Jen Wei; Kuan-Yu Su | |
17.00-17.20 | Highly Robust Action Retrieval Using View-Invariant Pose Feature and Simple Yet Effective Query Expansion Method | Noboru Yoshida; Jianquan Liu | |
17.20-17.40 | A Unified Compression and Watermarking Scheme for MT-BTC Images | Jing-Ming Guo; Sankarasrinivasan Seshathiri | |
17.40-18.00 | Fusion With Hierarchical Graphs for Multimodal Emotion Recognition | Shuyun Tang; Zhaojie Luo; Guoshun Nan; Jun Baba; Yuichiro Yoshikawa; Hiroshi Ishiguro | |
18.00-18.20 | Multi-Stage Superpixel-Based Segmentation Algorithm Using Fully Convolutional Networks and Discriminative Features | Pei-Chi Huang; Jian-Jiun Ding | |
18.20-18.40 | Deep Learning Acceleration Design Based on Low-Rank Approximation | Yi-Hsiang Chang*, Gwo Giun (Chris) Lee*, Shiu-Yu Chen* | |
Session | Room | Chair | |
WedPM2-8 (Data Analytics and Machine Learning) | Board Room 4 | Wanus Srimaharaj | |
Date | Time | Title | Authors |
9 November 2022 | 16.20-16.40 | Internet of Behavior and Brain Response Identification for Cognitive Performance Analysis | Wanus Srimaharaj; Roungsan Chaisricharoen |
16.40-17.00 | Refinement of Utterance Fluency Feature Extraction and Automated Scoring of L2 Oral Fluency With Dialogic Features | Ryuki Matsuura; Shungo Suzuki; Mao Saeki; Tetsuji Ogawa; Yoichi Matsuyama | |
17.00-17.20 | A Vision Transformer-Based Approach to Bearing Fault Classification via Vibration Signals | Abid Hasan Zim; Aeyan Ashraf; Aquib Iqbal; Asad Malik; Minoru Kuribayashi | |
17.20-17.40 | Analysis Method for Motion Factors Related to Joint Contact Forces at the Knee During Walking Using Grad-CAM | Satoshi Suwa; Koh Inoue; Ryo Matsuoka | |
17.40-18.00 | A Dataset and a Lightweight Object Detection Network for Thermal Image-Based Home Surveillance | Zhengqiang Shao; Longbin Yan; Jie Chen; Jingdong Chen | |
18.00-18.20 | SCQ: Self-Supervised Cross-Modal Quantization for Unsupervised Large-Scale Retrieval | Fuga Nakamura; Ryosuke Harakawa; Masahiro Iwahashi | |
CONFERENCE FORMAT
The conference is planned to be in presence. However, if there are some travel restrictions for some authors at the time, we will allow them to upload their videos for the oral presentation. The presenter must attend the session online for Q&A. This will however mean that there will be no live streaming of the conference presentations, as in the hybrid conference. For more information please contact: apsipa2022@gmail.com