APSIPA 2021

Session Index


Wednesday, December 15, 11:20 - 13:20
Wednesday, December 15, 14:40 - 16:40
Thursday, December 16, 09:00 - 11:00
Thursday, December 16, 14:00 - 16:00
Thursday, December 16, 16:20 - 18:20
Friday, December 17, 10:20 - 12:20
Friday, December 17, 13:00 - 15:00
Friday, December 17, 15:20 - 17:00

Wednesday, December 15, 11:20 - 13:20 ∘ Live Session A (Hall)
LS-A-WE1 - Advanced Topics in Low Precision Image Processing

LS-A-WE1.1: AN EFFICIENT IMAGE PROCESSING AND MACHINE LEARNING BASED TECHNIQUE FOR SKIN LESION SEGMENTATION AND CLASSIFICATION

Imtiaz, Izbaila, Institute of Management Sciences, Pakistan Ahmed, Imran, Institute of Management Sciences, Pakistan Jeon, Gwanggil, Incheon National University, Korea, Republic of Muramatsu, Shogo, Niigata University, Japan , , ,

LS-A-WE1.2: DISTRIBUTED ARITHMETIC CODING FOR SOURCES WITH HIDDEN MARKOV CORRELATION

Zhang, Yan, Chang'an University, China Yang, Nan, Chang'an University, China Fang, Yong, Chang'an University, China

LS-A-WE1.3: MULTI-RESIDUAL FEATURE FUSION NETWORK FOR LIGHTWEIGHT SINGLE IMAGE SUPER-RESOLUTION

Qin, Jiayi, Sichuan University, China He, Zheng, Sichuan University, China Yan, Binyu, Sichuan University, China Jeon, Gwanggil, Incheon National University, Korea, Republic of Yang, Xiaomin, Sichuan University, China

LS-A-WE1.4: AUTOMOTIVE ENGINE CYLINDER HEAD CRACK DETECTION: CANNY EDGE DETECTION WITH MORPHOLOGICAL DILATION

Berwo, Michael Abebe, Chang`an University, China Fang, Yong, Chang`an University, China Mahmood, Jabar, Chang`an University, China Retta, Ephrem Afele, Northwest University, China , , ,

LS-A-WE1.5: ACCELERATION OF PDS–BASED HIGH–DIMENSIONAL SIGNAL RESTORATION

YAMAMOTO, Gai, Niigata University, Japan KODAMA, Yuya, Niigata University, Japan MURAMATSU, Shogo, Niigata University, Japan CHOI, Samuel, Niigata University, Japan JEON, Gwanggil, Incheon National University, Japan

LS-A-WE1.6: PRODUCT QUANTIZATION TO REDUCE ENTROPY OF LABELS FOR FAST AND ACCURATE IMAGE RETRIEVAL

Nakamura, Fuga, Nagaoka University of Technology, Japan Harakawa, Ryosuke, Nagaoka University of Technology, Japan Iwahashi, Masahiro, Nagaoka University of Technology, Japan

Wednesday, December 15, 11:20 - 13:20 ∘ Live Session B (Annex)
LS-B-WE1 - Digital Convergence of 5G/B5G, AIoT and Security (1)

LS-B-WE1.1: DEEP REINFORCEMENT LEARNING FOR NPDCCH PERIOD ADJUSTMENT IN NB-IOT NETWORKS

Yu, Ya-Ju, National University of Kaohsiung, Taiwan Chuang, Ching-Chih, National Pingtung University, Taiwan Cheng, Yu-Wei, National University of Kaohsiung, Taiwan

LS-B-WE1.2: A THRESHOLD-BASED SCHEDULING AND POWER CONTROL DESIGN ON IMT-2020 EVALUATION

Yeh, Ting-Yu, Industrial Technology Research Institute, Taiwan Pao, Wei-Chen, Industrial Technology Research Institute, Taiwan Chou, Wei-Hung, Industrial Technology Research Institute, Taiwan Tsai, Chun-Chia, Industrial Technology Research Institute, Taiwan Pan, Jen-Yi, National Chung Cheng University, Taiwan

LS-B-WE1.3: IMPLEMENTATION OF A FAST FAILURE RECOVERY METHOD CONSIDERING LOAD DISTRIBUTION FOR NETWORK SLICING

Misugi, Takeru, Kansai University, Japan Hirata, Kouji, Kansai University, Japan Tachibana, Takuji, University of Fukui, Japan

LS-B-WE1.4: MULTI-ARMED BANDIT-BASED ROUTING METHOD FOR IN-NETWORK CACHING

Tabei, Gen, Kansai University, Japan Ito, Yusuke, Tokyo University of Science, Japan Kimura, Tomotaka, Doshisha University, Japan Hirata, Kouji, Kansai University, Japan

LS-B-WE1.5: GENERALIZED CLASSIFICATION OF DNS OVER HTTPS TRAFFIC WITH DEEP LEARNING

Casanova, Lionel F. Gonzalez, Yuan Ze University, Taiwan Lin, Po-Chiang, Yuan Ze University, Taiwan

LS-B-WE1.6: INHIBITION MODELING OF FUTURE MALWARE DIFFUSION WITH AN EVOLUTIONARY GAME THEORY

Miura, Hideyoshi, Kansai University, Japan Kimura, Tomotaka, Doshisha University, Japan Hirata, Kouji, Kansai University, Japan

Wednesday, December 15, 11:20 - 13:20 ∘ Live Session C (10A)
LS-C-WE1 - Deep Learning for Biomedical Signal Processing and Systems

LS-C-WE1.1: MICROPHONE ARRAY SPEECH SEPARATION ALGORITHM BASED ON DNN

Wu, Chaoyan, Southeast University, China Zhou, Lin, Southeast University, China Chen, Xijin, Southeast University, China Chen, Liyuan, Southeast University, China

LS-C-WE1.2: EXPLORING ARTIFACT REJECTION FOR HIGH-PULSE RATE ELECTRICALLY EVOKED AUDITORY STEADY STATE RESPONSES IN COCHLEAR IMPLANTS USERS

Hu, Hongmei, Universitat Oldenburg, Germany Ewert, Stephan, Universitat Oldenburg, Germany

LS-C-WE1.3: DEPRESSION SEVERITY LEVEL CLASSIFICATION USING MULTITASK LEARNING OF GENDER RECOGNITION

Liu, Yang, Northwest Normal University, China Lu, Xiaoyong, Northwest Normal University, China Shi, Daimin, Northwest Normal University, China Yuan, Jingyi, Northwest Normal University, China

LS-C-WE1.4: MULTI-FEATURE FUSION FOR EPILEPTIC FOCUS LOCALIZATION BASED ON TENSOR REPRESENTATION

Zhao, Xuyang, Tokyo University of Agriculture and Technology, Japan Sole-Casals, Jordi, University of Vic–Central University of Catalonia, Spain Zhao, Qibin, RIKEN Center for Advanced Intelligence Project, Japan Cao, Jianting, Saitama Institute of Technology, Japan Tanaka, Toshihisa, Tokyo University of Agriculture and Technology, Japan

LS-C-WE1.5: ADHD CLASSIFICATION VIA AUTO-ENCODING NETWORK WITH NON-IMAGING DATA FUSION

Tang, Yibin, Hohai University, China Jiang, Junping, Hohai University, China Li, Min, Hohai University, China Chen, Ying, Changzhou University, China Meng, Xiaojin Meng, Xuzhou Medical University, China

LS-C-WE1.6: ARRHYTHMIA CLASSIFICATION ALGORITHM BASED ON SPARSE AUTOENCODER

Liang, Mengnan, Hohai University, China Jiang, Aimin, Hohai University, China Liu, Xiaofeng, Hohai University, China Kwan, Hon Keung, University of Windsor, Canada Zhu, Yanping, Changzhou University, China

Wednesday, December 15, 11:20 - 13:20 ∘ Live Session D (Virtual)
LS-D-WE1 - Robust Speaker Recognition with Microphone Arrays

LS-D-WE1.1: ATTENTION-BASED MULTI-CHANNEL SPEAKER VERIFICATION WITH AD-HOC MICROPHONE ARRAYS

Liang, Chengdong, Northwestern Polytechnical University, China Chen, Junqi, Northwestern Polytechnical University, China Guan, Shanzheng, Northwestern Polytechnical University, China Zhang, Xiao-Lei, Northwestern Polytechnical University, China

LS-D-WE1.2: LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS

Guan, Shanzheng, Northwestern Polytechnical University, China Liu, Shupei, Northwestern Polytechnical University, China Chen, Junqi, Northwestern Polytechnical University, China Zhu, Wenbo, Northwestern Polytechnical University, China Li, Shengqiang, Northwestern Polytechnical University, China Tan, Xu, Northwestern Polytechnical University, China Yang, Ziye, Northwestern Polytechnical University, China Xu, Menglong, Northwestern Polytechnical University, China Chen, Yijiang, Northwestern Polytechnical University, China Liang, Chengdong, Northwestern Polytechnical University, China Wang, Jianyu, Northwestern Polytechnical University, China Zhang, Xiao-Lei, Northwestern Polytechnical University, China

LS-D-WE1.3: AN MAP ESTIMATION FOR BETWEEN-CLASS VARIANCE

Han, Jiao, Northwest Minzu University, China Cai, Yunqi, Tsinghua University, China Li, Lantian, Tsinghua University, China Li, Guanyu, Northwest Minzu University, China Wang, Dong, Tsinghua University, China

LS-D-WE1.4: MIXING OR EXTRACTING? FURTHER EXPLORING NECESSITY OF MUSIC SEPARATION FOR SINGER IDENTIFICATION

Zhang, Yuxin, Communication University of China, China Xiao, Yatong, Tsinghua University, China Zhang, Wei-Qiang, Tsinghua University, China Tan, Xu, Microsoft Research Asia, China Lei, Ling, Communication University of China, China Wang, Shengjin, Tsinghua University, China

LS-D-WE1.5: A UNIFIED DEEP SPEAKER EMBEDDING FRAMEWORK FOR MIXED-BANDWIDTH SPEECH DATA

Cai, Weicheng, Duke Kunshan University, China Li, Ming, Duke Kunshan University, China

Wednesday, December 15, 11:20 - 13:20 ∘ On-Demand A
OD-A-WE1 - Speech Recognition

OD-A-WE1.1: ON THE USE OF SPEAKER INFORMATION FOR AUTOMATIC SPEECH RECOGNITION IN SPEAKER-IMBALANCED CORPORA

Soky, Kak, Kyoto University, Japan Li, Sheng, National Institute of Information and Communications Technology, Japan Mimura, Masato, Kyoto University, Japan Chu, Chenhui, Kyoto University, Japan Kawahara, Tatsuya, Kyoto University, Japan

OD-A-WE1.2: SPECTROGRAMS FUSION-BASED END-TO-END ROBUST AUTOMATIC SPEECH RECOGNITION

Shi, Hao, Graduate School of Informatics, Kyoto University, Japan Wang, Longbiao, Tianjin University, China Li, Sheng, National Institute of Information and Communications Technology (NICT), Japan Fan, Cunhang, Anhui Province Key Laboratory of Multimodal Cognitive Computation, School of Computer Science and Technology, Anhui University, China Dang, Jianwu, Japan Advanced Institute of Science and Technology, Ishikawa, Japan Kawahara, Tatsuya, Graduate School of Informatics, Kyoto University, Japan

OD-A-WE1.3: CONFORMER-BASED END-TO-END SPEECH RECOGNITION WITH ROTARY POSITION EMBEDDING

Li, Shengqiang, Northwestern Polytechnical University, China Xu, Menglong, Northwestern Polytechnical University, China Zhang, Xiao-Lei, Northwestern Polytechnical University, China

OD-A-WE1.4: EFFICIENT CONFORMER-BASED SPEECH RECOGNITION WITH LINEAR ATTENTION

Li, Shengqiang, Northwestern Polytechnical University, China Xu, Menglong, Northwestern Polytechnical University, China Zhang, Xiao-Lei, Northwestern Polytechnical University, China

OD-A-WE1.5: ONE IN A HUNDRED: SELECTING THE BEST PREDICTED SEQUENCE FROM NUMEROUS CANDIDATES FOR SPEECH RECOGNITION

Tian, Zhengkun, Institute of Automation, Chinese Academy of Sciences, China Yi, Jiangyan, 1,2, China Bai, Ye, 1,2, China Tao, Jianhua, Institute of Automation, Chinese Academy of Sciences, China Zhang, Shuai, Institute of Automation, Chinese Academy of Sciences, China Wen, Zhengqi, Institute of Automation, Chinese Academy of Sciences, China

OD-A-WE1.6: LARGE-CONTEXT AUTOMATIC SPEECH RECOGNITION BASED ON RNN TRANSDUCER

Kojima, Atsushi, Advanced Media, Inc., Japan

OD-A-WE1.7: AN END-TO-END MODEL FROM SPEECH TO CLEAN TRANSCRIPT FOR PARLIAMENTARY MEETINGS

Mimura, Masato, Kyoto University, Japan Sakai, Shinsuke, Kyoto University, Japan Kawahara, Tatsuya, Kyoto University, Japan

OD-A-WE1.8: DATA AUGMENTATION BASED ON FREQUENCY WARPING FOR RECOGNITION OF CLEFT PALATE SPEECH

Fujiwara, Kento, Graduate School of System Informatics, Kobe University, Japan Takashima, Ryoichi, Graduate School of System Informatics, Kobe University, Japan Sugiyama, Chihiro, Graduate School of Dentistry, Osaka University, Japan Tanaka, Nobukazu, Graduate School of Dentistry, Osaka University, Japan Nohara, Kanji, Graduate School of Dentistry, Osaka University, Japan Nozaki, Kazunori, Graduate School of Dentistry, Osaka University, Japan Takiguchi, Tetsuya, Graduate School of System Informatics, Kobe University, Japan

OD-A-WE1.9: AN INVESTIGATION OF ENHANCING CTC MODEL FOR TRIGGERED ATTENTION-BASED STREAMING ASR

Zhao, Huaibo, Waseda University, Japan Higuchi, Yosuke, Waseda University, Japan Ogawa, Tetsuji, Waseda University, Japan Kobayashi, Tetsunori, Waseda University, Japan

OD-A-WE1.10: SIGNIFICANCE OF DATA AUGMENTATION FOR IMPROVING CLEFT LIP AND PALATE SPEECH RECOGNITION

Nomo Sudro, Protima, Indian Institute of Technology Guwahati, India Das, Rohan Kumar, Fortemedia Singapore, Singapore Sinha, Rohit, Indian Institute of Technology Guwahati, India Prasanna, S R Mahadeva, Indian Institute of Technology Dharwad, India

OD-A-WE1.11: TEAGER ENERGY SUBBAND FILTERED FEATURES FOR NEAR AND FAR-FIELD AUTOMATIC SPEECH RECOGNITION

Kamble, Madhu, EURECOM, France, France Nayak, Shekhar, SRI-B, India Shaik, M. Ali Basha, SRI-B, India Rath, Shakti P., SRI-B, India Vij, Vikram, SRI-B, India Patil, Hemant, DA-IICT, Gandhinagar, Gujarat, India

OD-A-WE1.12: MULTITASK-BASED JOINT LEARNING APPROACH TO ROBUST ASR FOR RADIO COMMUNICATION SPEECH

Ma, Duo, National University of Singapore, Singapore Hou, Nana, Nanyang Technological University, Singapore Pham, Van Tung, Nanyang Technological University, Singapore Xu, Haihua, Nanyang Technological University, Singapore Chng, Eng Siong, Nanyang Technological University, Singapore

OD-A-WE1.13: ADVANCED LANGUAGE MODEL FUSION METHOD FOR ENCODER-DECODER MODEL IN JAPANESE SPEECH RECOGNITION

Mori, Daiki, Toyohashi University of Technology, Japan Ohta, Kengo, Anan National College of Technology, Japan Nishimura, Ryota, Tokushima University, Japan Ogawa, Atsunori, Nippon Telegraph and Telephone Corporation, Japan Kitaoka, Norihide, Toyohashi University of Technology, Japan

OD-A-WE1.14: CSTD-TELUGU CORPUS: CROWD-SOURCED APPROACH FOR LARGE-SCALE SPEECH DATA COLLECTION

Mirishkar, Ganesh S, IIIT Hyderabad, India V, Vishnu Vidyadhara Raju, IIIT Hyderabad, India Naroju, Meher Dinesh, Pacteraedge, India Maity, Sudhamay, Ozonetel, India Yalla, Prakash, IIIT Hyderabad, India Vuppala, Anil Kumar, IIIT Hyderabad, India

OD-A-WE1.15: AN EMPIRICAL STUDY ON TRANSFORMER-BASED END-TO-END SPEECH RECOGNITION WITH NOVEL DECODER MASKING

Weng, Shi-Yan, National Taiwan Normal University, Taiwan Chiu, Hsuan-Sheng, Chunghwa Telecom Laboratories, Taiwan Chen, Berlin, National Taiwan Normal University, Taiwan

Wednesday, December 15, 11:20 - 13:20 ∘ On-Demand B
OD-B-WE1 - Signal Processing Systems

OD-B-WE1.1: FAST-PARALLEL SINGULAR VALUE THRESHOLDING FOR MANY SMALL MATRICES BASED ON GEOMETRIC FEATURE OF SINGULAR VALUES

Sasaki, Takayuki, NTT Corporation, Japan Tanida, Ryuichi, NTT Corporation, Japan Kitahara, Masaki, NTT Corporation, Japan Kimata, Hideaki, Kogakuin University, Japan

OD-B-WE1.2: ADAPTIVE FEEDBACK CANCELLATION BASED ON PREDICTION ERROR METHOD USING INTERAURAL LEVEL DIFFERENCES IN HEARING DEVICE

Ueda, Yuto, National Institute of Technology, Kumamoto College, Japan Nakashima, Hidetoshi, National Institute of Technology, Kumamoto College, Japan Yuno, Yuuki, Rion Co Ltd., Japan Hiruma, Nobuhiko, Rion Co Ltd., Japan

OD-B-WE1.3: DUAL-CHANNEL DRUM SEPARATION FOR LOW-COST DRUM RECORDING USING NON-NEGATIVE MATRIX FACTORIZATION

Cai, Cheng-Yu, Research Center of Music, Technology and Health, National Tsing Hua University, Hsinchu, Taiwan, Taiwan Su, Yu-Hui, Research Center of Music, Technology and Health, National Tsing Hua University, Hsinchu, Taiwan, Taiwan Su, Li, Institute of Information Science, Academia Sinica, Taipei, Taiwan, Taiwan

OD-B-WE1.4: MASK-BASED BEAMFORMING USING COMPLEX-VALUED NEURAL NETWORK FOR RECOGNITION OF SPATIAL TARGET SPEECH

Hayakawa, Daichi, Corporate Research & Development Center, Toshiba Corporation, Japan Kagoshima, Takehiko, Corporate Research & Development Center, Toshiba Corporation, Japan Fujimura, Hiroshi, Corporate Research & Development Center, Toshiba Corporation, Japan

OD-B-WE1.5: MOVING SOUND SOURCE TRACKING IN WIDE SPACE BY MULTIPLE MICROPHONE ARRAYS

Takahashi, Toru, Osaka Sangyo University, Japan Ekawa, Takuma, Osaka Sangyo University, Japan Nakayama, Masato, Osaka Sangyo University, Japan

OD-B-WE1.6: STUDY ON SIMULTANEOUS ESTIMATION OF GLOTTAL SOURCE AND VOCAL TRACT PARAMETERS BY ARMAX-LF MODEL FOR SPEECH ANALYSIS/SYNTHESIS

Li, Kai, Japan Advanced Institute of Science and Technology, Japan Unoki, Masashi, Japan Advanced Institute of Science and Technology, Japan Li, Yongwei, Chinese Academy of Sciences, China Dang, Jianwu, Japan Advanced Institute of Science and Technology, Japan Akagi, Masato, Japan Advanced Institute of Science and Technology, Japan

OD-B-WE1.7: LOW-POWER BOOTH MULTIPLICATION WITHOUT DYNAMIC RANGE DETECTION IN FFTS FOR FMCW RADAR SIGNAL PROCESSING

Meteer, Oğuz, University of Twente, Netherlands Bekooij, Marco, University of Twente, Netherlands

OD-B-WE1.8: KRONECKER PRODUCT ADAPTIVE BEAMFORMING FOR MICROPHONE ARRAYS

Wang, Xuehan, Northwestern Polytechnical University, China Huang, Gongping, Technion-Israel Institute of Technology, Israel Cohen, Israel, Technion-Israel Institute of Technology, Israel Benesty, Jacob, University of Quebec, Canada Chen, Jingdong, Northwestern Polytechnical University, China

OD-B-WE1.9: AN OPTIMAL VARIABLE-LATENCY ARCHITECTURE FOR DETERMINISTIC APPROACHES TO STOCHASTIC COMPUTING WITH UNARY BIT STREAM PRESERVING PROPERTIES

Meteer, Oğuz, University of Twente, Netherlands Bekooij, Marco, NXP Semiconductors, Netherlands

Wednesday, December 15, 14:40 - 16:40 ∘ Live Session A (Hall)
LS-A-WE2 - High-Performance Image Processing

LS-A-WE2.1: DOMAIN SPECIFIC DESCRIPTION IN HALIDE FOR RANDOMIZED IMAGE CONVOLUTION

Takagi, Hiroyasu, Nagoya Institute of Technology, Japan Fukushima, Norishige, Nagoya Institute of Technology, Japan

LS-A-WE2.2: FAST STILL PICTURE CODING FOR VVC

Kawamura, Kei, KDDI Research, Japan Unno, Kyohei, KDDI Research, Japan Kidani, Yoshitaka, KDDI Research, Japan

LS-A-WE2.3: ACCELERATING FINITE IMPULSE RESPONSE FILTERING USING TENSOR CORES

Kondo, Takumi, Nagoya Institute of Technology, Japan Maeda, Yoshihiro, Tokyo University of Science, Japan Fukushima, Norishige, Nagoya Institute of Technology, Japan

LS-A-WE2.4: HISUI: AN IMAGE AND VIDEO PROCESSING FRAMEWORK WITH AUTO-OPTIMIZER

Okuda, Ippei, Nagoya Institute of Technology, Japan Takaoka, Masahiro, Nagoya Institute of Technology, Japan Tsumura, Tomoaki, Nagoya Institute of Technology, Japan

LS-A-WE2.5: COLOR TRANSFORMATION FOR COMPRESSIVE COMPUTING IN IMAGE FILTERING

Maeda, Yoshihiro, Tokyo University of Science, Japan Fukushima, Norishige, Nagoya institute of technology, Japan Hamamoto, Takayuki, Tokyo University of Science, Japan

LS-A-WE2.6: IMBALANCED SAMPLE FEATURE ENHANCEMENT OF HYPERSPECTRAL IMAGERY CLASSIFICATION

Yu, Xumin, Northwestern Polytechnical University, China Feng, Yan, Northwestern Polytechnical University, China Gao, Yanlong, Northwestern Polytechnical University, China

Wednesday, December 15, 14:40 - 16:40 ∘ Live Session B (Annex)
LS-B-WE2 - The Future of Biometrics beyond Recognition and Security

LS-B-WE2.1: A COMPREHENSIVE STUDY OF FACE RECOGNITION USING DEEP LEARNING

Ito, Koichi, Tohoku University, Japan Kawai, Hiroya, Tohoku University, Japan Aoki, Takafumi, Tohoku University, Japan

LS-B-WE2.2: CONTINUOUS BIOMETRIC AUTHENTICATION FOR SMARTPHONES CONSIDERING USAGE ENVIRONMENTS

Watanabe, Yuka, The University of Kitakyushu, Japan Yamazaki, Yasushi, The University of Kitakyushu, Japan

LS-B-WE2.3: EXAMINING OF SHALLOW AUTOENCODER ON BLACK-BOX ATTACK AGAINST FACE RECOGNITION

Vo, Ngoc Khoi Nguyen, Shizuoka University, Japan Terada, Takamichi, Shizuoka University, Japan Nishigaki, Masakatsu, Shizuoka University, Japan Ohki, Tetsushi, Shizuoka University, Japan

LS-B-WE2.4: COMPARATIVE STUDY OF FEATURE EXTRACTION METHOD FOR EMOTIONAL CLASSIFICATION BY MICRO-EXPRESSIONS

Kato, Koki, Toyama Prefectural University, Japan Takano, Hironobu, Toyama Prefectural University, Japan Saiko, Masahiro, NEC Biometrics Laboratory, Japan Kubo, Masahiro, NEC Biometrics Laboratory, Japan Imaoka, Hitoshi, NEC, Japan

Wednesday, December 15, 14:40 - 16:40 ∘ Live Session C (10A)
LS-C-WE2 - Advances in Human Behavior Sensing and Understanding

LS-C-WE2.1: DEEP LEARNING ANALYSIS MODELS FOR SPEECH AND EMOTIONAL RECOGNITION

WU, JUN, Hubei University of Technology, China Zhu, Tianliang, Hubei University of Technology, China YU, CHENGTIAN, Hubei University of Technology, China WANG, CHUNZHI, Hubei University of Technology, China Zhou, Xianjing, Wuhan Zall Information Technology Co. , Ltd., China Liu, Hu, Wuhan Zall Information Technology Co. , Ltd., China

LS-C-WE2.2: INFANT POSTURE ASSESSMENT BASED ON ROTATIONAL KEYPOINT DETECTION

Zhao, Xuyang, Tokyo University of Agriculture and Technology, Japan Takata, Shogo, Tokyo University of Agriculture and Technology, Japan Fukumori, Kosuke, Tokyo University of Agriculture and Technology, Japan Tanaka, Toshihisa, Tokyo University of Agriculture and Technology, Japan

LS-C-WE2.3: TEXT DESCRIPTION GENERATION FROM VIDEOS VIA DEEP SEMANTIC MODELS

Li, Lin, Wuhan University of Technology, China Hu, Kaixi, Wuhan University of Technology, China

LS-C-WE2.4: VIEW-INVARIANT FEATURE USING POSE INFORMATION AND FLEXIBLE MATCHING ALGORITHM FOR ACTION RETRIEVAL

Yoshida, Noboru, NEC Corporation, Japan Liu, Jianquan, NEC Corporation, Japan

LS-C-WE2.5: VIDEO-BASED SPORTS ACTIVITY RECOGNITION FOR CHILDREN

Olalere, Feyisayo, Utrecht University, Netherlands Brouwers, Vincent, Utrecht University, Netherlands Doyran, Metehan, Utrecht University, Netherlands Poppe, Ronald, Utrecht University, Netherlands Salah, Albert Ali, Utrecht University, Netherlands

Wednesday, December 15, 14:40 - 16:40 ∘ Live Session D (Virtual)
LS-D-WE2 - Digital Convergence of 5G/B6G, AIoT and Security (2)

LS-D-WE2.1: AN ADAPTIVE RANK SELECTION METHOD IN 3GPP 5G NR SYSTEMS

Chou, Wei-Hung, Industrial Technology Research Institute, Taiwan Pao, Wei-Chen, Industrial Technology Research Institute, Taiwan Tsai, Chun-Chia, Industrial Technology Research Institute, Taiwan Yeh, Ting-Yu, Industrial Technology Research Institute, Taiwan Pan, Jen-Yi, National Chung Cheng University, Taiwan

LS-D-WE2.2: A LOW COMPLEXITY PMI SELECTION SCHEME FOR 3GPP 5G NR FDD SYSTEMS

Tsai, Chun-Chia, Industrial Technology Research Institute, Taiwan Yeh, Ting-Yu, Industrial Technology Research Institute, Taiwan Chou, Wei-Hung, Industrial Technology Research Institute, Taiwan Pao, Wei-Chen, Industrial Technology Research Institute, Taiwan Pan, Jen-Yi, National Chung Cheng University, Taiwan

LS-D-WE2.3: REALIZING 5G NETWORK SLICING PROVISIONING WITH OPEN SOURCE SOFTWARE

Lee, Kuan-Lin, National Sun Yat-sen University, Taiwan Lee, Chung-Nan, National Sun Yat-sen University, Taiwan Lee, Ming-Feng, National Sun Yat-sen University, Taiwan

LS-D-WE2.4: A PARKING MONITORING SYSTEM USING FMCW RADARS

Kan, Yao-Chiang, YuanZe University, Taiwan Chen, Kuan-Tzu, YuanZe University, Taiwan Lin, Hsueh-Chun, China Medical University Hospital and College of Medicine, Taiwan Lee, Junghsi, YuanZe University, Taiwan

LS-D-WE2.5: A SEMI-EMPIRICAL DATA-RATE ESTIMATION METHOD OF 5G RAN SLICING

Lai, Wen-Ping, Yuan Ze University, Taiwan Lai, Ming-Jay, National Central University, Taiwan Lai, Hong-Lun, Yuan Ze University, Taiwan

LS-D-WE2.6: AN ENTROPY-BASED DDOS ATTACK DETECTION AND CLASSIFICATION WITH HIERARCHICAL TEMPORAL MEMORY

Nguyen, Manh Hung, CYCU, Taiwan Lai, Yu-Kuen, CYCU, Taiwan Chang, Kai-Po, CYCU, Taiwan

Wednesday, December 15, 14:40 - 16:40 ∘ On-Demand A
OD-A-WE2 - Speech Enhancement

OD-A-WE2.1: CYCLEGAN-BASED NON-PARALLEL SPEECH ENHANCEMENT WITH AN ADAPTIVE ATTENTION-IN-ATTENTION MECHANISM

Yu, Guochen, Communication University of China, China Wang, Yutian, Communication University of China, China Zheng, Chengshi, Institute of Acoustics, Chinese Academy of Sciences, China Wang, Hui, Communication University of China, China Zhang, Qin, Communication University of China, China

OD-A-WE2.2: A ROBUST MAXIMUM LIKELIHOOD DISTORTIONLESS RESPONSE BEAMFORMER BASED ON A COMPLEX GENERALIZED GAUSSIAN DISTRIBUTION

Meng, Weixin, Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Science, Beijing, 100190, China;University of Chinese Academy of Sciences, Beijing, China, China Zheng, Chengshi, Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Science, Beijing, 100190, China;University of Chinese Academy of Sciences, Beijing, China, China Li, Xiaodong, Key Laboratory of Noise and Vibration Research, Institute of Acoustics, Chinese Academy of Science, Beijing, 100190, China;University of Chinese Academy of Sciences, Beijing, China, China

OD-A-WE2.3: SPEECH ENHANCEMENT BASED ON MASKING APPROACH CONSIDERING SPEECH QUALITY AND ACOUSTIC CONFIDENCE FOR NOISY SPEECH RECOGNITION

Chu, Shih-Chuan, National Cheng Kung University, Taiwan Wu, Chung-Hsien, National Cheng Kung University, Taiwan Lin, Yun-Wen, National Cheng Kung University, Taiwan

OD-A-WE2.4: DNN-BASED LINEAR PREDICTION RESIDUAL ENHANCEMENT FOR SPEECH DEREVERBERATION

Feng, Xinyang, Beijing Institute of Technology, China Li, Nuo, Beijing Institute of Technology, China He, Zunwen, Beijing Institute of Technology, China Zhang, Yan, Beijing Institute of Technology, China Zhang, Wancheng, Beijing Institute of Technology, China

OD-A-WE2.5: MANDARIN ELECTRO-LARYNGEAL SPEECH ENHANCEMENT BASED ON STATISTICAL VOICE CONVERSION AND MANUAL TONE CONTROL

Qian, Zhaopeng, Beihang University, China Niu, Haijun, Beihang University, China Wang, Li, Beijing Research Center of Urban System Engineering, China Kobayashi, Kazuhiro, Nagoya University, Japan Zhang, Shaochuan, Beihang University, China Toda, Tomoki, Nagoya University, Japan

OD-A-WE2.6: INCORPORATING MULTI-TARGET IN MULTI-STAGE SPEECH ENHANCEMENT MODEL FOR BETTER GENERALIZATION

Zhang, Lu, Harbin Institute of Technology, Shenzhen, China Wang, Mingjiang, Harbin Institute of Technology, Shenzhen, China Li, Andong, Institute of Acoustics, Chinese Academy of Sciences, China Zhang, Zehua, Harbin Institute of Technology, Shenzhen, China Zhuang, Xuyi, Harbin Institute of Technology, Shenzhen, China

OD-A-WE2.7: LOW-POWER CONVOLUTIONAL RECURRENT NEURAL NETWORK FOR MONAURAL SPEECH ENHANCEMENT

Gao, Fei, Unisound AI Technology Co. Ltd, China Guan, Haixin, Unisound AI Technology Co. Ltd, China , , ,

OD-A-WE2.8: MULTI-CHANNEL SPEECH ENHANCEMENT WITH 2-D CONVOLUTIONAL TIME-FREQUENCY DOMAIN FEATURES AND A PRE-TRAINED ACOUSTIC MODEL

Wang, Quandong, Xiaomi Corporation, China Wu, Junnan, Xiaomi Corporation, China Yan, Zhao, Xiaomi Corporation, China Qian, Sichong, Xiaomi Corporation, China Guo, Liyong, Xiaomi Corporation, China Fan, Lichun, Xiaomi Corporation, China Zhuang, Weiji, Xiaomi Corporation, China Gao, Peng, Xiaomi Corporation, China Wang, Yujun, Xiaomi Corporation, China

OD-A-WE2.9: PROCESSING PHONEME SPECIFIC SEGMENTS FOR CLEFT LIP AND PALATE SPEECH ENHANCEMENT

Nomo Sudro, Protima, Indian Institute of Technology Guwahati, India Sinha, Rohit, Indian Institute of Technology Guwahati, India Prasanna, S R Mahadeva, Indian Institute of Technology Dharwad, India

OD-A-WE2.10: SPEECH ENHANCEMENT BY NOISE SELF-SUPERVISED RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION VIA INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS

Misawa, Sota, The University of Tokyo, Japan Takamune, Norihiro, The University of Tokyo, Japan Nakamura, Tomohiko, The University of Tokyo, Japan Kitamura, Daichi, National Institute of Technology, Japan Saruwatari, Hiroshi, The University of Tokyo, Japan Une, Masakazu, University of Tsukuba, Japan Makino, Shoji, Waseda University, Japan

OD-A-WE2.11: CAUSAL DISTORTIONLESS RESPONSE BEAMFORMING BY ALTERNATING DIRECTION METHOD OF MULTIPLIERS

Masuyama, Yoshiki, Tokyo Metropolitan University​, Japan Yamaoka, Kouei, Tokyo Metropolitan University​, Japan Kinoshita, Yuma, Tokyo Metropolitan University​, Japan Ono, Nobutaka, Tokyo Metropolitan University​, Japan

OD-A-WE2.12: STACKED U-NET WITH HIGH-LEVEL FEATURE TRANSFER FOR PARAMETER EFFICIENT SPEECH ENHANCEMENT

Lee, Jinyoung, Yonsei University, Korea, Republic of Kang, Hong-Goo, Yonsei University, Korea, Republic of

OD-A-WE2.13: EXTENSION OF VIRTUAL MICROPHONE TECHNIQUE TO MULTIPLE REAL MICROPHONES AND INVESTIGATION OF THE IMPACT OF PHASE AND AMPLITUDE INTERPOLATION ON SPEECH ENHANCEMENT

Segawa, Hanako, University of Tsukuba, Japan Li, Li, NTT Corporation, Japan Makino, Shoji, University of Tsukuba, Waseda University, Japan Yamada, Takeshi, University of Tsukuba, Japan

OD-A-WE2.14: COMPARATIVE STUDY ON DNN-BASED MINIMUM VARIANCE BEAMFORMING ROBUST TO SMALL MOVEMENTS OF SOUND SOURCES

Saijo, Kohei, Waseda University, Japan Katagiri, Kazuhiro, OKI Electric Industry Corporation, Japan Fujieda, Masaru, OKI Electric Industry Corporation, Japan Kobayashi, Tetsunori, Waseda University, Japan Ogawa, Tetsuji, Waseda University, Japan

OD-A-WE2.15: IMPROVEMENTS TO NON-INTRUSIVE INTELLIGIBILITY PREDICTION FOR REVERBERANT SPEECH

Nakazawa, Kazushi, Yamagata University, Japan Kondo, Kazuhiro, Yamagata University, Japan

Wednesday, December 15, 14:40 - 16:40 ∘ On-Demand B
OD-B-WE2 - Theory and Methods

OD-B-WE2.1: ON SPARSE GRAPH ESTIMATION UNDER STATISTICAL AND LAPLACIAN CONSTRAINTS

Tugnait, Jitendra, Auburn University, United States of America

OD-B-WE2.2: ORDERING PRINCIPAL COMPONENTS OF MULTIVARIATE FRACTIONAL BROWNIAN MOTION FOR SOLVING INVERSE PROBLEMS

Mohr, Marisa, University of Luebeck, Germany Möller, Ralf, University of Luebeck, Germany

OD-B-WE2.3: SPATIAL NORMALIZATION TO REDUCE POSITIONAL COMPLEXITY IN DIRECTION-AIDED SUPERVISED BINAURAL SOUND SOURCE SEPARATION

Takeda, Ryu, Osaka University, Japan Nakadai, Kazuhiro, Honda Research Institute Japan, Co, Ltd., Japan Komatani, Kazunori, Osaka University, Japan

OD-B-WE2.4: PHASE-AWARE AUDIO INPAINTING BASED ON INSTANTANEOUS FREQUENCY

Tanaka, Tomoro, Waseda University, Japan Yatabe, Kohei, Waseda University, Japan Oikawa, Yasuhiro, Waseda University, Japan

OD-B-WE2.5: STATISTICAL-MECHANICAL ANALYSIS OF ADAPTIVE VOLTERRA FILTER FOR TIME-VARYING UNKNOWN SYSTEM

Kugiyama, Koyo, Kansai University, Japan Motonaka, Kimiko, Kansai University, Japan Kajikawa, Yoshinobu, Kansai University, Japan Miyoshi, Seiji, Kansai University, Japan

OD-B-WE2.6: HIGH-ACCURACY RECONSTRUCTION OF PERIODIC SIGNALS BASED ON COMPRESSIVE SENSING

Arronde Pérez, Dailys, University of Klagenfurt, Austria Zangl, Hubert, University of Klagenfurt/ AAU SAL USE LAB, Austria

OD-B-WE2.7: SEMI-SUPERVISED SOUND EVENT DETECTION USING SELF-ATTENTION AND MULTIPLE TECHNIQUES OF CONSISTENCY TRAINING

Wang, Yih-Wen, National Sun Yat-Sen University, Taiwan Chen, Chia-Ping, National Sun Yat-Sen University, Taiwan Lu, Chung-Li, Chunghwa Telecom Laboratories, Taiwan Chan, Bo-Cheng, Chunghwa Telecom Laboratories, Taiwan

OD-B-WE2.8: NONLINEAR SVM-TYPE AUTOMATIC DICISION ALGORITHM IN NOISY ENVIRONMENT FOR HAMMERING TEST SYSTEM

Hori, Kouki, Tokyo University of Science, Japan Tanabe, Nari, Suwa University of Science, Japan Fujisawa, Masaya, Tokyo University of Science, Japan

OD-B-WE2.9: NEARBY-PERSON OCCLUSION DATA AUGMENTATION FOR HUMAN POSE ESTIMATION WITH NON-EXTRA ANNOTATIONS

Chen, Yucheng, Northwestern Polytechnical University, China He, Mingyi, Northwestern Polytechnical University, China Dai, Yuchao, Northwestern Polytechnical University, China

OD-B-WE2.10: DENSE DEPTHMAP PREDICTION FROM ULTRASONIC SENSORS

Yasui, Koki, Nagoya Institute of Technology, Japan Sakaue, Fumihiko, Nagoya Institute of Technology, Japan Sato, Jun, Nagoya Institute of Technology, Japan Koyama, Yu, SOKEN INC., Japan Matsuura, Mitsuyasu, SOKEN INC., Japan

OD-B-WE2.11: FEEDBACK QUANTIZATION AND BIT ALLOCATION FOR NETWORKED CONTROL SYSTEMS WITH RATE LIMITED CHANNELS

Hanamoto, Kazuya, J-QuAD DYNAMICS Inc., Japan Ohno, Shuichi, Osaka City University, Japan

OD-B-WE2.12: ENHANCED LOOP-WEAKENED BELIEF PROPAGATION ALGORITHM FOR PERFORMANCE ENHANCED POLAR CODE DECODERS

van den Brink, Arvid, University of Twente, Netherlands Bekooij, Marco, NXP Semiconductors, Netherlands

OD-B-WE2.13: POSITIONAL-SPECTRAL-TEMPORAL ATTENTION IN 3D CONVOLUTIONAL NEURAL NETWORKS FOR EEG EMOTION RECOGNITION

Liu, Jiyao, Northwestern Polytechnical University, China Zhao, Yanxi, Northwestern Polytechnical University, China Wu, Hao, Northwestern Polytechnical University, China Jiang, Dongmei, Northwestern Polytechnical University, China

OD-B-WE2.14: INTEGRATED SPECTRAL KURTOSIS ANALYSIS

Trapp, Arvid, Munich University of Applied Sciences, Germany Wolfsteiner, Peter, Munich University of Applied Sciences, Germany

OD-B-WE2.15: COMPUTATIONAL COMPLEXITY REDUCED BELIEF PROPAGATION ALGORITHM FOR POLAR CODE DECODERS

van den Brink, Arvid, University of Twente, Netherlands Bekooij, Marco, NXP Semiconductors, Netherlands

Thursday, December 16, 09:00 - 11:00 ∘ Live Session A (Hall)
LS-A-TH1 - Signal Processing in Behavior Analysis

LS-A-TH1.1: REAL-TIME MONITORING SYSTEM TO EVALUATE EXERCISE LOAD, HYPOXIC LOAD, AND SAFETY IN A NORMOBARIC HYPOXIC ROOM

Hisatsune, Kazuki, Kumamoto University, Japan Noguchi, Aoi, Kumamoto University, Japan Yamakawa, Toshitaka, Kumamoto University, Japan

LS-A-TH1.2: PREOPERATIVE MONITORING USING IMPLANTABLE, MULTIMODAL, MULTICHANNEL PROBE

Wakuya, Manami, Kumamoto University, Japan Inoue, Takao, Yamaguchi University, Japan Imoto, Hirochika, Yamaguchi University, Japan Nomura, Sadahiro, Yamaguchi University, Japan Suzuki, Michiyasu, Yamaguchi University, Japan Yamakawa, Toshitaka, Kumamoto University, Japan

LS-A-TH1.3: PRELIMINARY STUDY USING AUTOENCODER FOR EARLY DETECTION OF HEAT ILLNESS FROM HEART RATE VARIABILITY OBTAINED WITH WEARABLE DEVICE

Inatsu, Nao, Kumamoto University, Japan Noguchi, Aoi, Kumamoto University, Japan Ota, Koshi, Nagoya University, Japan Fujiwara, Koichi, Nagoya University, Japan Kubo, Takatomi, Nara Institute of Science and Technology, Japan Yamakawa, Toshitaka, Kumamoto University, Japan

LS-A-TH1.4: MATHEMATICAL MODEL OF A HORSE AND THE RIDER DURING A JUMP

Tsuruo, Asahi, Nara Institute of Science and Technology, Japan Ringhofer, Monamie, Kyoto University, Japan Yamamoto, Shinya, Kyoto University, Japan Ikeda, Kazushi, Nara Institute of Science and Technology, Japan

LS-A-TH1.5: EVALUATION OF THE EFFECT OF TRANSFER LEARNING TO MULTI-INSTANCE DETECTION OF MONKEYS

Pineda, Riza Rae, Nara Institute of Science and Technology, Japan Kubo, Takatomi, Nara Institute of Science and Technology, Japan Shimada, Masaki, Teikyo University of Science, Japan Ikeda, Kazushi, Nara Institute of Science and Technology, Japan

LS-A-TH1.6: SEMI-SUPERVISED ESTIMATION OF DRIVING BEHAVIORS USING ROBUST TIME-CONTRASTIVE LEARNING

Kuroki, Takuma, Nara Institute of Science and Technology, Japan Shouno, Osamu, Otemon Gakuin University, Japan Yoshimoto, Junichiro, Nara Institute of Science and Technology, Japan

Thursday, December 16, 09:00 - 11:00 ∘ Live Session B (Annex)
LS-B-TH1 - Deep Generative Models for Media Clones and Its Detection

LS-B-TH1.1: DETECTING DEEPFAKE VIDEOS USING DIGITAL WATERMARKING

Qureshi, Amna, Universitat Oberta de Catalunya; University of Bradford, Spain Megías, David, Universitat Oberta de Catalunya, Spain Kuribayashi, Minoru, Okayama University, Japan

LS-B-TH1.2: A FLEXIBLE REVERSIBLE DATA HIDING METHOD IN COMPRESSIBLE ENCRYPTED IMAGES

Motomura, Ryota, Chiba University, Japan Imaizumi, Shoko, Chiba University, Japan Kiya, Hitoshi, Tokyo Metropolitan University, Japan

LS-B-TH1.3: MODEL INVERSION ATTACK AGAINST A FACE RECOGNITION SYSTEM IN A BLACK-BOX SETTING

Yoshimura, Shunsuke, Osaka University, Japan Nakamura, Kazuaki, Osaka University, Japan Nitta, Naoko, Osaka University, Japan Babaguchi, Noboru, Osaka University, Japan

LS-B-TH1.4: FEATURE EXTRACTION SUITABLE FOR DOUBLE JPEG COMPRESSION ANALYSIS BASED ON STATISTICAL BIAS OBSERVATION OF DCT COEFFICIENTS

Takeshita, Daichi, Okayama University, Japan Kuribayashi, Minoru, Okayama University, Japan Funabiki, Nobuo, Okayama University, Japan

LS-B-TH1.5: FEATURE EXTRACTION BASED ON DENOISING AUTO ENCODER FOR CLASSIFICATION OF ADVERSARIAL EXAMPLES

Yamasaki, Yuma, Okayama University, Japan Kuribayashi, Minoru, Okayama University, Japan Funabiki, Nobuo, Okayama University, Japan Hong Nguyen, Huy, National Institute of Informatics, Japan Echizen, Isao, National Institute of Informatics, Japan

Thursday, December 16, 09:00 - 11:00 ∘ Live Session C (10A)
LS-C-TH1 - Online and Distributed Kernel Learning Algorithms

LS-C-TH1.1: GRAPH KERNEL RECURSIVE LEAST-SQUARES ALGORITHMS

gogineni, Vinay Chakravarthi, Simula Metropolitan Center for Digital Engineering, Norway Naumova, Valeriya, Simula Metropolitan Center for Digital Engineering, Norway Werner, Stefan, Norwegian University of Science and Technology, Norway Huang, Yih-Fang, University of Notre Dame, United States of America

LS-C-TH1.2: A HILBERTIAN PROJECTION APPROACH WITH DICTIONARY DIVIDING STRATEGY: ACCELERATING NONLINEAR ESTIMATION ALGORITHM WITH MULTISCALE GAUSSIANS

Takizawa, Masaaki, National Institute of Technology, Toyama College, Japan Yukawa, Masahiro, Keio University, Japan

LS-C-TH1.3: PERSONALIZED LEARNING USING MULTIPLE KERNEL MODELS

Kuh, Anthony, University of Hawaii, United States of America Huang, Shuai, University of Washington, United States of America Chen, Cynthia, University of Washington, United States of America

LS-C-TH1.4: REAL TIME KERNEL LEARNING FOR SENSOR NETWORKS USING PRINCIPLES OF FEDERATED LEARNING

Kuh, Anthony, University of Hawaii, United States of America

Thursday, December 16, 09:00 - 11:00 ∘ Live Session D (Virtual)
LS-D-TH1 - Reconfigurable Computing and Performance Evaluation

LS-D-TH1.1: IMPROVED FRUIT FLY OPTIMIZATION ALGORITHM BASED ON SIMULATED ANNEALING IN NEURAL NETWORK

Wu, Jin, Xi’an University of Posts and Telecommunications, China Dai, Wei, Xi’an University of Posts and Telecommunications, China Wang, Yu, Xi’an University of Posts and Telecommunications, China Zhao, Bo, Xi’an University of Posts and Telecommunications, China

LS-D-TH1.2: AN IMPLEMENTATION METHOD OF HEVC DATAFLOW GRAPH BASED ON RECONFIGURABLE PROCESSER

Zhu, Yun, Xi’an University of Posts and Telecommunications, China Hu, Chuanzhan, Xi’an University of Posts and Telecommunications, China Jiang, Lin, Xi’an University of Science and Technology, China Shen, Xubang, Xi'an Microelectronic Technology Research Institute, China

LS-D-TH1.3: AN IMPROVED NAIVE BAYES MODEL FOR AIR TEMPERATURE PREDICTION

JIANG, BINGHONG, Xi’an Jiaotong University, China

LS-D-TH1.4: AN IDE FOR RECONFIGURABLE VIDEO ARRAY PROCESSOR

YANG, Rong, Xi'an University of Posts and Telecommunications, China XIE, Xiaoyan, Xi'an University of Posts and Telecommunications, China CHAI, Miaomiao, Xi'an University of Posts and Telecommunications, China FANG, Lin, Xi'an University of Posts and Telecommunications, China HE, Wanqi, Xi'an University of Posts and Telecommunications, China SUN, Jingtao, Xi'an University of Posts and Telecommunications, China

LS-D-TH1.5: A RECONFIGURABLE PARALLELIZATION OF GENERATIVE ADVERSARIAL NETWORKS BASED ON ARRAY PROCESSOR

Xie, Xiaoyan, Xi'an University of Posts and Telecommunications, China Chai, Miaomiao, Xi'an University of Posts and Telecommunications, China Du, Zhuolin, Xi'an University of Posts and Telecommunications, China Yang, Kun, Xi'an University of Posts and Telecommunications, China Yin, Shaorun, Xi'an University of Posts and Telecommunications, China

LS-D-TH1.6: PERFORMANCE CHARACTERIZATION OF RASTERIZATION ALGORITHMS FOR RECONFIGURABLE GRAPHICS PROCESSOR

Deng, Junyong, Xi'an University of Posts and Telecommunications, China Ma, Qingqing, Xi'an University of Posts and Telecommunications, China Ye, Zekun, Xi'an University of Posts and Telecommunications, China

Thursday, December 16, 09:00 - 11:00 ∘ On-Demand A
OD-A-TH1 - Speech Enhancement and Separation

OD-A-TH1.1: A TARGET SPEAKER SEPARATION NEURAL NETWORK WITH JOINT-TRAINING

Yang, Wenjing, Beijing Institute of Technology, China Wang, Jing, Beijing Institute of Technology, China Li, Hongfeng, Xiaomi, China Xu, Na, Xiaomi, China Xiang, Fei, Xiaomi, China Qian, Kai, Beijing Institute of Technology, China Hu, Shenghua, Beijing Institute of Technology, China

OD-A-TH1.2: IMPROVEMENT OF SPATIAL AMBIGUITY IN MULTI-CHANNEL SPEECH SEPARATION USING CHANNEL ATTENTION

Hong, Qian-Bei, National Cheng Kung University and Academia Sinica, Taiwan Wu, Chung-Hsien, National Cheng Kung University and Academia Sinica, Taiwan Nguyen, Thanh Binh, National Cheng Kung University, Viet Nam Wang, Hsin-Min, National Cheng Kung University and Academia Sinica, Taiwan

OD-A-TH1.3: NOISE-TOLERANT TIME-DOMAIN SPEECH SEPARATION WITH NOISE BASES

Ozamoto, Kohei, Tokyo Institute of Technology, Japan Uto, Kuniaki, Tokyo Institute of Technology, Japan Iwano, Koji, Tokyo City University, Japan Shinoda, Koichi, Tokyo Institute of Technology, Japan

OD-A-TH1.4: MINIMUM-VOLUME REGULARIZED ILRMA FOR BLIND AUDIO SOURCE SEPARATION

Wang, Jianyu, Northwestern Polytechnical University, China Guan, Shanzheng, Northwestern Polytechnical University, China Zhang, Xiao-Lei, Northwestern Polytechnical University, China

OD-A-TH1.5: A COMPARISON OF HANDCRAFTED, PARAMETERIZED, AND LEARNABLE FEATURES FOR SPEECH SEPARATION

Zhu, Wenbo, Northwestern Polytechnical University, China Wang, Mou, Northwestern Polytechnical University, China Zhang, Xiao-Lei, Northwestern Polytechnical University, China Rahardja, Susanto, Northwestern Polytechnical University, China

OD-A-TH1.6: OVER-DETERMINED SEMI-BLIND SPEECH SOURCE SEPARATION

Togami, Masahito, Line corporation, Japan Scheibler, Robin, Line corporation, Japan

OD-A-TH1.7: GROUP MULTI-SCALE CONVOLUTIONAL NETWORK FOR MONAURAL SPEECH ENHANCEMENT IN TIME-DOMAIN

Yu, Juntao, Beijing University of Posts and Telecommunications, China Ting, Jiang, Beijing University of Posts and Telecommunications, China Yu, Jiacheng, Beijing University of Posts and Telecommunications, China

OD-A-TH1.8: PRIOR DISTRIBUTION DESIGN FOR MUSIC BLEEDING-SOUND REDUCTION BASED ON NONNEGATIVE MATRIX FACTORIZATION

Mizobuchi, Yusaku, National Institute of Technology, Kagawa College, Japan Kitamura, Daichi, National Institute of Technology, Kagawa College, Japan Nakamura, Tomohiko, The University of Tokyo, Japan Saruwatari, Hiroshi, The University of Tokyo, Japan Takahashi, Yu, Yamaha Corporation, Japan Kondo, Kazunobu, Yamaha Corporation, Japan

OD-A-TH1.9: A STUDY ON SPEECH ENHANCEMENT BASED ON DIFFUSION PROBABILISTIC MODEL

Lu, Yen-Ju, Academia Sinica, Taiwan Tsao, Yu, Academia Sinica, Taiwan Watanabe, Shinji, Carnegie Mellon University, United States of America

OD-A-TH1.10: A DEEP ANALYSIS OF SPEECH SEPARATION GUIDED DIARIZATION UNDER REALISTIC CONDITIONS

Fang, Xin, University of Science and Technology of China, China Ling, Zhen-hua, University of Science and Technology of China, China Sun, Lei, iFlytek Research, China Niu, Shu-Tong, University of Science and Technology of China, China Du, Jun, University of Science and Technology of China, China Liu, Cong, iFlytek Research, China Sheng, Zhi-chao, iFlytek Research, China

OD-A-TH1.11: TARGET SPEAKER EXTRACTION FOR CUSTOMIZABLE QUERY-BY-EXAMPLE KEYWORD SPOTTING

Shao, Qijie, Northwestern Polytechnical University, China Hou, Jingyong, Northwestern Polytechnical University, China Hu, Yanxin, Northwestern Polytechnical University, China Wang, Qing, Northwestern Polytechnical University, China Xie, Lei, Northwestern Polytechnical University, China Lei, Xin, Mobvoi, United States of America

OD-A-TH1.12: TIME DOMAIN SPEECH ENHANCEMENT WITH ATTENTIVE MULTI-SCALE APPROACH

Chen, Chen, Nanyang Technological University, Singapore Hou, Nana, Nanyang Technological University, Singapore Ma, Duo, National University of Singapore, Singapore Chng, Eng Siong, Nanyang Technological University, Singapore

OD-A-TH1.13: ON SPEECH SPARSITY FOR COMPUTATIONAL EFFICIENCY AND NOISE REDUCTION IN HEARING AIDS

Llave, Adrien, CentraleSupélec, IETR, France Leglaive, Simon, CentraleSupélec, IETR, France

OD-A-TH1.14: SPARSELY OVERLAPPED SPEECH TRAINING IN THE TIME DOMAIN: JOINT LEARNING OF TARGET SPEECH SEPARATION AND PERSONAL VAD BENEFITS

Lin, Qingjian, Lenovo, China Yang, Lin, Lenovo, China Wang, Xuyang, Lenovo, China Xie, Luyuan, Lenovo, China Jia, Chen, Lenovo, China Wang, Junjie, Lenovo, China

Thursday, December 16, 09:00 - 11:00 ∘ On-Demand B
OD-B-TH1 - Image and Video Processing

OD-B-TH1.1: COMPUTATION REDUCTION FOR HEVC INTER PREDICTION

Tsai, Yi-Ta, National Central University, Taiwan Lin, Yinyi, National Central University, Taiwan

OD-B-TH1.2: SNAPSHOT MULTISPECTRAL IMAGE COMPLETION AND UNMIXING WITH TOTAL VARIATION REGULARIZATION ON ABUNDANCE MAPS

Ozawa, Keisuke, DENSO IT Laboratory, Japan Sumiyoshi, Shinichi, DENSO IT Laboratory, Japan Tachioka, Yuki, DENSO IT Laboratory, Japan

OD-B-TH1.3: UNDERWATER IMAGE DEHAZING BASED ON DISPARITY ESTIMATION AND COLOR CONSTRAINT

Liu, Yan, Hohai University, China Li, Qingwu, Hohai University, China Huo, Guanying, Hohai University, China Zhou, Yan, Hohai University, China Yu, Dabin, Hohai University, China

OD-B-TH1.4: HIGH REFLECTION REMOVAL USING CNN WITH DETECTION AND ESTIMATION

Funahashi, Isana, University of Electro-Communications, Japan Yamashita, Naoki, University of Electro-Communications, Japan Yoshida, Taichi, University of Electro-Communications, Japan Ikehara, Masaaki, Keio University, Japan

OD-B-TH1.5: INTRA CODING TOOL PRUNING FOR REDUCING COMPLEXITY OF VVC SCREEN CONTENT CODING

Tang, Tong, Chongqing University of Posts and Telecommunications, China Hu, Shun, Chongqing University of Posts and Telecommunications, China Cui, Linfeng, Chongqing University of Posts and Telecommunications, China Yin, Zhiyang, Chongqing University of Posts and Telecommunications, China

OD-B-TH1.6: IMAGE COMPRESSION ARCHITECTURE WITH BUILT-IN LIGHTWEIGHT MODEL

Kuo, Tien-Ying, National Taipei University of Technology, Taiwan Wei, Yu-Jen, National Taipei University of Technology, Taiwan Lin, Jhih-Jhou, National Taipei University of Technology, Taiwan

OD-B-TH1.7: DENOISING HYPERSPECTRAL IMAGES USING INTERBAND CORRELATION

Takehisa, Shuhei, Doshisha University, Japan Okuda, Masahiro, Doshisha University, Japan

OD-B-TH1.8: A CONSENSUS FRAMEWORK FOR CONVOLUTIONAL DICTIONARY LEARNING BASED ON L1 NORM ERROR

Takanashi, Mizuki, National Institute of Technology (KOSEN), Kurume College, Japan Kuroki, Yoshimitsu, National Institute of Technology (KOSEN), Kurume College, Japan

OD-B-TH1.9: NOISE REMOVAL FOR DYNAMIC MODE DECOMPOSITION BASED ON PLUG-AND-PLAY ADMM

Anami, Shunki, The University of Kitakyushu, Japan Matsuoka, Ryo, The University of Kitakyushu, Japan

OD-B-TH1.10: NEW END-TO-END NETWORK FOR STEREO HIGH DYNAMIC RANGE IMAGING

Zhong, Lifei, University of Macau, Macao Zhou, Jiantao, University of Macau, Macao

OD-B-TH1.11: MOVING OBJECT DETECTION IN HEVC VIDEO

Pang, LieLin, University of Malaya, Malaysia Wong, KokSheik, Monash University Malaysia, Malaysia

OD-B-TH1.12: SPATIAL INFORMATION REFINEMENT FOR CHROMA INTRA PREDICTION IN VIDEO CODING

Zou, Chengyi, Northwestern Polytechnical University, China Wan, Shuai, Northwestern Polytechnical University, China Ji, Tiannan, Northwestern Polytechnical University, China Mrak, Marta, British Broadcasting Corporation, United Kingdom of Great Britain and Northern Ireland Blanch, Marc Gorriz, British Broadcasting Corporation, United Kingdom of Great Britain and Northern Ireland Herranz, Luis, Computer Vision Center, Spain

Thursday, December 16, 14:00 - 16:00 ∘ Live Session A (Hall)
LS-A-TH2 - Advanced Topics on Sound Event and Scene Analysis

LS-A-TH2.1: COMPARISON OF LOW COMPLEXITY SELF-ATTENTION MECHANISMS FOR ACOUSTIC EVENT DETECTION

Komatsu, Tatsuya, LINE Corporation, Japan Scheibler, Robin, LINE Corporation, Japan

LS-A-TH2.2: DUAL-PATH TRANSFORMER FOR MACHINE CONDITION MONITORING

Bai, Jisheng, School of Marine Science and Technology, Northwestern Polytechnical University, China Wang, Mou, School of Marine Science and Technology, Northwestern Polytechnical University, China Chen, Jianfeng, School of Marine Science and Technology, Northwestern Polytechnical University, China

LS-A-TH2.3: SPEAKER COUNT: A NEW BUILDING BLOCK FOR SPEAKER DIARIZATION

Thi-Hien Duong, Thanh, Hanoi University of Mining and Geology, Viet Nam Nguyen, Phi-Le, Hanoi University of Science and Technology, Viet Nam Nguyen, Duc-Chien, Aimenext Join Stock Company, Viet Nam Nguyen, Hong-Son, Aimenext Join Stock Company, Viet Nam Phan, Huy, Queen Mary University of London, United Kingdom of Great Britain and Northern Ireland Q. K. Duong, Ngoc, InterDigital, France

LS-A-TH2.4: MULTITASK LEARNING OF ACOUSTIC SCENES AND EVENTS USING DYNAMIC WEIGHT ADAPTATION BASED ON MULTI-FOCAL LOSS

Nada, Kayo, Doshisha University, Japan Imoto, Keisuke, Doshisha University, Japan Iwamae, Reina, Doshisha University, Japan Tsuchiya, Takao, Doshisha University, Japan

LS-A-TH2.5: INVESTIGATION ON SPATIAL AND FREQUENCY-BASED FEATURES FOR ASYNCHRONOUS ACOUSTIC SCENE ANALYSIS

Shiroma, Yuki, Tokyo Metropolitan University, Japan Imoto, Keisuke, Doshisha University, Japan Shiota, Sayaka, Tokyo Metropolitan University, Japan Ono, Nobutaka, Tokyo Metropolitan University, Japan Kiya, Hitoshi, Tokyo Metropolitan University, Japan

LS-A-TH2.6: ANALYSIS ON ROLES OF DNNS IN END-TO-END ACOUSTIC SCENE ANALYSIS FRAMEWORK WITH DISTRIBUTED SOUND-TO-LIGHT CONVERSION DEVICES

Kinoshita, Yuma, Tokyo Metropolitan University, Japan Ono, Nobutaka, Tokyo Metropolitan University, Japan

Thursday, December 16, 14:00 - 16:00 ∘ Live Session B (Annex)
LS-B-TH2 - Signal Processing and Machine Learning over Graphs

LS-B-TH2.1: NODE CLUSTERING OF TIME-VARYING GRAPHS BASED ON TEMPORAL LABEL SMOOTHNESS

Fukumoto, Katsuki, Tokyo University of Agriculture and Technology, Japan Yamada, Koki, Tokyo University of Agriculture and Technology, Japan Tanaka, Yuichi, Tokyo University of Agriculture and Technology, Japan

LS-B-TH2.2: RECOVERY OF TIME SERIES OF GRAPH SIGNALS OVER DYNAMIC TOPOLOGY

Yamagata, Eisuke, Tokyo Institute of Technology, Japan Ono, Shunsuke, Tokyo Institute of Technology, Japan

LS-B-TH2.3: AN EMPIRICAL STUDY ON COMPRESSED DECENTRALIZED STOCHASTIC GRADIENT ALGORITHMS WITH OVERPARAMETERIZED MODELS

Rao, Arjun Ashok, The Chinese University of Hong Kong, Hong Kong Wai, Hoi-To, The Chinese University of Hong Kong, Hong Kong

LS-B-TH2.4: MODEL SELECTION-INSPIRED COEFFICIENTS OPTIMIZATION FOR POLYNOMIAL-KERNEL GRAPH LEARNING

Yang, Cheng, Shanghai Jiao Tong University, China Wang, Fen, Zhejiang Lab, China Ye, Minxiang, Zhejiang Lab, China Zhai, Guangtao, Shanghai Jiao Tong University, China Zhang, Xiao-Ping, Ryerson University, Canada Stankovic, Vladimir, University of Strathclyde, United Kingdom of Great Britain and Northern Ireland Stankovic, Lina, University of Strathclyde, United Kingdom of Great Britain and Northern Ireland

LS-B-TH2.5: CHANNEL-WISE EARLY STOPPING WITHOUT A VALIDATION SET VIA NNK POLYTOPE INTERPOLATION

Bonet, David, Universitat Politècnica de Catalunya, Spain Ortega, Antonio, University of Southern California, United States of America Ruiz-Hidalgo, Javier, Universitat Politècnica de Catalunya, Spain Shekkizhar, Sarath, University of Southern California, United States of America

Thursday, December 16, 14:00 - 16:00 ∘ Live Session C (10A)
LS-C-TH2 - High Performance Image and Video Processing and Applications

LS-C-TH2.1: SPATIALLY VARYING WHITE BALANCING FOR MIXED AND NON-UNIFORM ILLUMINANTS

Akazawa, Teruaki, Tokyo Metropolitan University, Japan Kinoshita, Yuma, Tokyo Metropolitan University, Japan Kiya, Hitoshi, Tokyo Metropolitan University, Japan

LS-C-TH2.2: SEMANTICALLY RELEVANT SCENE DETECTION USING DEEP LEARNING

Chakraborty, Dipanita, King Mongkut's University of Technology Thonburi, Thailand Chiracharit, Werapon, King Mongkut's University of Technology Thonburi, Thailand Chamnongthai, Kosin, King Mongkut's University of Technology Thonburi, Thailand

LS-C-TH2.3: DIGITAL HALFTONE CLASSIFICATION USING SIMPLIFIED CNN AND STOCHASTIC STATISTICS

Guo, Jing-Ming, National Taiwan University of Science and Technology, Taiwan Seshathiri, Sankarasrinivasan, National Taiwan University of Science and Technology, Taiwan

LS-C-TH2.4: IMPLEMENTATION OF AVS3 MULTICAST SYSTEM BASED ON EMBMS

Fang, Lingfeng, Beijing University of Posts and Telecommunications, China Li, Chunhao, Beijing University of Posts and Telecommunications, China Sun, Songlin, Beijing University of Posts and Telecommunications, China

Thursday, December 16, 14:00 - 16:00 ∘ Live Session D (Virtual)
LS-D-TH2 - Advanced Image, Video and Multimedia Processing using Deep Learning(1)

LS-D-TH2.1: NON-PARALLEL VOICE CONVERSION WITH GENERATIVE ATTENTIONAL NETWORKS

Chiu, Tse Wei, National Central University, Taiwan Guo, You Sheng, National Central University, Taiwan Chang, Pao-Chi, National Central University, Taiwan

LS-D-TH2.2: UNPAIRED IMAGE DEMOIRÉING BASED ON CYCLIC MOIRÉ LEARNING

Park, Hyunkook, Dongguk University, Korea, Republic of Vien, An Gia, Dongguk University, Korea, Republic of Koh, Yeong Jun, Chungam National University, Korea, Republic of Lee, Chul, Dongguk University, Korea, Republic of

LS-D-TH2.3: RESIDUAL DILATED U-NET WITH SPATIALLY ADAPTIVE NORMALIZATION FOR THE RESTORATION OF UNDER DISPLAY CAMERA IMAGES

Oh, Youngjin, Seoul National University, Korea, Republic of Park, Gu Yong, Seoul National University, Korea, Republic of Chung, Haesoo, Seoul National University, Korea, Republic of Cho, Sunwoo, Seoul National University, Korea, Republic of Cho, Nam Ik, Seoul National University, Korea, Republic of

LS-D-TH2.4: LOSSLESS IMAGE COMPRESSION BASED ON IMAGE DECOMPOSITION AND PROGRESSIVE PREDICTION USING CONVOLUTIONAL NEURAL NETWORKS

Shim, Jae Hoon, Institute of New Media and Communications, Korea, Republic of Rhee, Hochang, Institute of New Media and Communications, Korea, Republic of Jang, Yeong Il, Institute of New Media and Communications, Korea, Republic of Lee, Geonsu, Institute of New Media and Communications, Korea, Republic of Kim, Seyun, Institute of New Media and Communications, Korea, Republic of Cho, Nam Ik, Institute of New Media and Communications, Korea, Republic of

LS-D-TH2.5: FACIAL VIDEO FRAME INTERPOLATION COMBINING SYMMETRIC AND ASYMMETRIC MOTIONS

Kim, Jintae, Korea University, Korea, Republic of Park, Junheum, Korea University, Korea, Republic of Choi, Whan, Korea University, Korea, Republic of Kim, Chang-Su, Korea University, Korea, Republic of

LS-D-TH2.6: FACE ANTI-SPOOFING USING MULTI-BRANCH CNN

Nguyen, Tin Cong, National Central University, Viet Nam Pham, Bach-Tung, National Central University, Taiwan Le, Thi Phuong, National Central University, Taiwan Tai, Tzu-Chiang, Providence University, Taiwan Wang, Jia-Ching, National Central University, Taiwan

Thursday, December 16, 14:00 - 16:00 ∘ On-Demand A
OD-A-TH2 - Emotion, Paralinguistic, and Speaker Analysis

OD-A-TH2.1: INTEGRATION OF ANNOTATOR-WISE ESTIMATIONS FOR EMOTION RECOGNITION BY USING GROUP SOFTMAX

Tachioka, Yuuki, Denso IT Laboratory, Japan

OD-A-TH2.2: HIERARCHICAL PROSODY ANALYSIS IMPROVES CATEGORICAL AND DIMENSIONAL EMOTION RECOGNITION

LI, Xingfeng, Hithink RoyalFlush AI Research Institute, China Guo, Taiyang, Japan Advanced Institute of Science and Technology, Japan Hu, Xinhui, Hithink RoyalFlush AI Research Institute, China Xu, Xinkang, Hithink RoyalFlush AI Research Institute, China Dang, Jianwu, Japan Advanced Institute of Science and Technology;Tianjin University, Japan Akagi, Masato, Japan Advanced Institute of Science and Technology, Japan

OD-A-TH2.3: A STUDY OF SALIENT MODULATION DOMAIN FEATURES FOR SPEAKER IDENTIFICATION

McKnight, Simon, Imperial College London, United Kingdom of Great Britain and Northern Ireland Hogg, Aidan, Imperial College London, United Kingdom of Great Britain and Northern Ireland Neo, Vincent, Imperial College London, United Kingdom of Great Britain and Northern Ireland Naylor, Patrick, Imperial College London, United Kingdom of Great Britain and Northern Ireland

OD-A-TH2.4: A STUDY ON DECOUPLED PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS

Wang, Di, Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, China Li, Lantian, Center for Speech and Language Technologies, BNRist, Tsinghua University, China Yu, Hongzhi, Key Laboratory of China’s Ethnic Languages and Information Technology of Ministry of Education, Northwest Minzu University, China Wang, Dong, Center for Speech and Language Technologies, BNRist, Tsinghua University, China

OD-A-TH2.5: GENERATION OF SPEAKER REPRESENTATIONS USING HETEROGENEOUS TRAINING BATCH ASSEMBLY

Peng, Yu-Huai, Academia Sinica, Taiwan Lee, Hung-Shin, Academia Sinica, Taiwan Huang, Pin-Tuan, Academia Sinica, Taiwan Wang, Hsin-Min, Academia Sinica, Taiwan

OD-A-TH2.6: SPEECH EMOTION RECOGNITION WITH FUSION OF ACOUSTIC- AND LINGUISTIC-FEATURE-BASED DECISIONS

Nagase, Ryotaro, Ritsumeikan University, Japan Fukumori, Takahiro, Ritsumeikan University, Japan Yamashita, Yoichi, Ritsumeikan University, Japan

OD-A-TH2.7: AUTOMATIC NATURALNESS RECOGNITION FROM ACTED SPEECH USING NEURAL NETWORKS

Atmaja, Bagus Tris, National Institute of Advanced Industrial Science and Technology, Japan Sasou, Akira, National Institute of Advanced Industrial Science and Technology, Japan Akagi, Masato, Japan Advanced Institute of Science and Technology, Japan

OD-A-TH2.8: COMPARATIVE STUDY OF FILTER BANKS TO IMPROVE THE PERFORMANCE OF VOICE DISORDER ASSESSMENT SYSTEMS USING LTAS FEATURES

Barche, Purva, International Institute of Information Technology, India Gurugubelli, Krishna, International Institute of Information Technology, India Vuppala, Anil Kumar, International Institute of Information Technology, India

OD-A-TH2.9: DUAL DROPOUT RANKING OF LINGUISTIC FEATURES FOR ALZHEIMER’S DISEASE RECOGNITION

Ke, Xiaoquan, The Hong Kong Polytechnic University, Hong Kong Mak, Man-Wai, The Hong Kong Polytechnic University, Hong Kong Li, Jinchao, The Chinese University of Hong Kong, Hong Kong Meng, Helen M., The Chinese University of Hong Kong, Hong Kong

OD-A-TH2.10: A MULTILINGUAL FRAMEWORK BASED ON PRETRAINING MODEL FOR SPEECH EMOTION RECOGNITION

Zhang, Zhaohang, Beihang University, Beijing, China, China Zhang, Xiaohui, Beijing Jiaotong University, Beijing, China, China Guo, Min, Tsinghua University, Beijing, China, China Zhang, Wei-Qiang, Tsinghua University, Beijing, China, China Li, Ke, Beijing Haitian Ruisheng Science Technology Ltd., Beijing 100083, China, China Huang, Yukai, Beijing Haitian Ruisheng Science Technology Ltd., Beijing 100083, China, China

OD-A-TH2.11: FILTERS KNOW HOW YOU FEEL: EXPLAINING INTERMEDIATE SPEECH EMOTION CLASSIFICATION REPRESENTATIONS

Anand, Anubhav, Wipro Limited, India Negi, Shubham, Wipro Limited, India N, Narendra, Wipro Limited, India

OD-A-TH2.12: DETECTING MULTIPLE DISFLUENCIES FROM SPEECH USING PRE-LINGUISTIC AUTOMATIC SYLLABIFICATION WITH ACOUSTIC AND PROSODY FEATURES

Mehrotra, Utkarsh, IIIT Hyderabad, India Garg, Sparsh, IIIT Hyderabad, India Krishna, Gurugubelli, IIIT Hyderabad, India Vuppala, Anil Kumar, IIIT Hyderabad, India

OD-A-TH2.13: SIAMESE NEURAL NETWORK WITH JOINT BAYESIAN MODEL STRUCTURE FOR SPEAKER VERIFICATION

Lu, Xugang, National Institute of Information and Communications Technology, Japan Shen, Peng, National Institute of Information and Communications Technology, Japan Tsao, Yu, Research Center for Information Technology Innovation, Taiwan Kawai, Hisashi, National Institute of Information and Communications Technology, Japan

OD-A-TH2.14: DEEP CONVOLUTIONAL NEURAL NETWORK FOR VOICE LIVENESS DETECTION

Gupta, Siddhant, Dhirubhai Ambani Institute of Information and Communication Technology, India Khoria, Kuldeep, Dhirubhai Ambani Institute of Information and Communication Technology, India Patil, Ankur T., Dhirubhai Ambani Institute of Information and Communication Technology, India Patil, Hemant A., Dhirubhai Ambani Institute of Information and Communication Technology, India

OD-A-TH2.15: HOW SPEECH IS RECOGNIZED TO BE EMOTIONAL - A STUDY BASED ON INFORMATION DECOMPOSITION

Sun, Haoran, Tsinghua University, China Li, Lantian, Tsinghua University, China Zheng, Thomas Fang, Tsinghua University, China Wang, Dong, Tsinghua University, China

OD-A-TH2.16: END-TO-END SPEAKER AGE AND HEIGHT ESTIMATION USING ATTENTION MECHANISM AND TRIPLET LOSS

Kaushik, Manav, Birla Institute of Technology and Science Pilani, India Pham, Van Tung, Nanyang Technological University, Singapore The Anh, Tran, Nanyang Technological University, Singapore Chng, Eng Siong, Nanyang Technological University, Singapore

Thursday, December 16, 14:00 - 16:00 ∘ On-Demand B
OD-B-TH2 - Security and Communications

OD-B-TH2.1: CROSS-DOMAIN RECAPTURED DOCUMENT DETECTION WITH TEXTURE AND REFLECTANCE CHARACTERISTICS

Yan, Jiabin, Shenzhen University, China Chen, Changsheng, Shenzhen University, China

OD-B-TH2.2: JOINT ESTIMATION OF IMAGE ROTATION ANGLE AND SCALING FACTOR

Yu, Kun, School of Computer Science & Technology Southwest University of Science and Technology, China Yang, Rongsong, School of Computer Science & Technology Southwest University of Science and Technology, China Zeng, Hui, School of Computer Science & Technology Southwest University of Science and Technology, China Peng, Anjie, School of Computer Science & Technology Southwest University of Science and Technology, China

OD-B-TH2.3: UNDETECTABLE JPEG IMAGE BATCH REVERSIBLE DATA HIDING WITH CONTENT-ADAPTIVE PAYLOAD ALLOCATION

Wang, Yangguang, University of Science and Technology of China, China Li, Jinwei, University of Science and Technology of China, China Yao, Yuanzhi, University of Science and Technology of China, China Yu, Nenghai, University of Science and Technology of China, China

OD-B-TH2.4: WORKLOAD BASED MODEL OF LARGE SCALE 1:N BIOMETRICS MULTI-STEP NARROWING DOWN PROCESS

Aoki, Takahiro, Fujitsu Limited, Japan

OD-B-TH2.5: EVALUATION ON PALM VEIN RECOGNITION OF CHILDREN IN GROWING

Hama, Soichi, Fujitsu Limited, Japan

OD-B-TH2.6: CLUSTER-TRNET: JOINTED MODEL FOR REAL-TIME TRAFFIC IDENTIFICATION WITH HIGH ACCURACY

Guo, Zhaojin, Xi’an Univeristy of Posts and Telecommunications, China Liu, Runji, Xi’an Univeristy of Posts and Telecommunications, China Lin, Yijun, Xi’an Univeristy of Posts and Telecommunications, China Chen, Feixiong, Xi’an Univeristy of Posts and Telecommunications, China Xiong, Chuxi, University of Waterloo, China Xie, Xiaoyan, Xi'an Univeristy of Posts and Telecommunications, China

OD-B-TH2.7: AN OVERLOADED MU-MIMO SIGNAL DETECTION METHOD USING PIECEWISE CONTINUOUS NONCONVEX SPARSE REGULARIZER

Hirayama, Atsuya, Osaka City University, Japan Hayashi, Kazunori, Kyoto University, Japan

OD-B-TH2.8: RECEIVED SIGNAL POWER BASED SENSOR ZONE ESTIMATION WITH MAXIMUM LIKELIHOOD APPROACH

Honda, Hiroki, Osaka City University, Japan Hayashi, Kazunori, Kyoto University, Japan Pabbisetty, Gurusanthosh, Toshiba Corporation, Japan Mori, Hiroki, Toshiba Corporation, Japan

OD-B-TH2.9: ANOMALY DETECTION FOR WIRELESS COMMUNICATION LINKS VIA DATA INTEGRITY MODELING

Nemati, Mahyar, Deakin University, Australia Park, Jihong, Deakin University, Australia Jeon, Moongu, Gwangju Institute of Science and Technology, Korea, Republic of Choi, Jinho, Deakin University, Australia

Thursday, December 16, 16:20 - 18:20 ∘ Live Session A (Hall)
LS-A-TH3 - Complex- and Hypercomplex-valued Adaptive Signal Processing
Thursday, December 16, 16:20 - 18:20 ∘ Live Session B (Annex)
LS-B-TH3 - Objective Measurements in Psychological and Cognitive Sciences

LS-B-TH3.1: MODELING THE DYNAMICS OF OBSERVATIONAL BEHAVIORS BASE ON OBSERVERS’ PERSONALITY TRAITS USING HIDDEN MARKOV MODELS

Xu, Kuangzhe, Kwansei Gakuin University, Japan Nagata, Noriko, Kwansei Gakuin University, Japan Matsuka, Toshihiko, Chiba University, Japan

LS-B-TH3.2: ESTIMATING BEVERAGE PREFERENCE BASED ON SUBJECTIVE EMOTIONAL REACTIONS AND EEG ACTIVITY

Xu, Kuangzhe, Kwansei Gakuin University, Japan Katahira, Kenji, Waseda University, Japan Yamazaki, Yoichi, Kwansei Gakuin University, Japan Zhang, Fan, Kwansei Gakuin University, Japan Nishida, Naoki, Kwansei Gakuin University, Japan Tamai, Yuichiro, Kwansei Gakuin University, Japan Matsuzaki, Naoyuki, Suntory Global Innovation Center Research Group, Japan Nagata, Noriko, Kwansei Gakuin University, Japan

LS-B-TH3.3: MEASURING ATTRACTIVENESS OF TOURISM RESOURCES BY FOCUSING ON KANSEI VALUE STRUCTURE: POSSIBILITY OF INVITING VISITORS USING THE JAPANESE HERITAGE “AKO SALT.”

HATANO, Tomomi, Kwansei Gakuin University, Japan TAKEZAWA, Tomomi, Kwansei Gakuin University, Japan SUGIMOTO, Masashi, Kwansei Gakuin University, Japan XU, Kuangzhe, Kwansei Gakuin University, China MORIKAWA, Takashi, Kwansei Gakuin University, Japan AZUMA, Yasuhiro, Kwansei Gakuin University, Japan SHIBUTA, Kazuo, Kwansei Gakuin University, Japan NAGATA, Noriko, Kwansei Gakuin University, Japan

LS-B-TH3.4: AIZUCHI AS A SIGN OF INTERNAL INFORMATION PROCESSING AND ITS INTERPRETATIONS BY LISTENERS

Kawabata, Yoshiko, National Institute for Japanese Language and Linguistics, Japan Matsuka, Toshihiko, Chiba University, Japan

Thursday, December 16, 16:20 - 18:20 ∘ Live Session C (10A)
LS-C-TH3 - Recent Topics on Signal and Information Processing for Active Control of Sound

LS-C-TH3.1: A STUDY ON OPTIMAL FILTER OF FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM BASED ON ANALYSIS OF FREQUENCY RESPONSE

Iwai, Kenta, Ritsumeikan University, Japan Nishiura, Takanobu, Ritsumeikan University, Japan

LS-C-TH3.2: DESIGN AND EVALUATION OF ACTIVE NOISE CONTROL ON MACHINERY NOISE

Wen, Shulin, Nanyang Technological University, Singapore Nguyen, Duy Hai, Nanyang Technological University, Singapore Wang, Miqing, Nanyang Technological University, Singapore Gan, Woon-Seng, Nanyang Technological University, Singapore

LS-C-TH3.3: A SUBBAND ACTIVE NOISE CONTROL SYSTEM WITH AUTOMATIC TAP ASSIGNMENT IN CONSIDERATION OF PSYCHOACOUSTIC PROPERTIES

Yamanouchi, Satoshi, Kansai University, Japan Kajikawa, Yoshinobu, Kansai University, Japan

LS-C-TH3.4: A TRUE DIGITAL FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM WITH NO ANALOG-TO-DIGITAL AND DIGITAL-TO-ANALOG CONVERTERS

Li, Mingzhe, University of Electronic Science and Technology of China, China Shi, Chuang, University of Electronic Science and Technology of China, China Wang, Yue, University of Electronic Science and Technology of China, China

LS-C-TH3.5: DEVELOPMENT OF ACTIVE HEAR-THROUGH EQUALIZATION ALGORITHM FOR EARPHONES

Huang, Chong-Rui, Chung Yuan Christian University, Taiwan Chang, Cheng-Yuan, Chung Yuan Christian University, Taiwan Kuo, Sen M., Chung Yuan Christian University, Taiwan

Thursday, December 16, 16:20 - 18:20 ∘ Live Session D (Virtual)
LS-D-TH3 - Advanced Image, Video and Multimedia Processing using Deep Learning (2)

LS-D-TH3.1: ROBUSTNESS AGAINST ADVERSARY MODELS ON MNIST BY DEEP-Q REINFORCEMENT LEARNING BASED PARALLEL-GANS

Zhang, Rong, National Central University, Taiwan Chang, Pao-Chi, National Central University, Taiwan

LS-D-TH3.2: RATE-DISTORTION OPTIMIZED TEMPORAL SEGMENTATION USING REINFORCEMENT LEARNING FOR VIDEO CODING

Lee, Jung-Kyung, Ewha Womans University, Korea, Republic of Kim, Nayoung, Ewha Womans University, Korea, Republic of Kang, Je-Won, Ewha Womans University, Korea, Republic of

LS-D-TH3.3: A FUSION METHODOLOGY OF AKAZE AND NEURAL NETWORK FOR FINGERPRINT RECOGNITION

Raswa, Farchan Hakim, Universitas Gadjah Mada, Indonesia Harjoko, Agus, Universitas Gadjah Mada, Indonesia Chrisantonius, Chrisantonius, Universitas Gadjah Mada, Indonesia Wang, Jia-Ching, National Central University, Taiwan

LS-D-TH3.4: CONTEXT-BASED MATCHING REFINEMENT FOR PERSON SEARCH

Han, Byeong-Ju, UNIST, Korea, Republic of Yang, Jae-Won, UNIST, Korea, Republic of Lee, Oggyu, UNIST, Korea, Republic of Sim, Jae-Young, UNIST, Korea, Republic of

LS-D-TH3.5: PARTIAL FINGERPRINT ON COMBINED EVALUATION USING DEEP LEARNING AND FEATURE DESCRIPTOR

Chrisantonius, Chrisantonius, Universitas Gadjah Mada, Indonesia Priyambodo, Tri Kuntoro, Universitas Gadjah Mada, Indonesia Raswa, Farchan Hakim, Universitas Gadjah Mada, Indonesia Wang, Jia-Ching, National Central University, Taiwan

LS-D-TH3.6: ENVIRONMENT ADAPTIVE 3D POSE ESTIMATION MODEL AND LEARNING STRATEGY

Park, Yeseung, Yonsei University, Korea, Republic of Lee, Kyoungoh, Yonsei University, Korea, Republic of Lee, Sanghoon, Yonsei University, Korea, Republic of

Thursday, December 16, 16:20 - 18:20 ∘ On-Demand A
OD-A-TH3 - Speech Synthesis and Voice Conversion

OD-A-TH3.1: EMOTION-CONTROLLABLE SPEECH SYNTHESIS USING EMOTION SOFT LABELS AND FINE-GRAINED PROSODY FACTORS

Luo, Xuan, The University of Tokyo, Japan Takamichi, Shinnosuke, The University of Tokyo, Japan Koriyama, Tomoki, The University of Tokyo, Japan Saito, Yuki, The University of Tokyo, Japan Saruwatari, Hiroshi, The University of Tokyo, Japan

OD-A-TH3.2: CA-VC: A NOVEL ZERO-SHOT VOICE CONVERSION METHOD WITH CHANNEL ATTENTION

Xiao, Ruitong, south China university of technology, China Xing, Xiaofen, south China university of technology, China Yang, Jichen, south China university of technology, China Xu, Xiangmin, south China university of technology, China

OD-A-TH3.3: CONDITIONAL DEEP HIERARCHICAL VARIATIONAL AUTOENCODER FOR VOICE CONVERSION

Akuzawa, Kei, The University of Tokyo, Japan Onishi, Kotaro, The University of Electro-Communications, Tokyo, Japan Takiguchi, Keisuke, DeNA Co., Ltd., Japan Mametani, Kohki, DeNA Co., Ltd., Japan Mori, Koichiro, DeNA Co., Ltd., Japan

OD-A-TH3.4: NOISY-TO-NOISY VOICE CONVERSION FRAMEWORK WITH DENOISING MODEL

Xie, Chao, Nagoya University, Japan Wu, Yi-Chiao, Nagoya University, Japan Lumban Tobing, Patrick, Nagoya University, Japan Huang, Wen-Chin, Nagoya University, Japan Toda, Tomoki, Nagoya University, Japan

OD-A-TH3.5: ACOUSTIC SIMULATION OF BODY-CONDUCTED SPEECH AND ITS USE TO CONVERT ONE'S RECORDED VOICES TO ONE'S OWN VOICES

Chen, Ruiyan, The University of Tokyo, Japan Nishimura, Tazuko, The University of Tokyo, Japan Minematsu, Nobuaki, The University of Tokyo, Japan Saito, Daisuke, The University of Tokyo, Japan

OD-A-TH3.6: SPEECH RECONSTRUCTION FROM THE LARYNX VIBRATION FEATURE CAPTURED BY LASER-DOPPLER VIBROMETER SENSOR

Lin, Yi-Chieh, National Yang Ming Chiao Tung University, Taiwan Han, Ji-Yan, National Yang Ming Chiao Tung University, Taiwan Lin, Yu-Min, National Yang Ming Chiao Tung University, Taiwan Zheng, Wei-Zhong, National Yang Ming Chiao Tung University, Taiwan Young, Shuenn-Tsong, MacKay Medical College, Taiwan Lai, Ying-Hui, National Yang Ming Chiao Tung University, Taiwan

OD-A-TH3.7: STARGAN-BASED EMOTIONAL VOICE CONVERSION FOR JAPANESE PHRASES

Moritani, Asuka, Ritsumeikan University, Japan Sakamoto, Shoki, Ritsumeikan University, Japan Ozaki, Ryo, Ritsumeikan University, Japan Kameoka, Hirokazu, NTT Corporation, Japan Taniguchi, Tadahiro, Ritsumeikan University, Japan

OD-A-TH3.8: UNDERSTANDING THE TRADEOFFS IN CLIENT-SIDE PRIVACY FOR DOWNSTREAM SPEECH TASKS

Wu, Peter, Carnegie Mellon University, United States of America Liang, Paul, Carnegie Mellon University, United States of America Shi, Jiatong, Carnegie Mellon University, United States of America Salakhutdinov, Ruslan, Carnegie Mellon University, United States of America Watanabe, Shinji, Carnegie Mellon University, United States of America Morency, Louis-Philippe, Carnegie Mellon University, United States of America

OD-A-TH3.9: MULTI-SPEAKER TTS SYSTEM FOR LOW-RESOURCE LANGUAGE USING CROSS-LINGUAL TRANSFER LEARNING AND DATA AUGMENTATION

Byambadorj, Zolzaya, Tokushima University, Japan Nishimura, Ryota, Tokushima University, Japan Ayush, Altangerel, Mongolian University of Science and Technology, Mongolia Ohta, Kengo, National Institute of Technology, Anan College, Japan Kitaoka, Norihide, Toyohashi University of Technology, Japan

OD-A-TH3.10: TOWARDS UNSEEN SPEAKERS ZERO-SHOT VOICE CONVERSION WITH GENERATIVE ADVERSARIAL NETWORKS

Lu, WeiRui, South China University of Technology, China Xing, Xiaofen, South China University of Technology, China Xu, Xiangmin, South China University of Technology, China Zhang, Weibin, Shenzhen VoiceAI Technology Co. Ltd., China

OD-A-TH3.11: LOW-RESOURCE MANDARIN PROSODIC STRUCTURE PREDICTION USING SELF-TRAINING

Wang, Xingrui, Tokyo Institute of Technology, Japan Zhang, Bowen, Tokyo Institute of Technology, Japan Shinozaki, Takahiro, Tokyo Institute of Technology, Japan

OD-A-TH3.12: SPTTS: PARALLEL SPEECH SYNTHESIS WITHOUT EXTRA ALIGNER MODEL

Zhao, Zeqing, Lenovo Research, China Chen, Xi, Lenovo Research, China Liu, Hui, Lenovo Research, China Wang, XuYang, Lenovo Research, China Yang, Lin, Lenovo Research, China Wang, Junjie, Lenovo Research, China

OD-A-TH3.13: INVESTIGATION OF TEXT-TO-SPEECH-BASED SYNTHETIC PARALLEL DATA FOR SEQUENCE-TO-SEQUENCE NON-PARALLEL VOICE CONVERSION

Ma, Ding, Nagoya University, Japan Huang, Wen-chin, Nagoya University, Japan Toda, Tomoki, Nagoya University, Japan

Thursday, December 16, 16:20 - 18:20 ∘ On-Demand B
OD-B-TH3 - Machine Learning and Data Analytics

OD-B-TH3.1: MANDARIN SINGING VOICE SYNTHESIS WITH A PHONOLOGY-BASED DURATION MODEL

Yang, Fu-Rong, National Tsing Hua University, Taiwan Cho, Yin-Ping, National Tsing Hua University, Taiwan Yang, Yi-Hsuan, Academia Sinica, Taiwan Wu, Da-Yi, Academia Sinica, Taiwan Wu, Shan-Hung, National Tsing Hua University, Taiwan Liu, Yi-Wen, National Tsing Hua University, Taiwan

OD-B-TH3.2: TASK-AWARE BERT-BASED SENTIMENT ANALYSIS FROM MULTIPLE ESSENCES OF THE TEXT

Hsu, Jia-Hao, National Cheng Kung University, Taiwan Wu, Chung-Hsien, National Cheng Kung University, Taiwan Yang, Tsung-Hsien, Chunghwa Telecom Co., Ltd., Taiwan

OD-B-TH3.3: CONVOLUTIONAL AUTOENCODER BASED DEEP LEARNING MODEL FOR IDENTIFICATION OF RED PALM WEEVIL SIGNALS

S. R., PARVATHY, CENTRE FOR DEVELOPMENT OF ADVANCED COMPUTING, India JAYAN P., DEEPAK, CENTRE FOR DEVELOPMENT OF ADVANCED COMPUTING, India PATHROSE, NIMMY, CENTRE FOR DEVELOPMENT OF ADVANCED COMPUTING, India K. R., RAJESH, CENTRE FOR DEVELOPMENT OF ADVANCED COMPUTING, India

OD-B-TH3.4: AUGMENTATION-AGNOSTIC REGULARIZATION FOR UNSUPERVISED CONTRASTIVE LEARNING WITH ITS APPLICATION TO SPEAKER VERIFICATION

Inoue, Nakamasa, Tokyo Institute of Technology, Japan Maruyama, Tsubasa, Tokyo Institute of Technology, Japan Goto, Keita, Tokyo Institute of Technology, Japan

OD-B-TH3.5: DEEP LEARNING EVALUATION OF A STEGANOGRAPHIC ALGORITHM

Eze, Peter, University of Melbourne, Australia Udaya, Parampalli, University of Melbourne, Australia

OD-B-TH3.6: FAQ RETRIEVAL USING QUESTION-AWARE GRAPH CONVOLUTIONAL NETWORK AND CONTEXTUALIZED LANGUAGE MODEL

Tseng, Wan-Ting, National Taiwan Normal University, Taiwan Wu, Chin-Ying, National Taiwan Normal University, Taiwan Hsu, Yung-Chang, EZAI, Taiwan Chen, Berlin, National Taiwan Normal University, Taiwan

OD-B-TH3.7: 3D-GFE: A THREE-DIMENSIONAL GEOMETRIC-FEATURE EXTRACTOR FOR POINT CLOUD DATA

Chou, Yu-Chen, National Taiwan University, Taiwan Lin, Yen-Po, National Taiwan University, Taiwan Yeh, Yang-Ming, National Taiwan University, Taiwan Lu, Yi-Chang, National Taiwan University, Taiwan

OD-B-TH3.8: ATTENTION EDGECONV FOR 3D POINT CLOUD CLASSIFICATION

Lin, Yen-Po, National Taiwan University, Taiwan Yeh, Yang-Ming, National Taiwan University, Taiwan Chou, Yu-Chen, National Taiwan University, Taiwan Lu, Yi-Chang, National Taiwan University, Taiwan

OD-B-TH3.9: THE EFFECT OF DENSITY AND PLACEMENT OF BLE BEACONS ON INDOOR LOCATION AND MOTION DIRECTION ESTIMATION ACCURACY

Echizenya, Kaito, Yamagata University, Japan Kondo, Kazuhiro, Yamagata University, Japan

OD-B-TH3.10: MODEL-BASED SOFT ACTOR-CRITIC

Chien, Jen-Tzung, National Yang Ming Chiao Tung University, Taiwan Yang, Shu-Hsiang, National Yang Ming Chiao Tung University, Taiwan

OD-B-TH3.11: SELF-SUPERVISED LEARNING FOR ONLINE SPEAKER DIARIZATION

Chien, Jen-Tzung, National Yang Ming Chiao Tung University, Taiwan Luo, Sixun, National Yang Ming Chiao Tung University, Taiwan

OD-B-TH3.12: MULTI-RESOLUTION CONVOLUTIONAL RECURRENT NETWORKS

Chien, Jen-Tzung, National Yang Ming Chiao Tung University, Taiwan Huang, Yu-Min, National Yang Ming Chiao Tung University, Taiwan

OD-B-TH3.13: NETWORK INTRUSION DETECTION WITH IMPROVED FEATURE REPRESENTATION

Lee, Geonsu, Seoul National University, Korea, Republic of Rhee, Hochang, Seoul National University, Korea, Republic of Shim, Jae Hoon, Seoul National University, Korea, Republic of Koo, Hyung Il, AjouUniversity, Korea, Republic of Cho, Nam Ik, Seoul National University, Korea, Republic of

OD-B-TH3.14: 3D LANDMARK-BASED FACE DETECTION AND RECOGNITION SYSTEM FOR LARGE POSES

Tang, Ching-Tung, National Tsing Hua University, Taiwan Chiu, Ching-Te, National Tsing Hua University, Taiwan Chen, Wei-Jyun, National Tsing Hua University, Taiwan

OD-B-TH3.15: ENTAILMENT METHOD BASED ON TEMPLATE SELECTION FOR CHINESE TEXT FEW-SHOT LEARNING

Wang, Zeyuan, Beijing University of Posts and Telecommunications, China Wei, Zhiyu, Beijing University of Posts and Telecommunications, China Zhang, Lihui, Beijing University of Posts and Telecommunications, China Li, Ruifan, Beijing University of Posts and Telecommunications, China Ma, Zhanyu, Beijing University of Posts and Telecommunications, China

OD-B-TH3.16: IMAGE CAPTIONING BASED ON AN IMPROVED TRANSFORMER WITH IOU POSITION ENCODING

Li, Yazhou, Beijing University of Posts and Telecommunications, China Shi, Yihui, Beijing University of Posts and Telecommunications, China Liu, Yun, Beijing University of Posts and Telecommunications, China Li, Ruifan, Beijing University of Posts and Telecommunications, China Ma, Zhanyu, Beijing University of Posts and Telecommunications, China

Friday, December 17, 10:20 - 12:20 ∘ Live Session A (Hall)
LS-A-FR1 - Advanced Wireless Access Technologies and Data Analysis for IoT and Environmental Monitor
Friday, December 17, 10:20 - 12:20 ∘ Live Session B (Annex)
LS-B-FR1 - Advanced Signal Processing and Machine Learning for Audio and Speech Applications

LS-B-FR1.1: DEVELOPMENT OF A SYNTHETIC DATABASE FOR COMPACT NEURAL NETWORK CLASSIFICATION OF ACOUSTIC SCENES IN DEMENTIA CARE ENVIRONMENTS

Copiaco, Abigail, University of Wollongong in Dubai, United Arab Emirates Ritz, Christian, University of Wollongong, Australia Fasciani, Stefano, University of Oslo, Norway Abdulaziz, Nidhal, University of Wollongong in Dubai, United Arab Emirates

LS-B-FR1.2: REDUCING ALGORITHMIC DELAY USING LOW-OVERLAP WINDOW FOR ONLINE WAVE-U-NET

Nakaoka, Sotaro, Tsukuba University, Japan Li, Li, NTT Communication Science Laboratories, Japan Makino, Shoji, Waseda University, Japan Yamada, Takeshi, Tsukuba University, Japan

LS-B-FR1.3: FRAMEWISE FINITE IMPULSE RESPONSE FILTERING BASED ON TIME-FREQUENCY MASK FOR LOW-LATENCY SPEECH ENHANCEMENT

Haruta, Chiho, Tokyo Metropolitan University, Japan Ono, Nobutaka, Tokyo Metropolitan University, Japan Kinoshita, Yuma, Tokyo Metropolitan University, Japan

LS-B-FR1.4: CONSTRAINED MAXIMUM DIRECTIVITY BEAMFORMERS BASED ON UNIFORM LINEAR ACOUSTIC VECTOR SENSOR ARRAYS

Luo, Xueqin, Northwestern Polytechnical University, China Jin, Jilu, Northwestern Polytechnical University, China Huang, Gongping, Israel Institute of Technology, Israel Chen, Jingdong, Northwestern Polytechnical University, China Benesty, Jacob, University of Quebec, Canada Cohen, Israel, Israel Institute of Technology, Israel Zhang, Wen, Northwestern Polytechnical University, China

LS-B-FR1.5: MULTICHANNEL AUDIO SOURCE SEPARATION WITH INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS USING PRODUCT OF SOURCE MODELS

Hasumi, Takuya, The University of Tokyo, Japan Nakamura, Tomohiko, The University of Tokyo, Japan Takamune, Norihiro, The University of Tokyo, Japan Saruwatari, Hiroshi, The University of Tokyo, Japan Kitamura, Daichi, National Institute of Technology, Kagawa College, Japan Takahashi, Yu, Yamaha Corporation, Japan Kondo, Kazunobu, Yamaha Corporation, Japan

Friday, December 17, 10:20 - 12:20 ∘ Live Session C (10A)
LS-C-FR1 - Data Hiding Techniques for Enriched Multimedia and Beyond

LS-C-FR1.1: TAMPERING DETECTION FOR SPEECH SIGNALS USING SYNCHRONIZATION CODE AND LSF-BASED WATERMARKS

Wang, Shengbei, Tiangong University, China Yuan, Weitao, Tiangong University, China Zhang, Zhen, Tiangong University, China Wang, Jianming, Tiangong University, China Unoki, Masashi, Japan Advanced Institute of Science and Technology, Japan

LS-C-FR1.2: IMPROVING SECURITY IN MCADAMS COEFFICIENT-BASED SPEAKER ANONYMIZATION BY WATERMARKING METHOD

Mawalim, Candy Olivia, Japan Advanced Institute of Science and Technology, Japan Unoki, Masashi, Japan Advanced Institute of Science and Technology, Japan

LS-C-FR1.3: HYBRIDIZATION OF SPEECH INFORMATION HIDING AND ENCRYPTION FOR DOUBLE-LAYER SECURITY IN SPEECH COMMUNICATION

Galajit, Kasorn, Sirindhorn International Institute of Technology, Thammasat University,, Thailand Karnjana, Jessada, NECTEC, National Science and Technology Development Agency,, Thailand aimmanee, Pakinee, Sirindhorn International Institute of Technology, Thammasat University,, Thailand Unoki, Masashi, Japan Advanced Institute of Science and Technology, Japan

LS-C-FR1.4: BSS-BASED EXTRACTION FOR ADDITIVE VIDEO WATERMARKING

Yokota, Akane, Yamaguchi University, Japan Kawamura, Masaki, Yamaguchi University, Japan

LS-C-FR1.5: DETECTION OF PERIODIC PILOT SIGNAL IN IMAGE WATERMARKING

Kawano, Rinka, Yamaguchi University, Japan Kawamura, Masaki, Yamaguchi University, Japan

LS-C-FR1.6: AN ACOUSTIC COMMUNICATION TECHNIQUE BASED ON AUDIO DATA HIDING UTILIZING ARTIFICIAL FLOWING WATER SOUNDS

Kojima, Tetsuya, National Institute of Technology, Tokyo College, Japan Muraoka, Naoyuki, National Institute of Technology, Tokyo College, Japan Matsuzaki, Raito, National Institute of Technology, Tokyo College, Japan

Friday, December 17, 10:20 - 12:20 ∘ On-Demand A
OD-A-FR1 - Speech and Singing Voice Analysis

OD-A-FR1.1: END-TO-END MANDARIN TONE CLASSIFICATION WITH SHORT TERM CONTEXT INFORMATION

Tang, Jiyang, Duke Kunshan University, China Li, Ming, Duke Kunshan University, China

OD-A-FR1.2: RETHINKING SINGING VOICE SEPARATION WITH SPECTRAL-TEMPORAL TRANSFORMER

Yu, Shuai, Fudan University, China Li, Chenxing, Kuai Shou, China Deng, Feng, Kuai Shou, China Wang, Xiaorui, Kuai Shou, China

OD-A-FR1.3: INVESTIGATING TIME-FREQUENCY REPRESENTATIONS FOR AUDIO FEATURE EXTRACTION IN SINGING TECHNIQUE CLASSIFICATION

Yamamoto, Yuya, University of Tsukuba, Japan Nam, Juhan, KAIST, Korea, Republic of Terasawa, Hiroko, University of Tsukuba, Japan Hiraga, Yuzuru, University of Tsukuba, Japan

OD-A-FR1.4: IMPLEMENTATION OF INTERACTIVE TOOLS FOR INVESTIGATING FUNDAMENTAL FREQUENCY RESPONSE OF VOICED SOUNDS TO AUDITORY STIMULATION

Kawahara, Hideki, Wakayama University, Japan Matsui, Toshie, Toyohashi University of Technology, Japan Yatabe, Kohei, Waseda University, Japan Sakakibara, Ken-Ichi, Health Science University of Hokkaido, Japan Tsuzaki, Minoru, Kyoto City University of Arts, Japan Morise, Masanori, Meiji University, Japan Irino, Toshio, Wakayama University, Japan

OD-A-FR1.5: TRAINING EXPLAINABLE SINGING QUALITY ASSESSMENT NETWORK WITH AUGMENTED DATA

Li, Jinhu, National University of Singapore, Singapore Gupta, Chitralekha, National University of Singapore, Singapore Li, Haizhou, National University of Singapore, Singapore

OD-A-FR1.6: TOWARDS REFERENCE-INDEPENDENT RHYTHM ASSESSMENT OF SOLO SINGING

Gupta, Chitralekha, National University of Singapore, Singapore Li, Jinhu, National University of Singapore, Singapore Li, Haizhou, National University of Singapore, Singapore

OD-A-FR1.7: NOISE ROBUST SINGING VOICE SYNTHESIS USING GAUSSIAN MIXTURE VARIATIONAL AUTOENCODER

Xue, Heyang, Northwestern Polytechnical University, China Wu, Jie, Xiaomi Technology Co., Ltd., China Luan, Jian, Xiaomi Technology Co., Ltd., China Wang, Yujun, Xiaomi Technology Co., Ltd., China Xie, Lei, Northwestern Polytechnical University, China

OD-A-FR1.8: PITCH ESTIMATION ALGORITHM FOR NARROWBAND SPEECH SIGNAL USING PHASE DIFFERENCES BETWEEN HARMONICS

Hosoda, Yuya, Osaka university, Japan Kawamura, Arata, Kyoto Sangyo university, Japan Iiguni, Youji, Osaka university, Japan

OD-A-FR1.9: SVM-BASED EVALUATION OF THAI TONE IMITATIONS BY THAI-NAÏVE MANDARIN AND VIETNAMESE SPEAKERS

Chen, Juqiang, Western Sydney University, Australia Ni, Tianyi, The Ohio State University, United States of America Kasisopa, Benjawan, Western Sydney University, Australia Antoniou, Mark, Western Sydney University, Australia Best, Catherine, Western Sydney University, Australia

OD-A-FR1.10: ON AN IMPROVED F0 ESTIMATION BASED ON L2-NORM REGULARIZED TV-CAR SPEECH ANALYSIS

FUNAKI, KEIICHI, University of the Ryukyus, Japan

Friday, December 17, 10:20 - 12:20 ∘ On-Demand B
OD-B-FR1 - Computer Vision and Multimedia

OD-B-FR1.1: HIGH-QUALITY SINGLE IMAGE 3D FACIAL SHAPE RECONSTRUCTION VIA ROBUST ALBEDO ESTIMATION

Heo, Suwoong, Yonsei University, Korea, Republic of Song, Hyewon, Yonsei University, Korea, Republic of Kang, Jiwoo, Yonsei University, Korea, Republic of Lee, Sanghoon, Yonsei University, Korea, Republic of

OD-B-FR1.2: SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS

Huang, Huirong, Tsinghua University, China Wu, Zhiyong, Tsinghua University, China Kang, Shiyin, Tencent, China Dai, Dongyang, Tsinghua University, China Jia, Jia, Tsinghua University, China Fu, Tianxiao, Tencent, China Tuo, Deyi, Tencent, China Lei, Guangzhi, Tencent, China Liu, Peng, Tencent, China Su, Dan, Tencent, China Yu, Dong, Tencent, China Meng, Helen, The Chinese University of Hong Kong, China

OD-B-FR1.3: HMM-BASED LIP READING WITH STINGY RESIDUAL 3D CONVOLUTION

Zeng, Qifeng, University of Science and Technology of China, China Du, Jun, University of Science and Technology of China, China Wang, Zirui, Chongqing University of Posts and Telecommunications, China

OD-B-FR1.4: DEEP SIAMESE NETWORK FOR LOW-RESOLUTION FACE RECOGNITION

Lai, Shun-Cheung, The Hong Kong Polytechnic University, Hong Kong Lam, Kin-Man, The Hong Kong Polytechnic University, Hong Kong

OD-B-FR1.5: LEARN TO SKETCH: A FAST APPROACH FOR UNIVERSAL PHOTO SKETCH

Liu, Zhi-Song, Caritas Institute of Higher Education, Hong Kong, Hong Kong Siu, Wan-Chi, Hong Kong Polytechnic University, Hong Kong Chan, H. Anthony, Caritas Institute of Higher Education, Hong Kong, Hong Kong

OD-B-FR1.6: HEAD MOVEMENT PREDICTION USING FCNN

Shafi, Rabia, Northwestern Polytechnical University, Xi’an, China, China Shuai, Wan, Northwestern Polytechnical University, Xi’an, China, China Gong, Hao, Northwestern Polytechnical University, Xi’an, China, China Younus, Muhammad Usman, Université de Toulouse, Toulouse, France., France

OD-B-FR1.7: A STUDY ON VIRTUAL REALITY SICKNESS AND VISUAL ATTENTION

Lee, Jeonghaeng, Yonsei University, Korea, Republic of Kim, Woojae, Yonsei University, Korea, Republic of Kim, Jinwoo, Yonsei University, Korea, Republic of Lee, Sanghoon, Yonsei University, Korea, Republic of

OD-B-FR1.8: QUALITY OF INTERACTION ARISING FROM AUGMENTED REALITY CONTENT: A COMPREHENSIVE STUDY

Kim, Seongjean, Yonsei University, Korea, Republic of Kim, Jinwoo, Yonsei University, Korea, Republic of Lee, Sanghoon, Yonsei University, Korea, Republic of

OD-B-FR1.9: E-PIXELHOP: AN ENHANCED PIXELHOP METHOD FOR OBJECT CLASSIFICATION

Yang, Yijing, University of Southern California, United States of America Magoulianitis, Vasileios, University of Southern California, United States of America Kuo, C.-C. Jay, University of Southern California, United States of America

OD-B-FR1.10: REAL-TIME EDGE ATTENTION-BASED LEARNING FOR LOW-LIGHT ONE-STAGE OBJECT DETECTION

Pu, Yen-Yu, National Tsing Hua University, Taiwan Chiu, Ching-Te, National Tsing Hua University, Taiwan Wu, Shu-Yun, National Tsing Hua University, Taiwan

OD-B-FR1.11: CHECKERBOARD CORNER LOCALIZATION ACCELERATED WITH DEEP FALSE DETECTION FOR MULTI-CAMERA CALIBRATION

Kang, Jiwoo, Yonsei University, Korea, Republic of Yoon, Hyunse, Yonsei University, Korea, Republic of Lee, Seongmin, Yonsei University, Korea, Republic of Lee, Sanghoon, Yonsei University, Korea, Republic of

OD-B-FR1.12: STRATEGIES OF TRADITIONAL CHINESE CHARACTER RECOGNITION IN STREETSCAPE BASED ON DEEP LEARNING NETWORKS

Syu, Sin-Wun, National Central University, Taiwan Su, Po-Chyi, National Central University, Taiwan

Friday, December 17, 13:00 - 15:00 ∘ Live Session A (Hall)
LS-A-FR2 - Digital Filters and Its Applications

LS-A-FR2.2: LEARNING THE STATISTICAL MODEL OF THE NMF USING THE DEEP MULTIPLICATIVE UPDATE ALGORITHM WITH APPLICATIONS

Tanji, Hiroki, Meiji University, Japan Murakami, Takahiro, Meiji University, Japan

LS-A-FR2.3: AN IMPROVED PARAMETER FREE GENETIC ALGORITHM FOR CSD-FIR FILTER DESIGN

Kato, Ryota, School of Engineering, Tokyo Denki University, Japan Suyama, Kenji, School of Engineering, Tokyo Denki University, Japan

LS-A-FR2.4: A PROPOSAL TOWARD STANDARDIZATION OF DESIGN EXAMPLES FOR IIR FILTER DESIGN METHODS

Harigae, Yuta, School of Engineering, Tokyo Denki University, Japan Matumoto, Kazuki, School of Engineering, Tokyo Denki University, Japan Suyama, Kenji, School of Engineering, Tokyo Denki University, Japan

LS-A-FR2.5: ON OPTIMAL REALIZATIONS FOR ALL-PASS FRACTIONAL DELAY DIGITAL FILTERS

Koshita, Shunsuke, Hachinohe Institute of Technology, Japan

LS-A-FR2.6: LOW-PASS MAXIMALLY FLAT IIR DIGITAL DIFFERENTIATOR DESIGN WITH ARBITRARY FLATNESS DEGREE

Yoshida, Takashi, Tokyo Metropolitan College of Industrial Technology, Japan

Friday, December 17, 13:00 - 15:00 ∘ Live Session B (Annex)
LS-B-FR2 - Intelligent Approaches in Signal Processing for Multimedia Security (1)

LS-B-FR2.1: AN EXTENDED REVERSIBLE DATA HIDING METHOD FOR HDR IMAGES USING EDGE ESTIMATION

Ueda, Minagi, Chiba University, Japan Imaizumi, Shoko, Chiba University, Japan Wong, KokSheik, Monash University Malaysia, Malaysia

LS-B-FR2.2: IMAGE WATERMARKING BASED ON NON-NEWTONIAN EFFECT AND INTERPOLATED SWT-DWT

Khan, Ahmed, School of Information Technology, Monash University Malaysia, Malaysia, Malaysia Wong, KokSheik, School of Information Technology, Monash University Malaysia, Malaysia, Malaysia

LS-B-FR2.3: ACCESS CONTROL USING SPATIALLY INVARIANT PERMUTATION OF FEATURE MAPS FOR SEMANTIC SEGMENTATION MODELS

Ito, Hiroki, Tokyo Metropolitan University, Japan AprilPyone, MaungMaung, Tokyo Metropolitan University, Japan Kiya, Hitoshi, Tokyo Metropolitan University, Japan

LS-B-FR2.4: END-TO-END LEARNING FOR ENCRYPTED IMAGE RETRIEVAL

Feng, Qihua, Jinan University, China Li, Peiya, Jinan University, China Lu, ZhiXun, Jinan University, China Liu, Guan, Jinan University, China Huang, Feiran, Jinan University, China

LS-B-FR2.5: A PRIVACY-PRESERVING IMAGE RETRIEVAL SCHEME USING A CODEBOOK GENERATED FROM INDEPENDENT PLAIN-IMAGE DATASET

Iida, Kenta, Tokyo Metropolitan University, Japan Kiya, Hitoshi, Tokyo Metropolitan University, Japan

Friday, December 17, 13:00 - 15:00 ∘ Live Session C (10A)
LS-C-FR2 - Recent Advances in Acoustic & Biomedical Signal Processing (1)

LS-C-FR2.1: INTERNAL STATE ESTIMATION BY THERMAL IMAGE AND IDENTIFICATION OF FACE AND NOSE POSITION

Watanabe, Yuta, Chiba University, Japan Manabe, Yoshitsugu, Chiba University, Japan Yata, Noriko, Chiba University, Japan

LS-C-FR2.2: ON IMPROVING THE ACCURACY OF OBJECT DETECTION FOR HIGH RESOLUTION IMAGES BASED ON SSD

Irie, Kei, Tokyo Metropolitan University, Japan Qiu, Yicheng, Tokyo Metropolitan University, Japan Nishikawa, Kiyoshi, Tokyo Metropolitan University, Japan

LS-C-FR2.3: DETECTION OF NOTE ONSETS FROM EEG WHILE LISTENING TO MUSIC

Kumagai, Yuiko, Tokyo University of Agriculture and Technology, Japan Tanaka, Toshihisa, Tokyo University of Agriculture and Technology, Japan

LS-C-FR2.4: SPEECH ENHANCEMENT NETWORK WITH UNSUPERVISED ATTENTION USING INVARIANT INFORMATION CLUSTERING

Sugiura, Yosuke, Saitama University, Japan Nagamori, Shunta, Saitama University, Japan Shimamura, Tetsuya, Saitama University, Japan

Friday, December 17, 13:00 - 15:00 ∘ Live Session D (Virtual)
LS-D-FR2 - High Performance Intelligent Technologies for Image and Video Applications

LS-D-FR2.1: SEMI-SUPERVISED LEARNING FOR FACIAL LANDMARKS WITH CONFIDENCE AND AUGMENTATION SIFTING MECHANISMS

Chia, Hao-Wen, National Taiwan University, Taiwan Ding, Jian-Jiun, National Taiwan University, Taiwan

LS-D-FR2.2: DEEPFAKE ALGORITHM USING MULTIPLE NOISE MODALITIES WITH TWO-BRANCH PREDICTION NETWORK

Hsu, Hsuan-Wei, National Taiwan University, Taiwan Ding, Jian-Jiun, National Taiwan University, Taiwan

LS-D-FR2.3: DIGITAL MULTITONE IMAGE RECONSTRUCTION USING DEEP GENERATIVE ADVERSARIAL NETS

Guo, Jing-Ming, National Taiwan University of Science and Technology, Taiwan Seshathiri, Sankarasrinivasan, National Taiwan University of Science and Technology, Taiwan

LS-D-FR2.4: SMART FACIAL SKINCARE PRODUCTS USING COMPUTER VISION TECHNOLOGIES

Chan, Hung-Tse, National Ilan University, Taiwan Lin, Ting-Yu, National Cheng Kung University, Taiwan Deng, Shih-Chun, Taipei Private Dongshan High School, Taiwan Hsia, Chih-Hsien, National Ilan University, Taiwan Lai, Chin-Feng, National Cheng Kung University, Taiwan

LS-D-FR2.5: AN ATTENTION BASED EXPERT INSPECTION SYSTEM FOR SMART SCALP

Jhong, Sin-Ye, National Cheng Kung University, Taiwan Yang, Po-Yen, The Hong Kong University of Science and Technology, China Hsia, Chih-Hsien, National Ilan University, Taiwan

Friday, December 17, 13:00 - 15:00 ∘ On-Demand A
OD-A-FR2 - Acoustics and Sound Event Processing

OD-A-FR2.1: CNN-BASED DISCRIMINATIVE TRAINING FOR DOMAIN COMPENSATION IN ACOUSTIC EVENT DETECTION WITH FRAME-WISE CLASSIFIER

Tang, Tiantian, Shanghai Normal University, China Zhou, Xinyuan, Shanghai Normal University, China Long, Yanhua, Shanghai Normal University, China Li, Yijie, Unisound AI Technology Co., Ltd., China Liang, Jiaen, Unisound AI Technology Co., Ltd., China

OD-A-FR2.2: FREQUENCY AXIS POOLING METHOD FOR WEAKLY LABELED SOUND EVENT DETECTION AND CLASSIFICATION

Liu, Miao, Beijing Institute of Technology, China Wang, Jing, Beijing Institute of Technology, China Wang, Yujun, Xiaomi Inc., China Yang, Lidong, Inner Mongolia University of Science and Technology, China

OD-A-FR2.3: A MULTI-SOURCE LOCALIZATION METHOD BASED ON CLUSTERING AND OUTLIER REMOVAL

Gao, Shang, Beijing University of Technology, China Jia, Maoshen, Beijing University of Technology, China Bao, Changchun, Beijing University of Technology, China

OD-A-FR2.4: IMPULSIVE TIMING DETECTION BASED ON MULTI-FRAME PHASE VOTING FOR ACOUSTIC EVENT DETECTION

Mishima, Sakiko, NEC Corporation, Japan Kondo, Reishi, NEC Corporation, Japan

OD-A-FR2.5: MULTIPLE-EMBEDDING SEPARATION NETWORKS: SOUND CLASS-SPECIFIC FEATURE EXTRACTION FOR UNIVERSAL SOUND SEPARATION

Munakata, Hokuto, Osaka University, Japan Takeda, Ryu, Osaka University, Japan Komatani, Kazunori, Osaka University, Japan

OD-A-FR2.6: NARROW-EDGED BEAMFORMING USING MASKED PARAMETRIC ARRAY LOUDSPEAKERS

GENG, Yuting, Ritsumeikan University, Japan WANG, Haonan, Ritsumeikan University, Japan NAKAYAMA, Masato, Osaka Sangyo University, Japan NISHIURA, Takanobu, Ritsumeikan University, Japan

OD-A-FR2.7: COPRIME MICROPHONE ARRAYS FOR ESTIMATING SPEECH DIRECTION OF ARRIVAL USING DEEP LEARNING

Zhao, Jiahong, University of Wollongong, Australia Ritz, Christian, University of Wollongong, Australia

OD-A-FR2.8: A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT

Ooi, Kenneth, Nanyang Technological University, Singapore Watcharasupat, Karn, Nanyang Technological University, Singapore Peksi, Santi, Nanyang Technological University, Singapore Karnapi, Furi Andi, Nanyang Technological University, Singapore Ong, Zhen-Ting, Nanyang Technological University, Singapore Chua, Danny, Nanyang Technological University, Singapore Leow, Hui-Wen, Nanyang Technological University, Singapore Kwok, Li-Long, Nanyang Technological University, Singapore Ng, Xin-Lei, Nanyang Technological University, Singapore Loh, Zhen-Ann, Nanyang Technological University, Singapore Gan, Woon-Seng, Nanyang Technological University, Singapore

OD-A-FR2.9: FORMULATION OF MULTIDIMENSIONAL FREQUENCY CHARACTERISTICS OF SECOND-ORDER NONLINEAR IIR FILTER

Iwai, Kenta, Ritsumeikan University, Japan Kajikawa, Yoshinobu, Kansai University, Japan Nishiura, Takanobu, Ritsumeikan University, Japan

OD-A-FR2.10: TWO-STAGE PHASE RECONSTRUCTION USING DNN AND VON MISES DISTRIBUTION-BASED MAXIMUM LIKELIHOOD

Nguyen, Binh Thien, Ritsumeikan University, Japan Wakabayashi, Yukoh, Tokyo Metropolitan University, Japan Iwai, Kenta, Ritsumeikan University, Japan Nishiura, Takanobu, Ritsumeikan University, Japan

OD-A-FR2.11: SHARP-SOUND-IMAGE CONSTRUCTION METHOD USING MULTICHANNEL SOUND SYSTEM WITH OPTIMAL PARAMETRIC LOUDSPEAKER ARRANGEMENT

Harada, Yuna, Ritsumeikan University, Japan Shimada, Naoto, Ritsumeikan University, Japan Wang, Haonan, Ritsumeikan University, Japan Iwai, Kenta, Ritsumeikan University, Japan Nakayama, Masato, Osaka Sangyo University, Japan Nishiura, Takanobu, Ritsumeikan University, Japan

OD-A-FR2.12: VIRTUAL SOUND SOURCE RENDERING BASED ON DISTANCE CONTROL TO PENETRATE LISTENERS USING SURROUND PARAMETRIC-ARRAY AND ELECTRODYNAMIC LOUDSPEAKERS

Ekawa, Takuma, Osaka Sangyo University, Japan Nakayama, Masato, Osaka Sangyo University, Japan Takahashi, Toru, Osaka Sangyo University, Japan

OD-A-FR2.13: SELF-ROTATION ANGLE ESTIMATION OF CIRCULAR MICROPHONE ARRAY BASED ON SOUND FIELD INTERPOLATION

Lian, Guansan, Tokyo Metropolitan University, Japan Wakabayashi, Yukoh, Tokyo Metropolitan University, Japan Nakashima, Taishi, Tokyo Metropolitan University, Japan Ono, Nobutaka, Tokyo Metropolitan University, Japan

Friday, December 17, 13:00 - 15:00 ∘ On-Demand B
OD-B-FR2 - Biomedical Signal Processing and Systems

OD-B-FR2.1: A SELF-ATTENTION-BASED ENSEMBLE CONVOLUTION NEURAL NETWORK APPROACH FOR SLEEP STAGE CLASSIFICATION WITH MERGED SPECTROGRAM

Kuo, Chih-En, National Chung Hsing University, Taiwan Liao, Po-Yu, Feng Chia University, Taiwan Lin, Yu-Syuan, Feng Chia University, Taiwan , , ,

OD-B-FR2.2: SEIZURE CLASSIFICATION OF EEG BASED ON WAVELET SIGNAL DENOISING USING A NOVEL CHANNEL SELECTION ALGORITHM

McCallan, Niamh, Ulster University, United Kingdom of Great Britain and Northern Ireland Davidson, Scot, Ulster University, United Kingdom of Great Britain and Northern Ireland Ng, Kok Yew, Ulster University, United Kingdom of Great Britain and Northern Ireland Biglarbeigi, Pardis, Ulster University, United Kingdom of Great Britain and Northern Ireland Finlay, Dewar, Ulster University, United Kingdom of Great Britain and Northern Ireland Lan, Boon Leong, Monash University, Malaysia McLaughlin, James, Ulster University, United Kingdom of Great Britain and Northern Ireland

OD-B-FR2.3: A RECOMMENDATION SYSTEMS APPROACH FOR DETECTING EPISTASIS IN GENOMIC SIGNALS

Banuelos, Mario, California State University, Fresno, United States of America Hernandez, Marissa, California State University, Fresno, United States of America

OD-B-FR2.4: UNDERSTANDING STRUCTURE INDUCED FUNCTIONAL CONNECTIVITY IN BRAIN USING EEG

Gupta, Shefali, Indian Institute of Technology, Delhi, India, India Gandhi, Tapan, Indian Institute of Technology, Delhi, India, India Sinha, Pawan, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA, India

OD-B-FR2.5: EFFECT OF VISUAL ATTENTION AND DRIVING EXPERIENCES ON THE EVENT-RELATED POTENTIAL P300 IN THE PERCEPTION OF TRAFFIC SCENES

Yamamoto, Kota, Chubu University, Japan Nobukawa, Sou, Chiba Institute of Technology, Japan Wagatsuma, Nobuhiko, Toho University, Japan Inagaki, Keiichiro, Chubu University, Japan , , ,

OD-B-FR2.6: TOWARD ESTIMATION OF ABNORMAL BRAKE IN AUTONOMOUS VEHICLES FROM ELECTROENCEPHALOGRAM AND HEART RATE INTERVAL

Sekiguchi, Erika, Tokyo University of Agriculture and Technology, Japan Kubota, Ken, JATCO Engineering Ltd, Japan Nakamura, Shun, CorLab Inc., Japan Makita, Kenichi, Innovative Technology Development Department, JATCO Ltd, Japan Tanaka, Toshihisa, Tokyo University of Agriculture and Technology, Japan

OD-B-FR2.7: SPEAKER TURN AWARE SIMILARITY SCORING FOR DIARIZATION OF SPEECH-BASED COGNITIVE ASSESSMENTS

Xu, Sean Shensheng, Shenzhen University, China Mak, Man-Wai, The Hong Kong Polytechnic University, Hong Kong Wong, Ka Ho, The Chinese University of Hong Kong, Hong Kong Meng, Helen, The Chinese University of Hong Kong, Hong Kong Kwok, Timothy C.Y., The Chinese University of Hong Kong, Hong Kong

Friday, December 17, 15:20 - 17:00 ∘ Live Session A (Hall)
LS-A-FR3 - Recent Advances in Deep Image Restoration

LS-A-FR3.1: MULTI-BAND NIR COLORIZATION USING STRUCTURE-AWARE NETWORK

Park, Min-Je, Korea University, Korea, Republic of Lee, Ju-Han, Korea University, Korea, Republic of Lee, Sang-Ho, Korea University, Korea, Republic of Kim, Jong-Ok, Korea University, Korea, Republic of

LS-A-FR3.2: PROXIMAL GRADIENT-BASED LOOP UNROLLING WITH INTERSCALE THRESHOLDING

Kobayashi, Ruiki, Niigata University, Japan Muramatsu, Shogo, Niigata University, Japan Ono, Shunsuke, Tokyo Institute of Technology University, Japan

LS-A-FR3.3: EDGE MAP-GUIDED SCALE-ITERATIVE IMAGE DEBLURRING

Min, Sung-Jun, Sogang University, Korea, Republic of Kang, Suk-Ju, Sogang University, Korea, Republic of

LS-A-FR3.4: SUPER-RESOLUTION IMAGING USING A FOCUS PIXEL SENSOR

Woo, Sung-Min, Korea University of Technology and Education, Korea, Republic of Ha, Jeong-Won, Korea University, Korea, Republic of Kim, Jong-Ok, Korea University, Korea, Republic of

LS-A-FR3.5: MULTI-VIEW VARIATIONAL AUTOENCODER FOR ROBUST CLASSIFICATION AGAINST IRRELEVANT DATA

Nishikawa, Daichi, Nagaoka University of Technology, Japan Harakawa, Ryosuke, Nagaoka University of Technology, Japan Iwahashi, Masahiro, Nagaoka University of Technology, Japan

Friday, December 17, 15:20 - 17:00 ∘ Live Session B (Annex)
LS-B-FR3 - Intelligent Approaches in Signal Processing for Multimedia Security (2)/Multimedia Security and Deepface

LS-B-FR3.1: A PROTECTION METHOD OF TRAINED CNN MODEL USING FEATURE MAPS TRANSFORMED WITH SECRET KEY FROM UNAUTHORIZED ACCESS

MaungMaung, AprilPyone, Tokyo Metropolitan University, Japan Kiya, Hitoshi, Tokyo Metropolitan University, Japan

LS-B-FR3.2: DERIVING A COMPACT ANALYTICAL MODEL FOR CAMERA RESPONSE FUNCTIONS WITH APPLICATION TO CHARTLESS RADIOMETRIC CALIBRATION

Qu, Zhenhua, Research Institute of China Telecom Ltd., China He, Ziqiang, Sun Yat-Sen University, China Kang, Xiangui, Sun Yat-Sen University, China

LS-B-FR3.3: A STUDY OF PRIVACY PROTECTION OF PHOTOS TAKEN BY A WIDE-ANGLE SURVEILLANCE CAMERA

Nakai, Koki, Okayama University, Japan Kuribayashi, Minoru, Okayama University, Japan Funabiki, Nobuo, Okayama University, Japan

LS-B-FR3.4: A PILOT EXPLORATION OF INDUSTRIAL VIDEO SCENE DATA EMBEDDING USING REAL-TIME MV-HEVC

Pang, Yik Siang, Tunku Abdul Rahman University College, Malaysia Tew, Yiqi, Tunku Abdul Rahman University College, Malaysia

LS-B-FR3.5: RELABEL, SCRAMBLE, SYNTHESIZE: A NOVEL COVERLESS STEGANOGRAPHY APPROACH VIA COLLAGE IMAGE

Ng, Koi Yee, University of Malaya, Malaysia Ong, Simying, University of Malaya, Malaysia Loh, Yuen Peng, Multimedia University, Malaysia Chan, Chee Seng, University of Malaya, Malaysia

Friday, December 17, 15:20 - 17:00 ∘ Live Session C (10A)
LS-C-FR3 - Recent Advances in Acoustic & Biomedical Signal Processing (2)

LS-C-FR3.1: EVENT-RELATED SPECTROGRAM REPRESENTATION OF EEG FOR CNN-BASED P300 SPELLER

Mussabayeva, Ayana, Nazarbayev University, Kazakhstan Ermaganbet, Zangar, Nazarbayev University, Kazakhstan Jamwal, Prashant Kumar, Nazarbayev University, Kazakhstan Akhtar, Muhammad Tahir, Nazarbayev University, Kazakhstan

LS-C-FR3.2: COST-EFFECTIVE PROPORTIONATE AFFINE PROJECTION ALGORITHM WITH VARIABLE PARAMETERS FOR ACOUSTIC FEEDBACK CANCELLATION

Okhassov, Timur, Nazarbayev University, Kazakhstan Jamwal, Prashant, Nazarbayev University, Kazakhstan Akhtar, Muhammad, Nazarbayev University, Kazakhstan

LS-C-FR3.3: SELF-SUPERVISED VISUAL TRANSFORMERS FOR BREAST CANCER DIAGNOSIS

Saidnassim, Nurbek, Nazarbayev University, Kazakhstan Abdikenov, Beibit, Relive Research LLP, Kazakhstan Kelesbekov, Rauan, Nazarbayev University, Kazakhstan Akhtar, Muhammad Tahir, Nazarbayev University, Kazakhstan Jamwal, Prashant, Nazarbayev University, Kazakhstan

LS-C-FR3.4: PITCH AND VOLUME STABILITY IN THE COMMUNICATIVE RESPONSE OF ADULTS WITH AUTISM

OCHI, Keiko, Kyoto University, Japan Kojima, Masaki, University of Tokyo, Japan Owada, Keiho, University of Tokyo, Japan Ono, Nobutaka, Tokyo Metropolitan University, Japan Sagayama, Shigeki, University of Tokyo, Japan Yamasue, Hidenori, Hamamatsu University School of Medicine, Japan

Friday, December 17, 15:20 - 17:00 ∘ Live Session D (Virtual)
LS-D-FR3 - Multimodal Learning for Biological and Biomedical Acoustic Signal Processing

LS-D-FR3.1: TIME ALIGNMENT USING LIP IMAGES FOR FRAME-BASED ELECTROLARYNGEAL VOICE CONVERSION

Liou, Yi-Syuan, Academia Sinica, Taiwan Huang, Wen-Chin, Nagoya University, Japan Yen, Ming-Chi, Academia Sinica, Taiwan Tsai, Shu-Wei, National Cheng Kung University Hospital, Taiwan Peng, Yu-Huai, Academia Sinica, Taiwan Toda, Tomoki, Nagoya University, Japan Tsao, Yu, Academia Sinica, Taiwan Wang, Hsin-Min, Academia Sinica, Taiwan

LS-D-FR3.2: ESTIMATION AND CORRECTION OF RELATIVE TRANSFER FUNCTION FOR BINAURAL SPEECH SEPARATION NETWORKS TO PRESERVE SPATIAL CUES

Feng, Zicheng, Southern University of Science and Technology, China Tsao, Yu, Academia Sinica, Taiwan Chen, Fei, Southern University of Science and Technology, China

LS-D-FR3.3: MIMO SPEECH COMPRESSION AND ENHANCEMENT BASED ON CONVOLUTIONAL DENOISING AUTOENCODER

Li, You-Jin, National Taiwan University, Taiwan Wang, Syu-Siang, Yuan Ze University, Taiwan Tsao, Yu, Academia Sinica, Taiwan Su, Borching, National Taiwan University, Taiwan

LS-D-FR3.4: PREDICTING PATIENT'S CHOICES OF HOSPITAL LEVELS USING DEEP LEARNING AND REPRESENTATION IMPROVEMENTS

Chen, Lichin, Academia Sinica, Taiwan Sheu, Ji-Tian, Chang Gung University, Taiwan Chuang, Yuh-Jue, Chang Gung University, Taiwan

LS-D-FR3.5: INSTRUMENTED ROMBERG TEST OF POSTURAL STABILITY IN PATIENTS WITH VESTIBULAR DISORDERS USING INERTIAL MEASUREMENT UNITS

Lin, Yu-Chieh, National Yang Ming Chiao Tung University, Taiwan Ting, Kuan-Chung, Taipei Veterans General Hospital, Taiwan Liu, Kai-Chun, Academia Sinica, Taiwan Hsieh, Chia-Yeh, Fu Jen Catholic University, Taiwan Chan, Chia-Tai, National Yang Ming Chiao Tung University, Taiwan

Friday, December 17, 15:20 - 17:00 ∘ On-Demand A
OD-A-FR3 - Speech Recognition and Spoken Language Processing

OD-A-FR3.1: ENRICHING UNDER-REPRESENTED NAMED ENTITIES FOR IMPROVED SPEECH RECOGNITION

Mao, Tingzhi, Xinjiang University, China Khassanov, Yerbolat, Nazarbayev University, Kazakhstan Pham, Van Tung, Nanyang Technological University, Singapore, Singapore Xu, Haihua, Nanyang Technological University, Singapore, Singapore Huang, Hao, Xinjiang University, China Wumaier, Aishan, Xinjiang University, China Chng, Eng Siong, Nanyang Technological University, Singapore, Singapore

OD-A-FR3.2: ENSEMBLE OF ONE MODEL: CREATING MODEL VARIATIONS FOR TRANSFORMER WITH LAYER PERMUTATION

Liaw, Andrew, National Cheng Kung University, Taiwan Hsu, Jia-Hao, National Cheng Kung University, Taiwan Wu, Chung-Hsien, National Cheng Kung University, Taiwan

OD-A-FR3.3: UNCERTAINTY ESTIMATION IN AUTOMATIC PRONUNCIATION ASSESSMENT WITH PSEUDO SAMPLES BASED ON DEEP KERNEL LEARNING

Lin, Binghuai, Tencent Technology Co., Ltd, China Wang, Liyuan, Tencent Technology Co., Ltd, China

OD-A-FR3.4: RETRIEVAL-ORIENTED E2E ASR MODELING FOR IMPROVED QUERY-BY-EXAMPLE SPOKEN TERM DETECTION

Kurokawa, Takumi, Shizuoka University, Japan Kai, Atsuhiko, Shizuoka University, Japan

OD-A-FR3.5: MULTILINGUAL APPROACH TO JOINT SPEECH AND ACCENT RECOGNITION WITH DNN-HMM FRAMEWORK

Peng, Yizhou, Xinjiang University, China Zhang, Jicheng, Xinjiang University, China Zhang, Haobo, Xinjiang University, China Xu, Haihua, Nanyang Technological University, Singapore Huang, Hao, Xinjiang University, China Li, Sheng, National Institute of Information and Communications Technology, Japan Chng, Eng Siong, Nanyang Technological University, Singapore

OD-A-FR3.6: IMPROVING END-TO-END MODELING FOR MISPRONUNCIATION DETECTION WITH EFFECTIVE AUGMENTATION MECHANISMS

Lo, Tien-Hong, National Taiwan Normal University, Taiwan Sung, Yao-Ting, National Taiwan Normal University, Taiwan Chen, Berlin, National Taiwan Normal University, Taiwan

OD-A-FR3.7: ZERO-SHOT DOMAIN ADAPTATION WITH INFERENCE RELATION PATHS FOR SPOKEN LANGUAGE UNDERSTANDING

Li, Sixia, Japan Advanced Institute of Science and Technology, Japan Dang, Jianwu, Japan Advanced Institute of Science and Technology, Japan

OD-A-FR3.8: END TO END SPOKEN LANGUAGE UNDERSTANDING USING PARTIAL DISENTANGLED SLOT EMBEDDING

Liu, Tan, University of Science and Technology of China, China Guo, Wu, University of Science and Technology of China, China

OD-A-FR3.9: MULTIPLE DEEP LEARNING MODELS AND ARCHITECTURES WITH DIFFERENT NUMBERS OF STATES USED TO IMPROVE RETRIEVAL ACCURACY OF QUERY-BY-EXAMPLE

Hatakeyama, Kazuki, Iwate Prefectural University, Japan Nishino, Masahiro, TOYOTA SYSTEMS CORPORATION, Japan Kojima, Kazunori, Iwate Prefectural University, Japan Lee, Shi-wook, AIST, Japan Itoh, Yoshiaki, Iwate Prefectural University, Japan

OD-A-FR3.10: SEPARABLE TEMPORAL CONVOLUTION PLUS TEMPORALLY POOLED ATTENTION FOR LIGHTWEIGHT HIGH-PERFORMANCE KEYWORD SPOTTING

Hu, Shenghua, Beijing Institute of Technology, China Wang, Jing, Beijing Institute of Technology, China Wang, Yujun, Xiaomi Inc., China Yang, Wenjing, Beijing Institute of Technology, China

OD-A-FR3.11: END-TO-END SPONTANEOUS SPEECH RECOGNITION USING HESITATION LABELING

Horii, Koharu, Toyohashi University of Technology, Japan Fukuda, Meiko, Tokushima University, Japan Ohta, Kengo, National Institute of Technology, Anan College, Japan Nishimura, Ryota, Tokushima University, Japan Ogawa, Atsunori, Nippon Telegraph and Telephone Corporation, Japan Kitaoka, Norihide, Toyohashi University of Technology, Japan

OD-A-FR3.12: UNSUPERVISED SPOKEN TERM DISCOVERY USING WAV2VEC 2.0

Iwamoto, Yu, Tokyo Institute of Technology, Japan Shinozaki, Takahiro, Tokyo Institute of Technology, Japan

OD-A-FR3.13: EFFECT OF PERCEPTUAL TRAINING WITH NOISE ON CHINESE LEARNERS’ ENGLISH CONSONANT RECEPTION THRESHOLDS

Gong, Jian, Jiangsu University of Science and Technology, China Yu, Yameng, Jiangsu University of Science and Technology, China Bellamy, William, Jiangsu University of Science and Technology, China Wang, Feng, Jiangsu University of Science and Technology, China Ji, Xiaoli, Jiangsu University of Science and Technology, China

OD-A-FR3.14: MULTI-VIEW CONVOLUTION FOR LIPREADING

Maeda, Tsubasa, Gifu university, Japan Tamura, Satoshi, Gifu university, Japan

OD-A-FR3.15: OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES

Wang, Binling, Xiamen University, China Hu, Wenxuan, Xiamen University, China Li, Jing, Xiamen University, China Zhi, Yiming, Xiamen University, China Li, Zheng, Xiamen University, China Hong, Qingyang, Xiamen University, China Li, Lin, Xiamen University, China Wang, Dong, Tsinghua University, China Song, Liming, Speechocean, China Yang, Cheng, Speechocean, China

OD-A-FR3.16: CROSS-UTTERANCE RERANKING MODELS WITH BERT AND GRAPH CONVOLUTIONAL NETWORKS FOR CONVERSATIONAL SPEECH RECOGNITION

Chiu, Shih-Hsuan, National Taiwan Normal University, Taiwan Lo, Tien-Hong, National Taiwan Normal University, Taiwan Chao, Fu-An, National Taiwan Normal University, Taiwan Chen, Berlin, National Taiwan Normal University, Taiwan

Friday, December 17, 15:20 - 17:00 ∘ On-Demand B
OD-B-FR3 - Research Abstract

OD-B-FR3.1: FAST ALGORITHM FOR LOW-RANK TENSOR COMPLETION IN DELAY EMBEDDED SPACE

Yamamoto, Ryuki, Nagoya Institute of Technology, Japan Yokota, Tatsuya, Nagoya Institute of Technology, Japan Imakura, Akira, University of Tsukuba, Japan Hontani, Hidekata, Nagoya Institute of Technology, Japan

OD-B-FR3.2: PHASE CONTROL OF PARAMETRIC AEEAY LOUNDSPEAKER BY OPTIMIZING THE SIDEBAND WEIGHTING FUNCTIONS

OKANO, Ai, Kansai University, Japan Kajikawa, Yoshinobu, Kansai University, Japan

OD-B-FR3.3: STUDY ON GENERALIZATION PERFORMANCE OF DEEP IMAGE RESTORATION WITH UNFOLDING ON SMALL DATASETS

Itasaka, Tatsuki, Doshisha University, Japan Okuda, Masahiro, Doshisha University, Japan

OD-B-FR3.4: STABILITY OF A FINANCIAL SYSTEM VIA FINDING SYSTEMICALLY IMPORTANT FINANCIAL INSTITUTIONS

Zavialov, Igor, Nara Institute of Science and Technology, Japan Ikeda, Kazushi, Nara Institute of Science and Technology, Japan

Main Menu