APSIPA 2020
Author Index
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z
A
Abdulla, Waleed
pg. 22
A-1-3.5 - BARK FREQUENCY SPECTRUM IN PARALLEL-FORM REMOTE ACTIVE NOISE CONTROL
pg. 1631
C-2-3.7 - GENERALISATION TECHNIQUES USING A VARIATIONAL CEAE FOR CLASSIFYING MANUKA HONEY QUALITY
Abdulla, Waleed H.
pg. 64
F-3-3.4 - SEGMENTATION OF PALM VEIN IMAGES USING U-NET
Abe, Masanobu
pg. 826
F-3-2.4 - MODULE COMPARISON OF TRANSFORMER-TTS FOR SPEAKER ADAPTATION BASED ON FINE-TUNING
Abe, Narishige
pg. 1430
B-3-2.3 - A NOVEL QUALITY ASSESSMENT METHOD FOR EYE MOVEMENT AUTHENTICATION
Abhayapala, Thushara
pg. 156
B-1-2.4 - ESTIMATING DRONE MOTOR RELATED ACOUSTIC TRANSFER FUNCTION: A PRELIMINARY INVESTIGATION
pg. 288
E-1-1.5 - ACTIVE NOISE CONTROL OVER MULTIPLE ZONES: ADAPTIVE ALGORITHM IN TIME DOMAIN
pg. 734
E-3-1.6 - ON THE USE OF THE RELATIVE TRANSFER FUNCTION FOR SOURCE SEPARATION USING TWO-CHANNEL RECORDINGS
pg. 694
F-2-3.6 - MODELLING ROOM REVERBERATION DIRECTIVITY USING VON MISES-FISHER MIXTURE DISTRIBUTION
Acharya, Rajul
pg. 538
F-2-1.3 - SUBBAND CHANNEL SELECTION USING TEO FOR REPLAY SPOOF DETECTION IN VOICE ASSISTANTS
Adachi, Koichi
pg. 1502
C-1-3.1 - PROBABILISTIC BINARY OFFLOADING FOR WIRELESS POWERED MOBILE EDGE COMPUTING SYSTEM
pg. 1513
C-1-3.3 - ESTIMATION OF DESIRED POWER AND UNDESIRED POWER USING CHIRP DEMODULATION AND EVALUATION OF ACCURACY
pg. 1460
C-1-1.4 - AUTONOMOUS DECENTRALIZED TRANSMISSION TIMING CONTROL IN WIRELESS SENSOR NETWORK
Agrawal, Dharmeshkumar
pg. 727
E-3-1.5 - IMPACT OF MINIMUM HYPERSPHERICAL ENERGY REGULARIZATION ON TIME-FREQUENCY DOMAIN NETWORKS FOR SINGING VOICE SEPARATION
Ahmed, Jameel
pg. 1132
D-1-3.6 - EVALUATION OF THE ENCODING ACCURACY OF THE PQ BASED HDR CONTENT DELIVERY FORMATS
Ahn, Seokhyun
pg. 1161
D-2-3.4 - DYNAMIC MATCHING OF LOCAL FEATURES FOR RE-IDENTIFICATION OF PEDESTRIANS
Ahsan, Zubair
pg. 1008
B-3-1.2 - PREDICTING EXPERTISE AMONG NOVICE PROGRAMMERS WITH PRIOR KNOWLEDGE ON PROGRAMMING TASKS
Ai, Yang
pg. 815
F-3-2.2 - ONLINE SPEAKER ADAPTATION FOR WAVENET-BASED NEURAL VOCODERS
Aikawa, Naoyuki
pg. 1
A-1-3.1 - AN IMPROVED METHOD FOR INSTANTANEOUS FREQUENCY ESTIMATION USING A FINITE ORDER HILBERT TRANSFORMER
Aing, Lee
pg. 1673
C-3-2.6 - DETECTING OBJECT SURFACE KEYPOINTS FROM A SINGLE RGB IMAGE VIA DEEP LEARNING NETWORK FOR 6DOF POSE ESTIMATION
Akagi, Masato
pg. 753
F-3-1.3 - ENHANCEMENT OF SPEECH INTELLIGIBILITY UNDER NOISY REVERBERANT CONDITIONS BASED ON MODULATION SPECTRUM CONCEPT
pg. 325
F-1-1.6 - DEEP MULTILAYER PERCEPTRONS FOR DIMENSIONAL SPEECH EMOTION RECOGNITION
Akhtar, Muhammad Tahir
pg. 222
C-2-2.2 - COMPARISON OF GENERIC AND SUBJECT-SPECIFIC TRAINING FOR FEATURES CLASSIFICATION IN P300 SPELLER
Amano, Masaki
pg. 1523
C-1-3.5 - SPECTRUM SHARING FOR INTERNET OF THINGS SYSTEM IN PERIODIC TRANSMISSION
Ando, Atsushi
pg. 319
F-1-1.5 - SPEAKER AGE ESTIMATION USING AGE-DEPENDENT INSENSITIVE LOSS
Aoki, Takafumi
pg. 1414
B-3-2.1 - PERFORMANCE EVALUATION OF FACE ANTI-SPOOFING METHOD USING DEEP METRIC LEARNING FROM A FEW FRAMES OF FACE VIDEO
Aono, Masai
pg. 1081
D-1-2.3 - VISUAL SENTIMENT ANALYSIS FOR FEW-SHOT IMAGE CLASSIFICATION BASED ON METRIC LEARNING
Aono, Masaki
pg. 1207
D-3-2.4 - PART-IN-WHOLE TYPE 3D PARTIAL SHAPE RETREIVAL BASED ON CONNECTED FACES WITH POINTNET FEATURES
Arakawa, Kaoru
pg. 1145
D-2-3.1 - VISUAL TRACKING VIA SPATIAL-TEMPORAL REGULARIZED CORRELATION FILTERS WITH ADVANCED STATE ESTIMATION
Ardekani, Iman
pg. 57
F-3-3.3 - AN ACOUSTIC SIGNAL PROCESSING SYSTEM FOR IDENTIFICATION OF QUEEN-LESS BEEHIVES
Arnia, Fitri
pg. 924
B-1-3.4 - PERFORMANCE EVALUATION OF BINARY CLASSIFICATION OF TUBERCULOSIS THROUGH UNSHARP MASKING AND DEEP LEARNING TECHNIQUE
Asakawa, Tetsuya
pg. 1081
D-1-2.3 - VISUAL SENTIMENT ANALYSIS FOR FEW-SHOT IMAGE CLASSIFICATION BASED ON METRIC LEARNING
Asano, Futoshi
pg. 184
C-2-1.2 - AGE CLASSIFICATION OF EVACUEES AT TIMES OF DISASTER USING A VIBRATION SENSOR
Ashihara, Takanori
pg. 632
E-2-3.2 - END-TO-END AUTOMATIC SPEECH RECOGNITION WITH DEEP MUTUAL LEARNING
Atmaja, Bagus Tris
pg. 325
F-1-1.6 - DEEP MULTILAYER PERCEPTRONS FOR DIMENSIONAL SPEECH EMOTION RECOGNITION
B
Babaguchi, Noboru
pg. 1375
B-2-3.2 - DETECTION OF CLONED RECOGNIZERS: A DEFENDING METHOD AGAINST RECOGNIZER CLONING ATTACK
pg. 1400
B-2-3.6 - DEEP FACE RECOGNIZER PRIVACY ATTACK: MODEL INVERSION INITIALIZATION BY A DEEP GENERATIVE ADVERSARIAL DATA SPACE DISCRIMINATOR
Bai, Ruyu
pg. 416
E-1-3.2 - MULTI-BEAM DESIGN METHOD FOR A STEERABLE PARAMETRIC ARRAY LOUDSPEAKER
Banno, Hideki
pg. 174
C-2-1.1 - SIMULTANEOUS MEASUREMENT OF TIME-INVARIANT LINEAR AND NONLINEAR, AND RANDOM AND EXTRA RESPONSES USING FREQUENCY DOMAIN VARIANT OF VELVET NOISE
Banuelos, Mario
pg. 968
A-3-3.6 - A NEURAL NETWORK APPROACH FOR ANOMALY DETECTION IN GENOMIC SIGNALS
Bastine, Amy
pg. 694
F-2-3.6 - MODELLING ROOM REVERBERATION DIRECTIVITY USING VON MISES-FISHER MIXTURE DISTRIBUTION
Bates, Alice
pg. 734
E-3-1.6 - ON THE USE OF THE RELATIVE TRANSFER FUNCTION FOR SOURCE SEPARATION USING TWO-CHANNEL RECORDINGS
Bezzam, Eric
pg. 674
F-2-3.3 - A STUDY ON MORE REALISTIC ROOM SIMULATION FOR FAR-FIELD KEYWORD SPOTTING
Blu, Thierry
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
pg. 986
B-2-1.2 - FRI SENSING: 2D LOCALIZATION FROM 1D MOBILE SENSOR DATA
Boeck, Marion
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Botnar, René
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
Byun, Kyunggeun
pg. 308
F-1-1.3 - SPEAKER-INVARIANT PSYCHOLOGICAL STRESS DETECTION USING ATTENTION-BASED NETWORK
Byun, Kyungguen
pg. 831
F-3-2.5 - EXCITGLOW: IMPROVING A WAVEGLOW-BASED NEURAL VOCODER WITH LINEAR PREDICTION ANALYSIS
C
Cadoux, Cyril
pg. 674
F-2-3.3 - A STUDY ON MORE REALISTIC ROOM SIMULATION FOR FAR-FIELD KEYWORD SPOTTING
CAI, Chengkai
pg. 449
F-1-3.1 - SPEECH ENHANCEMENT FOR OPTICAL LASER MICROPHONE WITH DEEP NEURAL NETWORK
Cao, Jie
pg. 1707
C-3-3.4 - ADAPTIVE MULTI-PROTOTYPE RELATION NETWORK
Cao, Siyi
pg. 211
C-2-1.6 - DIFFERENTIATED PROSODIC ADAPTION OF CHINESE AND ENGLISH POETRY: AN ACOUSTIC APPROACH TO READING OF CHINESE TANG POETRY AND SHAKESPEAREAN SONNETS
Chamnongthai, Kosin
pg. 1103
D-1-2.6 - FIXATIONAL FEATURE-BASED GAZE PATTERN RECOGNITION USING LONG SHORT-TERM MEMORY
Champagne, Benoit
pg. 764
F-3-1.5 - AN INTEGRATED CNN-GRU FRAMEWORK FOR COMPLEX RATIO MASK ESTIMATION IN SPEECH ENHANCEMENT
pg. 76
F-3-3.6 - ENHANCED CHANNEL TRACKING IN THZ BEAMSPACE MASSIVE MIMO: A DEEP CNN APPROACH
Chan, Kai-Hsuan
pg. 1170
D-2-3.5 - IMPLEMENTATION OF BI-RADS CLASSIFICATION AND PRIORITY PREDICTION FOR MAMMOGRAM PRE-SCREENING BASED ON MULTI-DECISION FRAMEWORK
Chan, Ken
pg. 381
F-1-2.3 - OPENNLU: OPEN-SOURCE WEB-INTERFACE NLU TOOLKIT FOR DEVELOPMENT OF CONVERSATIONAL AGENT
Chan, Yi-Ming
pg. 1594
C-2-3.1 - MERGING WELL-TRAINED DEEP CNN MODELS FOR EFFICIENT INFERENCE
Chan, Yui-Lam
pg. 1112
D-1-3.2 - ULTRA FAST SCREEN CONTENT CODING VIA RANDOM FOREST
Chang, Cheng-Sheng
pg. 1566
D-3-1.2 - REAL-TIME DDOS ATTACK DETECTION USING SKETCH-BASED ENTROPY ESTIMATION ON THE NETFPGA SUME PLATFORM
Chang, Cheng-Yuan
pg. 293
E-1-1.6 - IMPLEMENTATION OF FEEDFORWARD ACTIVE NOISE CONTROL TECHNIQUES FOR HEADPHONES
Chang, Chun-Min
pg. 314
F-1-1.4 - SENSING WITH CONTEXTS: CRYING REASON CLASSIFICATION FOR INFANT CARE CENTER WITH ENVIRONMENTAL FUSION
Chang, Pao Chi
pg. 88
F-3-3.8 - ACOUSTIC ECHO CANCELLATION BASED ON RECURRENT NEURAL NETWORK
chen, Berlin
pg. 759
F-3-1.4 - EXPLORING FEATURE ENHANCEMENT IN THE MODULATION SPECTRUM DOMAIN VIA IDEAL RATIO MASK FOR ROBUST SPEECH RECOGNITION
Chen, Chih-Yang
pg. 1527
D-2-1.1 - DUAL ADAPTIVE MODULATION AND CODING FOR MITIGATING UE-UE INTERFERENCE IN HETEROGENEOUS TDD SLOT CONFIGURATIONS
Chen, Chu-Song
pg. 1594
C-2-3.1 - MERGING WELL-TRAINED DEEP CNN MODELS FOR EFFICIENT INFERENCE
pg. 1605
C-2-3.3 - EXTENDING CONDITIONAL CONVOLUTION STRUCTURES FOR ENHANCING MULTITASKING CONTINUAL LEARNING
Chen, Fei
pg. 894
B-1-1.4 - A TEMPORAL ENVELOPE-BASED SPEECH RECONSTRUCTION APPROACH WITH EEG SIGNALS DURING SPEECH IMAGERY
Chen, Houshou
pg. 1448
C-1-1.2 - CONSTRUCTION OF CYCLICALLY PERMUTABLE CODES FROM PRIME LENGTH CYCLIC CODES
Chen, Hsiang-Chun
pg. 314
F-1-1.4 - SENSING WITH CONTEXTS: CRYING REASON CLASSIFICATION FOR INFANT CARE CENTER WITH ENVIRONMENTAL FUSION
Chen, Huan-Yu
pg. 314
F-1-1.4 - SENSING WITH CONTEXTS: CRYING REASON CLASSIFICATION FOR INFANT CARE CENTER WITH ENVIRONMENTAL FUSION
Chen, Hwann-Tzong
pg. 1075
D-1-2.2 - HALLUCINATING SCENES
pg. 1087
D-1-2.4 - LEARNING DENSE CORRESPONDENCES VIA LOCAL AND NON-LOCAL FEATURE FUSION
CHEN, JIAJIA
pg. 1150
D-2-3.2 - A NEW POLARIZED IMAGE FUSION ALGORITHM BASED ON TWO-SCALE GUIDED FILTERING
Chen, Jiajia
pg. 6
A-1-3.2 - A NEW ALGORITHM TO DERIVE HARDWARE EFFICIENT INTEGER DISCRETE COSINE TRANSFORM FOR HEVC
pg. 1156
D-2-3.3 - AN IMPROVED GUIDED FILTERING ALGORITHM FOR POLARIZED IMAGES BY USING LOG OPERATOR
Chen, Jim Hao
pg. 1566
D-3-1.2 - REAL-TIME DDOS ATTACK DETECTION USING SKETCH-BASED ENTROPY ESTIMATION ON THE NETFPGA SUME PLATFORM
Chen, Kuan-Yu
pg. 386
F-1-2.4 - SPOKEN MULTIPLE-CHOICE QUESTION ANSWERING USING MULTI-TURN AUDIO-EXTRACTER BERT
Chen, Po-Yu
pg. 1234
D-2-2.1 - FUSION TECHNOLOGY OF RADAR AND RGB CAMERA SENSORS FOR OBJECT DETECTION AND TRACKING AND ITS EMBEDDED SYSTEM IMPLEMENTATION
Chen, Rilin
pg. 720
E-3-1.4 - INDEPENDENT VECTOR ANALYSIS FOR BLIND SPEECH SEPARATION USING COMPLEX GENERALIZED GAUSSIAN MIXTURE MODEL WITH WEIGHTED VARIANCE
Chen, Sheng
pg. 1549
D-2-1.5 - REALIZATION OF 5G NETWORK SLICING USING OPEN SOURCE SOFTWARES
Chen, Yan
pg. 11
A-1-3.3 - NON-LINE-OF-SIGHT IMAGING WITH RADIO SIGNALS
pg. 1617
C-2-3.5 - CAN-SIN: A CROSS-LAYER HETEROGENEOUS ACADEMIC NETWORK WITH SEMANTIC INFORMATION
pg. 161
B-1-2.5 - AN EVOLUTIONARY GAME THEORETICAL FRAMEWORK FOR DECISION FUSION IN THE PRESENCE OF BYZANTINES
Chen, Yanyang
pg. 584
E-2-2.5 - ACOUSTIC ANALYSIS OF NASALIZATION IN MANDARIN PRENASAL VOWELS PRODUCED BY WENZHOU AND RUGAO SPEAKERS
Chen, Ying
pg. 584
E-2-2.5 - ACOUSTIC ANALYSIS OF NASALIZATION IN MANDARIN PRENASAL VOWELS PRODUCED BY WENZHOU AND RUGAO SPEAKERS
Chen, Yu-Hao
pg. 1170
D-2-3.5 - IMPLEMENTATION OF BI-RADS CLASSIFICATION AND PRIORITY PREDICTION FOR MAMMOGRAM PRE-SCREENING BASED ON MULTI-DECISION FRAMEWORK
Chen, Zhaoqi
pg. 963
A-3-3.5 - BOWEL MOVEMENT SIGNAL MODELING AND PARAMETERS EXTRACTION
Cheng, Chia-Ming
pg. 1075
D-1-2.2 - HALLUCINATING SCENES
Cheng, Linjuan
pg. 769
F-3-1.6 - A TIME-DOMAIN MONAURAL SPEECH ENHANCEMENT WITH FEEDBACK LEARNING
Cheung, Tsun-hin
pg. 376
F-1-2.2 - SIMULTANEOUS FAKE NEWS AND TOPIC CLASSIFICATION VIA AUXILIARY TASK LEARNING
Chiang, Jui-Chiu
pg. 1128
D-1-3.5 - RATE-DISTORTION OPTIMIZATION FOR 360-DEGREE IMAGE CONSIDERING VISUAL ATTENTION
Chien, Jen-Tzung
pg. 1611
C-2-3.4 - MULTIPLE TARGET PREDICTION FOR DEEP REINFORCEMENT LEARNING
pg. 1713
C-3-3.5 - SUPPORTIVE AND SELF ATTENTIONS FOR IMAGE CAPTION
Chin, Wen-Chi
pg. 1087
D-1-2.4 - LEARNING DENSE CORRESPONDENCES VIA LOCAL AND NON-LOCAL FEATURE FUSION
Chiu, Ching-Te
pg. 71
F-3-3.5 - DEEP NEURAL NETWORK COMPRESSION WITH KNOWLEDGE DISTILLATION USING CROSS-LAYER MATRIX, KL DIVERGENCE AND OFFLINE ENSEMBLE
Cho, Keng-Pei
pg. 1448
C-1-1.2 - CONSTRUCTION OF CYCLICALLY PERMUTABLE CODES FROM PRIME LENGTH CYCLIC CODES
Cho, Nam Ik
pg. 1161
D-2-3.4 - DYNAMIC MATCHING OF LOCAL FEATURES FOR RE-IDENTIFICATION OF PEDESTRIANS
CHO, NAM IK
pg. 1067
D-1-2.1 - LOCAL BACKLIGHT DIMMING FOR LIQUID CRYSTAL DISPLAYS VIA CONVOLUTIONAL NEURAL NETWORK
Chou, Hsing-Hung
pg. 71
F-3-3.5 - DEEP NEURAL NETWORK COMPRESSION WITH KNOWLEDGE DISTILLATION USING CROSS-LAYER MATRIX, KL DIVERGENCE AND OFFLINE ENSEMBLE
Chou, Huang-Cheng
pg. 393
F-1-2.5 - "YOUR BEHAVIOR MAKES ME THINK IT IS A LIE": RECOGNIZING PERCEIVED DECEPTION USING MULTIMODAL DATA IN DIALOG GAMES
Chouksey, Mausam
pg. 1177
D-2-3.6 - VARIATIONAL MODE DECOMPOSITION BASED IMAGE SEGMENTATION USING SINE COSINE ALGORITHM
Chu, Chan-Chuan
pg. 1647
C-3-2.2 - MPOP600: A MANDARIN POPULAR SONG DATABASE WITH ALIGNED AUDIO, LYRICS, AND MUSICAL SCORES FOR SINGING VOICE SYNTHESIS
Chuang, Shang-Yi
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
Chuang, Yi-Chin
pg. 346
E-1-2.3 - BEAT AND DOWNBEAT TRACKING OF SYMBOLIC MUSIC DATA USING DEEP RECURRENT NEURAL NETWORKS
Coronel, Carmina
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Cruz, Gastao
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
Cui, Sanshuai
pg. 1352
B-2-2.3 - DENSELY CONNECTED CONVOLUTIONAL NETWORK FOR AUDIO SPOOFING DETECTION
D
Dai, Lirong
pg. 638
E-2-3.3 - ATTENTIVE FUSION ENHANCED AUDIO-VISUAL ENCODING FOR TRANSFORMER BASED ROBUST SPEECH RECOGNITION
Dai, Yuchao
pg. 150
B-1-2.3 - CLASS ATTENTION NETWORK FOR SEMANTIC SEGMENTATION OF REMOTE SENSING IMAGES
Dang, Jianwu
pg. 881
B-1-1.2 - A MULTI-SUBJECT TEMPORAL-SPATIAL HYPER-ALIGNMENT METHOD FOR EEG-BASED NEURAL ENTRAINMENT TO SPEECH
pg. 616
F-2-2.5 - A PITCH-AWARE SPEAKER EXTRACTION SERIAL NETWORK
Das, Rohan Kumar
pg. 747
F-3-1.2 - CLASSIFICATION OF SPEECH WITH AND WITHOUT FACE MASK USING ACOUSTIC FEATURES
pg. 605
F-2-2.3 - HLT-NUS SUBMISSION FOR 2019 NIST MULTIMEDIA SPEAKER RECOGNITION EVALUATION
pg. 610
F-2-2.4 - EMOTION INVARIANT SPEAKER EMBEDDINGS FOR SPEAKER IDENTIFICATION WITH EMOTIONAL SPEECH
Dev Sarma, Biswajit
pg. 610
F-2-2.4 - EMOTION INVARIANT SPEAKER EMBEDDINGS FOR SPEAKER IDENTIFICATION WITH EMOTIONAL SPEECH
Ding, Ning
pg. 805
E-3-2.6 - ADAPTIVE NOISE SUPPRESSION FOR WAKE-WORD DETECTION BY TEMPORAL-DIFFERENCE GENERALIZED EIGENVALUE BEAMFORMER
Ding, Yi-Yang
pg. 556
F-2-1.6 - ADVERSARIAL POST-PROCESSING OF VOICE CONVERSION AGAINST SPOOFING DETECTION
Doya, Kenji
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Du, Jun
pg. 465
F-1-3.4 - FREQUENCY GATING: IMPROVED CONVOLUTIONAL NEURAL NETWORKS FOR SPEECH ENHANCEMENT IN THE TIME-FREQUENCY DOMAIN
Du, Yuwei
pg. 875
B-1-1.1 - CLASSIFICATION OF SEIZURE EEGS BASED ON SHORT-TIME FOURIER TRANSFORM AND HIDDEN MARKOV MODEL
Du, Zhuolin
pg. 92
A-2-3.1 - A PARALLELIZATION METHOD OF INCEPTION ARCHITECTURE BASED ON ARRAY PROCESSOR
pg. 104
A-2-3.2 - RSP-BT:AN ADVANCED PARALLEL METHOD FOR DEPTH MAP MOTION ESTIMATION
Du, Zongyang
pg. 507
E-2-1.4 - SPECTRUM AND PROSODY CONVERSION FOR CROSS-LINGUAL VOICE CONVERSION WITH CYCLEGAN
E
Echizen, Isao
pg. 1293
B-3-3.1 - A METHOD FOR IDENTIFYING ORIGIN OF DIGITAL IMAGES USING A CONVOLUTIONAL NEURAL NETWORK
pg. 1386
B-2-3.4 - DETECTION OF ADVERSARIAL EXAMPLES BASED ON SENSITIVITIES TO NOISE REMOVAL FILTER
pg. 1392
B-2-3.5 - A QR SYMBOL WITH ECDSA FOR BOTH PUBLIC AND SECRET AREAS USING RHOMBIC SUB-CELLS
pg. 1406
B-2-3.7 - COLOR TRANSFER TO ANONYMIZED GAIT IMAGES WHILE MAINTAINING ANONYMIZATION
Endo, Hideki
pg. 1466
C-1-1.5 - PACKET AGGREGATION BASED ON ENCRYPTION-THEN-COMPRESSION FOR HIGHLY EFFICIENT MULTI-HOP TRANSMISSION
Eshghi, Mohammad
pg. 572
E-2-2.3 - PHONEME EMBEDDINGS ON PREDICTING FUNDAMENTAL FREQUENCY PATTERN FOR ELECTROLARYNGEAL SPEECH
F
Fan, Meng
pg. 126
A-2-3.4 - OPTIMIZATION OF FALSE-OVERLAP DETECTION OF TILE ASSEMBLY IN TILE-BASED RENDERING
Fang, Chunyao
pg. 1033
D-1-1.1 - CLOUD RECOGNITION BASED ON LIGHTWEIGHT NEURAL NETWORK
Fang, Fuming
pg. 1293
B-3-3.1 - A METHOD FOR IDENTIFYING ORIGIN OF DIGITAL IMAGES USING A CONVOLUTIONAL NEURAL NETWORK
Feng, Hui
pg. 205
C-2-1.5 - SAMPLING POLICY DESIGN FOR TRACKING TIME-VARYING GRAPH SIGNALS WITH ADAPTIVE BUDGET ALLOCATION
Finke, Nils
pg. 189
C-2-1.3 - ON THE BEHAVIOUR OF PERMUTATION ENTROPY ON FRACTIONAL BROWNIAN MOTION IN A MULTIVARIATE SETTING
Fleming, Rachel
pg. 1310
B-3-3.4 - VEIN PATTERN VISUALISATION USING CONDITIONAL GENERATIVE ADVERSARIAL NETWORKS
Fu, Szu-Wei
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
pg. 482
F-1-3.7 - STOI-NET: A DEEP LEARNING BASED NON-INTRUSIVE SPEECH INTELLIGIBILITY ASSESSMENT MODEL
Fu, Zhonghua
pg. 716
E-3-1.3 - MULTI-CHANNEL SPEECH SEPARATION USING DEEP EMBEDDING WITH MULTILAYER BOOTSTRAP NETWORKS
Fuchikami, Manabu
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Fuh, Chiou-Shann
pg. 482
F-1-3.7 - STOI-NET: A DEEP LEARNING BASED NON-INTRUSIVE SPEECH INTELLIGIBILITY ASSESSMENT MODEL
Fujii, Takeo
pg. 1477
C-1-2.1 - COMPENSATION METHOD OF RECEIVED SIGNAL POWER OBSERVED BY SMARTPHONE FOR CROWDSENSED SPECTRUM DATABASE
pg. 1507
C-1-3.2 - SCHEDULING ALGORITHM CONSIDERING INTERFERENCE INTERVAL FOR LPWA
pg. 1513
C-1-3.3 - ESTIMATION OF DESIRED POWER AND UNDESIRED POWER USING CHIRP DEMODULATION AND EVALUATION OF ACCURACY
pg. 1519
C-1-3.4 - ON PLACEMENT OF END DEVICES IN LPWAN BASED WSN FOR ENVIRONMENTAL MONITORING APPLICATIONS
pg. 1497
C-1-2.4 - SPECIFICATION OF LINK QUALITY DEGRADATION IN WLAN BASED ON MCS AND RETRANSMISSION FLAG
pg. 1460
C-1-1.4 - AUTONOMOUS DECENTRALIZED TRANSMISSION TIMING CONTROL IN WIRELESS SENSOR NETWORK
Fujimura, Hiroshi
pg. 805
E-3-2.6 - ADAPTIVE NOISE SUPPRESSION FOR WAKE-WORD DETECTION BY TEMPORAL-DIFFERENCE GENERALIZED EIGENVALUE BEAMFORMER
Fukusaki, Takuto
pg. 100
A-2-3.1 - AN EVALUATION OF A CNN-BASED PARKING DETECTION SYSTEM WITH WEBCAMS
Fukushima, Norishige
pg. 934
B-1-3.6 - COMPARISON OF IMAGE FEATURES DESCRIPTIONS FOR DIAGNOSIS OF LEAF DISEASES
pg. 28
A-1-3.6 - AN EFFICIENT DESCRIPTION WITH HALIDE FOR IIR GAUSSIAN FILTER
Funabiki, Nobuo
pg. 1340
B-2-2.1 - DEEPWATERMARK: EMBEDDING WATERMARK INTO DNN MODEL
pg. 1381
B-2-3.3 - CLASSIFICATION OF VIDEO RECAPTURED FROM DISPLAY DEVICE
pg. 1386
B-2-3.4 - DETECTION OF ADVERSARIAL EXAMPLES BASED ON SENSITIVITIES TO NOISE REMOVAL FILTER
FUNAKI, KEIICHI
pg. 568
E-2-2.2 - TV-CAR SPEECH ANALYSIS BASED ON THE L2-NORM REGULARIZATION IN THE TIME-DOMAIN AND FREQUENCY DOMAIN
Furuya, Ken’ichi
pg. 929
B-1-3.5 - HYPERPARAMETER TUNING OF THE SHUNT-MURMUR DISCRIMINATION ALGORITHM USING BAYESIAN OPTIMIZATION
G
Gao, Guanglai
pg. 52
F-3-3.2 - ROBUST SPEECH DEREVERBERATION BASED ON WPE AND DEEP LEARNING
Gao, Wen-Biao
pg. 255
C-3-1.2 - WINDOWED FRACTIONAL FOURIER TRANSFORM ON GRAPHS: FRACTIONAL TRANSLATION OPERATOR AND HAUSDORFF-YOUNG INEQUALITY
Garn, Heinrich
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Gatidis, Sergios
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
Ge, Meng
pg. 616
F-2-2.5 - A PITCH-AWARE SPEAKER EXTRACTION SERIAL NETWORK
Geng, Yurong
pg. 126
A-2-3.4 - OPTIMIZATION OF FALSE-OVERLAP DETECTION OF TILE ASSEMBLY IN TILE-BASED RENDERING
GENG, Yuting
pg. 409
E-1-3.1 - EVALUATION OF A MULTI-WAY PARAMETRIC ARRAY LOUDSPEAKER BASED ON MULTIPLEXED DOUBLE SIDEBAND MODULATION
Gilliam, Christopher
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
pg. 992
B-2-1.3 - APPLICATION OF IMAGE PROCESSING AND CIRCULAR STATISTICS TO 3D CELLULAR ALIGNMENT
Gisselbrecht, Thibault
pg. 674
F-2-3.3 - A STUDY ON MORE REALISTIC ROOM SIMULATION FOR FAR-FIELD KEYWORD SPOTTING
Glos, Martin
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Gohil, Raj
pg. 437
E-1-3.6 - LEARNING BASED DOA ESTIMATION IN ADVERSE ACOUSTIC ENVIRONMENT USING CO-PRIME CIRCULAR MICROPHONE ARRAY
Gong, Jian
pg. 589
E-2-2.6 - TEMPORAL AND FORMANT TRAJECTORY ANALYSIS OF ENGLISH TENSE-LAX VOWELS PRODUCED BY NATIVE CHINESE SPEAKERS
Goto, Kana
pg. 858
E-3-3.3 - A STUDY ON GEOMETRICALLY CONSTRAINED IVA WITH AUXILIARY FUNCTION APPROACH AND VCD FOR IN-CAR COMMUNICATION
Goto, Keita
pg. 527
F-2-1.1 - QUASI-NEWTON ADVERSARIAL ATTACKS ON SPEAKER VERIFICATION SYSTEMS
pg. 1641
C-3-2.1 - SEMI-SUPERVISED CONTRASTIVE LEARNING WITH GENERALIZED CONTRASTIVE LOSS AND ITS APPLICATION TO SPEAKER RECOGNITION
pg. 600
F-2-2.2 - OPTIMIZING SPEAKER EMBEDDINGS USING META-TRAINING SETS
pg. 1693
C-3-3.2 - CLOSED-FORM PRE-TRAINING FOR SMALL-SAMPLE ENVIRONMENTAL SOUND RECOGNITION
Gou, Jiacheng
pg. 416
E-1-3.2 - MULTI-BEAM DESIGN METHOD FOR A STEERABLE PARAMETRIC ARRAY LOUDSPEAKER
Grixti-Cheng, Daniel
pg. 734
E-3-1.6 - ON THE USE OF THE RELATIVE TRANSFER FUNCTION FOR SOURCE SEPARATION USING TWO-CHANNEL RECORDINGS
Gu, Rongzhi
pg. 595
F-2-2.1 - CONTEXT-ADAPTIVE GAUSSIAN ATTENTION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Guo, Jiun-In
pg. 1234
D-2-2.1 - FUSION TECHNOLOGY OF RADAR AND RGB CAMERA SENSORS FOR OBJECT DETECTION AND TRACKING AND ITS EMBEDDED SYSTEM IMPLEMENTATION
Guo, Ruiming
pg. 986
B-2-1.2 - FRI SENSING: 2D LOCALIZATION FROM 1D MOBILE SENSOR DATA
Gupta, Chitralekha
pg. 492
E-2-1.2 - SPECTRAL FEATURES AND PITCH HISTOGRAM FOR AUTOMATIC SINGING QUALITY EVALUATION WITH CRNN
Gupta, Priyanka
pg. 543
F-2-1.4 - DESIGN OF VOICE PRIVACY SYSTEM USING LINEAR PREDICTION
H
Ha, Seong Jong
pg. 1279
D-3-3.6 - TEMPORAL ATTENTION FEATURE ENCODING FOR VIDEO CAPTIONING
Hamada, Yuri
pg. 958
A-3-3.4 - SHEET-TYPE DEVICE FOR UNCONSTRAINED HEART SOUND MEASUREMENT AND WHITE NOISE REDUCTION BY WIENER FILTER
Hammernik, Kerstin
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
Han, Hyewon
pg. 308
F-1-1.3 - SPEAKER-INVARIANT PSYCHOLOGICAL STRESS DETECTION USING ATTENTION-BASED NETWORK
Han, Mengqiao
pg. 126
A-2-3.4 - OPTIMIZATION OF FALSE-OVERLAP DETECTION OF TILE ASSEMBLY IN TILE-BASED RENDERING
Hara, Sunao
pg. 826
F-3-2.4 - MODULE COMPARISON OF TRANSFORMER-TTS FOR SPEAKER ADAPTATION BASED ON FINE-TUNING
Hara, Takanori
pg. 1466
C-1-1.5 - PACKET AGGREGATION BASED ON ENCRYPTION-THEN-COMPRESSION FOR HIGHLY EFFICIENT MULTI-HOP TRANSMISSION
Hart, Rylea
pg. 1201
D-3-2.3 - THE VALIDITY OF A DUAL AZURE KINECT-BASED MOTION CAPTURE SYSTEM FOR GAIT ANALYSIS: A PRELIMINARY STUDY
Hasannezhad, Mojtaba
pg. 764
F-3-1.5 - AN INTEGRATED CNN-GRU FRAMEWORK FOR COMPLEX RATIO MASK ESTIMATION IN SPEECH ENHANCEMENT
Hattori, Gen
pg. 1601
C-2-3.2 - EFFICIENT DIVERSE RESPONSE GENERATION IN ATTENTION-BASED NEURAL CONVERSATIONAL MODEL WITH MAXIMUM MUTUAL INFORMATION
Hautamäki, Ville
pg. 1300
B-3-3.2 - COST SENSITIVE OPTIMIZATION OF DEEPFAKE DETECTOR
Hayakawa, Ryo
pg. 1490
C-1-2.3 - AN OVERLOADED IOT SIGNAL DETECTION METHOD USING NON-CONVEX SPARSE REGULARIZERS
Hayashi, Kazunori
pg. 1490
C-1-2.3 - AN OVERLOADED IOT SIGNAL DETECTION METHOD USING NON-CONVEX SPARSE REGULARIZERS
pg. 228
C-2-2.3 - OPTIMAL COMBINATION WEIGHT FOR SPARSE DIFFUSION LEAST-MEAN-SQUARE BASED ON CONSENSUS PROPAGATION
He, Mingyi
pg. 150
B-1-2.3 - CLASS ATTENTION NETWORK FOR SEMANTIC SEGMENTATION OF REMOTE SENSING IMAGES
He, Ying
pg. 11
A-1-3.3 - NON-LINE-OF-SIGHT IMAGING WITH RADIO SIGNALS
Hegde, Rajesh
pg. 437
E-1-3.6 - LEARNING BASED DOA ESTIMATION IN ADVERSE ACOUSTIC ENVIRONMENT USING CO-PRIME CIRCULAR MICROPHONE ARRAY
Heo, Suwoong
pg. 1262
D-3-3.3 - IMAGE INPAINTING USING WEIGHTED MASK CONVOLUTION
Hidetake, Uwano
pg. 1017
B-3-1.3 - DISCOVERY OF EVENT-RELATED POTENTIALS DURING A COGNITIVE PROCESS OF COMPARISON OPERATION
Higashi, Akinori
pg. 1386
B-2-3.4 - DETECTION OF ADVERSARIAL EXAMPLES BASED ON SENSITIVITIES TO NOISE REMOVAL FILTER
Hioka, Yusuke
pg. 850
E-3-3.2 - SOURCE ENHANCEMENT FOR UNMANNED AERIAL VEHICLE RECORDING USING MULTI-SENSORY INFORMATION
Hirabayashi, Akira
pg. 339
E-1-2.2 - DEEP NEURAL NETWORK MODELING OF DISTORTION STOMP BOX USING SPECTRAL FEATURES
HIRASAWA, Ryoichi
pg. 1347
B-2-2.2 - FLEXIBLE DATA HIDING AND EXTRACTION IN ETC IMAGES
Hirata, Kouji
pg. 1536
D-2-1.2 - OPTIMIZATION OF VIRTUAL MACHINE PLACEMENT FOR BALANCING NETWORK AND SERVER LOAD IN EDGE COMPUTING ENVIRONMENTS
pg. 1541
D-2-1.3 - PREDICTION METHOD OF MALWARE INFECTION SPREADING CONSIDERING NETWORK SCALE
pg. 1545
D-2-1.4 - JOINT OPTIMIZATION OF EDGE SERVER AND VIRTUAL MACHINE PLACEMENT IN EDGE COMPUTING ENVIRONMENTS
Hirayama, Atsuya
pg. 1490
C-1-2.3 - AN OVERLOADED IOT SIGNAL DETECTION METHOD USING NON-CONVEX SPARSE REGULARIZERS
Ho, Tuanvu
pg. 753
F-3-1.3 - ENHANCEMENT OF SPEECH INTELLIGIBILITY UNDER NOISY REVERBERANT CONDITIONS BASED ON MODULATION SPECTRUM CONCEPT
Ho, Tzong-Shiann
pg. 1626
C-2-3.6 - NATURAL LANGUAGE PROCESSING METHODS FOR DETECTION OF INFLUENZA-LIKE ILLNESS FROM CHIEF COMPLAINTS
Honda, Hiroki
pg. 1490
C-1-2.3 - AN OVERLOADED IOT SIGNAL DETECTION METHOD USING NON-CONVEX SPARSE REGULARIZERS
Honda, Kiyoshi
pg. 616
F-2-2.5 - A PITCH-AWARE SPEAKER EXTRACTION SERIAL NETWORK
Hong, Qingyang
pg. 550
F-2-1.5 - AP20-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES
Horiike, Daiki
pg. 443
E-1-3.7 - ENERGY-BASED MULTIPLE SOURCE LOCALIZATION WITH BLINKIES
Hoshino, Junichi
pg. 403
F-1-2.6 - SPOKEN DIALOG TRAINING SYSTEM FOR CUSTOMER SERVICE IMPROVEMENT
Hosseini, Seyyed Saleh
pg. 76
F-3-3.6 - ENHANCED CHANNEL TRACKING IN THZ BEAMSPACE MASSIVE MIMO: A DEEP CNN APPROACH
Hou, Junfeng
pg. 638
E-2-3.3 - ATTENTIVE FUSION ENHANCED AUDIO-VISUAL ENCODING FOR TRANSFORMER BASED ROBUST SPEECH RECOGNITION
Hou, Qinhan
pg. 1657
C-3-2.4 - DECODING MUSIC GENRES BASED ON HIGH RESOLUTION BRAIN ACTIVITY INFORMATION
Hou, Yu-Hong
pg. 1247
D-2-2.3 - SCENE TEXT-LINE EXTRACTION WITH FULLY CONVOLUTIONAL NETWORK AND REFINED PROPOSALS
Hsieh, I-Ting
pg. 302
F-1-1.2 - ACOUSTIC AND TEXTUAL DATA AUGMENTATION FOR CODE-SWITCHING SPEECH RECOGNITION IN UNDER-RESOURCED LANGUAGE
Hsieh, Ting-I
pg. 1075
D-1-2.2 - HALLUCINATING SCENES
Hsieh, Tsun-An
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
Hsu, Jia-Hao
pg. 1048
D-1-1.3 - ATTENTIVELY-COUPLED LONG SHORT-TERM MEMORY FOR AUDIO-VISUAL EMOTION RECOGNITION
pg. 1626
C-2-3.6 - NATURAL LANGUAGE PROCESSING METHODS FOR DETECTION OF INFLUENZA-LIKE ILLNESS FROM CHIEF COMPLAINTS
Hsu, Ruei-Hau
pg. 1578
D-3-1.4 - PRIVACY-PRESERVING DATA SHARING WITH ATTRIBUTE-BASED PRIVATE MATCHING BASED ON EDGE COMPUTATION IN THE INTERNET-OF-THINGS
Hu, Bo
pg. 205
C-2-1.5 - SAMPLING POLICY DESIGN FOR TRACKING TIME-VARYING GRAPH SIGNALS WITH ADAPTIVE BUDGET ALLOCATION
Hu, Chuanzhan
pg. 92
A-2-3.1 - A PARALLELIZATION METHOD OF INCEPTION ARCHITECTURE BASED ON ARRAY PROCESSOR
pg. 104
A-2-3.2 - RSP-BT:AN ADVANCED PARALLEL METHOD FOR DEPTH MAP MOTION ESTIMATION
Hu, Hong
pg. 161
B-1-2.5 - AN EVOLUTIONARY GAME THEORETICAL FRAMEWORK FOR DECISION FUSION IN THE PRESENCE OF BYZANTINES
pg. 1617
C-2-3.5 - CAN-SIN: A CROSS-LAYER HETEROGENEOUS ACADEMIC NETWORK WITH SEMANTIC INFORMATION
Hu, Yang
pg. 11
A-1-3.3 - NON-LINE-OF-SIGHT IMAGING WITH RADIO SIGNALS
Hu, Yu
pg. 556
F-2-1.6 - ADVERSARIAL POST-PROCESSING OF VOICE CONVERSION AGAINST SPOOFING DETECTION
Hu, Yu-Hsaing
pg. 1578
D-3-1.4 - PRIVACY-PRESERVING DATA SHARING WITH ATTRIBUTE-BASED PRIVATE MATCHING BASED ON EDGE COMPUTATION IN THE INTERNET-OF-THINGS
Huang, Chong-Rui
pg. 293
E-1-1.6 - IMPLEMENTATION OF FEEDFORWARD ACTIVE NOISE CONTROL TECHNIQUES FOR HEADPHONES
Huang, Chun-Kai
pg. 1588
D-3-1.5 - COORDINATED DOWNLINK/UPLINK TRANSMISSION ASSIGNMENT AND DYNAMIC SWITCHING IN HYBRID TDD SYSTEM
Huang, Lin
pg. 492
E-2-1.2 - SPECTRAL FEATURES AND PITCH HISTOGRAM FOR AUTOMATIC SINGING QUALITY EVALUATION WITH CRNN
Huang, Po-Yu
pg. 1566
D-3-1.2 - REAL-TIME DDOS ATTACK DETECTION USING SKETCH-BASED ENTROPY ESTIMATION ON THE NETFPGA SUME PLATFORM
Huang, Qiuchen
pg. 815
F-3-2.2 - ONLINE SPEAKER ADAPTATION FOR WAVENET-BASED NEURAL VOCODERS
Huang, Rong
pg. 1293
B-3-3.1 - A METHOD FOR IDENTIFYING ORIGIN OF DIGITAL IMAGES USING A CONVOLUTIONAL NEURAL NETWORK
Huang, Yan-Hao
pg. 1075
D-1-2.2 - HALLUCINATING SCENES
pg. 1087
D-1-2.4 - LEARNING DENSE CORRESPONDENCES VIA LOCAL AND NON-LOCAL FEATURE FUSION
Huang, Yi-Chin
pg. 837
F-3-2.6 - PERSONALIZED END-TO-END MANDARIN SPEECH SYNTHESIS USING SMALL-SIZED CORPUS
Huh, Jungwoo
pg. 1274
D-3-3.5 - DATA REDUCTION USING CLUSTER SAMPLING
Hung, Kuo-Hsuan
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
Hung, Po-Yen
pg. 1611
C-2-3.4 - MULTIPLE TARGET PREDICTION FOR DEEP REINFORCEMENT LEARNING
Hwang, Min-Jae
pg. 810
F-3-2.1 - LP-WAVENET: LINEAR PREDICTION-BASED WAVENET SPEECH SYNTHESIS
pg. 831
F-3-2.5 - EXCITGLOW: IMPROVING A WAVEGLOW-BASED NEURAL VOCODER WITH LINEAR PREDICTION ANALYSIS
I
Ibi, Shinsuke
pg. 1483
C-1-2.2 - 3D CONVOLUTIONAL NEURAL NETWORK-AIDED INDOOR POSITIONING BASED ON FINGERPRINTS OF BLE RSSI
Ihori, Mana
pg. 632
E-2-3.2 - END-TO-END AUTOMATIC SPEECH RECOGNITION WITH DEEP MUTUAL LEARNING
pg. 1054
D-1-1.4 - UNSUPERVISED DOMAIN ADVERSARIAL TRAINING IN ANGULAR SPACE FOR FACIAL EXPRESSION RECOGNITION
Iida, Hidehiro
pg. 952
A-3-3.3 - QUANTIFICATION ANALYSIS OF BEHAVIORAL CHANGES AFTER SCIATIC NERVE LIGATION IN RATS
Iida, Kenta
pg. 1436
B-3-2.3 - A PRIVACY-PRESERVING CONTENT-BASED IMAGE RETRIEVAL SCHEME ALLOWING MIXED USE OF ENCRYPTED AND PLAIN IMAGES
Iida, Soichiro
pg. 403
F-1-2.6 - SPOKEN DIALOG TRAINING SYSTEM FOR CUSTOMER SERVICE IMPROVEMENT
Iimori, Hiroki
pg. 1453
C-1-1.3 - LOW-COMPLEXITY ROBUST BEAMFORMING WITH BLOCKAGE PREDICTION FOR MILLIMETER-WAVE COMMUNICATIONS
Ikeda, Daizo
pg. 578
E-2-2.4 - A DATA AUGMENTATION TECHNIQUE FOR AUTOMATIC DETECTION OF CHEWING SIDE AND SWALLOWING
Ikeda, Kazushi
pg. 939
A-3-3.1 - MATHEMATICAL MODEL OF HORSE AND RIDER INTERACTION DURING HORSE JUMPING
pg. 952
A-3-3.3 - QUANTIFICATION ANALYSIS OF BEHAVIORAL CHANGES AFTER SCIATIC NERVE LIGATION IN RATS
ikehara, Masaaki
pg. 1222
D-3-2.6 - RAPID AND ACCURATE LOCAL GAUSSIAN NOISE REMOVAL
Ikutani, Yoshiharu
pg. 1017
B-3-1.3 - DISCOVERY OF EVENT-RELATED POTENTIALS DURING A COGNITIVE PROCESS OF COMPARISON OPERATION
Imaizumi, Ryo
pg. 297
F-1-1.1 - DIALECT-AWARE MODELING FOR END-TO-END JAPANESE DIALECT SPEECH RECOGNITION
IMAIZUMI, Shoko
pg. 1347
B-2-2.2 - FLEXIBLE DATA HIDING AND EXTRACTION IN ETC IMAGES
Imoto, Keisuke
pg. 701
F-2-3.7 - EXPERIMENTAL INVESTIGATION OF ROBUSTNESS OF SPATIAL CEPSTRUM FEATURES UNDER VARIOUS RECORDING CONDITIONS
Inaba, Haruki
pg. 135
A-2-3.4 - WIRELESS CHANNEL MEASUREMENT SYSTEM USING ZYNQ ULTRASCALE+ RFSOC FOR MIMO AND D2D COMMUNICATION SYSTEMS
Inoue, Katsuki
pg. 826
F-3-2.4 - MODULE COMPARISON OF TRANSFORMER-TTS FOR SPEAKER ADAPTATION BASED ON FINE-TUNING
Inoue, Nakamasa
pg. 1641
C-3-2.1 - SEMI-SUPERVISED CONTRASTIVE LEARNING WITH GENERALIZED CONTRASTIVE LOSS AND ITS APPLICATION TO SPEAKER RECOGNITION
pg. 527
F-2-1.1 - QUASI-NEWTON ADVERSARIAL ATTACKS ON SPEAKER VERIFICATION SYSTEMS
pg. 600
F-2-2.2 - OPTIMIZING SPEAKER EMBEDDINGS USING META-TRAINING SETS
pg. 1693
C-3-3.2 - CLOSED-FORM PRE-TRAINING FOR SMALL-SAMPLE ENVIRONMENTAL SOUND RECOGNITION
Inoue, Takao
pg. 972
A-3-3.7 - EVALUATION OF THE PRESSURE MEASUREMENT FUNCTION OF AN IMPLANTABLE MULTIMODALITY PROBE
Ise, Tomohiko
pg. 46
F-3-3.1 - NOISE SUPPRESSION USING A DIFFERENTIAL-TYPE MICROPHONE ARRAY AND TWO-DIMENSIONAL AMPLITUDE AND PHASE SPECTRA
Ishi, Carlos Toshinori
pg. 1060
D-1-1.5 - 3D SKELETAL MOVEMENT ENHANCED EMOTION RECOGNITION NETWORK
Ishibashi, Koji
pg. 1453
C-1-1.3 - LOW-COMPLEXITY ROBUST BEAMFORMING WITH BLOCKAGE PREDICTION FOR MILLIMETER-WAVE COMMUNICATIONS
pg. 1466
C-1-1.5 - PACKET AGGREGATION BASED ON ENCRYPTION-THEN-COMPRESSION FOR HIGHLY EFFICIENT MULTI-HOP TRANSMISSION
Ishiguro, Hiroshi
pg. 1060
D-1-1.5 - 3D SKELETAL MOVEMENT ENHANCED EMOTION RECOGNITION NETWORK
Ishizuka, Ryoto
pg. 359
E-1-2.5 - TATUM-LEVEL DRUM TRANSCRIPTION BASED ON A CONVOLUTIONAL RECURRENT NEURAL NETWORK WITH LANGUAGE MODEL-BASED REGULARIZED TRAINING
Itani, Shunji
pg. 1317
B-3-3.5 - MULTIMODAL PERSONAL EAR AUTHENTICATION USING MULTIPLE SENSOR INFORMATION
Itasaka, Tatsuki
pg. 17
A-1-3.4 - CONSTRAINED DESIGN OF TWO-DIMENSIONAL FIR FILTERS WITH SPARSE COEFFICIENTS
Ito, Hiroki
pg. 1420
B-3-2.1 - A FRAMEWORK FOR TRANSFORMATION NETWORK TRAINING IN COORDINATION WITH SEMI-TRUSTED CLOUD PROVIDER FOR PRIVACY-PRESERVING DEEP NEURAL NETWORKS
Ito, Koichi
pg. 1414
B-3-2.1 - PERFORMANCE EVALUATION OF FACE ANTI-SPOOFING METHOD USING DEEP METRIC LEARNING FROM A FEW FRAMES OF FACE VIDEO
pg. 1087
D-1-2.4 - LEARNING DENSE CORRESPONDENCES VIA LOCAL AND NON-LOCAL FEATURE FUSION
ITO, Satoshi
pg. 909
B-1-3.1 - DEEP-LEARNING-BASED MR COMPRESSED SENSING USING NON-RANDOMLY UNDER-SAMPLED SIGNAL IN NONLINEAR PHASE ENCODING IMAGING
Itoh, Yoshiaki
pg. 649
E-2-3.5 - REDUCTION OF SPEECH DATA POSTERIORGRAMS BY COMPRESSING MAXIMUM-LIKELIHOOD STATE SEQUENCES IN QUERY BY EXAMPLE
Iwabuchi, Wataru
pg. 1207
D-3-2.4 - PART-IN-WHOLE TYPE 3D PARTIAL SHAPE RETREIVAL BASED ON CONNECTED FACES WITH POINTNET FEATURES
Iwai, Kenta
pg. 449
F-1-3.1 - SPEECH ENHANCEMENT FOR OPTICAL LASER MICROPHONE WITH DEEP NEURAL NETWORK
pg. 662
F-2-3.1 - HARMONIC STRUCTURE MASK FOR SPEECH ENHANCEMENT USING SPARSITY REGULARIZATION
pg. 266
E-1-1.1 - STUDY ON FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM WITH OPTICAL LASER MICROPHONE TO DETECT REFERENCE SIGNAL WITH SHORT DELAY
pg. 272
E-1-1.2 - FEEDFORWARD ACTIVE NOISE CONTROL WITH COHERENCE-ADJUSTING FILTER FOR IMPROVING NOISE REDUCTION PERFORMANCE UNDER LOW-COHERENCE CONDITION
Iwamura, Keiichi
pg. 1392
B-2-3.5 - A QR SYMBOL WITH ECDSA FOR BOTH PUBLIC AND SECRET AREAS USING RHOMBIC SUB-CELLS
J
Jamwal, Prashant Kumar
pg. 222
C-2-2.2 - COMPARISON OF GENERIC AND SUBJECT-SPECIFIC TRAINING FOR FEATURES CLASSIFICATION IN P300 SPELLER
Jang, Mingyu
pg. 1274
D-3-3.5 - DATA REDUCTION USING CLUSTER SAMPLING
Jelfs, Beth
pg. 992
B-2-1.3 - APPLICATION OF IMAGE PROCESSING AND CIRCULAR STATISTICS TO 3D CELLULAR ALIGNMENT
Jeong, Se-Won
pg. 1193
D-3-2.2 - MULTISCALE SALIENCY DETECTION FOR COLORED 3D POINT CLOUDS BASED ON RANDOM WALK
Jha, Rajib Kumar
pg. 1177
D-2-3.6 - VARIATIONAL MODE DECOMPOSITION BASED IMAGE SEGMENTATION USING SINE COSINE ALGORITHM
Jhang, Zih-Jian
pg. 1087
D-1-2.4 - LEARNING DENSE CORRESPONDENCES VIA LOCAL AND NON-LOCAL FEATURE FUSION
Ji, Jiafang
pg. 211
C-2-1.6 - DIFFERENTIATED PROSODIC ADAPTION OF CHINESE AND ENGLISH POETRY: AN ACOUSTIC APPROACH TO READING OF CHINESE TANG POETRY AND SHAKESPEAREAN SONNETS
Ji, Jinchen
pg. 681
F-2-3.4 - A VARIABLE STEP SIZE IMPROVED MULTIBAND-STRUCTURED SUBBAND ADAPTIVE FEEDBACK CANCELLATION SCHEME FOR HEARING AIDS
Ji, Xiaoli
pg. 589
E-2-2.6 - TEMPORAL AND FORMANT TRAJECTORY ANALYSIS OF ENGLISH TENSE-LAX VOWELS PRODUCED BY NATIVE CHINESE SPEAKERS
Jia, Kebin
pg. 1033
D-1-1.1 - CLOUD RECOGNITION BASED ON LIGHTWEIGHT NEURAL NETWORK
Jia, Xupeng
pg. 711
E-3-1.2 - OPTIMAL SCALE-INVARIANT SIGNAL-TO-NOISE RATIO AND CURRICULUM LEARNING FOR MONAURAL MULTI-SPEAKER SPEECH SEPARATION IN NOISY ENVIRONMENT
pg. 477
F-1-3.6 - A DEEP LEARNING-BASED TIME-DOMAIN APPROACH FOR NON-INTRUSIVE SPEECH QUALITY ASSESSMENT
Jia, Zhuoying
pg. 278
E-1-1.3 - EFFECT OF CROSS-CHANNEL CONTROL FILTERS IN MULTI-CHANNEL FEEDBACK ACTIVE NOISE CONTROL
Jiang, Lin
pg. 118
A-2-3.3 - FAST INTER-FRAME PREDICTION BASED ARRAY PROCESSOR FOR DEPTH MAPS IN 3D-HEVC
Jiang, Yu
pg. 616
F-2-2.5 - A PITCH-AWARE SPEAKER EXTRACTION SERIAL NETWORK
Jiang, Yuan
pg. 556
F-2-1.6 - ADVERSARIAL POST-PROCESSING OF VOICE CONVERSION AGAINST SPOOFING DETECTION
Jin, Jing
pg. 875
B-1-1.1 - CLASSIFICATION OF SEIZURE EEGS BASED ON SHORT-TIME FOURIER TRANSFORM AND HIDDEN MARKOV MODEL
Jinzai, Ryoga
pg. 421
E-1-3.3 - APPLYING VIRTUAL MICROPHONES TO TRIANGULAR MICROPHONE ARRAY IN IN-CAR COMMUNICATION
JO, JUNHO
pg. 1067
D-1-2.1 - LOCAL BACKLIGHT DIMMING FOR LIQUID CRYSTAL DISPLAYS VIA CONVOLUTIONAL NEURAL NETWORK
Jun, Jinyoung
pg. 1287
D-3-3.8 - HUMAN POSE ESTIMATION USING SKELETAL HEATMAPS
Jung, Myunghun
pg. 739
F-3-1.1 - DYNAMIC NOISE EMBEDDING: NOISE AWARE TRAINING AND ADAPTATION FOR SPEECH ENHANCEMENT
Jung, Sang Mok
pg. 143
B-1-2.2 - A PARALLEL ADAPTIVE FILTERING ALGORITHM BASED ON THE MEAN-SQUARE DEVIATION ANALYSIS FOR LARGE-SCALE DATA
Jung, Youngmoon
pg. 739
F-3-1.1 - DYNAMIC NOISE EMBEDDING: NOISE AWARE TRAINING AND ADAPTATION FOR SPEECH ENHANCEMENT
K
Kaburagi, Takashi
pg. 958
A-3-3.4 - SHEET-TYPE DEVICE FOR UNCONSTRAINED HEART SOUND MEASUREMENT AND WHITE NOISE REDUCTION BY WIENER FILTER
Kaburaki, Aoto
pg. 1460
C-1-1.4 - AUTONOMOUS DECENTRALIZED TRANSMISSION TIMING CONTROL IN WIRELESS SENSOR NETWORK
Kagoshima, Takehiko
pg. 805
E-3-2.6 - ADAPTIVE NOISE SUPPRESSION FOR WAKE-WORD DETECTION BY TEMPORAL-DIFFERENCE GENERALIZED EIGENVALUE BEAMFORMER
Kai, Atsuhiko
pg. 654
E-2-3.6 - EFFECTS OF END-TO-END ASR AND SCORE FUSION MODEL LEARNING FOR IMPROVED QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
Kaichi, Ayumi
pg. 1519
C-1-3.4 - ON PLACEMENT OF END DEVICES IN LPWAN BASED WSN FOR ENVIRONMENTAL MONITORING APPLICATIONS
Kajikawa, Yoshinobu
pg. 1317
B-3-3.5 - MULTIMODAL PERSONAL EAR AUTHENTICATION USING MULTIPLE SENSOR INFORMATION
Kamakari, Kodai
pg. 1381
B-2-3.3 - CLASSIFICATION OF VIDEO RECAPTURED FROM DISPLAY DEVICE
Kamble, Madhu
pg. 543
F-2-1.4 - DESIGN OF VOICE PRIVACY SYSTEM USING LINEAR PREDICTION
Kameoka, Hirokazu
pg. 572
E-2-2.3 - PHONEME EMBEDDINGS ON PREDICTING FUNDAMENTAL FREQUENCY PATTERN FOR ELECTROLARYNGEAL SPEECH
Kamio, Akinori
pg. 1497
C-1-2.4 - SPECIFICATION OF LINK QUALITY DEGRADATION IN WLAN BASED ON MCS AND RETRANSMISSION FLAG
Kamiyama, Hosana
pg. 319
F-1-1.5 - SPEAKER AGE ESTIMATION USING AGE-DEPENDENT INSENSITIVE LOSS
Kamo, Keigo
pg. 869
E-3-3.5 - JOINT-DIAGONALIZABILITY-CONSTRAINED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION BASED ON MULTIVARIATE COMPLEX STUDENT'S T-DISTRIBUTION
Kan, Yao-Chiang
pg. 1561
D-3-1.1 - LORA-BASED AIR QUALITY MONITORING SYSTEM USING CHATBOT
Kang, Hong-Goo
pg. 810
F-3-2.1 - LP-WAVENET: LINEAR PREDICTION-BASED WAVENET SPEECH SYNTHESIS
pg. 308
F-1-1.3 - SPEAKER-INVARIANT PSYCHOLOGICAL STRESS DETECTION USING ATTENTION-BASED NETWORK
pg. 831
F-3-2.5 - EXCITGLOW: IMPROVING A WAVEGLOW-BASED NEURAL VOCODER WITH LINEAR PREDICTION ANALYSIS
Kang, Jewon
pg. 1279
D-3-3.6 - TEMPORAL ATTENTION FEATURE ENCODING FOR VIDEO CAPTIONING
pg. 1283
D-3-3.7 - SUPER-RESOLUTION OF MULTI-VIEW ERP 360-DEGREE IMAGES WITH TWO-STAGE DISPARITY REFINEMENT
Kang, Jiwoo
pg. 1262
D-3-3.3 - IMAGE INPAINTING USING WEIGHTED MASK CONVOLUTION
Kang, Li-Wei
pg. 1247
D-2-2.3 - SCENE TEXT-LINE EXTRACTION WITH FULLY CONVOLUTIONAL NETWORK AND REFINED PROPOSALS
kang, Xiangui
pg. 1352
B-2-2.3 - DENSELY CONNECTED CONVOLUTIONAL NETWORK FOR AUDIO SPOOFING DETECTION
Kang, Xiangui
pg. 1442
B-3-2.4 - A GENERATIVE ADVERSARIAL NETWORK FRAMEWORK FOR JPEG ANTI-FORENSICS
Kaniusas, Eugenijus
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Karttunen, Janne
pg. 1300
B-3-3.2 - COST SENSITIVE OPTIMIZATION OF DEEPFAKE DETECTOR
Kathuria, Nitin
pg. 1472
C-1-1.6 - 24 GHZ FLEXIBLE LCP ANTENNA ARRAY FOR RADAR-BASED NONCONTACT VITAL SIGN MONITORING
Kato, Masaharu
pg. 371
F-1-2.1 - LANGUAGE MODEL ADAPTATION FOR EMOTIONAL SPEECH RECOGNITION USING TWEET DATA
Kato, Tsuneo
pg. 1601
C-2-3.2 - EFFICIENT DIVERSE RESPONSE GENERATION IN ATTENTION-BASED NEURAL CONVERSATIONAL MODEL WITH MAXIMUM MUTUAL INFORMATION
Kaur, Navjot
pg. 76
F-3-3.6 - ENHANCED CHANNEL TRACKING IN THZ BEAMSPACE MASSIVE MIMO: A DEEP CNN APPROACH
Kawabata, Tomotaka
pg. 114
A-2-3.3 - AN EVALUATION OF HIGH-THROUGHPUT SCALABLE RADIX-4 FFT PROCESSOR ARCHITECTURE USING FIXED-POINT ARITHMETIC
Kawahara, Hideki
pg. 174
C-2-1.1 - SIMULTANEOUS MEASUREMENT OF TIME-INVARIANT LINEAR AND NONLINEAR, AND RANDOM AND EXTRA RESPONSES USING FREQUENCY DOMAIN VARIANT OF VELVET NOISE
Kawahara, Tatsuya
pg. 775
E-3-2.1 - INTEGRATION OF SEMI-BLIND SPEECH SOURCE SEPARATION AND VOICE ACTIVITY DETECTION FOR FLEXIBLE SPOKEN DIALOGUE
pg. 788
E-3-2.3 - COMPUTER-RESOURCE-AWARE DEEP SPEECH SEPARATION WITH A RUN-TIME-SPECIFIED NUMBER OF BLSTM LAYERS
pg. 800
E-3-2.5 - END-TO-END MUSIC-MIXED SPEECH RECOGNITION
Kawamura, Taiga
pg. 701
F-2-3.7 - EXPERIMENTAL INVESTIGATION OF ROBUSTNESS OF SPATIAL CEPSTRUM FEATURES UNDER VARIOUS RECORDING CONDITIONS
Kawata, Kento
pg. 1381
B-2-3.3 - CLASSIFICATION OF VIDEO RECAPTURED FROM DISPLAY DEVICE
Keivanmarz, Ali
pg. 1310
B-3-3.4 - VEIN PATTERN VISUALISATION USING CONDITIONAL GENERATIVE ADVERSARIAL NETWORKS
Khan, Ishtiaq Rasool
pg. 1132
D-1-3.6 - EVALUATION OF THE ENCODING ACCURACY OF THE PQ BASED HDR CONTENT DELIVERY FORMATS
Khong, Andy W. H.
pg. 841
E-3-3.1 - A JOINT-LOSS APPROACH FOR SPEECH ENHANCEMENT VIA SINGLE-CHANNEL NEURAL NETWORK AND MVDR BEAMFORMER
Khosravy, Mahdi
pg. 1400
B-2-3.6 - DEEP FACE RECOGNIZER PRIVACY ATTACK: MODEL INVERSION INITIALIZATION BY A DEEP GENERATIVE ADVERSARIAL DATA SPACE DISCRIMINATOR
Kim, Chang-Su
pg. 1287
D-3-3.8 - HUMAN POSE ESTIMATION USING SKELETAL HEATMAPS
Kim, Hee-Jae
pg. 1283
D-3-3.7 - SUPER-RESOLUTION OF MULTI-VIEW ERP 360-DEGREE IMAGES WITH TWO-STAGE DISPARITY REFINEMENT
Kim, Hoirin
pg. 739
F-3-1.1 - DYNAMIC NOISE EMBEDDING: NOISE AWARE TRAINING AND ADAPTATION FOR SPEECH ENHANCEMENT
Kim, Jong-Ok
pg. 1257
D-3-3.2 - PROGRESSIVE DEEP NETWORK WITH CHANNEL BACK-PROJECTION FOR HYPERSPECTRAL RECOVERY FROM RGB
Kim, Nayoung
pg. 1279
D-3-3.6 - TEMPORAL ATTENTION FEATURE ENCODING FOR VIDEO CAPTIONING
Kimata, Hideaki
pg. 1107
D-1-3.1 - SUBJECTIVE QUALITY DRIVEN IMAGE ENCODING METHOD USING IMAGE COMPLETION
Kimura, Asateru
pg. 1414
B-3-2.1 - PERFORMANCE EVALUATION OF FACE ANTI-SPOOFING METHOD USING DEEP METRIC LEARNING FROM A FEW FRAMES OF FACE VIDEO
Kimura, Tomotaka
pg. 1536
D-2-1.2 - OPTIMIZATION OF VIRTUAL MACHINE PLACEMENT FOR BALANCING NETWORK AND SERVER LOAD IN EDGE COMPUTING ENVIRONMENTS
pg. 1541
D-2-1.3 - PREDICTION METHOD OF MALWARE INFECTION SPREADING CONSIDERING NETWORK SCALE
pg. 1545
D-2-1.4 - JOINT OPTIMIZATION OF EDGE SERVER AND VIRTUAL MACHINE PLACEMENT IN EDGE COMPUTING ENVIRONMENTS
Kinoshita, Yuma
pg. 1420
B-3-2.1 - A FRAMEWORK FOR TRANSFORMATION NETWORK TRAINING IN COORDINATION WITH SEMI-TRUSTED CLOUD PROVIDER FOR PRIVACY-PRESERVING DEEP NEURAL NETWORKS
pg. 443
E-1-3.7 - ENERGY-BASED MULTIPLE SOURCE LOCALIZATION WITH BLINKIES
pg. 1139
D-1-3.7 - CHECKERBOARD-ARTIFACT-FREE IMAGE-ENHANCEMENT NETWORK CONSIDERING LOCAL AND GLOBAL FEATURES
Kishida, Yuki
pg. 1601
C-2-3.2 - EFFICIENT DIVERSE RESPONSE GENERATION IN ATTENTION-BASED NEURAL CONVERSATIONAL MODEL WITH MAXIMUM MUTUAL INFORMATION
Kishioka, Keita
pg. 1541
D-2-1.3 - PREDICTION METHOD OF MALWARE INFECTION SPREADING CONSIDERING NETWORK SCALE
Kita, Shunsuke
pg. 1317
B-3-3.5 - MULTIMODAL PERSONAL EAR AUTHENTICATION USING MULTIPLE SENSOR INFORMATION
Kitagishi, Yuki
pg. 319
F-1-1.5 - SPEAKER AGE ESTIMATION USING AGE-DEPENDENT INSENSITIVE LOSS
Kitahara, Daichi
pg. 339
E-1-2.2 - DEEP NEURAL NETWORK MODELING OF DISTORTION STOMP BOX USING SPECTRAL FEATURES
Kitamura, Daichi
pg. 781
E-3-2.2 - DNN-BASED PERMUTATION SOLVER FOR FREQUENCY-DOMAIN INDEPENDENT COMPONENT ANALYSIS IN TWO-SOURCE MIXTURE CASE
pg. 869
E-3-3.5 - JOINT-DIAGONALIZABILITY-CONSTRAINED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION BASED ON MULTIVARIATE COMPLEX STUDENT'S T-DISTRIBUTION
KIYA, Hitoshi
pg. 1347
B-2-2.2 - FLEXIBLE DATA HIDING AND EXTRACTION IN ETC IMAGES
Kiya, Hitoshi
pg. 1420
B-3-2.1 - A FRAMEWORK FOR TRANSFORMATION NETWORK TRAINING IN COORDINATION WITH SEMI-TRUSTED CLOUD PROVIDER FOR PRIVACY-PRESERVING DEEP NEURAL NETWORKS
pg. 1369
B-2-3.1 - AN EXTENSION OF ENCRYPTION-INSPIRED ADVERSARIAL DEFENSE WITH SECRET KEYS AGAINST ADVERSARIAL EXAMPLES
pg. 297
F-1-1.1 - DIALECT-AWARE MODELING FOR END-TO-END JAPANESE DIALECT SPEECH RECOGNITION
pg. 1436
B-3-2.3 - A PRIVACY-PRESERVING CONTENT-BASED IMAGE RETRIEVAL SCHEME ALLOWING MIXED USE OF ENCRYPTED AND PLAIN IMAGES
pg. 1304
B-3-3.3 - VISUAL SECURITY EVALUATION OF LEARNABLE IMAGE ENCRYPTION METHODS AGAINST CIPHERTEXT-ONLY ATTACKS
pg. 1139
D-1-3.7 - CHECKERBOARD-ARTIFACT-FREE IMAGE-ENHANCEMENT NETWORK CONSIDERING LOCAL AND GLOBAL FEATURES
Kloesch, Gerhard
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Ko, Bing-Cheng
pg. 1578
D-3-1.4 - PRIVACY-PRESERVING DATA SHARING WITH ATTRIBUTE-BASED PRIVATE MATCHING BASED ON EDGE COMPUTATION IN THE INTERNET-OF-THINGS
Kobashikawa, Satoshi
pg. 319
F-1-1.5 - SPEAKER AGE ESTIMATION USING AGE-DEPENDENT INSENSITIVE LOSS
Kobayashi, Akio
pg. 460
F-1-3.3 - SPEECH ENHANCEMENT FOR DEMODULATED SIGNALS UNDER MULTIPATH FADING COMMUNICATION CHANNELS
pg. 403
F-1-2.6 - SPOKEN DIALOG TRAINING SYSTEM FOR CUSTOMER SERVICE IMPROVEMENT
Kobayashi, Gaku
pg. 1513
C-1-3.3 - ESTIMATION OF DESIRED POWER AND UNDESIRED POWER USING CHIRP DEMODULATION AND EVALUATION OF ACCURACY
Kobayashi, Kazuhiro
pg. 572
E-2-2.3 - PHONEME EMBEDDINGS ON PREDICTING FUNDAMENTAL FREQUENCY PATTERN FOR ELECTROLARYNGEAL SPEECH
Kobayashi, Takuya
pg. 1502
C-1-3.1 - PROBABILISTIC BINARY OFFLOADING FOR WIRELESS POWERED MOBILE EDGE COMPUTING SYSTEM
Kobayashi, Tetsunori
pg. 1226
D-3-2.7 - EFFICIENT HUMAN-IN-THE-LOOP OBJECT DETECTION USING BI-DIRECTIONAL DEEP SORT AND ANNOTATION-FREE SEGMENT IDENTIFICATION
Kodama, Yuya
pg. 1216
D-3-2.5 - FIXED-POINT ARITHMETIC OF L2-NORM APPROXIMATION FOR 2-TUPLE ARRAYS WITH ROTATED L1-NORM EVALUATION
Koguchi, Junya
pg. 487
E-2-1.1 - PJS: PHONEME-BALANCED JAPANESE SINGING-VOICE CORPUS
Kohn, Bernhard
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Kojima, Kazunori
pg. 649
E-2-3.5 - REDUCTION OF SPEECH DATA POSTERIORGRAMS BY COMPRESSING MAXIMUM-LIKELIHOOD STATE SEQUENCES IN QUERY BY EXAMPLE
Komatsu, Tatsuya
pg. 788
E-3-2.3 - COMPUTER-RESOURCE-AWARE DEEP SPEECH SEPARATION WITH A RUN-TIME-SPECIFIED NUMBER OF BLSTM LAYERS
Kondo, Hiroki
pg. 654
E-2-3.6 - EFFECTS OF END-TO-END ASR AND SCORE FUSION MODEL LEARNING FOR IMPROVED QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
Kondo, Kazunobu
pg. 869
E-3-3.5 - JOINT-DIAGONALIZABILITY-CONSTRAINED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION BASED ON MULTIVARIATE COMPLEX STUDENT'S T-DISTRIBUTION
Kosaka, Tetsuo
pg. 371
F-1-2.1 - LANGUAGE MODEL ADAPTATION FOR EMOTIONAL SPEECH RECOGNITION USING TWEET DATA
Kotta, Harsh
pg. 538
F-2-1.3 - SUBBAND CHANNEL SELECTION USING TEO FOR REPLAY SPOOF DETECTION IN VOICE ASSISTANTS
Krishnamani, Divya Bharathi
pg. 905
B-1-1.6 - GEOMETRIC FEATURES BASED MUSCLE FATIGUE ANALYSIS USING LOW FREQUENCY BAND IN SURFACE ELECTROMYOGRAPHIC SIGNALS
Kubo, Rieko
pg. 753
F-3-1.3 - ENHANCEMENT OF SPEECH INTELLIGIBILITY UNDER NOISY REVERBERANT CONDITIONS BASED ON MODULATION SPECTRUM CONCEPT
Kubo, Takatomi
pg. 1017
B-3-1.3 - DISCOVERY OF EVENT-RELATED POTENTIALS DURING A COGNITIVE PROCESS OF COMPARISON OPERATION
Kubo, Yuki
pg. 869
E-3-3.5 - JOINT-DIAGONALIZABILITY-CONSTRAINED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION BASED ON MULTIVARIATE COMPLEX STUDENT'S T-DISTRIBUTION
Kudo, Shinobu
pg. 1107
D-1-3.1 - SUBJECTIVE QUALITY DRIVEN IMAGE ENCODING METHOD USING IMAGE COMPLETION
Kukanov, Ivan
pg. 1300
B-3-3.2 - COST SENSITIVE OPTIMIZATION OF DEEPFAKE DETECTOR
Kumagai, Satoshi
pg. 958
A-3-3.4 - SHEET-TYPE DEVICE FOR UNCONSTRAINED HEART SOUND MEASUREMENT AND WHITE NOISE REDUCTION BY WIENER FILTER
Kuo, C.-C. Jay
pg. 1698
C-3-3.3 - NITES: A NON-PARAMETRIC INTERPRETABLE TEXTURE SYNTHESIS METHOD
Kuo, Chia-Chih
pg. 386
F-1-2.4 - SPOKEN MULTIPLE-CHOICE QUESTION ANSWERING USING MULTI-TURN AUDIO-EXTRACTER BERT
Kuo, Heng-Cheng
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
Kuo, Sen M.
pg. 293
E-1-1.6 - IMPLEMENTATION OF FEEDFORWARD ACTIVE NOISE CONTROL TECHNIQUES FOR HEADPHONES
Kuo, Tien-Ying
pg. 1243
D-2-2.2 - CHROMA COMPONENT GENERATION OF GRAY IMAGES USING MULTI-SCALE CONVOLUTIONAL NEURAL NETWORK
Kuribayashi, Minoru
pg. 1340
B-2-2.1 - DEEPWATERMARK: EMBEDDING WATERMARK INTO DNN MODEL
pg. 1381
B-2-3.3 - CLASSIFICATION OF VIDEO RECAPTURED FROM DISPLAY DEVICE
pg. 1386
B-2-3.4 - DETECTION OF ADVERSARIAL EXAMPLES BASED ON SENSITIVITIES TO NOISE REMOVAL FILTER
Kurihara, Yosuke
pg. 958
A-3-3.4 - SHEET-TYPE DEVICE FOR UNCONSTRAINED HEART SOUND MEASUREMENT AND WHITE NOISE REDUCTION BY WIENER FILTER
Kuroda, Hiroki
pg. 339
E-1-2.2 - DEEP NEURAL NETWORK MODELING OF DISTORTION STOMP BOX USING SPECTRAL FEATURES
Kurokawa, Takumi
pg. 654
E-2-3.6 - EFFECTS OF END-TO-END ASR AND SCORE FUSION MODEL LEARNING FOR IMPROVED QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
Kwong, Ngai-Wing
pg. 1112
D-1-3.2 - ULTRA FAST SCREEN CONTENT CODING VIA RANDOM FOREST
KĂ¼stner, Thomas
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
L
Lai, Hong-Lun
pg. 1571
D-3-1.3 - A DESIGN FRAMEWORK OF AUTOMATIC DEPLOYMENT FOR 5G NETWORK SLICING
Lai, Hsin-Yi
pg. 1170
D-2-3.5 - IMPLEMENTATION OF BI-RADS CLASSIFICATION AND PRIORITY PREDICTION FOR MAMMOGRAM PRE-SCREENING BASED ON MULTI-DECISION FRAMEWORK
Lai, Ming-Jay
pg. 1571
D-3-1.3 - A DESIGN FRAMEWORK OF AUTOMATIC DEPLOYMENT FOR 5G NETWORK SLICING
Lai, Wen-Ping
pg. 1571
D-3-1.3 - A DESIGN FRAMEWORK OF AUTOMATIC DEPLOYMENT FOR 5G NETWORK SLICING
Lai, Yu-Kuen
pg. 1566
D-3-1.2 - REAL-TIME DDOS ATTACK DETECTION USING SKETCH-BASED ENTROPY ESTIMATION ON THE NETFPGA SUME PLATFORM
Lam, Kin-man
pg. 376
F-1-2.2 - SIMULTANEOUS FAKE NEWS AND TOPIC CLASSIFICATION VIA AUXILIARY TASK LEARNING
Lee, Byung-Uk
pg. 1283
D-3-3.7 - SUPER-RESOLUTION OF MULTI-VIEW ERP 360-DEGREE IMAGES WITH TWO-STAGE DISPARITY REFINEMENT
Lee, Chi-Chun
pg. 314
F-1-1.4 - SENSING WITH CONTEXTS: CRYING REASON CLASSIFICATION FOR INFANT CARE CENTER WITH ENVIRONMENTAL FUSION
pg. 900
B-1-1.5 - FROM INTENDED TO SUBJECTIVE: A CONDITIONAL TENSOR FUSION NETWORK FOR RECOGNIZING SELF-REPORTED EMOTION USING PHYSIOLOGY
pg. 393
F-1-2.5 - "YOUR BEHAVIOR MAKES ME THINK IT IS A LIE": RECOGNIZING PERCEIVED DECEPTION USING MULTIMODAL DATA IN DIALOG GAMES
Lee, Chul
pg. 1268
D-3-3.4 - MOIRÉ ARTIFACTS REMOVAL IN SCREEN-SHOT IMAGES VIA MULTIPLE DOMAIN LEARNING
Lee, Chung-Nan
pg. 1549
D-2-1.5 - REALIZATION OF 5G NETWORK SLICING USING OPEN SOURCE SOFTWARES
Lee, Ho-Ping
pg. 1566
D-3-1.2 - REAL-TIME DDOS ATTACK DETECTION USING SKETCH-BASED ENTROPY ESTIMATION ON THE NETFPGA SUME PLATFORM
Lee, Jae-Han
pg. 1287
D-3-3.8 - HUMAN POSE ESTIMATION USING SKELETAL HEATMAPS
Lee, Jia-Hong
pg. 1594
C-2-3.1 - MERGING WELL-TRAINED DEEP CNN MODELS FOR EFFICIENT INFERENCE
Lee, Joohyung
pg. 739
F-3-1.1 - DYNAMIC NOISE EMBEDDING: NOISE AWARE TRAINING AND ADAPTATION FOR SPEECH ENHANCEMENT
Lee, Junghsi
pg. 1561
D-3-1.1 - LORA-BASED AIR QUALITY MONITORING SYSTEM USING CHATBOT
Lee, Kyoungoh
pg. 1274
D-3-3.5 - DATA REDUCTION USING CLUSTER SAMPLING
Lee, Ming-Feng
pg. 1549
D-2-1.5 - REALIZATION OF 5G NETWORK SLICING USING OPEN SOURCE SOFTWARES
Lee, Sang-Ho
pg. 1257
D-3-3.2 - PROGRESSIVE DEEP NETWORK WITH CHANNEL BACK-PROJECTION FOR HYPERSPECTRAL RECOVERY FROM RGB
Lee, Sanghoon
pg. 1262
D-3-3.3 - IMAGE INPAINTING USING WEIGHTED MASK CONVOLUTION
pg. 1274
D-3-3.5 - DATA REDUCTION USING CLUSTER SAMPLING
Lee, Seongmin
pg. 1262
D-3-3.3 - IMAGE INPAINTING USING WEIGHTED MASK CONVOLUTION
Lee, Shi-wook
pg. 649
E-2-3.5 - REDUCTION OF SPEECH DATA POSTERIORGRAMS BY COMPRESSING MAXIMUM-LIKELIHOOD STATE SEQUENCES IN QUERY BY EXAMPLE
Lee, Sz-Yuan
pg. 36
A-1-3.7 - DOPPLER CENTROID ESTIMATION WITH QUALITY ASSESSMENT FOR REAL-TIME SAR IMAGING
Lee, Yi-Jhe
pg. 1647
C-3-2.2 - MPOP600: A MANDARIN POPULAR SONG DATABASE WITH ALIGNED AUDIO, LYRICS, AND MUSICAL SCORES FOR SINGING VOICE SYNTHESIS
Lee, Yu-Chieh
pg. 36
A-1-3.7 - DOPPLER CENTROID ESTIMATION WITH QUALITY ASSESSMENT FOR REAL-TIME SAR IMAGING
Leglaive, Simon
pg. 686
F-2-3.5 - LOCALIZATION CUES PRESERVATION IN HEARING AIDS BY COMBINING NOISE REDUCTION AND DYNAMIC RANGE COMPRESSION
Lei, Xuejing
pg. 1698
C-3-3.3 - NITES: A NON-PARAMETRIC INTERPRETABLE TEXTURE SYNTHESIS METHOD
Leow, Chee Siang
pg. 403
F-1-2.6 - SPOKEN DIALOG TRAINING SYSTEM FOR CUSTOMER SERVICE IMPROVEMENT
Li, Andong
pg. 769
F-3-1.6 - A TIME-DOMAIN MONAURAL SPEECH ENHANCEMENT WITH FEEDBACK LEARNING
Li, Bing-Zhao
pg. 255
C-3-1.2 - WINDOWED FRACTIONAL FOURIER TRANSFORM ON GRAPHS: FRACTIONAL TRANSLATION OPERATOR AND HAUSDORFF-YOUNG INEQUALITY
Li, Bing-zhao
pg. 260
C-3-1.3 - A NOVEL ISAR IMAGING ALGORITHM FOR MANEUVERING TARGET BASED ON PARAMETER ESTIMATION METHOD
Li, Dongmei
pg. 711
E-3-1.2 - OPTIMAL SCALE-INVARIANT SIGNAL-TO-NOISE RATIO AND CURRICULUM LEARNING FOR MONAURAL MULTI-SPEAKER SPEECH SEPARATION IN NOISY ENVIRONMENT
pg. 477
F-1-3.6 - A DEEP LEARNING-BASED TIME-DOMAIN APPROACH FOR NON-INTRUSIVE SPEECH QUALITY ASSESSMENT
Li, Haizhou
pg. 492
E-2-1.2 - SPECTRAL FEATURES AND PITCH HISTOGRAM FOR AUTOMATIC SINGING QUALITY EVALUATION WITH CRNN
pg. 747
F-3-1.2 - CLASSIFICATION OF SPEECH WITH AND WITHOUT FACE MASK USING ACOUSTIC FEATURES
pg. 605
F-2-2.3 - HLT-NUS SUBMISSION FOR 2019 NIST MULTIMEDIA SPEAKER RECOGNITION EVALUATION
pg. 507
E-2-1.4 - SPECTRUM AND PROSODY CONVERSION FOR CROSS-LINGUAL VOICE CONVERSION WITH CYCLEGAN
pg. 514
E-2-1.5 - VAW-GAN FOR SINGING VOICE CONVERSION WITH NON-PARALLEL TRAINING DATA
Li, Hao
pg. 52
F-3-3.2 - ROBUST SPEECH DEREVERBERATION BASED ON WPE AND DEEP LEARNING
pg. 197
C-2-1.4 - MODELING DECISION PROCESS IN MULTI-AGENT SYSTEMS: A GRAPHICAL MARKOV GAME BASED APPROACH
Li, Hongliang
pg. 1096
D-1-2.5 - BLIND TONE-MAPPED IMAGE QUALITY ASSESSMENT AND ENHANCEMENT VIA DISENTANGLED REPRESENTATION LEARNING
Li, Huiyong
pg. 278
E-1-1.3 - EFFECT OF CROSS-CHANNEL CONTROL FILTERS IN MULTI-CHANNEL FEEDBACK ACTIVE NOISE CONTROL
pg. 283
E-1-1.4 - SIMULTANEOUS VARIABLE PERTURBATION METHOD FOR THE ACTIVE NOISE CONTROL SYSTEM WITH A WIRELESS ERROR MICROPHONE
Li, Junru
pg. 1122
D-1-3.4 - SSIM MOTIVATED QUALITY CONTROL FOR VERSATILE VIDEO CODING
Li, Li
pg. 858
E-3-3.3 - A STUDY ON GEOMETRICALLY CONSTRAINED IVA WITH AUXILIARY FUNCTION APPROACH AND VCD FOR IN-CAR COMMUNICATION
Li, Lin
pg. 550
F-2-1.5 - AP20-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES
Li, MingHui
pg. 1666
C-3-2.5 - 3D POINT CLOUD LABELING TOOL FOR DRIVING AUTOMATICALLY
Li, Ruohao
pg. 1653
C-3-2.3 - IMPROVING KEYWORDS SPOTTING PERFORMANCE IN NOISE WITH AUGMENTED DATASET FROM VOCODED SPEECH
Li, Xiaodong
pg. 769
F-3-1.6 - A TIME-DOMAIN MONAURAL SPEECH ENHANCEMENT WITH FEEDBACK LEARNING
Li, Xiaoxu
pg. 1707
C-3-3.4 - ADAPTIVE MULTI-PROTOTYPE RELATION NETWORK
pg. 1719
C-3-3.6 - ANTI-NOISE RELATION NETWORK FOR FEW-SHOT LEARNING
Li, You-Jin
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
Li, Yuejiang
pg. 197
C-2-1.4 - MODELING DECISION PROCESS IN MULTI-AGENT SYSTEMS: A GRAPHICAL MARKOV GAME BASED APPROACH
pg. 1617
C-2-3.5 - CAN-SIN: A CROSS-LAYER HETEROGENEOUS ACADEMIC NETWORK WITH SEMANTIC INFORMATION
Li, Zheng
pg. 550
F-2-1.5 - AP20-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES
Li, Zhibin
pg. 944
A-3-3.2 - HUMAN HAND MOVEMENT RECOGNITION BASED ON HMM WITH HYPERPARAMETERS OPTIMIZED BY MAXIMUM MUTUAL INFORMATION
Li, Zhonghua
pg. 1352
B-2-2.3 - DENSELY CONNECTED CONVOLUTIONAL NETWORK FOR AUDIO SPOOFING DETECTION
Liang, Jhe-Hao
pg. 1527
D-2-1.1 - DUAL ADAPTIVE MODULATION AND CODING FOR MITIGATING UE-UE INTERFERENCE IN HETEROGENEOUS TDD SLOT CONFIGURATIONS
Liang, Jiangnan
pg. 416
E-1-3.2 - MULTI-BEAM DESIGN METHOD FOR A STEERABLE PARAMETRIC ARRAY LOUDSPEAKER
Liang, Kai Wen
pg. 88
F-3-3.8 - ACOUSTIC ECHO CANCELLATION BASED ON RECURRENT NEURAL NETWORK
Liao, Chien-Feng
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
Liao, Yi-Ping
pg. 71
F-3-3.5 - DEEP NEURAL NETWORK COMPRESSION WITH KNOWLEDGE DISTILLATION USING CROSS-LAYER MATRIX, KL DIVERGENCE AND OFFLINE ENSEMBLE
Lie, Wen-Nung
pg. 1128
D-1-3.5 - RATE-DISTORTION OPTIMIZATION FOR 360-DEGREE IMAGE CONSIDERING VISUAL ATTENTION
pg. 1673
C-3-2.6 - DETECTING OBJECT SURFACE KEYPOINTS FROM A SINGLE RGB IMAGE VIA DEEP LEARNING NETWORK FOR 6DOF POSE ESTIMATION
Lim, Hyungseob
pg. 831
F-3-2.5 - EXCITGLOW: IMPROVING A WAVEGLOW-BASED NEURAL VOCODER WITH LINEAR PREDICTION ANALYSIS
Lin, Chun-Long
pg. 1448
C-1-1.2 - CONSTRUCTION OF CYCLICALLY PERMUTABLE CODES FROM PRIME LENGTH CYCLIC CODES
Lin, Guan-Wei
pg. 1578
D-3-1.4 - PRIVACY-PRESERVING DATA SHARING WITH ATTRIBUTE-BASED PRIVATE MATCHING BASED ON EDGE COMPUTATION IN THE INTERNET-OF-THINGS
Lin, Hsueh-Chun
pg. 1561
D-3-1.1 - LORA-BASED AIR QUALITY MONITORING SYSTEM USING CHATBOT
Lin, Jia Cheng
pg. 1234
D-2-2.1 - FUSION TECHNOLOGY OF RADAR AND RGB CAMERA SENSORS FOR OBJECT DETECTION AND TRACKING AND ITS EMBEDDED SYSTEM IMPLEMENTATION
Lin, Jing-Hua
pg. 561
E-2-2.1 - HARMONIC PRESERVING NEURAL NETWORKS FOR EFFICIENT AND ROBUST MULTIPITCH ESTIMATION
Lin, Po-Chiang
pg. 1557
D-2-1.6 - CELL OUTAGE DETECTION USING DEEP CONVOLUTIONAL AUTOENCODER IN MOBILE COMMUNICATION NETWORKS
Lin, Ting-An
pg. 1713
C-3-3.5 - SUPPORTIVE AND SELF ATTENTIONS FOR IMAGE CAPTION
Lin, Wen-Hsueh
pg. 1527
D-2-1.1 - DUAL ADAPTIVE MODULATION AND CODING FOR MITIGATING UE-UE INTERFERENCE IN HETEROGENEOUS TDD SLOT CONFIGURATIONS
Lin, Yiqing
pg. 161
B-1-2.5 - AN EVOLUTIONARY GAME THEORETICAL FRAMEWORK FOR DECISION FUSION IN THE PRESENCE OF BYZANTINES
Lin, Yu-Chen
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
Lin, Yu-Jau
pg. 1566
D-3-1.2 - REAL-TIME DDOS ATTACK DETECTION USING SKETCH-BASED ENTROPY ESTIMATION ON THE NETFPGA SUME PLATFORM
Ling, Zhen-Hua
pg. 556
F-2-1.6 - ADVERSARIAL POST-PROCESSING OF VOICE CONVERSION AGAINST SPOOFING DETECTION
Ling, Zhenhua
pg. 815
F-3-2.2 - ONLINE SPEAKER ADAPTATION FOR WAVENET-BASED NEURAL VOCODERS
Liu, Bin
pg. 1043
D-1-1.2 - MICRO-EXPRESSION RECOGNITION BASED ON MULTIPLE AGGREGATION NETWORKS
Liu, Chaoran
pg. 1060
D-1-1.5 - 3D SKELETAL MOVEMENT ENHANCED EMOTION RECOGNITION NETWORK
Liu, Chun-Tai
pg. 1588
D-3-1.5 - COORDINATED DOWNLINK/UPLINK TRANSMISSION ASSIGNMENT AND DYNAMIC SWITCHING IN HYBRID TDD SYSTEM
Liu, Conggui
pg. 794
E-3-2.4 - SELF-ATTENTION FOR MULTI-CHANNEL SPEECH SEPARATION IN NOISY AND REVERBERANT ENVIRONMENTS
Liu, K. J. Ray
pg. 41
A-1-3.8 - DRIVER ARRIVAL SENSING FOR SMART CAR USING WIFI FINE TIME MEASUREMENTS
Liu, Li
pg. 1442
B-3-2.4 - A GENERATIVE ADVERSARIAL NETWORK FRAMEWORK FOR JPEG ANTI-FORENSICS
Liu, Li-Juan
pg. 556
F-2-1.6 - ADVERSARIAL POST-PROCESSING OF VOICE CONVERSION AGAINST SPOOFING DETECTION
Liu, Pengyu
pg. 1033
D-1-1.1 - CLOUD RECOGNITION BASED ON LIGHTWEIGHT NEURAL NETWORK
Liu, Te-Lung
pg. 1566
D-3-1.2 - REAL-TIME DDOS ATTACK DETECTION USING SKETCH-BASED ENTROPY ESTIMATION ON THE NETFPGA SUME PLATFORM
Liu, Yan
pg. 875
B-1-1.1 - CLASSIFICATION OF SEIZURE EEGS BASED ON SHORT-TIME FOURIER TRANSFORM AND HIDDEN MARKOV MODEL
Liu, Yi-Wen
pg. 1647
C-3-2.2 - MPOP600: A MANDARIN POPULAR SONG DATABASE WITH ALIGNED AUDIO, LYRICS, AND MUSICAL SCORES FOR SINGING VOICE SYNTHESIS
Liu, Yuxin
pg. 1707
C-3-3.4 - ADAPTIVE MULTI-PROTOTYPE RELATION NETWORK
pg. 1719
C-3-3.6 - ANTI-NOISE RELATION NETWORK FOR FEW-SHOT LEARNING
Llave, Adrien
pg. 686
F-2-3.5 - LOCALIZATION CUES PRESERVATION IN HEARING AIDS BY COMBINING NOISE REDUCTION AND DYNAMIC RANGE COMPRESSION
Lu, Jian Xian
pg. 1234
D-2-2.1 - FUSION TECHNOLOGY OF RADAR AND RGB CAMERA SENSORS FOR OBJECT DETECTION AND TRACKING AND ITS EMBEDDED SYSTEM IMPLEMENTATION
Lu, Junchen
pg. 514
E-2-1.5 - VAW-GAN FOR SINGING VOICE CONVERSION WITH NON-PARALLEL TRAINING DATA
Lu, Yen-Ju
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
Lumban Tobing, Patrick
pg. 520
E-2-1.6 - CROSS-LINGUAL VOICE CONVERSION USING A CYCLIC VARIATIONAL AUTO-ENCODER AND A WAVENET VOCODER
Luo, Shang-Bao
pg. 386
F-1-2.4 - SPOKEN MULTIPLE-CHOICE QUESTION ANSWERING USING MULTI-TURN AUDIO-EXTRACTER BERT
Lv, Zhao
pg. 1043
D-1-1.2 - MICRO-EXPRESSION RECOGNITION BASED ON MULTIPLE AGGREGATION NETWORKS
M
Ma, Chao
pg. 711
E-3-1.2 - OPTIMAL SCALE-INVARIANT SIGNAL-TO-NOISE RATIO AND CURRICULUM LEARNING FOR MONAURAL MULTI-SPEAKER SPEECH SEPARATION IN NOISY ENVIRONMENT
Ma, Hui
pg. 1725
C-3-3.7 - SMALL DATA-DRIVEN ELECTRICAL INSULATOR DEFECT DETECTION
Ma, Siwei
pg. 1122
D-1-3.4 - SSIM MOTIVATED QUALITY CONTROL FOR VERSATILE VIDEO CODING
Ma, Xiang
pg. 944
A-3-3.2 - HUMAN HAND MOVEMENT RECOGNITION BASED ON HMM WITH HYPERPARAMETERS OPTIMIZED BY MAXIMUM MUTUAL INFORMATION
Ma, Xinxin
pg. 365
E-1-2.6 - DEEP SEMANTIC ENCODER-DECODER NETWORK FOR ACOUSTIC SCENE CLASSIFICATION WITH MULTIPLE DEVICES
Ma, Yong
pg. 365
E-1-2.6 - DEEP SEMANTIC ENCODER-DECODER NETWORK FOR ACOUSTIC SCENE CLASSIFICATION WITH MULTIPLE DEVICES
Ma, Yunru
pg. 1201
D-3-2.3 - THE VALIDITY OF A DUAL AZURE KINECT-BASED MOTION CAPTURE SYSTEM FOR GAIT ANALYSIS: A PRELIMINARY STUDY
Ma, Zhanyu
pg. 1689
C-3-3.1 - SPEAKER VERIFICATION SYSTEM BASED ON DEFORMABLE CNN AND TIME-FREQUENCY ATTENTION
pg. 1707
C-3-3.4 - ADAPTIVE MULTI-PROTOTYPE RELATION NETWORK
pg. 1719
C-3-3.6 - ANTI-NOISE RELATION NETWORK FOR FEW-SHOT LEARNING
Mace, Brian
pg. 850
E-3-3.2 - SOURCE ENHANCEMENT FOR UNMANNED AERIAL VEHICLE RECORDING USING MULTI-SENSORY INFORMATION
Madhavi, Maulik
pg. 381
F-1-2.3 - OPENNLU: OPEN-SOURCE WEB-INTERFACE NLU TOOLKIT FOR DEVELOPMENT OF CONVERSATIONAL AGENT
pg. 644
E-2-3.4 - QUERY-BY-EXAMPLE SPOKEN TERM DETECTION USING GENERATIVE ADVERSARIAL NETWORK
Madono, Koki
pg. 1226
D-3-2.7 - EFFICIENT HUMAN-IN-THE-LOOP OBJECT DETECTION USING BI-DIRECTIONAL DEEP SORT AND ANNOTATION-FREE SEGMENT IDENTIFICATION
Makino, Shoji
pg. 858
E-3-3.3 - A STUDY ON GEOMETRICALLY CONSTRAINED IVA WITH AUXILIARY FUNCTION APPROACH AND VCD FOR IN-CAR COMMUNICATION
pg. 421
E-1-3.3 - APPLYING VIRTUAL MICROPHONES TO TRIANGULAR MICROPHONE ARRAY IN IN-CAR COMMUNICATION
Makishima, Naoki
pg. 1054
D-1-1.4 - UNSUPERVISED DOMAIN ADVERSARIAL TRAINING IN ANGULAR SPACE FOR FACIAL EXPRESSION RECOGNITION
Malligere Shivanna, Vinay
pg. 1234
D-2-2.1 - FUSION TECHNOLOGY OF RADAR AND RGB CAMERA SENSORS FOR OBJECT DETECTION AND TRACKING AND ITS EMBEDDED SYSTEM IMPLEMENTATION
Manamperi, Wageesha
pg. 156
B-1-2.4 - ESTIMATING DRONE MOTOR RELATED ACOUSTIC TRANSFER FUNCTION: A PRELIMINARY INVESTIGATION
Mandl, Magdalena
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Marattukalam, Felix
pg. 64
F-3-3.4 - SEGMENTATION OF PALM VEIN IMAGES USING U-NET
Marcia, Roummel
pg. 968
A-3-3.6 - A NEURAL NETWORK APPROACH FOR ANOMALY DETECTION IN GENOMIC SIGNALS
Masumura, Ryo
pg. 297
F-1-1.1 - DIALECT-AWARE MODELING FOR END-TO-END JAPANESE DIALECT SPEECH RECOGNITION
pg. 632
E-2-3.2 - END-TO-END AUTOMATIC SPEECH RECOGNITION WITH DEEP MUTUAL LEARNING
pg. 1054
D-1-1.4 - UNSUPERVISED DOMAIN ADVERSARIAL TRAINING IN ANGULAR SPACE FOR FACIAL EXPRESSION RECOGNITION
Masuyama, Yoshiki
pg. 788
E-3-2.3 - COMPUTER-RESOURCE-AWARE DEEP SPEECH SEPARATION WITH A RUN-TIME-SPECIFIED NUMBER OF BLSTM LAYERS
Matsumoto, Tomoya
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Matsumoto, Toshiyuki
pg. 958
A-3-3.4 - SHEET-TYPE DEVICE FOR UNCONSTRAINED HEART SOUND MEASUREMENT AND WHITE NOISE REDUCTION BY WIENER FILTER
Matsuoka, Ryo
pg. 17
A-1-3.4 - CONSTRAINED DESIGN OF TWO-DIMENSIONAL FIR FILTERS WITH SPARSE COEFFICIENTS
Matsushima, Taiki
pg. 1477
C-1-2.1 - COMPENSATION METHOD OF RECEIVED SIGNAL POWER OBSERVED BY SMARTPHONE FOR CROWDSENSED SPECTRUM DATABASE
Maulina, Novi
pg. 924
B-1-3.4 - PERFORMANCE EVALUATION OF BINARY CLASSIFICATION OF TUBERCULOSIS THROUGH UNSHARP MASKING AND DEEP LEARNING TECHNIQUE
MaungMaung, AprilPyone
pg. 1369
B-2-3.1 - AN EXTENSION OF ENCRYPTION-INSPIRED ADVERSARIAL DEFENSE WITH SECRET KEYS AGAINST ADVERSARIAL EXAMPLES
Mawalim, Candy Olivia
pg. 1321
B-3-3.6 - SPEECH INFORMATION HIDING BY MODIFICATION OF LSF QUANTIZATION INDEX IN CELP CODEC
Meng, Fanman
pg. 1096
D-1-2.5 - BLIND TONE-MAPPED IMAGE QUALITY ASSESSMENT AND ENHANCEMENT VIA DISENTANGLED REPRESENTATION LEARNING
Mimura, Masato
pg. 800
E-3-2.5 - END-TO-END MUSIC-MIXED SPEECH RECOGNITION
Mineno, Hiroshi
pg. 578
E-2-2.4 - A DATA AUGMENTATION TECHNIQUE FOR AUTOMATIC DETECTION OF CHEWING SIDE AND SWALLOWING
Miyata, Toma
pg. 1
A-1-3.1 - AN IMPROVED METHOD FOR INSTANTANEOUS FREQUENCY ESTIMATION USING A FINITE ORDER HILBERT TRANSFORMER
Miyazaki, Ryoichi
pg. 701
F-2-3.7 - EXPERIMENTAL INVESTIGATION OF ROBUSTNESS OF SPATIAL CEPSTRUM FEATURES UNDER VARIOUS RECORDING CONDITIONS
Mizumachi, Mitsunori
pg. 174
C-2-1.1 - SIMULTANEOUS MEASUREMENT OF TIME-INVARIANT LINEAR AND NONLINEAR, AND RANDOM AND EXTRA RESPONSES USING FREQUENCY DOMAIN VARIANT OF VELVET NOISE
Moeller, Ralf
pg. 189
C-2-1.3 - ON THE BEHAVIOUR OF PERMUTATION ENTROPY ON FRACTIONAL BROWNIAN MOTION IN A MULTIVARIATE SETTING
Mohr, Marisa
pg. 189
C-2-1.3 - ON THE BEHAVIOUR OF PERMUTATION ENTROPY ON FRACTIONAL BROWNIAN MOTION IN A MULTIVARIATE SETTING
Montlouis, Webert
pg. 963
A-3-3.5 - BOWEL MOVEMENT SIGNAL MODELING AND PARAMETERS EXTRACTION
Mori, Takeshi
pg. 319
F-1-1.5 - SPEAKER AGE ESTIMATION USING AGE-DEPENDENT INSENSITIVE LOSS
Mori, Yuto
pg. 1375
B-2-3.2 - DETECTION OF CLONED RECOGNIZERS: A DEFENDING METHOD AGAINST RECOGNIZER CLONING ATTACK
Morinobu, Shigeru
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Morise, Masanori
pg. 487
E-2-1.1 - PJS: PHONEME-BALANCED JAPANESE SINGING-VOICE CORPUS
pg. 174
C-2-1.1 - SIMULTANEOUS MEASUREMENT OF TIME-INVARIANT LINEAR AND NONLINEAR, AND RANDOM AND EXTRA RESPONSES USING FREQUENCY DOMAIN VARIANT OF VELVET NOISE
pg. 821
F-3-2.3 - IMPLEMENTATION OF SEQUENTIAL REAL-TIME WAVEFORM GENERATOR FOR HIGH-QUALITY VOCODER
Muchtar, Kahlil
pg. 924
B-1-3.4 - PERFORMANCE EVALUATION OF BINARY CLASSIFICATION OF TUBERCULOSIS THROUGH UNSHARP MASKING AND DEEP LEARNING TECHNIQUE
Munadi, Khairul
pg. 924
B-1-3.4 - PERFORMANCE EVALUATION OF BINARY CLASSIFICATION OF TUBERCULOSIS THROUGH UNSHARP MASKING AND DEEP LEARNING TECHNIQUE
Munir, Muhammad Waqas
pg. 22
A-1-3.5 - BARK FREQUENCY SPECTRUM IN PARALLEL-FORM REMOTE ACTIVE NOISE CONTROL
Murai, Keisuke
pg. 1017
B-3-1.3 - DISCOVERY OF EVENT-RELATED POTENTIALS DURING A COGNITIVE PROCESS OF COMPARISON OPERATION
Muramatsu, Shogo
pg. 1216
D-3-2.5 - FIXED-POINT ARITHMETIC OF L2-NORM APPROXIMATION FOR 2-TUPLE ARRAYS WITH ROTATED L1-NORM EVALUATION
pg. 1182
D-2-3.7 - IMAGE RESTORATION BY GROUP SPARSITY WITH UNION OF HIERARCHICAL DIRLOTS
Mussabayeva, Ayana
pg. 222
C-2-2.2 - COMPARISON OF GENERIC AND SUBJECT-SPECIFIC TRAINING FOR FEATURES CLASSIFICATION IN P300 SPELLER
N
Nagasawa, Yurina
pg. 1541
D-2-1.3 - PREDICTION METHOD OF MALWARE INFECTION SPREADING CONSIDERING NETWORK SCALE
Naghibzadeh-Jalali, Anahid
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Nakadai, Kazuhiro
pg. 184
C-2-1.2 - AGE CLASSIFICATION OF EVACUEES AT TIMES OF DISASTER USING A VIBRATION SENSOR
Nakahara, Yusuke
pg. 1222
D-3-2.6 - RAPID AND ACCURATE LOCAL GAUSSIAN NOISE REMOVAL
Nakai-Kasai, Ayano
pg. 228
C-2-2.3 - OPTIMAL COMBINATION WEIGHT FOR SPARSE DIFFUSION LEAST-MEAN-SQUARE BASED ON CONSENSUS PROPAGATION
pg. 1490
C-1-2.3 - AN OVERLOADED IOT SIGNAL DETECTION METHOD USING NON-CONVEX SPARSE REGULARIZERS
Nakamura, Akihiro
pg. 578
E-2-2.4 - A DATA AUGMENTATION TECHNIQUE FOR AUTOMATIC DETECTION OF CHEWING SIDE AND SWALLOWING
Nakamura, Eita
pg. 500
E-2-1.3 - A VARIATIONAL AUTOENCODER FOR JOINT CHORD AND KEY ESTIMATION FROM AUDIO CHROMAGRAMS
pg. 359
E-1-2.5 - TATUM-LEVEL DRUM TRANSCRIPTION BASED ON A CONVOLUTIONAL RECURRENT NEURAL NETWORK WITH LANGUAGE MODEL-BASED REGULARIZED TRAINING
Nakamura, Kazuaki
pg. 1375
B-2-3.2 - DETECTION OF CLONED RECOGNIZERS: A DEFENDING METHOD AGAINST RECOGNIZER CLONING ATTACK
Nakamura, Kazuki
pg. 1400
B-2-3.6 - DEEP FACE RECOGNIZER PRIVACY ATTACK: MODEL INVERSION INITIALIZATION BY A DEEP GENERATIVE ADVERSARIAL DATA SPACE DISCRIMINATOR
Nakano, Teppei
pg. 1226
D-3-2.7 - EFFICIENT HUMAN-IN-THE-LOOP OBJECT DETECTION USING BI-DIRECTIONAL DEEP SORT AND ANNOTATION-FREE SEGMENT IDENTIFICATION
Nakashika, Toru
pg. 471
F-1-3.5 - GAMMA BOLTZMANN MACHINE FOR SIMULTANEOUSLY MODELING LINEAR- AND LOG-AMPLITUDE SPECTRA
Nakatani, Hikaru
pg. 520
E-2-1.6 - CROSS-LINGUAL VOICE CONVERSION USING A CYCLIC VARIATIONAL AUTO-ENCODER AND A WAVENET VOCODER
NAKAYAMA, Masato
pg. 409
E-1-3.1 - EVALUATION OF A MULTI-WAY PARAMETRIC ARRAY LOUDSPEAKER BASED ON MULTIPLEXED DOUBLE SIDEBAND MODULATION
Nangu, Shota
pg. 1536
D-2-1.2 - OPTIMIZATION OF VIRTUAL MACHINE PLACEMENT FOR BALANCING NETWORK AND SERVER LOAD IN EDGE COMPUTING ENVIRONMENTS
Narieda, Shusuke
pg. 1519
C-1-3.4 - ON PLACEMENT OF END DEVICES IN LPWAN BASED WSN FOR ENVIRONMENTAL MONITORING APPLICATIONS
Naruse, Hiroshi
pg. 1519
C-1-3.4 - ON PLACEMENT OF END DEVICES IN LPWAN BASED WSN FOR ENVIRONMENTAL MONITORING APPLICATIONS
Natori, Takahiro
pg. 1
A-1-3.1 - AN IMPROVED METHOD FOR INSTANTANEOUS FREQUENCY ESTIMATION USING A FINITE ORDER HILBERT TRANSFORMER
Ngan, King Ngi
pg. 1096
D-1-2.5 - BLIND TONE-MAPPED IMAGE QUALITY ASSESSMENT AND ENHANCEMENT VIA DISENTANGLED REPRESENTATION LEARNING
Ngo, Thuanvan
pg. 753
F-3-1.3 - ENHANCEMENT OF SPEECH INTELLIGIBILITY UNDER NOISY REVERBERANT CONDITIONS BASED ON MODULATION SPECTRUM CONCEPT
Nguyen, Anh H. T.
pg. 841
E-3-3.1 - A JOINT-LOSS APPROACH FOR SPEECH ENHANCEMENT VIA SINGLE-CHANNEL NEURAL NETWORK AND MVDR BEAMFORMER
Nguyen, Huy
pg. 1386
B-2-3.4 - DETECTION OF ADVERSARIAL EXAMPLES BASED ON SENSITIVITIES TO NOISE REMOVAL FILTER
Nguyen, Huy H.
pg. 1293
B-3-3.1 - A METHOD FOR IDENTIFYING ORIGIN OF DIGITAL IMAGES USING A CONVOLUTIONAL NEURAL NETWORK
Nguyen, Manh Hung
pg. 1566
D-3-1.2 - REAL-TIME DDOS ATTACK DETECTION USING SKETCH-BASED ENTROPY ESTIMATION ON THE NETFPGA SUME PLATFORM
Nie, Kaibao
pg. 1653
C-3-2.3 - IMPROVING KEYWORDS SPOTTING PERFORMANCE IN NOISE WITH AUGMENTED DATASET FROM VOCODED SPEECH
Nishigaki, Masakatsu
pg. 1425
B-3-2.2 - STUDY ON POSSIBILITY OF ESTIMATING SMARTPHONE INPUTS FROM TAP SOUNDS
Nishijima, Keisuke
pg. 929
B-1-3.5 - HYPERPARAMETER TUNING OF THE SHUNT-MURMUR DISCRIMINATION ALGORITHM USING BAYESIAN OPTIMIZATION
Nishikawa, Kiyoshi
pg. 216
C-2-2.1 - LOW COMPLEXITY IMPLEMENTATION METHOD FOR THE ADAPTIVE FILTERS BASED ON THE GAUSSIAN MODEL
Nishikimi, Ryo
pg. 359
E-1-2.5 - TATUM-LEVEL DRUM TRANSCRIPTION BASED ON A CONVOLUTIONAL RECURRENT NEURAL NETWORK WITH LANGUAGE MODEL-BASED REGULARIZED TRAINING
Nishimura, Masafumi
pg. 578
E-2-2.4 - A DATA AUGMENTATION TECHNIQUE FOR AUTOMATIC DETECTION OF CHEWING SIDE AND SWALLOWING
Nishio, Keita
pg. 958
A-3-3.4 - SHEET-TYPE DEVICE FOR UNCONSTRAINED HEART SOUND MEASUREMENT AND WHITE NOISE REDUCTION BY WIENER FILTER
NISHIURA, Takanobu
pg. 409
E-1-3.1 - EVALUATION OF A MULTI-WAY PARAMETRIC ARRAY LOUDSPEAKER BASED ON MULTIPLEXED DOUBLE SIDEBAND MODULATION
Nishiura, Takanobu
pg. 266
E-1-1.1 - STUDY ON FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM WITH OPTICAL LASER MICROPHONE TO DETECT REFERENCE SIGNAL WITH SHORT DELAY
pg. 662
F-2-3.1 - HARMONIC STRUCTURE MASK FOR SPEECH ENHANCEMENT USING SPARSITY REGULARIZATION
pg. 449
F-1-3.1 - SPEECH ENHANCEMENT FOR OPTICAL LASER MICROPHONE WITH DEEP NEURAL NETWORK
pg. 272
E-1-1.2 - FEEDFORWARD ACTIVE NOISE CONTROL WITH COHERENCE-ADJUSTING FILTER FOR IMPROVING NOISE REDUCTION PERFORMANCE UNDER LOW-COHERENCE CONDITION
Nishizaki, Hiromitsu
pg. 621
F-2-2.6 - ANALYSIS OF BIT SEQUENCE REPRESENTATION FOR SOUND CLASSIFICATION
pg. 403
F-1-2.6 - SPOKEN DIALOG TRAINING SYSTEM FOR CUSTOMER SERVICE IMPROVEMENT
Nitta, Naoko
pg. 1375
B-2-3.2 - DETECTION OF CLONED RECOGNIZERS: A DEFENDING METHOD AGAINST RECOGNIZER CLONING ATTACK
pg. 1400
B-2-3.6 - DEEP FACE RECOGNIZER PRIVACY ATTACK: MODEL INVERSION INITIALIZATION BY A DEEP GENERATIVE ADVERSARIAL DATA SPACE DISCRIMINATOR
Niu, Mingyue
pg. 1043
D-1-1.2 - MICRO-EXPRESSION RECOGNITION BASED ON MULTIPLE AGGREGATION NETWORKS
Noda, Fumiya
pg. 929
B-1-3.5 - HYPERPARAMETER TUNING OF THE SHUNT-MURMUR DISCRIMINATION ALGORITHM USING BAYESIAN OPTIMIZATION
O
Obaidellah, Unaizah
pg. 1008
B-3-1.2 - PREDICTING EXPERTISE AMONG NOVICE PROGRAMMERS WITH PRIOR KNOWLEDGE ON PROGRAMMING TASKS
Ogawa, Tetsuji
pg. 1226
D-3-2.7 - EFFICIENT HUMAN-IN-THE-LOOP OBJECT DETECTION USING BI-DIRECTIONAL DEEP SORT AND ANNOTATION-FREE SEGMENT IDENTIFICATION
Oh, Suhyeon
pg. 831
F-3-2.5 - EXCITGLOW: IMPROVING A WAVEGLOW-BASED NEURAL VOCODER WITH LINEAR PREDICTION ANALYSIS
Ohgane, Takeo
pg. 100
A-2-3.1 - AN EVALUATION OF A CNN-BASED PARKING DETECTION SYSTEM WITH WEBCAMS
pg. 110
A-2-3.2 - AN EVALUATION OF DESIGN FRAMEWORK FOR MIN-SUM IRREGULAR LDPC DECODERS
Ohki, Tetsushi
pg. 1425
B-3-2.2 - STUDY ON POSSIBILITY OF ESTIMATING SMARTPHONE INPUTS FROM TAP SOUNDS
Ohta, Ken
pg. 578
E-2-2.4 - A DATA AUGMENTATION TECHNIQUE FOR AUTOMATIC DETECTION OF CHEWING SIDE AND SWALLOWING
Ohta, Mai
pg. 1513
C-1-3.3 - ESTIMATION OF DESIRED POWER AND UNDESIRED POWER USING CHIRP DEMODULATION AND EVALUATION OF ACCURACY
pg. 1497
C-1-2.4 - SPECIFICATION OF LINK QUALITY DEGRADATION IN WLAN BASED ON MCS AND RETRANSMISSION FLAG
pg. 1460
C-1-1.4 - AUTONOMOUS DECENTRALIZED TRANSMISSION TIMING CONTROL IN WIRELESS SENSOR NETWORK
pg. 1523
C-1-3.5 - SPECTRUM SHARING FOR INTERNET OF THINGS SYSTEM IN PERIODIC TRANSMISSION
Okabe, Ryo
pg. 1453
C-1-1.3 - LOW-COMPLEXITY ROBUST BEAMFORMING WITH BLOCKAGE PREDICTION FOR MILLIMETER-WAVE COMMUNICATIONS
Okada, Go
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Okada, Satoshi
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Okamoto, Yasumasa
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Okawa, Masaki
pg. 621
F-2-2.6 - ANALYSIS OF BIT SEQUENCE REPRESENTATION FOR SOUND CLASSIFICATION
Okuda, Masahiro
pg. 17
A-1-3.4 - CONSTRAINED DESIGN OF TWO-DIMENSIONAL FIR FILTERS WITH SPARSE COEFFICIENTS
Okudera, Ryosuke
pg. 1425
B-3-2.2 - STUDY ON POSSIBILITY OF ESTIMATING SMARTPHONE INPUTS FROM TAP SOUNDS
Ong, Simying
pg. 1361
B-2-2.4 - DATA EMBEDDING METHOD USING PHOTO EFFECTS WITH RESISTANCE TO COMPRESSION
Ong, Yi Fan
pg. 381
F-1-2.3 - OPENNLU: OPEN-SOURCE WEB-INTERFACE NLU TOOLKIT FOR DEVELOPMENT OF CONVERSATIONAL AGENT
Ono, Naoki
pg. 952
A-3-3.3 - QUANTIFICATION ANALYSIS OF BEHAVIORAL CHANGES AFTER SCIATIC NERVE LIGATION IN RATS
Ono, Nobutaka
pg. 863
E-3-3.4 - DYNAMIC SYNCHRONOUS AVERAGING FOR ENHANCEMENT OF PERIODIC SIGNAL UNDER SAMPLING FREQUENCY VARIATION
pg. 701
F-2-3.7 - EXPERIMENTAL INVESTIGATION OF ROBUSTNESS OF SPATIAL CEPSTRUM FEATURES UNDER VARIOUS RECORDING CONDITIONS
pg. 443
E-1-3.7 - ENERGY-BASED MULTIPLE SOURCE LOCALIZATION WITH BLINKIES
Oostermeijer, Koen
pg. 465
F-1-3.4 - FREQUENCY GATING: IMPROVED CONVOLUTIONAL NEURAL NETWORKS FOR SPEECH ENHANCEMENT IN THE TIME-FREQUENCY DOMAIN
Orihashi, Shota
pg. 1107
D-1-3.1 - SUBJECTIVE QUALITY DRIVEN IMAGE ENCODING METHOD USING IMAGE COMPLETION
pg. 1054
D-1-1.4 - UNSUPERVISED DOMAIN ADVERSARIAL TRAINING IN ANGULAR SPACE FOR FACIAL EXPRESSION RECOGNITION
OUCHI, Shohei
pg. 909
B-1-3.1 - DEEP-LEARNING-BASED MR COMPRESSED SENSING USING NON-RANDOMLY UNDER-SAMPLED SIGNAL IN NONLINEAR PHASE ENCODING IMAGING
Ouchi, Yumo
pg. 1425
B-3-2.2 - STUDY ON POSSIBILITY OF ESTIMATING SMARTPHONE INPUTS FROM TAP SOUNDS
Ouyang, Zhiheng
pg. 764
F-3-1.5 - AN INTEGRATED CNN-GRU FRAMEWORK FOR COMPLEX RATIO MASK ESTIMATION IN SPEECH ENHANCEMENT
Ozawa, Kenji
pg. 46
F-3-3.1 - NOISE SUPPRESSION USING A DIFFERENTIAL-TYPE MICROPHONE ARRAY AND TWO-DIMENSIONAL AMPLITUDE AND PHASE SPECTRA
P
P.A., Karthick
pg. 905
B-1-1.6 - GEOMETRIC FEATURES BASED MUSCLE FATIGUE ANALYSIS USING LOW FREQUENCY BAND IN SURFACE ELECTROMYOGRAPHIC SIGNALS
Paliwal, Kuldip K.
pg. 667
F-2-3.2 - DEEP RESIDUAL NETWORK-BASED AUGMENTED KALMAN FILTER FOR SPEECH ENHANCEMENT
Pan, Jen-Yi
pg. 1527
D-2-1.1 - DUAL ADAPTIVE MODULATION AND CODING FOR MITIGATING UE-UE INTERFERENCE IN HETEROGENEOUS TDD SLOT CONFIGURATIONS
pg. 1588
D-3-1.5 - COORDINATED DOWNLINK/UPLINK TRANSMISSION ASSIGNMENT AND DYNAMIC SWITCHING IN HYBRID TDD SYSTEM
Pan, Jiazhen
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
Pan, Lei
pg. 1725
C-3-3.7 - SMALL DATA-DRIVEN ELECTRICAL INSULATOR DEFECT DETECTION
Pao, Wei-Chen
pg. 1527
D-2-1.1 - DUAL ADAPTIVE MODULATION AND CODING FOR MITIGATING UE-UE INTERFERENCE IN HETEROGENEOUS TDD SLOT CONFIGURATIONS
pg. 1588
D-3-1.5 - COORDINATED DOWNLINK/UPLINK TRANSMISSION ASSIGNMENT AND DYNAMIC SWITCHING IN HYBRID TDD SYSTEM
Park, Hyunkook
pg. 1268
D-3-3.4 - MOIRÉ ARTIFACTS REMOVAL IN SCREEN-SHOT IMAGES VIA MULTIPLE DOMAIN LEARNING
PARK, JAE SUNG
pg. 1067
D-1-2.1 - LOCAL BACKLIGHT DIMMING FOR LIQUID CRYSTAL DISPLAYS VIA CONVOLUTIONAL NEURAL NETWORK
Park, Min-Je
pg. 1257
D-3-3.2 - PROGRESSIVE DEEP NETWORK WITH CHANNEL BACK-PROJECTION FOR HYPERSPECTRAL RECOVERY FROM RGB
Park, Ye seung
pg. 1274
D-3-3.5 - DATA REDUCTION USING CLUSTER SAMPLING
Patil, Ankur T.
pg. 532
F-2-1.2 - SIGNIFICANCE OF CMVN FOR REPLAY SPOOF DETECTION
pg. 538
F-2-1.3 - SUBBAND CHANNEL SELECTION USING TEO FOR REPLAY SPOOF DETECTION IN VOICE ASSISTANTS
Patil, Hemant
pg. 644
E-2-3.4 - QUERY-BY-EXAMPLE SPOKEN TERM DETECTION USING GENERATIVE ADVERSARIAL NETWORK
Patil, Hemant A.
pg. 532
F-2-1.2 - SIGNIFICANCE OF CMVN FOR REPLAY SPOOF DETECTION
pg. 538
F-2-1.3 - SUBBAND CHANNEL SELECTION USING TEO FOR REPLAY SPOOF DETECTION IN VOICE ASSISTANTS
pg. 353
E-1-2.4 - SYMMETRY IN THE STRUCTURE OF MUSICAL NODES
pg. 543
F-2-1.4 - DESIGN OF VOICE PRIVACY SYSTEM USING LINEAR PREDICTION
Peng, Junyi
pg. 595
F-2-2.1 - CONTEXT-ADAPTIVE GAUSSIAN ATTENTION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Peng, Renhua
pg. 769
F-3-1.6 - A TIME-DOMAIN MONAURAL SPEECH ENHANCEMENT WITH FEEDBACK LEARNING
Peng, Rui
pg. 57
F-3-3.3 - AN ACOUSTIC SIGNAL PROCESSING SYSTEM FOR IDENTIFICATION OF QUEEN-LESS BEEHIVES
Penzel, Thomas
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Phillips, Tessa
pg. 1631
C-2-3.7 - GENERALISATION TECHNIQUES USING A VARIATIONAL CEAE FOR CLASSIFYING MANUKA HONEY QUALITY
Ping, Yeh-Hong
pg. 1557
D-2-1.6 - CELL OUTAGE DETECTION USING DEEP CONVOLUTIONAL AUTOENCODER IN MOBILE COMMUNICATION NETWORKS
Pradhan, Biswajeet
pg. 924
B-1-3.4 - PERFORMANCE EVALUATION OF BINARY CLASSIFICATION OF TUBERCULOSIS THROUGH UNSHARP MASKING AND DEEP LEARNING TECHNIQUE
Pradhan, Somanath
pg. 681
F-2-3.4 - A VARIABLE STEP SIZE IMPROVED MULTIBAND-STRUCTURED SUBBAND ADAPTIVE FEEDBACK CANCELLATION SCHEME FOR HEARING AIDS
Prajapati, Gauri
pg. 543
F-2-1.4 - DESIGN OF VOICE PRIVACY SYSTEM USING LINEAR PREDICTION
Prieto, Claudia
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
Q
Qi, Haikun
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
Qin, Boyu
pg. 6
A-1-3.2 - A NEW ALGORITHM TO DERIVE HARDWARE EFFICIENT INTEGER DISCRETE COSINE TRANSFORM FOR HEVC
Qiu, Xiaojun
pg. 681
F-2-3.4 - A VARIABLE STEP SIZE IMPROVED MULTIBAND-STRUCTURED SUBBAND ADAPTIVE FEEDBACK CANCELLATION SCHEME FOR HEARING AIDS
R
R, Sreeraj
pg. 644
E-2-3.4 - QUERY-BY-EXAMPLE SPOKEN TERM DETECTION USING GENERATIVE ADVERSARIAL NETWORK
Raikar, Aditya
pg. 437
E-1-3.6 - LEARNING BASED DOA ESTIMATION IN ADVERSE ACOUSTIC ENVIRONMENT USING CO-PRIME CIRCULAR MICROPHONE ARRAY
Rao, Wei
pg. 605
F-2-2.3 - HLT-NUS SUBMISSION FOR 2019 NIST MULTIMEDIA SPEAKER RECOGNITION EVALUATION
Rao, Zhibo
pg. 150
B-1-2.3 - CLASS ATTENTION NETWORK FOR SEMANTIC SEGMENTATION OF REMOTE SENSING IMAGES
Ren, Yanzhen
pg. 1331
B-3-3.7 - A SECURE OPUS PULSE STEGANOGRAPHIC SCHEME BASED ON MESSAGE TRANSFORM
Ringhofer, Monamie
pg. 939
A-3-3.1 - MATHEMATICAL MODEL OF HORSE AND RIDER INTERACTION DURING HORSE JUMPING
Ritz, Christian
pg. 426
E-1-3.4 - SEMI-ADAPTIVE BEAMFORMING FOR CO-PRIME CIRCULAR MICROPHONE ARRAYS
Routray, Gyanajyoti
pg. 437
E-1-3.6 - LEARNING BASED DOA ESTIMATION IN ADVERSE ACOUSTIC ENVIRONMENT USING CO-PRIME CIRCULAR MICROPHONE ARRAY
Roy, Sujan Kumar
pg. 667
F-2-3.2 - DEEP RESIDUAL NETWORK-BASED AUGMENTED KALMAN FILTER FOR SPEECH ENHANCEMENT
Rueckert, Daniel
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
S
Saeki, Kazuya
pg. 371
F-1-2.1 - LANGUAGE MODEL ADAPTATION FOR EMOTIONAL SPEECH RECOGNITION USING TWEET DATA
Saito, Takato
pg. 578
E-2-2.4 - A DATA AUGMENTATION TECHNIQUE FOR AUTOMATIC DETECTION OF CHEWING SIDE AND SWALLOWING
Sakakibara, Ken-Ichi
pg. 174
C-2-1.1 - SIMULTANEOUS MEASUREMENT OF TIME-INVARIANT LINEAR AND NONLINEAR, AND RANDOM AND EXTRA RESPONSES USING FREQUENCY DOMAIN VARIANT OF VELVET NOISE
Sakashita, Kazuki
pg. 1182
D-2-3.7 - IMAGE RESTORATION BY GROUP SPARSITY WITH UNION OF HIERARCHICAL DIRLOTS
Samarasinghe, Prasanga
pg. 156
B-1-2.4 - ESTIMATING DRONE MOTOR RELATED ACOUSTIC TRANSFER FUNCTION: A PRELIMINARY INVESTIGATION
pg. 734
E-3-1.6 - ON THE USE OF THE RELATIVE TRANSFER FUNCTION FOR SOURCE SEPARATION USING TWO-CHANNEL RECORDINGS
Sampei, Seiichi
pg. 1483
C-1-2.2 - 3D CONVOLUTIONAL NEURAL NETWORK-AIDED INDOOR POSITIONING BASED ON FINGERPRINTS OF BLE RSSI
Sano, Yuta
pg. 403
F-1-2.6 - SPOKEN DIALOG TRAINING SYSTEM FOR CUSTOMER SERVICE IMPROVEMENT
Saruwatari, Hiroshi
pg. 869
E-3-3.5 - JOINT-DIAGONALIZABILITY-CONSTRAINED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION BASED ON MULTIVARIATE COMPLEX STUDENT'S T-DISTRIBUTION
Sasaki, Tetsuya
pg. 1490
C-1-2.3 - AN OVERLOADED IOT SIGNAL DETECTION METHOD USING NON-CONVEX SPARSE REGULARIZERS
Sato, Yoshinao
pg. 794
E-3-2.4 - SELF-ATTENTION FOR MULTI-CHANNEL SPEECH SEPARATION IN NOISY AND REVERBERANT ENVIRONMENTS
Sawyer, Erica
pg. 968
A-3-3.6 - A NEURAL NETWORK APPROACH FOR ANOMALY DETECTION IN GENOMIC SIGNALS
Scheibler, Robin
pg. 705
E-3-1.1 - OVER-DETERMINED SPEECH SOURCE SEPARATION AND DEREVERBERATION
pg. 674
F-2-3.3 - A STUDY ON MORE REALISTIC ROOM SIMULATION FOR FAR-FIELD KEYWORD SPOTTING
pg. 443
E-1-3.7 - ENERGY-BASED MULTIPLE SOURCE LOCALIZATION WITH BLINKIES
Schindler, Alexander
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Seet, Boon-Chong
pg. 1472
C-1-1.6 - 24 GHZ FLEXIBLE LCP ANTENNA ARRAY FOR RADAR-BASED NONCONTACT VITAL SIGN MONITORING
Segawa, Hanako
pg. 421
E-1-3.3 - APPLYING VIRTUAL MICROPHONES TO TRIANGULAR MICROPHONE ARRAY IN IN-CAR COMMUNICATION
Seidel, Stefan
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Seiya, Shunya
pg. 1679
C-3-2.7 - INTERVENTION FORCE-BASED IMITATION LEARNING FOR AUTONOMOUS NAVIGATION IN DYNAMIC ENVIRONMENTS
Senda, Hirotaka
pg. 1497
C-1-2.4 - SPECIFICATION OF LINK QUALITY DEGRADATION IN WLAN BASED ON MCS AND RETRANSMISSION FLAG
seta, shogo
pg. 1222
D-3-2.6 - RAPID AND ACCURATE LOCAL GAUSSIAN NOISE REMOVAL
Shah, Neil
pg. 644
E-2-3.4 - QUERY-BY-EXAMPLE SPOKEN TERM DETECTION USING GENERATIVE ADVERSARIAL NETWORK
pg. 727
E-3-1.5 - IMPACT OF MINIMUM HYPERSPHERICAL ENERGY REGULARIZATION ON TIME-FREQUENCY DOMAIN NETWORKS FOR SINGING VOICE SEPARATION
Shah, Nirmesh
pg. 644
E-2-3.4 - QUERY-BY-EXAMPLE SPOKEN TERM DETECTION USING GENERATIVE ADVERSARIAL NETWORK
Shao, Yunfei
pg. 365
E-1-2.6 - DEEP SEMANTIC ENCODER-DECODER NETWORK FOR ACOUSTIC SCENE CLASSIFICATION WITH MULTIPLE DEVICES
Sharifzadeh, Hamid
pg. 57
F-3-3.3 - AN ACOUSTIC SIGNAL PROCESSING SYSTEM FOR IDENTIFICATION OF QUEEN-LESS BEEHIVES
pg. 1310
B-3-3.4 - VEIN PATTERN VISUALISATION USING CONDITIONAL GENERATIVE ADVERSARIAL NETWORKS
She, Wenxiang
pg. 1043
D-1-1.2 - MICRO-EXPRESSION RECOGNITION BASED ON MULTIPLE AGGREGATION NETWORKS
Shen, Xubang
pg. 118
A-2-3.3 - FAST INTER-FRAME PREDICTION BASED ARRAY PROCESSOR FOR DEPTH MAPS IN 3D-HEVC
Sheng, Bo
pg. 1201
D-3-2.3 - THE VALIDITY OF A DUAL AZURE KINECT-BASED MOTION CAPTURE SYSTEM FOR GAIT ANALYSIS: A PRELIMINARY STUDY
Shi, Chuang
pg. 416
E-1-3.2 - MULTI-BEAM DESIGN METHOD FOR A STEERABLE PARAMETRIC ARRAY LOUDSPEAKER
pg. 278
E-1-1.3 - EFFECT OF CROSS-CHANNEL CONTROL FILTERS IN MULTI-CHANNEL FEEDBACK ACTIVE NOISE CONTROL
pg. 283
E-1-1.4 - SIMULTANEOUS VARIABLE PERTURBATION METHOD FOR THE ACTIVE NOISE CONTROL SYSTEM WITH A WIRELESS ERROR MICROPHONE
Shi, Jiaqi
pg. 1060
D-1-1.5 - 3D SKELETAL MOVEMENT ENHANCED EMOTION RECOGNITION NETWORK
SHIMAMURA, Tetsuya
pg. 242
C-2-2.5 - ADVERSARIAL TRAINING USING INTER/INTRA-ATTENTION ARCHITECTURE FOR SPEECH ENHANCEMENT NETWORK
Shimizu, Yu
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Shimochi, Saeka
pg. 952
A-3-3.3 - QUANTIFICATION ANALYSIS OF BEHAVIORAL CHANGES AFTER SCIATIC NERVE LIGATION IN RATS
Shin, Hyeon-Kyeong
pg. 308
F-1-1.3 - SPEAKER-INVARIANT PSYCHOLOGICAL STRESS DETECTION USING ATTENTION-BASED NETWORK
Shiomi, Yuya
pg. 1425
B-3-2.2 - STUDY ON POSSIBILITY OF ESTIMATING SMARTPHONE INPUTS FROM TAP SOUNDS
Shiota, Sayaka
pg. 297
F-1-1.1 - DIALECT-AWARE MODELING FOR END-TO-END JAPANESE DIALECT SPEECH RECOGNITION
Shiozawa, Koichiro
pg. 46
F-3-3.1 - NOISE SUPPRESSION USING A DIFFERENTIAL-TYPE MICROPHONE ARRAY AND TWO-DIMENSIONAL AMPLITUDE AND PHASE SPECTRA
Siddiq, Asif
pg. 1132
D-1-3.6 - EVALUATION OF THE ENCODING ACCURACY OF THE PQ BASED HDR CONTENT DELIVERY FORMATS
Sillanpää, Hannu
pg. 1300
B-3-3.2 - COST SENSITIVE OPTIMIZATION OF DEEPFAKE DETECTOR
Sim, Jae-Young
pg. 1252
D-3-3.1 - DEEP LEARNING BASED DEPTH ESTIMATION AND RECONSTRUCTION OF LIGHT FIELD IMAGES
pg. 1193
D-3-2.2 - MULTISCALE SALIENCY DETECTION FOR COLORED 3D POINT CLOUDS BASED ON RANDOM WALK
Sindi, Suzanne
pg. 968
A-3-3.6 - A NEURAL NETWORK APPROACH FOR ANOMALY DETECTION IN GENOMIC SIGNALS
Singh, Shrishti
pg. 543
F-2-1.4 - DESIGN OF VOICE PRIVACY SYSTEM USING LINEAR PREDICTION
Sirichotedumrong, Warit
pg. 1304
B-3-3.3 - VISUAL SECURITY EVALUATION OF LEARNABLE IMAGE ENCRYPTION METHODS AGAINST CIPHERTEXT-ONLY ATTACKS
Sisman, Berrak
pg. 507
E-2-1.4 - SPECTRUM AND PROSODY CONVERSION FOR CROSS-LINGUAL VOICE CONVERSION WITH CYCLEGAN
pg. 514
E-2-1.5 - VAW-GAN FOR SINGING VOICE CONVERSION WITH NON-PARALLEL TRAINING DATA
SOH, JAE WOONG
pg. 1067
D-1-2.1 - LOCAL BACKLIGHT DIMMING FOR LIQUID CRYSTAL DISPLAYS VIA CONVOLUTIONAL NEURAL NETWORK
Song, Eunwoo
pg. 810
F-3-2.1 - LP-WAVENET: LINEAR PREDICTION-BASED WAVENET SPEECH SYNTHESIS
pg. 831
F-3-2.5 - EXCITGLOW: IMPROVING A WAVEGLOW-BASED NEURAL VOCODER WITH LINEAR PREDICTION ANALYSIS
Song, Hui
pg. 118
A-2-3.3 - FAST INTER-FRAME PREDICTION BASED ARRAY PROCESSOR FOR DEPTH MAPS IN 3D-HEVC
Song, Liming
pg. 550
F-2-1.5 - AP20-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES
Song, YuXin
pg. 1725
C-3-3.7 - SMALL DATA-DRIVEN ELECTRICAL INSULATOR DEFECT DETECTION
Soong, Frank
pg. 810
F-3-2.1 - LP-WAVENET: LINEAR PREDICTION-BASED WAVENET SPEECH SYNTHESIS
Sri-iesaranusorn, Panyawut
pg. 952
A-3-3.3 - QUANTIFICATION ANALYSIS OF BEHAVIORAL CHANGES AFTER SCIATIC NERVE LIGATION IN RATS
Stefanic-Kejik, Andrijana
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Su, Dan
pg. 720
E-3-1.4 - INDEPENDENT VECTOR ANALYSIS FOR BLIND SPEECH SEPARATION USING COMPLEX GENERALIZED GAUSSIAN MIXTURE MODEL WITH WEIGHTED VARIANCE
Su, Feng-Guang
pg. 1188
D-3-2.1 - DIVERSE AUDIO-TO-IMAGE GENERATION VIA SEMANTICS AND FEATURE CONSISTENCY
Su, Juyn-Da
pg. 82
F-3-3.7 - PROCESSING ELEMENT ARCHITECTURE DESIGN FOR DEEP REINFORCEMENT LEARNING WITH FLEXIBLE BLOCK FLOATING POINT EXPLOITING SIGNAL STATISTICS
Su, Li
pg. 561
E-2-2.1 - HARMONIC PRESERVING NEURAL NETWORKS FOR EFFICIENT AND ROBUST MULTIPITCH ESTIMATION
pg. 346
E-1-2.3 - BEAT AND DOWNBEAT TRACKING OF SYMBOLIC MUSIC DATA USING DEEP RECURRENT NEURAL NETWORKS
Su, Po-Chyi
pg. 1247
D-2-2.3 - SCENE TEXT-LINE EXTRACTION WITH FULLY CONVOLUTIONAL NETWORK AND REFINED PROPOSALS
Sugimoto, Ayaka
pg. 1425
B-3-2.2 - STUDY ON POSSIBILITY OF ESTIMATING SMARTPHONE INPUTS FROM TAP SOUNDS
SUGIURA, Yosuke
pg. 242
C-2-2.5 - ADVERSARIAL TRAINING USING INTER/INTRA-ATTENTION ARCHITECTURE FOR SPEECH ENHANCEMENT NETWORK
Sumiyoshi, Kyosuke
pg. 863
E-3-3.4 - DYNAMIC SYNCHRONOUS AVERAGING FOR ENHANCEMENT OF PERIODIC SIGNAL UNDER SAMPLING FREQUENCY VARIATION
Sun, Huiyuan
pg. 694
F-2-3.6 - MODELLING ROOM REVERBERATION DIRECTIVITY USING VON MISES-FISHER MIXTURE DISTRIBUTION
Sun, Wei
pg. 1352
B-2-2.3 - DENSELY CONNECTED CONVOLUTIONAL NETWORK FOR AUDIO SPOOFING DETECTION
pg. 1442
B-3-2.4 - A GENERATIVE ADVERSARIAL NETWORK FRAMEWORK FOR JPEG ANTI-FORENSICS
Sunil Phatnani, Kirtana
pg. 353
E-1-2.4 - SYMMETRY IN THE STRUCTURE OF MUSICAL NODES
Susun, Dingkai
pg. 1725
C-3-3.7 - SMALL DATA-DRIVEN ELECTRICAL INSULATOR DEFECT DETECTION
Suzuki, Jimpu
pg. 110
A-2-3.2 - AN EVALUATION OF DESIGN FRAMEWORK FOR MIN-SUM IRREGULAR LDPC DECODERS
Suzuki, Michiyasu
pg. 972
A-3-3.7 - EVALUATION OF THE PRESSURE MEASUREMENT FUNCTION OF AN IMPLANTABLE MULTIMODALITY PROBE
Suzuki, Taizo
pg. 1118
D-1-3.3 - TWO-LAYER LOSSLESS CODING OF HDR IMAGES SPECIALIZED FOR RADIANCE FORMAT
Swaminathan, Ramakrishnan
pg. 905
B-1-1.6 - GEOMETRIC FEATURES BASED MUSCLE FATIGUE ANALYSIS USING LOW FREQUENCY BAND IN SURFACE ELECTROMYOGRAPHIC SIGNALS
SĂ©guier, Renaud
pg. 686
F-2-3.5 - LOCALIZATION CUES PRESERVATION IN HEARING AIDS BY COMBINING NOISE REDUCTION AND DYNAMIC RANGE COMPRESSION
T
Tachioka, Yuuki
pg. 627
E-2-3.1 - PRIVACY PRESERVING ACOUSTIC MODEL TRAINING FOR SPEECH RECOGNITION
Tahir Akhtar, Muhammad
pg. 236
C-2-2.4 - EXPLOITING THE RULES OF THE TF-MUSIC AND SPATIAL SMOOTHING TO ENHANCE THE DOA ESTIMATION FOR COHERENT AND NON-STATIONARY SOURCES
Takagi, Gen
pg. 1001
B-3-1.1 - PREDICTION OF SOCIAL MALADAPTATION USING EMOTIONAL ENTRAINMENT OF DISGUST DURING COMPREHENSIVE PSYCHIATRIC INTERVIEWS
Takagi, Hiroyasu
pg. 28
A-1-3.6 - AN EFFICIENT DESCRIPTION WITH HALIDE FOR IIR GAUSSIAN FILTER
Takahashi, Riki
pg. 421
E-1-3.3 - APPLYING VIRTUAL MICROPHONES TO TRIANGULAR MICROPHONE ARRAY IN IN-CAR COMMUNICATION
pg. 858
E-3-3.3 - A STUDY ON GEOMETRICALLY CONSTRAINED IVA WITH AUXILIARY FUNCTION APPROACH AND VCD FOR IN-CAR COMMUNICATION
Takahashi, Takumi
pg. 1483
C-1-2.2 - 3D CONVOLUTIONAL NEURAL NETWORK-AIDED INDOOR POSITIONING BASED ON FINGERPRINTS OF BLE RSSI
Takahashi, Yu
pg. 869
E-3-3.5 - JOINT-DIAGONALIZABILITY-CONSTRAINED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION BASED ON MULTIVARIATE COMPLEX STUDENT'S T-DISTRIBUTION
Takamichi, Shinnosuke
pg. 487
E-2-1.1 - PJS: PHONEME-BALANCED JAPANESE SINGING-VOICE CORPUS
Takamune, Norihiro
pg. 869
E-3-3.5 - JOINT-DIAGONALIZABILITY-CONSTRAINED MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION BASED ON MULTIVARIATE COMPLEX STUDENT'S T-DISTRIBUTION
Takamura, Masahiro
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Takao, Keisuke
pg. 1
A-1-3.1 - AN IMPROVED METHOD FOR INSTANTANEOUS FREQUENCY ESTIMATION USING A FINITE ORDER HILBERT TRANSFORMER
Takashima, Akihiko
pg. 632
E-2-3.2 - END-TO-END AUTOMATIC SPEECH RECOGNITION WITH DEEP MUTUAL LEARNING
pg. 1054
D-1-1.4 - UNSUPERVISED DOMAIN ADVERSARIAL TRAINING IN ANGULAR SPACE FOR FACIAL EXPRESSION RECOGNITION
Takeda, Ayaka
pg. 1545
D-2-1.4 - JOINT OPTIMIZATION OF EDGE SERVER AND VIRTUAL MACHINE PLACEMENT IN EDGE COMPUTING ENVIRONMENTS
Takeda, Kazuya
pg. 520
E-2-1.6 - CROSS-LINGUAL VOICE CONVERSION USING A CYCLIC VARIATIONAL AUTO-ENCODER AND A WAVENET VOCODER
pg. 1679
C-3-2.7 - INTERVENTION FORCE-BASED IMITATION LEARNING FOR AUTONOMOUS NAVIGATION IN DYNAMIC ENVIRONMENTS
Takeuchi, Eijiro
pg. 1679
C-3-2.7 - INTERVENTION FORCE-BASED IMITATION LEARNING FOR AUTONOMOUS NAVIGATION IN DYNAMIC ENVIRONMENTS
Takyu, Osamu
pg. 1513
C-1-3.3 - ESTIMATION OF DESIRED POWER AND UNDESIRED POWER USING CHIRP DEMODULATION AND EVALUATION OF ACCURACY
pg. 1460
C-1-1.4 - AUTONOMOUS DECENTRALIZED TRANSMISSION TIMING CONTROL IN WIRELESS SENSOR NETWORK
pg. 1497
C-1-2.4 - SPECIFICATION OF LINK QUALITY DEGRADATION IN WLAN BASED ON MCS AND RETRANSMISSION FLAG
Tan, Zhi-Wei
pg. 841
E-3-3.1 - A JOINT-LOSS APPROACH FOR SPEECH ENHANCEMENT VIA SINGLE-CHANNEL NEURAL NETWORK AND MVDR BEAMFORMER
Tanaka, Kou
pg. 572
E-2-2.3 - PHONEME EMBEDDINGS ON PREDICTING FUNDAMENTAL FREQUENCY PATTERN FOR ELECTROLARYNGEAL SPEECH
Tanaka, Takuro
pg. 1340
B-2-2.1 - DEEPWATERMARK: EMBEDDING WATERMARK INTO DNN MODEL
Tanaka, Tomohiro
pg. 632
E-2-3.2 - END-TO-END AUTOMATIC SPEECH RECOGNITION WITH DEEP MUTUAL LEARNING
pg. 1054
D-1-1.4 - UNSUPERVISED DOMAIN ADVERSARIAL TRAINING IN ANGULAR SPACE FOR FACIAL EXPRESSION RECOGNITION
Tanaka, Yuichi
pg. 139
B-1-2.1 - LEARNING GRAPHS WITH MULTIPLE TEMPORAL RESOLUTIONS
Tang, Xiaoli
pg. 288
E-1-1.5 - ACTIVE NOISE CONTROL OVER MULTIPLE ZONES: ADAPTIVE ALGORITHM IN TIME DOMAIN
Tang, Xinyu
pg. 720
E-3-1.4 - INDEPENDENT VECTOR ANALYSIS FOR BLIND SPEECH SEPARATION USING COMPLEX GENERALIZED GAUSSIAN MIXTURE MODEL WITH WEIGHTED VARIANCE
TANG, ZHAO-QIAN
pg. 1145
D-2-3.1 - VISUAL TRACKING VIA SPATIAL-TEMPORAL REGULARIZED CORRELATION FILTERS WITH ADVANCED STATE ESTIMATION
Tang, Zhiyuan
pg. 550
F-2-1.5 - AP20-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES
Tanida, Ryuichi
pg. 1107
D-1-3.1 - SUBJECTIVE QUALITY DRIVEN IMAGE ENCODING METHOD USING IMAGE COMPLETION
Tao, Jianhua
pg. 1043
D-1-1.2 - MICRO-EXPRESSION RECOGNITION BASED ON MULTIPLE AGGREGATION NETWORKS
Tao, Ruijie
pg. 605
F-2-2.3 - HLT-NUS SUBMISSION FOR 2019 NIST MULTIMEDIA SPEAKER RECOGNITION EVALUATION
Taromaru, Makoto
pg. 1523
C-1-3.5 - SPECTRUM SHARING FOR INTERNET OF THINGS SYSTEM IN PERIODIC TRANSMISSION
Tasaki, Kodai
pg. 1483
C-1-2.2 - 3D CONVOLUTIONAL NEURAL NETWORK-AIDED INDOOR POSITIONING BASED ON FINGERPRINTS OF BLE RSSI
Tawara, Naohiro
pg. 319
F-1-1.5 - SPEAKER AGE ESTIMATION USING AGE-DEPENDENT INSENSITIVE LOSS
Teraura, Nobuyuki
pg. 1392
B-2-3.5 - A QR SYMBOL WITH ECDSA FOR BOTH PUBLIC AND SECRET AREAS USING RHOMBIC SUB-CELLS
Tian, Tao
pg. 1707
C-3-3.4 - ADAPTIVE MULTI-PROTOTYPE RELATION NETWORK
Tian, Yufei
pg. 1617
C-2-3.5 - CAN-SIN: A CROSS-LAYER HETEROGENEOUS ACADEMIC NETWORK WITH SEMANTIC INFORMATION
Tian, Yuru
pg. 247
C-3-1.1 - IMAGE SEGMENTATION METHOD BASED ON FRACTIONAL VARYING-ORDER DIFFERENTIAL
Tieu, Ngoc-Dung T.
pg. 1406
B-2-3.7 - COLOR TRANSFER TO ANONYMIZED GAIT IMAGES WHILE MAINTAINING ANONYMIZATION
Toda, Tomoki
pg. 572
E-2-2.3 - PHONEME EMBEDDINGS ON PREDICTING FUNDAMENTAL FREQUENCY PATTERN FOR ELECTROLARYNGEAL SPEECH
pg. 520
E-2-1.6 - CROSS-LINGUAL VOICE CONVERSION USING A CYCLIC VARIATIONAL AUTO-ENCODER AND A WAVENET VOCODER
Togami, Masahito
pg. 705
E-3-1.1 - OVER-DETERMINED SPEECH SOURCE SEPARATION AND DEREVERBERATION
pg. 775
E-3-2.1 - INTEGRATION OF SEMI-BLIND SPEECH SOURCE SEPARATION AND VOICE ACTIVITY DETECTION FOR FLEXIBLE SPOKEN DIALOGUE
pg. 788
E-3-2.3 - COMPUTER-RESOURCE-AWARE DEEP SPEECH SEPARATION WITH A RUN-TIME-SPECIFIED NUMBER OF BLSTM LAYERS
Tran, Linh T. T.
pg. 841
E-3-3.1 - A JOINT-LOSS APPROACH FOR SPEECH ENHANCEMENT VIA SINGLE-CHANNEL NEURAL NETWORK AND MVDR BEAMFORMER
Tsai, Cheng-Lin
pg. 1566
D-3-1.2 - REAL-TIME DDOS ATTACK DETECTION USING SKETCH-BASED ENTROPY ESTIMATION ON THE NETFPGA SUME PLATFORM
Tsai, Pei-Yun
pg. 82
F-3-3.7 - PROCESSING ELEMENT ARCHITECTURE DESIGN FOR DEEP REINFORCEMENT LEARNING WITH FLEXIBLE BLOCK FLOATING POINT EXPLOITING SIGNAL STATISTICS
pg. 36
A-1-3.7 - DOPPLER CENTROID ESTIMATION WITH QUALITY ASSESSMENT FOR REAL-TIME SAR IMAGING
Tsai, Yao Cheng
pg. 88
F-3-3.8 - ACOUSTIC ECHO CANCELLATION BASED ON RECURRENT NEURAL NETWORK
Tsang, Sik-Ho
pg. 1112
D-1-3.2 - ULTRA FAST SCREEN CONTENT CODING VIA RANDOM FOREST
Tsao, Yu
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
pg. 482
F-1-3.7 - STOI-NET: A DEEP LEARNING BASED NON-INTRUSIVE SPEECH INTELLIGIBILITY ASSESSMENT MODEL
Tsuchiya, Sota
pg. 1466
C-1-1.5 - PACKET AGGREGATION BASED ON ENCRYPTION-THEN-COMPRESSION FOR HIGHLY EFFICIENT MULTI-HOP TRANSMISSION
Tsuruo, Asahi
pg. 939
A-3-3.1 - MATHEMATICAL MODEL OF HORSE AND RIDER INTERACTION DURING HORSE JUMPING
Tsutsui, Hiroshi
pg. 100
A-2-3.1 - AN EVALUATION OF A CNN-BASED PARKING DETECTION SYSTEM WITH WEBCAMS
pg. 110
A-2-3.2 - AN EVALUATION OF DESIGN FRAMEWORK FOR MIN-SUM IRREGULAR LDPC DECODERS
pg. 114
A-2-3.3 - AN EVALUATION OF HIGH-THROUGHPUT SCALABLE RADIX-4 FFT PROCESSOR ARCHITECTURE USING FIXED-POINT ARITHMETIC
pg. 135
A-2-3.4 - WIRELESS CHANNEL MEASUREMENT SYSTEM USING ZYNQ ULTRASCALE+ RFSOC FOR MIMO AND D2D COMMUNICATION SYSTEMS
Tu, Cheng-Hao
pg. 1605
C-2-3.3 - EXTENDING CONDITIONAL CONVOLUTION STRUCTURES FOR ENHANCING MULTITASKING CONTINUAL LEARNING
Tu, Weiping
pg. 1331
B-3-3.7 - A SECURE OPUS PULSE STEGANOGRAPHIC SCHEME BASED ON MESSAGE TRANSFORM
U
Uehara, Kota
pg. 1425
B-3-2.2 - STUDY ON POSSIBILITY OF ESTIMATING SMARTPHONE INPUTS FROM TAP SOUNDS
Umebayashi, Kenta
pg. 1519
C-1-3.4 - ON PLACEMENT OF END DEVICES IN LPWAN BASED WSN FOR ENVIRONMENTAL MONITORING APPLICATIONS
Unoki, Masashi
pg. 753
F-3-1.3 - ENHANCEMENT OF SPEECH INTELLIGIBILITY UNDER NOISY REVERBERANT CONDITIONS BASED ON MODULATION SPECTRUM CONCEPT
pg. 1321
B-3-3.6 - SPEECH INFORMATION HIDING BY MODIFICATION OF LSF QUANTIZATION INDEX IN CELP CODEC
Utsuro, Takehito
pg. 403
F-1-2.6 - SPOKEN DIALOG TRAINING SYSTEM FOR CUSTOMER SERVICE IMPROVEMENT
V
Vien, An Gia
pg. 1268
D-3-3.4 - MOIRÉ ARTIFACTS REMOVAL IN SCREEN-SHOT IMAGES VIA MULTIPLE DOMAIN LEARNING
W
Wakabayashi, Yukoh
pg. 863
E-3-3.4 - DYNAMIC SYNCHRONOUS AVERAGING FOR ENHANCEMENT OF PERIODIC SIGNAL UNDER SAMPLING FREQUENCY VARIATION
pg. 443
E-1-3.7 - ENERGY-BASED MULTIPLE SOURCE LOCALIZATION WITH BLINKIES
Wakashima, Kobun
pg. 1001
B-3-1.1 - PREDICTION OF SOCIAL MALADAPTATION USING EMOTIONAL ENTRAINMENT OF DISGUST DURING COMPREHENSIVE PSYCHIATRIC INTERVIEWS
Wake, Masaya
pg. 775
E-3-2.1 - INTEGRATION OF SEMI-BLIND SPEECH SOURCE SEPARATION AND VOICE ACTIVITY DETECTION FOR FLEXIBLE SPOKEN DIALOGUE
Wakuya, Manami
pg. 972
A-3-3.7 - EVALUATION OF THE PRESSURE MEASUREMENT FUNCTION OF AN IMPLANTABLE MULTIMODALITY PROBE
Wan, Timmy S.T.
pg. 1594
C-2-3.1 - MERGING WELL-TRAINED DEEP CNN MODELS FOR EFFICIENT INFERENCE
Wang, Anqi
pg. 92
A-2-3.1 - A PARALLELIZATION METHOD OF INCEPTION ARCHITECTURE BASED ON ARRAY PROCESSOR
pg. 104
A-2-3.2 - RSP-BT:AN ADVANCED PARALLEL METHOD FOR DEPTH MAP MOTION ESTIMATION
pg. 118
A-2-3.3 - FAST INTER-FRAME PREDICTION BASED ARRAY PROCESSOR FOR DEPTH MAPS IN 3D-HEVC
Wang, Beibei
pg. 41
A-1-3.8 - DRIVER ARRIVAL SENSING FOR SMART CAR USING WIFI FINE TIME MEASUREMENTS
Wang, Chun-Huang
pg. 302
F-1-1.2 - ACOUSTIC AND TEXTUAL DATA AUGMENTATION FOR CODE-SWITCHING SPEECH RECOGNITION IN UNDER-RESOURCED LANGUAGE
Wang, Dong
pg. 550
F-2-1.5 - AP20-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES
Wang, Feng
pg. 589
E-2-2.6 - TEMPORAL AND FORMANT TRAJECTORY ANALYSIS OF ENGLISH TENSE-LAX VOWELS PRODUCED BY NATIVE CHINESE SPEAKERS
Wang, Haonan
pg. 662
F-2-3.1 - HARMONIC STRUCTURE MASK FOR SPEECH ENHANCEMENT USING SPARSITY REGULARIZATION
Wang, Hsin-Min
pg. 482
F-1-3.7 - STOI-NET: A DEEP LEARNING BASED NON-INTRUSIVE SPEECH INTELLIGIBILITY ASSESSMENT MODEL
Wang, Jiazheng
pg. 584
E-2-2.5 - ACOUSTIC ANALYSIS OF NASALIZATION IN MANDARIN PRENASAL VOWELS PRODUCED BY WENZHOU AND RUGAO SPEAKERS
Wang, Lei
pg. 1096
D-1-2.5 - BLIND TONE-MAPPED IMAGE QUALITY ASSESSMENT AND ENHANCEMENT VIA DISENTANGLED REPRESENTATION LEARNING
Wang, Lina
pg. 1331
B-3-3.7 - A SECURE OPUS PULSE STEGANOGRAPHIC SCHEME BASED ON MESSAGE TRANSFORM
Wang, Longbiao
pg. 616
F-2-2.5 - A PITCH-AWARE SPEAKER EXTRACTION SERIAL NETWORK
Wang, Meng
pg. 1122
D-1-3.4 - SSIM MOTIVATED QUALITY CONTROL FOR VERSATILE VIDEO CODING
Wang, Mingxi
pg. 888
B-1-1.3 - DECODING AUDITORY FREQUENCIES AND DIRECTIONS BASED ON BRAIN FUNCTIONAL FEATURES
Wang, Qiang
pg. 875
B-1-1.1 - CLASSIFICATION OF SEIZURE EEGS BASED ON SHORT-TIME FOURIER TRANSFORM AND HIDDEN MARKOV MODEL
pg. 944
A-3-3.2 - HUMAN HAND MOVEMENT RECOGNITION BASED ON HMM WITH HYPERPARAMETERS OPTIMIZED BY MAXIMUM MUTUAL INFORMATION
Wang, Qing
pg. 465
F-1-3.4 - FREQUENCY GATING: IMPROVED CONVOLUTIONAL NEURAL NETWORKS FOR SPEECH ENHANCEMENT IN THE TIME-FREQUENCY DOMAIN
Wang, Shengbei
pg. 1321
B-3-3.6 - SPEECH INFORMATION HIDING BY MODIFICATION OF LSF QUANTIZATION INDEX IN CELP CODEC
Wang, Shiqi
pg. 1122
D-1-3.4 - SSIM MOTIVATED QUALITY CONTROL FOR VERSATILE VIDEO CODING
Wang, Syu-Siang
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
Wang, Xi
pg. 810
F-3-2.1 - LP-WAVENET: LINEAR PREDICTION-BASED WAVENET SPEECH SYNTHESIS
Wang, Xiuyuan
pg. 211
C-2-1.6 - DIFFERENTIATED PROSODIC ADAPTION OF CHINESE AND ENGLISH POETRY: AN ACOUSTIC APPROACH TO READING OF CHINESE TANG POETRY AND SHAKESPEAREAN SONNETS
Wang, Xiyuan
pg. 720
E-3-1.4 - INDEPENDENT VECTOR ANALYSIS FOR BLIND SPEECH SEPARATION USING COMPLEX GENERALIZED GAUSSIAN MIXTURE MODEL WITH WEIGHTED VARIANCE
Wang, Yanan
pg. 1601
C-2-3.2 - EFFICIENT DIVERSE RESPONSE GENERATION IN ATTENTION-BASED NEURAL CONVERSATIONAL MODEL WITH MAXIMUM MUTUAL INFORMATION
Wang, Yikang
pg. 621
F-2-2.6 - ANALYSIS OF BIT SEQUENCE REPRESENTATION FOR SOUND CLASSIFICATION
Wang, Yu-Chiang Frank
pg. 1188
D-3-2.1 - DIVERSE AUDIO-TO-IMAGE GENERATION VIA SEMANTICS AND FEATURE CONSISTENCY
Wang, Yue
pg. 1122
D-1-3.4 - SSIM MOTIVATED QUALITY CONTROL FOR VERSATILE VIDEO CODING
Wang, Zheng
pg. 1352
B-2-2.3 - DENSELY CONNECTED CONVOLUTIONAL NETWORK FOR AUDIO SPOOFING DETECTION
Waqas, Muhammad
pg. 934
B-1-3.6 - COMPARISON OF IMAGE FEATURES DESCRIPTIONS FOR DIAGNOSIS OF LEAF DISEASES
Wei, Liangfa
pg. 638
E-2-3.3 - ATTENTIVE FUSION ENHANCED AUDIO-VISUAL ENCODING FOR TRANSFORMER BASED ROBUST SPEECH RECOGNITION
Wei, Yu-Jen
pg. 1243
D-2-2.2 - CHROMA COMPONENT GENERATION OF GRAY IMAGES USING MULTI-SCALE CONVOLUTIONAL NEURAL NETWORK
Wen, Ruoshi
pg. 944
A-3-3.2 - HUMAN HAND MOVEMENT RECOGNITION BASED ON HMM WITH HYPERPARAMETERS OPTIMIZED BY MAXIMUM MUTUAL INFORMATION
Weng, Ting-Chia
pg. 1626
C-2-3.6 - NATURAL LANGUAGE PROCESSING METHODS FOR DETECTION OF INFLUENZA-LIKE ILLNESS FROM CHIEF COMPLAINTS
Wiesmeyr, Christoph
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
William, Bellamy
pg. 589
E-2-2.6 - TEMPORAL AND FORMANT TRAJECTORY ANALYSIS OF ENGLISH TENSE-LAX VOWELS PRODUCED BY NATIVE CHINESE SPEAKERS
Wimmer, Markus
pg. 919
B-1-3.3 - COMPARISON OF PSG SIGNALS AND RESPIRATORY MOVEMENT SIGNAL VIA 3D CAMERA IN DETECTING SLEEP RESPIRATORY EVENTS BY LSTM MODELS
Wong, KokSheik
pg. 1361
B-2-2.4 - DATA EMBEDDING METHOD USING PHOTO EFFECTS WITH RESISTANCE TO COMPRESSION
Woo, Jeongwoo
pg. 800
E-3-2.5 - END-TO-END MUSIC-MIXED SPEECH RECOGNITION
Wu, Anqi
pg. 211
C-2-1.6 - DIFFERENTIATED PROSODIC ADAPTION OF CHINESE AND ENGLISH POETRY: AN ACOUSTIC APPROACH TO READING OF CHINESE TANG POETRY AND SHAKESPEAREAN SONNETS
Wu, Bo-Yen
pg. 1527
D-2-1.1 - DUAL ADAPTIVE MODULATION AND CODING FOR MITIGATING UE-UE INTERFERENCE IN HETEROGENEOUS TDD SLOT CONFIGURATIONS
Wu, Cheng-En
pg. 1594
C-2-3.1 - MERGING WELL-TRAINED DEEP CNN MODELS FOR EFFICIENT INFERENCE
pg. 1605
C-2-3.3 - EXTENDING CONDITIONAL CONVOLUTION STRUCTURES FOR ENHANCING MULTITASKING CONTINUAL LEARNING
Wu, Chung-Hsien
pg. 302
F-1-1.2 - ACOUSTIC AND TEXTUAL DATA AUGMENTATION FOR CODE-SWITCHING SPEECH RECOGNITION IN UNDER-RESOURCED LANGUAGE
pg. 1048
D-1-1.3 - ATTENTIVELY-COUPLED LONG SHORT-TERM MEMORY FOR AUDIO-VISUAL EMOTION RECOGNITION
pg. 1626
C-2-3.6 - NATURAL LANGUAGE PROCESSING METHODS FOR DETECTION OF INFLUENZA-LIKE ILLNESS FROM CHIEF COMPLAINTS
Wu, Han-Yu
pg. 1561
D-3-1.1 - LORA-BASED AIR QUALITY MONITORING SYSTEM USING CHATBOT
Wu, Hongde
pg. 894
B-1-1.4 - A TEMPORAL ENVELOPE-BASED SPEECH RECONSTRUCTION APPROACH WITH EEG SIGNALS DURING SPEECH IMAGERY
Wu, Jianming
pg. 1601
C-2-3.2 - EFFICIENT DIVERSE RESPONSE GENERATION IN ATTENTION-BASED NEURAL CONVERSATIONAL MODEL WITH MAXIMUM MUTUAL INFORMATION
Wu, Jianyuan
pg. 1442
B-3-2.4 - A GENERATIVE ADVERSARIAL NETWORK FRAMEWORK FOR JPEG ANTI-FORENSICS
Wu, Jijie
pg. 1719
C-3-3.6 - ANTI-NOISE RELATION NETWORK FOR FEW-SHOT LEARNING
Wu, Meng-Che
pg. 759
F-3-1.4 - EXPLORING FEATURE ENHANCEMENT IN THE MODULATION SPECTRUM DOMAIN VIA IDEAL RATIO MASK FOR ROBUST SPEECH RECOGNITION
Wu, Ming
pg. 1725
C-3-3.7 - SMALL DATA-DRIVEN ELECTRICAL INSULATOR DEFECT DETECTION
Wu, Qingbo
pg. 1096
D-1-2.5 - BLIND TONE-MAPPED IMAGE QUALITY ASSESSMENT AND ENHANCEMENT VIA DISENTANGLED REPRESENTATION LEARNING
Wu, Shan-Hung
pg. 1647
C-3-2.2 - MPOP600: A MANDARIN POPULAR SONG DATABASE WITH ALIGNED AUDIO, LYRICS, AND MUSICAL SCORES FOR SINGING VOICE SYNTHESIS
Wu, Shuang
pg. 881
B-1-1.2 - A MULTI-SUBJECT TEMPORAL-SPATIAL HYPER-ALIGNMENT METHOD FOR EEG-BASED NEURAL ENTRAINMENT TO SPEECH
Wu, Yiming
pg. 500
E-2-1.3 - A VARIATIONAL AUTOENCODER FOR JOINT CHORD AND KEY ESTIMATION FROM AUDIO CHROMAGRAMS
X
Xi, Jingwei
pg. 432
E-1-3.5 - FULL-SPHERE BINAURAL SOUND SOURCE LOCALIZATION USING MULTI-TASK NEURAL NETWORK
Xiao, Qin
pg. 211
C-2-1.6 - DIFFERENTIATED PROSODIC ADAPTION OF CHINESE AND ENGLISH POETRY: AN ACOUSTIC APPROACH TO READING OF CHINESE TANG POETRY AND SHAKESPEAREAN SONNETS
Xie, Fei
pg. 1150
D-2-3.2 - A NEW POLARIZED IMAGE FUSION ALGORITHM BASED ON TWO-SCALE GUIDED FILTERING
Xie, Rong
pg. 278
E-1-1.3 - EFFECT OF CROSS-CHANNEL CONTROL FILTERS IN MULTI-CHANNEL FEEDBACK ACTIVE NOISE CONTROL
pg. 283
E-1-1.4 - SIMULTANEOUS VARIABLE PERTURBATION METHOD FOR THE ACTIVE NOISE CONTROL SYSTEM WITH A WIRELESS ERROR MICROPHONE
Xie, Xiaoyan
pg. 92
A-2-3.1 - A PARALLELIZATION METHOD OF INCEPTION ARCHITECTURE BASED ON ARRAY PROCESSOR
pg. 104
A-2-3.2 - RSP-BT:AN ADVANCED PARALLEL METHOD FOR DEPTH MAP MOTION ESTIMATION
pg. 118
A-2-3.3 - FAST INTER-FRAME PREDICTION BASED ARRAY PROCESSOR FOR DEPTH MAPS IN 3D-HEVC
Xie, Xuan
pg. 205
C-2-1.5 - SAMPLING POLICY DESIGN FOR TRACKING TIME-VARYING GRAPH SIGNALS WITH ADAPTIVE BUDGET ALLOCATION
Xin, Hong-cai
pg. 260
C-3-1.3 - A NOVEL ISAR IMAGING ALGORITHM FOR MANEUVERING TARGET BASED ON PARAMETER ESTIMATION METHOD
Xu, Linfeng
pg. 1096
D-1-2.5 - BLIND TONE-MAPPED IMAGE QUALITY ASSESSMENT AND ENHANCEMENT VIA DISENTANGLED REPRESENTATION LEARNING
Xu, Yijie
pg. 332
E-1-2.1 - A DEEP MUSIC GENRES CLASSIFICATION MODEL BASED ON CNN WITH SQUEEZE & EXCITATION BLOCK
Xu, Yizhong
pg. 211
C-2-1.6 - DIFFERENTIATED PROSODIC ADAPTION OF CHINESE AND ENGLISH POETRY: AN ACOUSTIC APPROACH TO READING OF CHINESE TANG POETRY AND SHAKESPEAREAN SONNETS
Xue, Di
pg. 589
E-2-2.6 - TEMPORAL AND FORMANT TRAJECTORY ANALYSIS OF ENGLISH TENSE-LAX VOWELS PRODUCED BY NATIVE CHINESE SPEAKERS
Y
Yamada, Hiroyoshi
pg. 1216
D-3-2.5 - FIXED-POINT ARITHMETIC OF L2-NORM APPROXIMATION FOR 2-TUPLE ARRAYS WITH ROTATED L1-NORM EVALUATION
Yamada, Koki
pg. 139
B-1-2.1 - LEARNING GRAPHS WITH MULTIPLE TEMPORAL RESOLUTIONS
Yamada, Shigefumi
pg. 1430
B-3-2.3 - A NOVEL QUALITY ASSESSMENT METHOD FOR EYE MOVEMENT AUTHENTICATION
Yamada, Takeshi
pg. 858
E-3-3.3 - A STUDY ON GEOMETRICALLY CONSTRAINED IVA WITH AUXILIARY FUNCTION APPROACH AND VCD FOR IN-CAR COMMUNICATION
pg. 421
E-1-3.3 - APPLYING VIRTUAL MICROPHONES TO TRIANGULAR MICROPHONE ARRAY IN IN-CAR COMMUNICATION
Yamagishi, Junichi
pg. 1293
B-3-3.1 - A METHOD FOR IDENTIFYING ORIGIN OF DIGITAL IMAGES USING A CONVOLUTIONAL NEURAL NETWORK
pg. 1406
B-2-3.7 - COLOR TRANSFER TO ANONYMIZED GAIT IMAGES WHILE MAINTAINING ANONYMIZATION
Yamaguchi, Takuro
pg. 1222
D-3-2.6 - RAPID AND ACCURATE LOCAL GAUSSIAN NOISE REMOVAL
Yamaji, Shuhei
pg. 781
E-3-2.2 - DNN-BASED PERMUTATION SOLVER FOR FREQUENCY-DOMAIN INDEPENDENT COMPONENT ANALYSIS IN TWO-SOURCE MIXTURE CASE
Yamakawa, Toshitaka
pg. 972
A-3-3.7 - EVALUATION OF THE PRESSURE MEASUREMENT FUNCTION OF AN IMPLANTABLE MULTIMODALITY PROBE
Yamamoto, Shinya
pg. 939
A-3-3.1 - MATHEMATICAL MODEL OF HORSE AND RIDER INTERACTION DURING HORSE JUMPING
Yamashita, Masaru
pg. 914
B-1-3.2 - CONSTRUCTION OF EFFECTIVE HMMS FOR CLASSIFICATION BETWEEN NORMAL AND ABNORMAL RESPIRATION
Yamashita, Toru
pg. 184
C-2-1.2 - AGE CLASSIFICATION OF EVACUEES AT TIMES OF DISASTER USING A VIBRATION SENSOR
Yamashita, Yoichi
pg. 449
F-1-3.1 - SPEECH ENHANCEMENT FOR OPTICAL LASER MICROPHONE WITH DEEP NEURAL NETWORK
Yamawaki, Shigeto
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Yamazaki, Yudai
pg. 1507
C-1-3.2 - SCHEDULING ALGORITHM CONSIDERING INTERFERENCE INTERVAL FOR LPWA
Yan, Bi-Cheng
pg. 759
F-3-1.4 - EXPLORING FEATURE ENHANCEMENT IN THE MODULATION SPECTRUM DOMAIN VIA IDEAL RATIO MASK FOR ROBUST SPEECH RECOGNITION
Yan, Fang-Jia
pg. 255
C-3-1.2 - WINDOWED FRACTIONAL FOURIER TRANSFORM ON GRAPHS: FRACTIONAL TRANSLATION OPERATOR AND HAUSDORFF-YOUNG INEQUALITY
Yan, Jintao
pg. 1719
C-3-3.6 - ANTI-NOISE RELATION NETWORK FOR FEW-SHOT LEARNING
Yang, Bin
pg. 976
B-2-1.1 - DEEP-LEARNING BASED MOTION-CORRECTED IMAGE RECONSTRUCTION IN 4D MAGNETIC RESONANCE IMAGING OF THE BODY TRUNK
Yang, Bowen
pg. 126
A-2-3.4 - OPTIMIZATION OF FALSE-OVERLAP DETECTION OF TILE ASSEMBLY IN TILE-BASED RENDERING
Yang, Cheng
pg. 550
F-2-1.5 - AP20-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES
Yang, Cheng-Yu
pg. 1128
D-1-3.5 - RATE-DISTORTION OPTIMIZATION FOR 360-DEGREE IMAGE CONSIDERING VISUAL ATTENTION
Yang, Fu-Rong
pg. 1647
C-3-2.2 - MPOP600: A MANDARIN POPULAR SONG DATABASE WITH ALIGNED AUDIO, LYRICS, AND MUSICAL SCORES FOR SINGING VOICE SYNTHESIS
Yang, Hao-Chun
pg. 900
B-1-1.5 - FROM INTENDED TO SUBJECTIVE: A CONDITIONAL TENSOR FUSION NETWORK FOR RECOGNIZING SELF-REPORTED EMOTION USING PHYSIOLOGY
Yang, Jichen
pg. 605
F-2-2.3 - HLT-NUS SUBMISSION FOR 2019 NIST MULTIMEDIA SPEAKER RECOGNITION EVALUATION
Yang, Kai
pg. 1118
D-1-3.3 - TWO-LAYER LOSSLESS CODING OF HDR IMAGES SPECIALIZED FOR RADIANCE FORMAT
Yang, Kun
pg. 92
A-2-3.1 - A PARALLELIZATION METHOD OF INCEPTION ARCHITECTURE BASED ON ARRAY PROCESSOR
Yang, Pei-Tse
pg. 1188
D-3-2.1 - DIVERSE AUDIO-TO-IMAGE GENERATION VIA SEMANTICS AND FEATURE CONSISTENCY
Yang, Ting-Ya
pg. 1448
C-1-1.2 - CONSTRUCTION OF CYCLICALLY PERMUTABLE CODES FROM PRIME LENGTH CYCLIC CODES
Yang, Xiaochen
pg. 1719
C-3-3.6 - ANTI-NOISE RELATION NETWORK FOR FEW-SHOT LEARNING
Yang, Yichen
pg. 432
E-1-3.5 - FULL-SPHERE BINAURAL SOUND SOURCE LOCALIZATION USING MULTI-TASK NEURAL NETWORK
Yang, Ziye
pg. 716
E-3-1.3 - MULTI-CHANNEL SPEECH SEPARATION USING DEEP EMBEDDING WITH MULTILAYER BOOTSTRAP NETWORKS
Yanti, Budi
pg. 924
B-1-3.4 - PERFORMANCE EVALUATION OF BINARY CLASSIFICATION OF TUBERCULOSIS THROUGH UNSHARP MASKING AND DEEP LEARNING TECHNIQUE
Yasugi, Takuya
pg. 135
A-2-3.4 - WIRELESS CHANNEL MEASUREMENT SYSTEM USING ZYNQ ULTRASCALE+ RFSOC FOR MIMO AND D2D COMMUNICATION SYSTEMS
Yasukawa, Hideki
pg. 1490
C-1-2.3 - AN OVERLOADED IOT SIGNAL DETECTION METHOD USING NON-CONVEX SPARSE REGULARIZERS
Yatabe, Kohei
pg. 471
F-1-3.5 - GAMMA BOLTZMANN MACHINE FOR SIMULTANEOUSLY MODELING LINEAR- AND LOG-AMPLITUDE SPECTRA
Yatkin, Emrah
pg. 952
A-3-3.3 - QUANTIFICATION ANALYSIS OF BEHAVIORAL CHANGES AFTER SCIATIC NERVE LIGATION IN RATS
Yatsu, Ryota
pg. 1466
C-1-1.5 - PACKET AGGREGATION BASED ON ENCRYPTION-THEN-COMPRESSION FOR HIGHLY EFFICIENT MULTI-HOP TRANSMISSION
Yeamkuan, Suparat
pg. 1103
D-1-2.6 - FIXATIONAL FEATURE-BASED GAZE PATTERN RECOGNITION USING LONG SHORT-TERM MEMORY
Yen, Benjamin
pg. 850
E-3-3.2 - SOURCE ENHANCEMENT FOR UNMANNED AERIAL VEHICLE RECORDING USING MULTI-SENSORY INFORMATION
Yeong, William K.W.
pg. 1361
B-2-2.4 - DATA EMBEDDING METHOD USING PHOTO EFFECTS WITH RESISTANCE TO COMPRESSION
Yokota, Takashi
pg. 649
E-2-3.5 - REDUCTION OF SPEECH DATA POSTERIORGRAMS BY COMPRESSING MAXIMUM-LIKELIHOOD STATE SEQUENCES IN QUERY BY EXAMPLE
Yokotani, Kenji
pg. 1001
B-3-1.1 - PREDICTION OF SOCIAL MALADAPTATION USING EMOTIONAL ENTRAINMENT OF DISGUST DURING COMPREHENSIVE PSYCHIATRIC INTERVIEWS
Yokoyama, Kai
pg. 216
C-2-2.1 - LOW COMPLEXITY IMPLEMENTATION METHOD FOR THE ADAPTIVE FILTERS BASED ON THE GAUSSIAN MODEL
Yokoyama, Tomoya
pg. 1679
C-3-2.7 - INTERVENTION FORCE-BASED IMITATION LEARNING FOR AUTONOMOUS NAVIGATION IN DYNAMIC ENVIRONMENTS
Yoshida, Taichi
pg. 1118
D-1-3.3 - TWO-LAYER LOSSLESS CODING OF HDR IMAGES SPECIALIZED FOR RADIANCE FORMAT
Yoshii, Kazuyoshi
pg. 775
E-3-2.1 - INTEGRATION OF SEMI-BLIND SPEECH SOURCE SEPARATION AND VOICE ACTIVITY DETECTION FOR FLEXIBLE SPOKEN DIALOGUE
pg. 788
E-3-2.3 - COMPUTER-RESOURCE-AWARE DEEP SPEECH SEPARATION WITH A RUN-TIME-SPECIFIED NUMBER OF BLSTM LAYERS
pg. 500
E-2-1.3 - A VARIATIONAL AUTOENCODER FOR JOINT CHORD AND KEY ESTIMATION FROM AUDIO CHROMAGRAMS
pg. 800
E-3-2.5 - END-TO-END MUSIC-MIXED SPEECH RECOGNITION
pg. 359
E-1-2.5 - TATUM-LEVEL DRUM TRANSCRIPTION BASED ON A CONVOLUTIONAL RECURRENT NEURAL NETWORK WITH LANGUAGE MODEL-BASED REGULARIZED TRAINING
Yoshimoto, Junichiro
pg. 952
A-3-3.3 - QUANTIFICATION ANALYSIS OF BEHAVIORAL CHANGES AFTER SCIATIC NERVE LIGATION IN RATS
pg. 1023
B-3-1.4 - MAXIMUM CREDIBILITY VOTING (MCV): AN INTEGRATIVE APPROACH FOR ACCURATE DIAGNOSIS OF MAJOR DEPRESSIVE DISORDER FROM CLINICALLY READILY AVAILABLE DATA
Yoshimoto, Kento
pg. 339
E-1-2.2 - DEEP NEURAL NETWORK MODELING OF DISTORTION STOMP BOX USING SPECTRAL FEATURES
You, Bin-Yen
pg. 1243
D-2-2.2 - CHROMA COMPONENT GENERATION OF GRAY IMAGES USING MULTI-SCALE CONVOLUTIONAL NEURAL NETWORK
Yu, Bo
pg. 616
F-2-2.5 - A PITCH-AWARE SPEAKER EXTRACTION SERIAL NETWORK
Yu, Cheng
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
pg. 605
F-2-2.3 - HLT-NUS SUBMISSION FOR 2019 NIST MULTIMEDIA SPEAKER RECOGNITION EVALUATION
Yu, Chin-Yun
pg. 561
E-2-2.1 - HARMONIC PRESERVING NEURAL NETWORKS FOR EFFICIENT AND ROBUST MULTIPITCH ESTIMATION
Yu, Hong
pg. 1689
C-3-3.1 - SPEAKER VERIFICATION SYSTEM BASED ON DEFORMABLE CNN AND TIME-FREQUENCY ATTENTION
pg. 1707
C-3-3.4 - ADAPTIVE MULTI-PROTOTYPE RELATION NETWORK
Yuan, Chenhan
pg. 837
F-3-2.6 - PERSONALIZED END-TO-END MANDARIN SPEECH SYNTHESIS USING SMALL-SIZED CORPUS
Yuan, Zhongxing
pg. 283
E-1-1.4 - SIMULTANEOUS VARIABLE PERTURBATION METHOD FOR THE ACTIVE NOISE CONTROL SYSTEM WITH A WIRELESS ERROR MICROPHONE
Yun, Jae-Seong
pg. 1252
D-3-3.1 - DEEP LEARNING BASED DEPTH ESTIMATION AND RECONSTRUCTION OF LIGHT FIELD IMAGES
pg. 1193
D-3-2.2 - MULTISCALE SALIENCY DETECTION FOR COLORED 3D POINT CLOUDS BASED ON RANDOM WALK
Z
Zeng, Guan-Xin
pg. 1247
D-2-2.3 - SCENE TEXT-LINE EXTRACTION WITH FULLY CONVOLUTIONAL NETWORK AND REFINED PROPOSALS
Zeng, Xiaolu
pg. 41
A-1-3.8 - DRIVER ARRIVAL SENSING FOR SMART CAR USING WIFI FINE TIME MEASUREMENTS
Zeng, Yi-Chong
pg. 1170
D-2-3.5 - IMPLEMENTATION OF BI-RADS CLASSIFICATION AND PRIORITY PREDICTION FOR MAMMOGRAM PRE-SCREENING BASED ON MULTI-DECISION FRAMEWORK
Zezario, Ryandhimas
pg. 482
F-1-3.7 - STOI-NET: A DEEP LEARNING BASED NON-INTRUSIVE SPEECH INTELLIGIBILITY ASSESSMENT MODEL
Zezario, Ryandhimas E.
pg. 455
F-1-3.2 - BOOSTING OBJECTIVE SCORES OF A SPEECH ENHANCEMENT MODEL BY METRICGAN POST-PROCESSING
Zhagypar, Ruslan
pg. 236
C-2-2.4 - EXPLOITING THE RULES OF THE TF-MUSIC AND SPATIAL SMOOTHING TO ENHANCE THE DOA ESTIMATION FOR COHERENT AND NON-STATIONARY SOURCES
Zhagyparova, Kalamkas
pg. 236
C-2-2.4 - EXPLOITING THE RULES OF THE TF-MUSIC AND SPATIAL SMOOTHING TO ENHANCE THE DOA ESTIMATION FOR COHERENT AND NON-STATIONARY SOURCES
Zhan, Le
pg. 1156
D-2-3.3 - AN IMPROVED GUIDED FILTERING ALGORITHM FOR POLARIZED IMAGES BY USING LOG OPERATOR
Zhang, Dongheng
pg. 11
A-1-3.3 - NON-LINE-OF-SIGHT IMAGING WITH RADIO SIGNALS
Zhang, Gaoyan
pg. 881
B-1-1.2 - A MULTI-SUBJECT TEMPORAL-SPATIAL HYPER-ALIGNMENT METHOD FOR EEG-BASED NEURAL ENTRAINMENT TO SPEECH
pg. 888
B-1-1.3 - DECODING AUDITORY FREQUENCIES AND DIRECTIONS BASED ON BRAIN FUNCTIONAL FEATURES
pg. 1657
C-3-2.4 - DECODING MUSIC GENRES BASED ON HIGH RESOLUTION BRAIN ACTIVITY INFORMATION
Zhang, Haoran
pg. 595
F-2-2.1 - CONTEXT-ADAPTIVE GAUSSIAN ATTENTION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Zhang, Jie
pg. 638
E-2-3.3 - ATTENTIVE FUSION ENHANCED AUDIO-VISUAL ENCODING FOR TRANSFORMER BASED ROBUST SPEECH RECOGNITION
Zhang, Jihui
pg. 156
B-1-2.4 - ESTIMATING DRONE MOTOR RELATED ACOUSTIC TRANSFER FUNCTION: A PRELIMINARY INVESTIGATION
pg. 288
E-1-1.5 - ACTIVE NOISE CONTROL OVER MULTIPLE ZONES: ADAPTIVE ALGORITHM IN TIME DOMAIN
pg. 694
F-2-3.6 - MODELLING ROOM REVERBERATION DIRECTIVITY USING VON MISES-FISHER MIXTURE DISTRIBUTION
Zhang, Jing-Xuan
pg. 556
F-2-1.6 - ADVERSARIAL POST-PROCESSING OF VOICE CONVERSION AGAINST SPOOFING DETECTION
Zhang, Li
pg. 1122
D-1-3.4 - SSIM MOTIVATED QUALITY CONTROL FOR VERSATILE VIDEO CODING
Zhang, Liang
pg. 1033
D-1-1.1 - CLOUD RECOGNITION BASED ON LIGHTWEIGHT NEURAL NETWORK
Zhang, Lijun
pg. 432
E-1-3.5 - FULL-SPHERE BINAURAL SOUND SOURCE LOCALIZATION USING MULTI-TASK NEURAL NETWORK
Zhang, Rongqing
pg. 170
B-1-2.6 - A MATCH PURSUIT BASED METHOD ADAPTED TO OVERCOMPLETE DICTIONARY FOR COMPRESSIVE SPECTRAL IMAGING
Zhang, Sulin
pg. 616
F-2-2.5 - A PITCH-AWARE SPEAKER EXTRACTION SERIAL NETWORK
Zhang, Wei-Qiang
pg. 365
E-1-2.6 - DEEP SEMANTIC ENCODER-DECODER NETWORK FOR ACOUSTIC SCENE CLASSIFICATION WITH MULTIPLE DEVICES
Zhang, Wen
pg. 432
E-1-3.5 - FULL-SPHERE BINAURAL SOUND SOURCE LOCALIZATION USING MULTI-TASK NEURAL NETWORK
Zhang, Xiao-Lei
pg. 716
E-3-1.3 - MULTI-CHANNEL SPEECH SEPARATION USING DEEP EMBEDDING WITH MULTILAYER BOOTSTRAP NETWORKS
Zhang, Xinya
pg. 584
E-2-2.5 - ACOUSTIC ANALYSIS OF NASALIZATION IN MANDARIN PRENASAL VOWELS PRODUCED BY WENZHOU AND RUGAO SPEAKERS
Zhang, Xueliang
pg. 52
F-3-3.2 - ROBUST SPEECH DEREVERBERATION BASED ON WPE AND DEEP LEARNING
Zhang, Yanshan
pg. 247
C-3-1.1 - IMAGE SEGMENTATION METHOD BASED ON FRACTIONAL VARYING-ORDER DIFFERENTIAL
pg. 1666
C-3-2.5 - 3D POINT CLOUD LABELING TOOL FOR DRIVING AUTOMATICALLY
Zhang, Yanxin
pg. 1201
D-3-2.3 - THE VALIDITY OF A DUAL AZURE KINECT-BASED MOTION CAPTURE SYSTEM FOR GAIT ANALYSIS: A PRELIMINARY STUDY
Zhang, Yiming
pg. 1689
C-3-3.1 - SPEAKER VERIFICATION SYSTEM BASED ON DEFORMABLE CNN AND TIME-FREQUENCY ATTENTION
Zhang, Zhuo
pg. 881
B-1-1.2 - A MULTI-SUBJECT TEMPORAL-SPATIAL HYPER-ALIGNMENT METHOD FOR EEG-BASED NEURAL ENTRAINMENT TO SPEECH
Zhao, Ganning
pg. 1698
C-3-3.3 - NITES: A NON-PARAMETRIC INTERPRETABLE TEXTURE SYNTHESIS METHOD
Zhao, H. Vicky
pg. 1617
C-2-3.5 - CAN-SIN: A CROSS-LAYER HETEROGENEOUS ACADEMIC NETWORK WITH SEMANTIC INFORMATION
Zhao, H.Vicky
pg. 197
C-2-1.4 - MODELING DECISION PROCESS IN MULTI-AGENT SYSTEMS: A GRAPHICAL MARKOV GAME BASED APPROACH
pg. 161
B-1-2.5 - AN EVOLUTIONARY GAME THEORETICAL FRAMEWORK FOR DECISION FUSION IN THE PRESENCE OF BYZANTINES
Zhao, Jiahong
pg. 426
E-1-3.4 - SEMI-ADAPTIVE BEAMFORMING FOR CO-PRIME CIRCULAR MICROPHONE ARRAYS
Zhao, Miao
pg. 550
F-2-1.5 - AP20-OLR CHALLENGE: THREE TASKS AND THEIR BASELINES
Zhao, Shengjie
pg. 170
B-1-2.6 - A MATCH PURSUIT BASED METHOD ADAPTED TO OVERCOMPLETE DICTIONARY FOR COMPRESSIVE SPECTRAL IMAGING
Zheng, Chengshi
pg. 769
F-3-1.6 - A TIME-DOMAIN MONAURAL SPEECH ENHANCEMENT WITH FEEDBACK LEARNING
Zhong, Shan
pg. 1331
B-3-3.7 - A SECURE OPUS PULSE STEGANOGRAPHIC SCHEME BASED ON MESSAGE TRANSFORM
Zhou, Di
pg. 881
B-1-1.2 - A MULTI-SUBJECT TEMPORAL-SPATIAL HYPER-ALIGNMENT METHOD FOR EEG-BASED NEURAL ENTRAINMENT TO SPEECH
Zhou, Kun
pg. 507
E-2-1.4 - SPECTRUM AND PROSODY CONVERSION FOR CROSS-LINGUAL VOICE CONVERSION WITH CYCLEGAN
pg. 514
E-2-1.5 - VAW-GAN FOR SINGING VOICE CONVERSION WITH NON-PARALLEL TRAINING DATA
Zhou, Wuneng
pg. 332
E-1-2.1 - A DEEP MUSIC GENRES CLASSIFICATION MODEL BASED ON CNN WITH SQUEEZE & EXCITATION BLOCK
Zhou, Yi
pg. 720
E-3-1.4 - INDEPENDENT VECTOR ANALYSIS FOR BLIND SPEECH SEPARATION USING COMPLEX GENERALIZED GAUSSIAN MIXTURE MODEL WITH WEIGHTED VARIANCE
Zhu, Jianchen
pg. 170
B-1-2.6 - A MATCH PURSUIT BASED METHOD ADAPTED TO OVERCOMPLETE DICTIONARY FOR COMPRESSIVE SPECTRAL IMAGING
Zhu, Shengli
pg. 1725
C-3-3.7 - SMALL DATA-DRIVEN ELECTRICAL INSULATOR DEFECT DETECTION
Zhu, Wei-Ping
pg. 764
F-3-1.5 - AN INTEGRATED CNN-GRU FRAMEWORK FOR COMPLEX RATIO MASK ESTIMATION IN SPEECH ENHANCEMENT
Zhu, Yun
pg. 104
A-2-3.2 - RSP-BT:AN ADVANCED PARALLEL METHOD FOR DEPTH MAP MOTION ESTIMATION
pg. 118
A-2-3.3 - FAST INTER-FRAME PREDICTION BASED ARRAY PROCESSOR FOR DEPTH MAPS IN 3D-HEVC
Zou, Yuexian
pg. 595
F-2-2.1 - CONTEXT-ADAPTIVE GAUSSIAN ATTENTION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
Main Menu