APSIPA 2021
Author Index
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z
A
Abdikenov, Beibit
pg. 423
LS-C-FR3.3 - SELF-SUPERVISED VISUAL TRANSFORMERS FOR BREAST CANCER DIAGNOSIS
Abdulaziz, Nidhal
pg. 1202
LS-B-FR1.1 - DEVELOPMENT OF A SYNTHETIC DATABASE FOR COMPACT NEURAL NETWORK CLASSIFICATION OF ACOUSTIC SCENES IN DEMENTIA CARE ENVIRONMENTS
Adachi, Koichi
pg. 1963
LS-A-FR1.4 - OFFLOADING SELECTION WITH UNEQUAL TIMESLOT IN MOBILE EDGE COMPUTING
Ahmed, Imran
pg. 1499
LS-A-WE1.1 - AN EFFICIENT IMAGE PROCESSING AND MACHINE LEARNING BASED TECHNIQUE FOR SKIN LESION SEGMENTATION AND CLASSIFICATION
aimmanee, Pakinee
pg. 1634
LS-C-FR1.3 - HYBRIDIZATION OF SPEECH INFORMATION HIDING AND ENCRYPTION FOR DOUBLE-LAYER SECURITY IN SPEECH COMMUNICATION
Akagi, Masato
pg. 700
OD-A-TH2.2 - HIERARCHICAL PROSODY ANALYSIS IMPROVES CATEGORICAL AND DIMENSIONAL EMOTION RECOGNITION
pg. 36
OD-B-WE1.6 - STUDY ON SIMULTANEOUS ESTIMATION OF GLOTTAL SOURCE AND VOCAL TRACT PARAMETERS BY ARMAX-LF MODEL FOR SPEECH ANALYSIS/SYNTHESIS
pg. 731
OD-A-TH2.7 - AUTOMATIC NATURALNESS RECOGNITION FROM ACTED SPEECH USING NEURAL NETWORKS
Akazawa, Teruaki
pg. 1571
LS-C-TH2.1 - SPATIALLY VARYING WHITE BALANCING FOR MIXED AND NON-UNIFORM ILLUMINANTS
Akhtar, Muhammad
pg. 416
LS-C-FR3.2 - COST-EFFECTIVE PROPORTIONATE AFFINE PROJECTION ALGORITHM WITH VARIABLE PARAMETERS FOR ACOUSTIC FEEDBACK CANCELLATION
Akhtar, Muhammad Tahir
pg. 410
LS-C-FR3.1 - EVENT-RELATED SPECTROGRAM REPRESENTATION OF EEG FOR CNN-BASED P300 SPELLER
pg. 423
LS-C-FR3.3 - SELF-SUPERVISED VISUAL TRANSFORMERS FOR BREAST CANCER DIAGNOSIS
Akuzawa, Kei
pg. 808
OD-A-TH3.3 - CONDITIONAL DEEP HIERARCHICAL VARIATIONAL AUTOENCODER FOR VOICE CONVERSION
Anami, Shunki
pg. 1405
OD-B-TH1.9 - NOISE REMOVAL FOR DYNAMIC MODE DECOMPOSITION BASED ON PLUG-AND-PLAY ADMM
Anand, Anubhav
pg. 756
OD-A-TH2.11 - FILTERS KNOW HOW YOU FEEL: EXPLAINING INTERMEDIATE SPEECH EMOTION CLASSIFICATION REPRESENTATIONS
Antoniou, Mark
pg. 926
OD-A-FR1.9 - SVM-BASED EVALUATION OF THAI TONE IMITATIONS BY THAI-NAÏVE MANDARIN AND VIETNAMESE SPEAKERS
Aoki, Takafumi
pg. 1762
LS-B-WE2.1 - A COMPREHENSIVE STUDY OF FACE RECOGNITION USING DEEP LEARNING
Aoki, Takahiro
pg. 1729
OD-B-TH2.4 - WORKLOAD BASED MODEL OF LARGE SCALE 1:N BIOMETRICS MULTI-STEP NARROWING DOWN PROCESS
AprilPyone, MaungMaung
pg. 1833
LS-B-FR2.3 - ACCESS CONTROL USING SPATIALLY INVARIANT PERMUTATION OF FEATURE MAPS FOR SEMANTIC SEGMENTATION MODELS
Arronde Pérez, Dailys
pg. 264
OD-B-WE2.6 - HIGH-ACCURACY RECONSTRUCTION OF PERIODIC SIGNALS BASED ON COMPRESSIVE SENSING
Atmaja, Bagus Tris
pg. 731
OD-A-TH2.7 - AUTOMATIC NATURALNESS RECOGNITION FROM ACTED SPEECH USING NEURAL NETWORKS
Ayush, Altangerel
pg. 849
OD-A-TH3.9 - MULTI-SPEAKER TTS SYSTEM FOR LOW-RESOURCE LANGUAGE USING CROSS-LINGUAL TRANSFER LEARNING AND DATA AUGMENTATION
AZUMA, Yasuhiro
pg. 373
LS-B-TH3.3 - MEASURING ATTRACTIVENESS OF TOURISM RESOURCES BY FOCUSING ON KANSEI VALUE STRUCTURE: POSSIBILITY OF INVITING VISITORS USING THE JAPANESE HERITAGE “AKO SALT.”
B
Babaguchi, Noboru
pg. 1800
LS-B-TH1.3 - MODEL INVERSION ATTACK AGAINST A FACE RECOGNITION SYSTEM IN A BLACK-BOX SETTING
Bai, Jisheng
pg. 1144
LS-A-TH2.2 - DUAL-PATH TRANSFORMER FOR MACHINE CONDITION MONITORING
Bai, Ye
pg. 454
OD-A-WE1.5 - ONE IN A HUNDRED: SELECTING THE BEST PREDICTED SEQUENCE FROM NUMEROUS CANDIDATES FOR SPEECH RECOGNITION
Banuelos, Mario
pg. 1277
OD-B-FR2.3 - A RECOMMENDATION SYSTEMS APPROACH FOR DETECTING EPISTASIS IN GENOMIC SIGNALS
Bao, Changchun
pg. 950
OD-A-FR2.3 - A MULTI-SOURCE LOCALIZATION METHOD BASED ON CLUSTERING AND OUTLIER REMOVAL
Barche, Purva
pg. 737
OD-A-TH2.8 - COMPARATIVE STUDY OF FILTER BANKS TO IMPROVE THE PERFORMANCE OF VOICE DISORDER ASSESSMENT SYSTEMS USING LTAS FEATURES
Bekooij, Marco
pg. 44
OD-B-WE1.7 - LOW-POWER BOOTH MULTIPLICATION WITHOUT DYNAMIC RANGE DETECTION IN FFTS FOR FMCW RADAR SIGNAL PROCESSING
pg. 55
OD-B-WE1.9 - AN OPTIMAL VARIABLE-LATENCY ARCHITECTURE FOR DETERMINISTIC APPROACHES TO STOCHASTIC COMPUTING WITH UNARY BIT STREAM PRESERVING PROPERTIES
pg. 299
OD-B-WE2.12 - ENHANCED LOOP-WEAKENED BELIEF PROPAGATION ALGORITHM FOR PERFORMANCE ENHANCED POLAR CODE DECODERS
pg. 318
OD-B-WE2.15 - COMPUTATIONAL COMPLEXITY REDUCED BELIEF PROPAGATION ALGORITHM FOR POLAR CODE DECODERS
Bellamy, William
pg. 1087
OD-A-FR3.13 - EFFECT OF PERCEPTUAL TRAINING WITH NOISE ON CHINESE LEARNERS’ ENGLISH CONSONANT RECEPTION THRESHOLDS
Benesty, Jacob
pg. 1221
LS-B-FR1.4 - CONSTRAINED MAXIMUM DIRECTIVITY BEAMFORMERS BASED ON UNIFORM LINEAR ACOUSTIC VECTOR SENSOR ARRAYS
pg. 49
OD-B-WE1.8 - KRONECKER PRODUCT ADAPTIVE BEAMFORMING FOR MICROPHONE ARRAYS
Berwo, Michael Abebe
pg. 1519
LS-A-WE1.4 - AUTOMOTIVE ENGINE CYLINDER HEAD CRACK DETECTION: CANNY EDGE DETECTION WITH MORPHOLOGICAL DILATION
Best, Catherine
pg. 926
OD-A-FR1.9 - SVM-BASED EVALUATION OF THAI TONE IMITATIONS BY THAI-NAÏVE MANDARIN AND VIETNAMESE SPEAKERS
Biglarbeigi, Pardis
pg. 1269
OD-B-FR2.2 - SEIZURE CLASSIFICATION OF EEG BASED ON WAVELET SIGNAL DENOISING USING A NOVEL CHANNEL SELECTION ALGORITHM
Blanch, Marc Gorriz
pg. 1422
OD-B-TH1.12 - SPATIAL INFORMATION REFINEMENT FOR CHROMA INTRA PREDICTION IN VIDEO CODING
Bonet, David
pg. 351
LS-B-TH2.5 - CHANNEL-WISE EARLY STOPPING WITHOUT A VALIDATION SET VIA NNK POLYTOPE INTERPOLATION
Brouwers, Vincent
pg. 1563
LS-C-WE2.5 - VIDEO-BASED SPORTS ACTIVITY RECOGNITION FOR CHILDREN
Byambadorj, Zolzaya
pg. 849
OD-A-TH3.9 - MULTI-SPEAKER TTS SYSTEM FOR LOW-RESOURCE LANGUAGE USING CROSS-LINGUAL TRANSFER LEARNING AND DATA AUGMENTATION
C
Cai, Cheng-Yu
pg. 17
OD-B-WE1.3 - DUAL-CHANNEL DRUM SEPARATION FOR LOW-COST DRUM RECORDING USING NON-NEGATIVE MATRIX FACTORIZATION
Cai, Weicheng
pg. 1133
LS-D-WE1.5 - A UNIFIED DEEP SPEAKER EMBEDDING FRAMEWORK FOR MIXED-BANDWIDTH SPEECH DATA
Cai, Yunqi
pg. 1121
LS-D-WE1.3 - AN MAP ESTIMATION FOR BETWEEN-CLASS VARIANCE
Cao, Jianting
pg. 1323
LS-C-WE1.4 - MULTI-FEATURE FUSION FOR EPILEPTIC FOCUS LOCALIZATION BASED ON TENSOR REPRESENTATION
Casanova, Lionel F. Gonzalez
pg. 1903
LS-B-WE1.5 - GENERALIZED CLASSIFICATION OF DNS OVER HTTPS TRAFFIC WITH DEEP LEARNING
CHAI, Miaomiao
pg. 121
LS-D-TH1.4 - AN IDE FOR RECONFIGURABLE VIDEO ARRAY PROCESSOR
Chai, Miaomiao
pg. 127
LS-D-TH1.5 - A RECONFIGURABLE PARALLELIZATION OF GENERATIVE ADVERSARIAL NETWORKS BASED ON ARRAY PROCESSOR
Chakraborty, Dipanita
pg. 1576
LS-C-TH2.2 - SEMANTICALLY RELEVANT SCENE DETECTION USING DEEP LEARNING
Chamnongthai, Kosin
pg. 1576
LS-C-TH2.2 - SEMANTICALLY RELEVANT SCENE DETECTION USING DEEP LEARNING
Chan, Bo-Cheng
pg. 269
OD-B-WE2.7 - SEMI-SUPERVISED SOUND EVENT DETECTION USING SELF-ATTENTION AND MULTIPLE TECHNIQUES OF CONSISTENCY TRAINING
Chan, Chee Seng
pg. 1877
LS-B-FR3.5 - RELABEL, SCRAMBLE, SYNTHESIZE: A NOVEL COVERLESS STEGANOGRAPHY APPROACH VIA COLLAGE IMAGE
Chan, Chia-Tai
pg. 1258
LS-D-FR3.5 - INSTRUMENTED ROMBERG TEST OF POSTURAL STABILITY IN PATIENTS WITH VESTIBULAR DISORDERS USING INERTIAL MEASUREMENT UNITS
Chan, H. Anthony
pg. 1450
OD-B-FR1.5 - LEARN TO SKETCH: A FAST APPROACH FOR UNIVERSAL PHOTO SKETCH
Chan, Hung-Tse
pg. 1674
LS-D-FR2.4 - SMART FACIAL SKINCARE PRODUCTS USING COMPUTER VISION TECHNOLOGIES
Chang, Cheng-Yuan
pg. 1197
LS-C-TH3.5 - DEVELOPMENT OF ACTIVE HEAR-THROUGH EQUALIZATION ALGORITHM FOR EARPHONES
Chang, Kai-Po
pg. 1942
LS-D-WE2.6 - AN ENTROPY-BASED DDOS ATTACK DETECTION AND CLASSIFICATION WITH HIERARCHICAL TEMPORAL MEMORY
Chang, Pao-Chi
pg. 1590
LS-D-TH3.1 - ROBUSTNESS AGAINST ADVERSARY MODELS ON MNIST BY DEEP-Q REINFORCEMENT LEARNING BASED PARALLEL-GANS
pg. 141
LS-D-TH2.1 - NON-PARALLEL VOICE CONVERSION WITH GENERATIVE ATTENTIONAL NETWORKS
Chao, Fu-An
pg. 1104
OD-A-FR3.16 - CROSS-UTTERANCE RERANKING MODELS WITH BERT AND GRAPH CONVOLUTIONAL NETWORKS FOR CONVERSATIONAL SPEECH RECOGNITION
Chen, Berlin
pg. 2006
OD-B-TH3.6 - FAQ RETRIEVAL USING QUESTION-AWARE GRAPH CONVOLUTIONAL NETWORK AND CONTEXTUALIZED LANGUAGE MODEL
pg. 1049
OD-A-FR3.6 - IMPROVING END-TO-END MODELING FOR MISPRONUNCIATION DETECTION WITH EFFECTIVE AUGMENTATION MECHANISMS
pg. 518
OD-A-WE1.15 - AN EMPIRICAL STUDY ON TRANSFORMER-BASED END-TO-END SPEECH RECOGNITION WITH NOVEL DECODER MASKING
pg. 1104
OD-A-FR3.16 - CROSS-UTTERANCE RERANKING MODELS WITH BERT AND GRAPH CONVOLUTIONAL NETWORKS FOR CONVERSATIONAL SPEECH RECOGNITION
Chen, Changsheng
pg. 1708
OD-B-TH2.1 - CROSS-DOMAIN RECAPTURED DOCUMENT DETECTION WITH TEXTURE AND REFLECTANCE CHARACTERISTICS
Chen, Chen
pg. 679
OD-A-TH1.12 - TIME DOMAIN SPEECH ENHANCEMENT WITH ATTENTIVE MULTI-SCALE APPROACH
Chen, Chia-Ping
pg. 269
OD-B-WE2.7 - SEMI-SUPERVISED SOUND EVENT DETECTION USING SELF-ATTENTION AND MULTIPLE TECHNIQUES OF CONSISTENCY TRAINING
Chen, Cynthia
pg. 2085
LS-C-TH1.3 - PERSONALIZED LEARNING USING MULTIPLE KERNEL MODELS
Chen, Fei
pg. 1239
LS-D-FR3.2 - ESTIMATION AND CORRECTION OF RELATIVE TRANSFER FUNCTION FOR BINAURAL SPEECH SEPARATION NETWORKS TO PRESERVE SPATIAL CUES
Chen, Feixiong
OD-B-TH2.6 - CLUSTER-TRNET: JOINTED MODEL FOR REAL-TIME TRAFFIC IDENTIFICATION WITH HIGH ACCURACY
Chen, Jianfeng
pg. 1144
LS-A-TH2.2 - DUAL-PATH TRANSFORMER FOR MACHINE CONDITION MONITORING
Chen, Jingdong
pg. 1221
LS-B-FR1.4 - CONSTRAINED MAXIMUM DIRECTIVITY BEAMFORMERS BASED ON UNIFORM LINEAR ACOUSTIC VECTOR SENSOR ARRAYS
pg. 49
OD-B-WE1.8 - KRONECKER PRODUCT ADAPTIVE BEAMFORMING FOR MICROPHONE ARRAYS
Chen, Junqi
pg. 1111
LS-D-WE1.1 - ATTENTION-BASED MULTI-CHANNEL SPEAKER VERIFICATION WITH AD-HOC MICROPHONE ARRAYS
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
Chen, Juqiang
pg. 926
OD-A-FR1.9 - SVM-BASED EVALUATION OF THAI TONE IMITATIONS BY THAI-NAÏVE MANDARIN AND VIETNAMESE SPEAKERS
Chen, Kuan-Tzu
pg. 1931
LS-D-WE2.4 - A PARKING MONITORING SYSTEM USING FMCW RADARS
Chen, Lichin
pg. 1251
LS-D-FR3.4 - PREDICTING PATIENT'S CHOICES OF HOSPITAL LEVELS USING DEEP LEARNING AND REPRESENTATION IMPROVEMENTS
Chen, Liyuan
pg. 1305
LS-C-WE1.1 - MICROPHONE ARRAY SPEECH SEPARATION ALGORITHM BASED ON DNN
Chen, Ruiyan
pg. 821
OD-A-TH3.5 - ACOUSTIC SIMULATION OF BODY-CONDUCTED SPEECH AND ITS USE TO CONVERT ONE'S RECORDED VOICES TO ONE'S OWN VOICES
Chen, Wei-Jyun
pg. 2055
OD-B-TH3.14 - 3D LANDMARK-BASED FACE DETECTION AND RECOGNITION SYSTEM FOR LARGE POSES
Chen, Xi
pg. 864
OD-A-TH3.12 - SPTTS: PARALLEL SPEECH SYNTHESIS WITHOUT EXTRA ALIGNER MODEL
Chen, Xijin
pg. 1305
LS-C-WE1.1 - MICROPHONE ARRAY SPEECH SEPARATION ALGORITHM BASED ON DNN
Chen, Yijiang
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
Chen, Ying
pg. 1328
LS-C-WE1.5 - ADHD CLASSIFICATION VIA AUTO-ENCODING NETWORK WITH NON-IMAGING DATA FUSION
Chen, Yucheng
pg. 282
OD-B-WE2.9 - NEARBY-PERSON OCCLUSION DATA AUGMENTATION FOR HUMAN POSE ESTIMATION WITH NON-EXTRA ANNOTATIONS
Cheng, Yu-Wei
pg. 1883
LS-B-WE1.1 - DEEP REINFORCEMENT LEARNING FOR NPDCCH PERIOD ADJUSTMENT IN NB-IOT NETWORKS
Chia, Hao-Wen
pg. 1658
LS-D-FR2.1 - SEMI-SUPERVISED LEARNING FOR FACIAL LANDMARKS WITH CONFIDENCE AND AUGMENTATION SIFTING MECHANISMS
Chida, Tsukasa
pg. 1953
LS-A-FR1.2 - FUNDAMENTAL INVESTIGATION OF BACKOFF CONTROL METHOD FOR FAIR COMMUNICATION OPPORTUNITY OF MMW WBAN IN OVERCROWDED ENVIRONMENT
Chien, Jen-Tzung
pg. 2028
OD-B-TH3.10 - MODEL-BASED SOFT ACTOR-CRITIC
pg. 2036
OD-B-TH3.11 - SELF-SUPERVISED LEARNING FOR ONLINE SPEAKER DIARIZATION
pg. 2043
OD-B-TH3.12 - MULTI-RESOLUTION CONVOLUTIONAL RECURRENT NETWORKS
Chiracharit, Werapon
pg. 1576
LS-C-TH2.2 - SEMANTICALLY RELEVANT SCENE DETECTION USING DEEP LEARNING
Chiu, Ching-Te
pg. 1483
OD-B-FR1.10 - REAL-TIME EDGE ATTENTION-BASED LEARNING FOR LOW-LIGHT ONE-STAGE OBJECT DETECTION
pg. 2055
OD-B-TH3.14 - 3D LANDMARK-BASED FACE DETECTION AND RECOGNITION SYSTEM FOR LARGE POSES
Chiu, Hsuan-Sheng
pg. 518
OD-A-WE1.15 - AN EMPIRICAL STUDY ON TRANSFORMER-BASED END-TO-END SPEECH RECOGNITION WITH NOVEL DECODER MASKING
Chiu, Shih-Hsuan
pg. 1104
OD-A-FR3.16 - CROSS-UTTERANCE RERANKING MODELS WITH BERT AND GRAPH CONVOLUTIONAL NETWORKS FOR CONVERSATIONAL SPEECH RECOGNITION
Chiu, Tse Wei
pg. 141
LS-D-TH2.1 - NON-PARALLEL VOICE CONVERSION WITH GENERATIVE ATTENTIONAL NETWORKS
Chng, Eng Siong
pg. 1021
OD-A-FR3.1 - ENRICHING UNDER-REPRESENTED NAMED ENTITIES FOR IMPROVED SPEECH RECOGNITION
pg. 1043
OD-A-FR3.5 - MULTILINGUAL APPROACH TO JOINT SPEECH AND ACCENT RECOGNITION WITH DNN-HMM FRAMEWORK
pg. 497
OD-A-WE1.12 - MULTITASK-BASED JOINT LEARNING APPROACH TO ROBUST ASR FOR RADIO COMMUNICATION SPEECH
pg. 679
OD-A-TH1.12 - TIME DOMAIN SPEECH ENHANCEMENT WITH ATTENTIVE MULTI-SCALE APPROACH
pg. 786
OD-A-TH2.16 - END-TO-END SPEAKER AGE AND HEIGHT ESTIMATION USING ATTENTION MECHANISM AND TRIPLET LOSS
Cho, Nam Ik
pg. 151
LS-D-TH2.3 - RESIDUAL DILATED U-NET WITH SPATIALLY ADAPTIVE NORMALIZATION FOR THE RESTORATION OF UNDER DISPLAY CAMERA IMAGES
pg. 158
LS-D-TH2.4 - LOSSLESS IMAGE COMPRESSION BASED ON IMAGE DECOMPOSITION AND PROGRESSIVE PREDICTION USING CONVOLUTIONAL NEURAL NETWORKS
pg. 2049
OD-B-TH3.13 - NETWORK INTRUSION DETECTION WITH IMPROVED FEATURE REPRESENTATION
Cho, Sunwoo
pg. 151
LS-D-TH2.3 - RESIDUAL DILATED U-NET WITH SPATIALLY ADAPTIVE NORMALIZATION FOR THE RESTORATION OF UNDER DISPLAY CAMERA IMAGES
Cho, Yin-Ping
pg. 1975
OD-B-TH3.1 - MANDARIN SINGING VOICE SYNTHESIS WITH A PHONOLOGY-BASED DURATION MODEL
Choi, Jinho
pg. 1756
OD-B-TH2.9 - ANOMALY DETECTION FOR WIRELESS COMMUNICATION LINKS VIA DATA INTEGRITY MODELING
CHOI, Samuel
pg. 1528
LS-A-WE1.5 - ACCELERATION OF PDS–BASED HIGH–DIMENSIONAL SIGNAL RESTORATION
Choi, Whan
pg. 164
LS-D-TH2.5 - FACIAL VIDEO FRAME INTERPOLATION COMBINING SYMMETRIC AND ASYMMETRIC MOTIONS
Chou, Wei-Hung
pg. 1912
LS-D-WE2.1 - AN ADAPTIVE RANK SELECTION METHOD IN 3GPP 5G NR SYSTEMS
pg. 1889
LS-B-WE1.2 - A THRESHOLD-BASED SCHEDULING AND POWER CONTROL DESIGN ON IMT-2020 EVALUATION
pg. 1917
LS-D-WE2.2 - A LOW COMPLEXITY PMI SELECTION SCHEME FOR 3GPP 5G NR FDD SYSTEMS
Chou, Yu-Chen
pg. 2013
OD-B-TH3.7 - 3D-GFE: A THREE-DIMENSIONAL GEOMETRIC-FEATURE EXTRACTOR FOR POINT CLOUD DATA
pg. 2018
OD-B-TH3.8 - ATTENTION EDGECONV FOR 3D POINT CLOUD CLASSIFICATION
Chrisantonius, Chrisantonius
pg. 1602
LS-D-TH3.3 - A FUSION METHODOLOGY OF AKAZE AND NEURAL NETWORK FOR FINGERPRINT RECOGNITION
pg. 1611
LS-D-TH3.5 - PARTIAL FINGERPRINT ON COMBINED EVALUATION USING DEEP LEARNING AND FEATURE DESCRIPTOR
Chu, Chenhui
pg. 433
OD-A-WE1.1 - ON THE USE OF SPEAKER INFORMATION FOR AUTOMATIC SPEECH RECOGNITION IN SPEAKER-IMBALANCED CORPORA
Chu, Shih-Chuan
pg. 536
OD-A-WE2.3 - SPEECH ENHANCEMENT BASED ON MASKING APPROACH CONSIDERING SPEECH QUALITY AND ACOUSTIC CONFIDENCE FOR NOISY SPEECH RECOGNITION
Chua, Danny
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Chuang, Ching-Chih
pg. 1883
LS-B-WE1.1 - DEEP REINFORCEMENT LEARNING FOR NPDCCH PERIOD ADJUSTMENT IN NB-IOT NETWORKS
Chuang, Yuh-Jue
pg. 1251
LS-D-FR3.4 - PREDICTING PATIENT'S CHOICES OF HOSPITAL LEVELS USING DEEP LEARNING AND REPRESENTATION IMPROVEMENTS
Chung, Haesoo
pg. 151
LS-D-TH2.3 - RESIDUAL DILATED U-NET WITH SPATIALLY ADAPTIVE NORMALIZATION FOR THE RESTORATION OF UNDER DISPLAY CAMERA IMAGES
Cohen, Israel
pg. 1221
LS-B-FR1.4 - CONSTRAINED MAXIMUM DIRECTIVITY BEAMFORMERS BASED ON UNIFORM LINEAR ACOUSTIC VECTOR SENSOR ARRAYS
pg. 49
OD-B-WE1.8 - KRONECKER PRODUCT ADAPTIVE BEAMFORMING FOR MICROPHONE ARRAYS
Copiaco, Abigail
pg. 1202
LS-B-FR1.1 - DEVELOPMENT OF A SYNTHETIC DATABASE FOR COMPACT NEURAL NETWORK CLASSIFICATION OF ACOUSTIC SCENES IN DEMENTIA CARE ENVIRONMENTS
Cui, Linfeng
pg. 1386
OD-B-TH1.5 - INTRA CODING TOOL PRUNING FOR REDUCING COMPLEXITY OF VVC SCREEN CONTENT CODING
D
Dai, Dongyang
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
Dai, Wei
pg. 100
LS-D-TH1.1 - IMPROVED FRUIT FLY OPTIMIZATION ALGORITHM BASED ON SIMULATED ANNEALING IN NEURAL NETWORK
Dai, Yuchao
pg. 282
OD-B-WE2.9 - NEARBY-PERSON OCCLUSION DATA AUGMENTATION FOR HUMAN POSE ESTIMATION WITH NON-EXTRA ANNOTATIONS
Dang, Jianwu
pg. 438
OD-A-WE1.2 - SPECTROGRAMS FUSION-BASED END-TO-END ROBUST AUTOMATIC SPEECH RECOGNITION
pg. 700
OD-A-TH2.2 - HIERARCHICAL PROSODY ANALYSIS IMPROVES CATEGORICAL AND DIMENSIONAL EMOTION RECOGNITION
pg. 36
OD-B-WE1.6 - STUDY ON SIMULTANEOUS ESTIMATION OF GLOTTAL SOURCE AND VOCAL TRACT PARAMETERS BY ARMAX-LF MODEL FOR SPEECH ANALYSIS/SYNTHESIS
pg. 1056
OD-A-FR3.7 - ZERO-SHOT DOMAIN ADAPTATION WITH INFERENCE RELATION PATHS FOR SPOKEN LANGUAGE UNDERSTANDING
Das, Rohan Kumar
pg. 484
OD-A-WE1.10 - SIGNIFICANCE OF DATA AUGMENTATION FOR IMPROVING CLEFT LIP AND PALATE SPEECH RECOGNITION
Davidson, Scot
pg. 1269
OD-B-FR2.2 - SEIZURE CLASSIFICATION OF EEG BASED ON WAVELET SIGNAL DENOISING USING A NOVEL CHANNEL SELECTION ALGORITHM
Deng, Feng
pg. 884
OD-A-FR1.2 - RETHINKING SINGING VOICE SEPARATION WITH SPECTRAL-TEMPORAL TRANSFORMER
Deng, Junyong
pg. 133
LS-D-TH1.6 - PERFORMANCE CHARACTERIZATION OF RASTERIZATION ALGORITHMS FOR RECONFIGURABLE GRAPHICS PROCESSOR
Deng, Shih-Chun
pg. 1674
LS-D-FR2.4 - SMART FACIAL SKINCARE PRODUCTS USING COMPUTER VISION TECHNOLOGIES
Ding, Jian-Jiun
pg. 1658
LS-D-FR2.1 - SEMI-SUPERVISED LEARNING FOR FACIAL LANDMARKS WITH CONFIDENCE AND AUGMENTATION SIFTING MECHANISMS
pg. 1662
LS-D-FR2.2 - DEEPFAKE ALGORITHM USING MULTIPLE NOISE MODALITIES WITH TWO-BRANCH PREDICTION NETWORK
Doyran, Metehan
pg. 1563
LS-C-WE2.5 - VIDEO-BASED SPORTS ACTIVITY RECOGNITION FOR CHILDREN
Du, Jun
pg. 1438
OD-B-FR1.3 - HMM-BASED LIP READING WITH STINGY RESIDUAL 3D CONVOLUTION
pg. 667
OD-A-TH1.10 - A DEEP ANALYSIS OF SPEECH SEPARATION GUIDED DIARIZATION UNDER REALISTIC CONDITIONS
Du, Zhuolin
pg. 127
LS-D-TH1.5 - A RECONFIGURABLE PARALLELIZATION OF GENERATIVE ADVERSARIAL NETWORKS BASED ON ARRAY PROCESSOR
E
Echizen, Isao
pg. 1815
LS-B-TH1.5 - FEATURE EXTRACTION BASED ON DENOISING AUTO ENCODER FOR CLASSIFICATION OF ADVERSARIAL EXAMPLES
Echizenya, Kaito
pg. 2023
OD-B-TH3.9 - THE EFFECT OF DENSITY AND PLACEMENT OF BLE BEACONS ON INDOOR LOCATION AND MOTION DIRECTION ESTIMATION ACCURACY
Ekawa, Takuma
pg. 30
OD-B-WE1.5 - MOVING SOUND SOURCE TRACKING IN WIDE SPACE BY MULTIPLE MICROPHONE ARRAYS
pg. 1008
OD-A-FR2.12 - VIRTUAL SOUND SOURCE RENDERING BASED ON DISTANCE CONTROL TO PENETRATE LISTENERS USING SURROUND PARAMETRIC-ARRAY AND ELECTRODYNAMIC LOUDSPEAKERS
Ermaganbet, Zangar
pg. 410
LS-C-FR3.1 - EVENT-RELATED SPECTROGRAM REPRESENTATION OF EEG FOR CNN-BASED P300 SPELLER
Ewert, Stephan
pg. 1311
LS-C-WE1.2 - EXPLORING ARTIFACT REJECTION FOR HIGH-PULSE RATE ELECTRICALLY EVOKED AUDITORY STEADY STATE RESPONSES IN COCHLEAR IMPLANTS USERS
Eze, Peter
pg. 1999
OD-B-TH3.5 - DEEP LEARNING EVALUATION OF A STEGANOGRAPHIC ALGORITHM
F
Fan, Cunhang
pg. 438
OD-A-WE1.2 - SPECTROGRAMS FUSION-BASED END-TO-END ROBUST AUTOMATIC SPEECH RECOGNITION
Fan, Lichun
pg. 564
OD-A-WE2.8 - MULTI-CHANNEL SPEECH ENHANCEMENT WITH 2-D CONVOLUTIONAL TIME-FREQUENCY DOMAIN FEATURES AND A PRE-TRAINED ACOUSTIC MODEL
FANG, Lin
pg. 121
LS-D-TH1.4 - AN IDE FOR RECONFIGURABLE VIDEO ARRAY PROCESSOR
Fang, Lingfeng
pg. 1585
LS-C-TH2.4 - IMPLEMENTATION OF AVS3 MULTICAST SYSTEM BASED ON EMBMS
Fang, Xin
pg. 667
OD-A-TH1.10 - A DEEP ANALYSIS OF SPEECH SEPARATION GUIDED DIARIZATION UNDER REALISTIC CONDITIONS
Fang, Yong
pg. 1506
LS-A-WE1.2 - DISTRIBUTED ARITHMETIC CODING FOR SOURCES WITH HIDDEN MARKOV CORRELATION
pg. 1519
LS-A-WE1.4 - AUTOMOTIVE ENGINE CYLINDER HEAD CRACK DETECTION: CANNY EDGE DETECTION WITH MORPHOLOGICAL DILATION
Fasciani, Stefano
pg. 1202
LS-B-FR1.1 - DEVELOPMENT OF A SYNTHETIC DATABASE FOR COMPACT NEURAL NETWORK CLASSIFICATION OF ACOUSTIC SCENES IN DEMENTIA CARE ENVIRONMENTS
Feng, Qihua
pg. 1839
LS-B-FR2.4 - END-TO-END LEARNING FOR ENCRYPTED IMAGE RETRIEVAL
Feng, Xinyang
pg. 541
OD-A-WE2.4 - DNN-BASED LINEAR PREDICTION RESIDUAL ENHANCEMENT FOR SPEECH DEREVERBERATION
Feng, Yan
pg. 93
LS-A-WE2.6 - IMBALANCED SAMPLE FEATURE ENHANCEMENT OF HYPERSPECTRAL IMAGERY CLASSIFICATION
Feng, Zicheng
pg. 1239
LS-D-FR3.2 - ESTIMATION AND CORRECTION OF RELATIVE TRANSFER FUNCTION FOR BINAURAL SPEECH SEPARATION NETWORKS TO PRESERVE SPATIAL CUES
Finlay, Dewar
pg. 1269
OD-B-FR2.2 - SEIZURE CLASSIFICATION OF EEG BASED ON WAVELET SIGNAL DENOISING USING A NOVEL CHANNEL SELECTION ALGORITHM
Fu, Tianxiao
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
Fujieda, Masaru
pg. 603
OD-A-WE2.14 - COMPARATIVE STUDY ON DNN-BASED MINIMUM VARIANCE BEAMFORMING ROBUST TO SMALL MOVEMENTS OF SOUND SOURCES
Fujii, Takeo
pg. 1958
LS-A-FR1.3 - INTRA-SYSTEM INTERFERENCE AVOIDANCE FOR PACKET-LEVEL INDEX MODULATION IN INTERNET OF THINGS
Fujimura, Hiroshi
pg. 23
OD-B-WE1.4 - MASK-BASED BEAMFORMING USING COMPLEX-VALUED NEURAL NETWORK FOR RECOGNITION OF SPATIAL TARGET SPEECH
Fujisawa, Masaya
pg. 275
OD-B-WE2.8 - NONLINEAR SVM-TYPE AUTOMATIC DICISION ALGORITHM IN NOISY ENVIRONMENT FOR HAMMERING TEST SYSTEM
Fujiwara, Kento
pg. 471
OD-A-WE1.8 - DATA AUGMENTATION BASED ON FREQUENCY WARPING FOR RECOGNITION OF CLEFT PALATE SPEECH
Fujiwara, Koichi
pg. 1348
LS-A-TH1.3 - PRELIMINARY STUDY USING AUTOENCODER FOR EARLY DETECTION OF HEAT ILLNESS FROM HEART RATE VARIABILITY OBTAINED WITH WEARABLE DEVICE
Fukuda, Meiko
pg. 1077
OD-A-FR3.11 - END-TO-END SPONTANEOUS SPEECH RECOGNITION USING HESITATION LABELING
Fukumori, Kosuke
pg. 1546
LS-C-WE2.2 - INFANT POSTURE ASSESSMENT BASED ON ROTATIONAL KEYPOINT DETECTION
Fukumori, Takahiro
pg. 725
OD-A-TH2.6 - SPEECH EMOTION RECOGNITION WITH FUSION OF ACOUSTIC- AND LINGUISTIC-FEATURE-BASED DECISIONS
Fukumoto, Katsuki
pg. 324
LS-B-TH2.1 - NODE CLUSTERING OF TIME-VARYING GRAPHS BASED ON TEMPORAL LABEL SMOOTHNESS
Fukushima, Norishige
pg. 63
LS-A-WE2.1 - DOMAIN SPECIFIC DESCRIPTION IN HALIDE FOR RANDOMIZED IMAGE CONVOLUTION
pg. 74
LS-A-WE2.3 - ACCELERATING FINITE IMPULSE RESPONSE FILTERING USING TENSOR CORES
pg. 88
LS-A-WE2.5 - COLOR TRANSFORMATION FOR COMPRESSIVE COMPUTING IN IMAGE FILTERING
Funabiki, Nobuo
pg. 1865
LS-B-FR3.3 - A STUDY OF PRIVACY PROTECTION OF PHOTOS TAKEN BY A WIDE-ANGLE SURVEILLANCE CAMERA
pg. 1808
LS-B-TH1.4 - FEATURE EXTRACTION SUITABLE FOR DOUBLE JPEG COMPRESSION ANALYSIS BASED ON STATISTICAL BIAS OBSERVATION OF DCT COEFFICIENTS
pg. 1815
LS-B-TH1.5 - FEATURE EXTRACTION BASED ON DENOISING AUTO ENCODER FOR CLASSIFICATION OF ADVERSARIAL EXAMPLES
Funahashi, Isana
pg. 1381
OD-B-TH1.4 - HIGH REFLECTION REMOVAL USING CNN WITH DETECTION AND ESTIMATION
FUNAKI, KEIICHI
pg. 932
OD-A-FR1.10 - ON AN IMPROVED F0 ESTIMATION BASED ON L2-NORM REGULARIZED TV-CAR SPEECH ANALYSIS
G
Galajit, Kasorn
pg. 1634
LS-C-FR1.3 - HYBRIDIZATION OF SPEECH INFORMATION HIDING AND ENCRYPTION FOR DOUBLE-LAYER SECURITY IN SPEECH COMMUNICATION
Gan, Woon-Seng
pg. 1180
LS-C-TH3.2 - DESIGN AND EVALUATION OF ACTIVE NOISE CONTROL ON MACHINERY NOISE
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Gandhi, Tapan
pg. 1281
OD-B-FR2.4 - UNDERSTANDING STRUCTURE INDUCED FUNCTIONAL CONNECTIVITY IN BRAIN USING EEG
Gao, Fei
pg. 559
OD-A-WE2.7 - LOW-POWER CONVOLUTIONAL RECURRENT NEURAL NETWORK FOR MONAURAL SPEECH ENHANCEMENT
Gao, Peng
pg. 564
OD-A-WE2.8 - MULTI-CHANNEL SPEECH ENHANCEMENT WITH 2-D CONVOLUTIONAL TIME-FREQUENCY DOMAIN FEATURES AND A PRE-TRAINED ACOUSTIC MODEL
Gao, Shang
pg. 950
OD-A-FR2.3 - A MULTI-SOURCE LOCALIZATION METHOD BASED ON CLUSTERING AND OUTLIER REMOVAL
Gao, Yanlong
pg. 93
LS-A-WE2.6 - IMBALANCED SAMPLE FEATURE ENHANCEMENT OF HYPERSPECTRAL IMAGERY CLASSIFICATION
Garg, Sparsh
pg. 761
OD-A-TH2.12 - DETECTING MULTIPLE DISFLUENCIES FROM SPEECH USING PRE-LINGUISTIC AUTOMATIC SYLLABIFICATION WITH ACOUSTIC AND PROSODY FEATURES
GENG, Yuting
pg. 968
OD-A-FR2.6 - NARROW-EDGED BEAMFORMING USING MASKED PARAMETRIC ARRAY LOUDSPEAKERS
gogineni, Vinay Chakravarthi
pg. 2072
LS-C-TH1.1 - GRAPH KERNEL RECURSIVE LEAST-SQUARES ALGORITHMS
Gong, Hao
pg. 1458
OD-B-FR1.6 - HEAD MOVEMENT PREDICTION USING FCNN
Gong, Jian
pg. 1087
OD-A-FR3.13 - EFFECT OF PERCEPTUAL TRAINING WITH NOISE ON CHINESE LEARNERS’ ENGLISH CONSONANT RECEPTION THRESHOLDS
Goto, Keita
pg. 1993
OD-B-TH3.4 - AUGMENTATION-AGNOSTIC REGULARIZATION FOR UNSUPERVISED CONTRASTIVE LEARNING WITH ITS APPLICATION TO SPEAKER VERIFICATION
Guan, Haixin
pg. 559
OD-A-WE2.7 - LOW-POWER CONVOLUTIONAL RECURRENT NEURAL NETWORK FOR MONAURAL SPEECH ENHANCEMENT
Guan, Shanzheng
pg. 1111
LS-D-WE1.1 - ATTENTION-BASED MULTI-CHANNEL SPEAKER VERIFICATION WITH AD-HOC MICROPHONE ARRAYS
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
pg. 630
OD-A-TH1.4 - MINIMUM-VOLUME REGULARIZED ILRMA FOR BLIND AUDIO SOURCE SEPARATION
Guo, Jing-Ming
pg. 1670
LS-D-FR2.3 - DIGITAL MULTITONE IMAGE RECONSTRUCTION USING DEEP GENERATIVE ADVERSARIAL NETS
pg. 1580
LS-C-TH2.3 - DIGITAL HALFTONE CLASSIFICATION USING SIMPLIFIED CNN AND STOCHASTIC STATISTICS
Guo, Liyong
pg. 564
OD-A-WE2.8 - MULTI-CHANNEL SPEECH ENHANCEMENT WITH 2-D CONVOLUTIONAL TIME-FREQUENCY DOMAIN FEATURES AND A PRE-TRAINED ACOUSTIC MODEL
Guo, Min
pg. 750
OD-A-TH2.10 - A MULTILINGUAL FRAMEWORK BASED ON PRETRAINING MODEL FOR SPEECH EMOTION RECOGNITION
Guo, Taiyang
pg. 700
OD-A-TH2.2 - HIERARCHICAL PROSODY ANALYSIS IMPROVES CATEGORICAL AND DIMENSIONAL EMOTION RECOGNITION
Guo, Wu
pg. 1062
OD-A-FR3.8 - END TO END SPOKEN LANGUAGE UNDERSTANDING USING PARTIAL DISENTANGLED SLOT EMBEDDING
Guo, You Sheng
pg. 141
LS-D-TH2.1 - NON-PARALLEL VOICE CONVERSION WITH GENERATIVE ATTENTIONAL NETWORKS
Guo, Zhaojin
OD-B-TH2.6 - CLUSTER-TRNET: JOINTED MODEL FOR REAL-TIME TRAFFIC IDENTIFICATION WITH HIGH ACCURACY
Gupta, Chitralekha
pg. 904
OD-A-FR1.5 - TRAINING EXPLAINABLE SINGING QUALITY ASSESSMENT NETWORK WITH AUGMENTED DATA
pg. 912
OD-A-FR1.6 - TOWARDS REFERENCE-INDEPENDENT RHYTHM ASSESSMENT OF SOLO SINGING
Gupta, Shefali
pg. 1281
OD-B-FR2.4 - UNDERSTANDING STRUCTURE INDUCED FUNCTIONAL CONNECTIVITY IN BRAIN USING EEG
Gupta, Siddhant
pg. 775
OD-A-TH2.14 - DEEP CONVOLUTIONAL NEURAL NETWORK FOR VOICE LIVENESS DETECTION
Gurugubelli, Krishna
pg. 737
OD-A-TH2.8 - COMPARATIVE STUDY OF FILTER BANKS TO IMPROVE THE PERFORMANCE OF VOICE DISORDER ASSESSMENT SYSTEMS USING LTAS FEATURES
H
Ha, Jeong-Won
pg. 1698
LS-A-FR3.4 - SUPER-RESOLUTION IMAGING USING A FOCUS PIXEL SENSOR
Hama, Soichi
pg. 1736
OD-B-TH2.5 - EVALUATION ON PALM VEIN RECOGNITION OF CHILDREN IN GROWING
Hamamoto, Takayuki
pg. 88
LS-A-WE2.5 - COLOR TRANSFORMATION FOR COMPRESSIVE COMPUTING IN IMAGE FILTERING
Han, Byeong-Ju
pg. 1607
LS-D-TH3.4 - CONTEXT-BASED MATCHING REFINEMENT FOR PERSON SEARCH
Han, Ji-Yan
pg. 829
OD-A-TH3.6 - SPEECH RECONSTRUCTION FROM THE LARYNX VIBRATION FEATURE CAPTURED BY LASER-DOPPLER VIBROMETER SENSOR
Han, Jiao
pg. 1121
LS-D-WE1.3 - AN MAP ESTIMATION FOR BETWEEN-CLASS VARIANCE
Hanamoto, Kazuya
pg. 295
OD-B-WE2.11 - FEEDBACK QUANTIZATION AND BIT ALLOCATION FOR NETWORKED CONTROL SYSTEMS WITH RATE LIMITED CHANNELS
Harada, Yuna
pg. 1000
OD-A-FR2.11 - SHARP-SOUND-IMAGE CONSTRUCTION METHOD USING MULTICHANNEL SOUND SYSTEM WITH OPTIMAL PARAMETRIC LOUDSPEAKER ARRANGEMENT
Harakawa, Ryosuke
pg. 1703
LS-A-FR3.5 - MULTI-VIEW VARIATIONAL AUTOENCODER FOR ROBUST CLASSIFICATION AGAINST IRRELEVANT DATA
pg. 1536
LS-A-WE1.6 - PRODUCT QUANTIZATION TO REDUCE ENTROPY OF LABELS FOR FAST AND ACCURATE IMAGE RETRIEVAL
Harigae, Yuta
pg. 218
LS-A-FR2.4 - A PROPOSAL TOWARD STANDARDIZATION OF DESIGN EXAMPLES FOR IIR FILTER DESIGN METHODS
Harjoko, Agus
pg. 1602
LS-D-TH3.3 - A FUSION METHODOLOGY OF AKAZE AND NEURAL NETWORK FOR FINGERPRINT RECOGNITION
Haruta, Chiho
pg. 1215
LS-B-FR1.3 - FRAMEWISE FINITE IMPULSE RESPONSE FILTERING BASED ON TIME-FREQUENCY MASK FOR LOW-LATENCY SPEECH ENHANCEMENT
Hasumi, Takuya
pg. 1226
LS-B-FR1.5 - MULTICHANNEL AUDIO SOURCE SEPARATION WITH INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS USING PRODUCT OF SOURCE MODELS
Hatakeyama, Kazuki
pg. 1067
OD-A-FR3.9 - MULTIPLE DEEP LEARNING MODELS AND ARCHITECTURES WITH DIFFERENT NUMBERS OF STATES USED TO IMPROVE RETRIEVAL ACCURACY OF QUERY-BY-EXAMPLE
HATANO, Tomomi
pg. 373
LS-B-TH3.3 - MEASURING ATTRACTIVENESS OF TOURISM RESOURCES BY FOCUSING ON KANSEI VALUE STRUCTURE: POSSIBILITY OF INVITING VISITORS USING THE JAPANESE HERITAGE “AKO SALT.”
Hayakawa, Daichi
pg. 23
OD-B-WE1.4 - MASK-BASED BEAMFORMING USING COMPLEX-VALUED NEURAL NETWORK FOR RECOGNITION OF SPATIAL TARGET SPEECH
Hayashi, Kazunori
pg. 1741
OD-B-TH2.7 - AN OVERLOADED MU-MIMO SIGNAL DETECTION METHOD USING PIECEWISE CONTINUOUS NONCONVEX SPARSE REGULARIZER
pg. 1748
OD-B-TH2.8 - RECEIVED SIGNAL POWER BASED SENSOR ZONE ESTIMATION WITH MAXIMUM LIKELIHOOD APPROACH
He, Mingyi
pg. 282
OD-B-WE2.9 - NEARBY-PERSON OCCLUSION DATA AUGMENTATION FOR HUMAN POSE ESTIMATION WITH NON-EXTRA ANNOTATIONS
HE, Wanqi
pg. 121
LS-D-TH1.4 - AN IDE FOR RECONFIGURABLE VIDEO ARRAY PROCESSOR
He, Zheng
pg. 1511
LS-A-WE1.3 - MULTI-RESIDUAL FEATURE FUSION NETWORK FOR LIGHTWEIGHT SINGLE IMAGE SUPER-RESOLUTION
He, Ziqiang
pg. 1858
LS-B-FR3.2 - DERIVING A COMPACT ANALYTICAL MODEL FOR CAMERA RESPONSE FUNCTIONS WITH APPLICATION TO CHARTLESS RADIOMETRIC CALIBRATION
He, Zunwen
pg. 541
OD-A-WE2.4 - DNN-BASED LINEAR PREDICTION RESIDUAL ENHANCEMENT FOR SPEECH DEREVERBERATION
Heo, Suwoong
pg. 1428
OD-B-FR1.1 - HIGH-QUALITY SINGLE IMAGE 3D FACIAL SHAPE RECONSTRUCTION VIA ROBUST ALBEDO ESTIMATION
Hernandez, Marissa
pg. 1277
OD-B-FR2.3 - A RECOMMENDATION SYSTEMS APPROACH FOR DETECTING EPISTASIS IN GENOMIC SIGNALS
Herranz, Luis
pg. 1422
OD-B-TH1.12 - SPATIAL INFORMATION REFINEMENT FOR CHROMA INTRA PREDICTION IN VIDEO CODING
Higuchi, Yosuke
pg. 477
OD-A-WE1.9 - AN INVESTIGATION OF ENHANCING CTC MODEL FOR TRIGGERED ATTENTION-BASED STREAMING ASR
Hiraga, Yuzuru
pg. 890
OD-A-FR1.3 - INVESTIGATING TIME-FREQUENCY REPRESENTATIONS FOR AUDIO FEATURE EXTRACTION IN SINGING TECHNIQUE CLASSIFICATION
Hirata, Kouji
pg. 1895
LS-B-WE1.3 - IMPLEMENTATION OF A FAST FAILURE RECOVERY METHOD CONSIDERING LOAD DISTRIBUTION FOR NETWORK SLICING
pg. 1899
LS-B-WE1.4 - MULTI-ARMED BANDIT-BASED ROUTING METHOD FOR IN-NETWORK CACHING
pg. 1908
LS-B-WE1.6 - INHIBITION MODELING OF FUTURE MALWARE DIFFUSION WITH AN EVOLUTIONARY GAME THEORY
Hirayama, Atsuya
pg. 1741
OD-B-TH2.7 - AN OVERLOADED MU-MIMO SIGNAL DETECTION METHOD USING PIECEWISE CONTINUOUS NONCONVEX SPARSE REGULARIZER
Hirayama, Naotaka
pg. 1963
LS-A-FR1.4 - OFFLOADING SELECTION WITH UNEQUAL TIMESLOT IN MOBILE EDGE COMPUTING
Hirose, Akira
pg. 174
LS-A-TH3.1 - GENERALIZATION CHARACTERISTICS OF COMPLEX-VALUED RESERVOIR COMPUTING FOR INTERFEROMETRIC SYNTHETIC APERTURE RADAR APPLICATIONS
pg. 193
LS-A-TH3.4 - ADAPTIVE SUBSURFACE IMAGING BASED ON PEAK PHASE-PROFILE: THE SIGNIFICANCE IN SEPARATION OF SCATTERING PHASE FROM PROPAGATION PHASE
pg. 200
LS-A-TH3.5 - DISCUSSION ON THE ORIGIN OF THE STRENGTH OF PHASOR QUATERNION SELF-ORGANIZING MAP
Hiruma, Nobuhiko
pg. 9
OD-B-WE1.2 - ADAPTIVE FEEDBACK CANCELLATION BASED ON PREDICTION ERROR METHOD USING INTERAURAL LEVEL DIFFERENCES IN HEARING DEVICE
Hisatsune, Kazuki
pg. 1338
LS-A-TH1.1 - REAL-TIME MONITORING SYSTEM TO EVALUATE EXERCISE LOAD, HYPOXIC LOAD, AND SAFETY IN A NORMOBARIC HYPOXIC ROOM
Hogg, Aidan
pg. 705
OD-A-TH2.3 - A STUDY OF SALIENT MODULATION DOMAIN FEATURES FOR SPEAKER IDENTIFICATION
Honda, Hiroki
pg. 1748
OD-B-TH2.8 - RECEIVED SIGNAL POWER BASED SENSOR ZONE ESTIMATION WITH MAXIMUM LIKELIHOOD APPROACH
Hong Nguyen, Huy
pg. 1815
LS-B-TH1.5 - FEATURE EXTRACTION BASED ON DENOISING AUTO ENCODER FOR CLASSIFICATION OF ADVERSARIAL EXAMPLES
Hong, Qian-Bei
pg. 619
OD-A-TH1.2 - IMPROVEMENT OF SPATIAL AMBIGUITY IN MULTI-CHANNEL SPEECH SEPARATION USING CHANNEL ATTENTION
Hong, Qingyang
pg. 1097
OD-A-FR3.15 - OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
Hontani, Hidekata
OD-B-FR3.1 - FAST ALGORITHM FOR LOW-RANK TENSOR COMPLETION IN DELAY EMBEDDED SPACE
Hori, Kouki
pg. 275
OD-B-WE2.8 - NONLINEAR SVM-TYPE AUTOMATIC DICISION ALGORITHM IN NOISY ENVIRONMENT FOR HAMMERING TEST SYSTEM
Horii, Koharu
pg. 1077
OD-A-FR3.11 - END-TO-END SPONTANEOUS SPEECH RECOGNITION USING HESITATION LABELING
Hosoda, Yuya
pg. 920
OD-A-FR1.8 - PITCH ESTIMATION ALGORITHM FOR NARROWBAND SPEECH SIGNAL USING PHASE DIFFERENCES BETWEEN HARMONICS
Hou, Jingyong
pg. 672
OD-A-TH1.11 - TARGET SPEAKER EXTRACTION FOR CUSTOMIZABLE QUERY-BY-EXAMPLE KEYWORD SPOTTING
Hou, Nana
pg. 497
OD-A-WE1.12 - MULTITASK-BASED JOINT LEARNING APPROACH TO ROBUST ASR FOR RADIO COMMUNICATION SPEECH
pg. 679
OD-A-TH1.12 - TIME DOMAIN SPEECH ENHANCEMENT WITH ATTENTIVE MULTI-SCALE APPROACH
Hsia, Chih-Hsien
pg. 1674
LS-D-FR2.4 - SMART FACIAL SKINCARE PRODUCTS USING COMPUTER VISION TECHNOLOGIES
pg. 1678
LS-D-FR2.5 - AN ATTENTION BASED EXPERT INSPECTION SYSTEM FOR SMART SCALP
Hsieh, Chia-Yeh
pg. 1258
LS-D-FR3.5 - INSTRUMENTED ROMBERG TEST OF POSTURAL STABILITY IN PATIENTS WITH VESTIBULAR DISORDERS USING INERTIAL MEASUREMENT UNITS
Hsu, Hsuan-Wei
pg. 1662
LS-D-FR2.2 - DEEPFAKE ALGORITHM USING MULTIPLE NOISE MODALITIES WITH TWO-BRANCH PREDICTION NETWORK
Hsu, Jia-Hao
pg. 1982
OD-B-TH3.2 - TASK-AWARE BERT-BASED SENTIMENT ANALYSIS FROM MULTIPLE ESSENCES OF THE TEXT
pg. 1026
OD-A-FR3.2 - ENSEMBLE OF ONE MODEL: CREATING MODEL VARIATIONS FOR TRANSFORMER WITH LAYER PERMUTATION
Hsu, Yung-Chang
pg. 2006
OD-B-TH3.6 - FAQ RETRIEVAL USING QUESTION-AWARE GRAPH CONVOLUTIONAL NETWORK AND CONTEXTUALIZED LANGUAGE MODEL
Hu, Chuanzhan
pg. 106
LS-D-TH1.2 - AN IMPLEMENTATION METHOD OF HEVC DATAFLOW GRAPH BASED ON RECONFIGURABLE PROCESSER
Hu, Hongmei
pg. 1311
LS-C-WE1.2 - EXPLORING ARTIFACT REJECTION FOR HIGH-PULSE RATE ELECTRICALLY EVOKED AUDITORY STEADY STATE RESPONSES IN COCHLEAR IMPLANTS USERS
Hu, Kaixi
pg. 1551
LS-C-WE2.3 - TEXT DESCRIPTION GENERATION FROM VIDEOS VIA DEEP SEMANTIC MODELS
Hu, Shenghua
pg. 614
OD-A-TH1.1 - A TARGET SPEAKER SEPARATION NEURAL NETWORK WITH JOINT-TRAINING
pg. 1072
OD-A-FR3.10 - SEPARABLE TEMPORAL CONVOLUTION PLUS TEMPORALLY POOLED ATTENTION FOR LIGHTWEIGHT HIGH-PERFORMANCE KEYWORD SPOTTING
Hu, Shun
pg. 1386
OD-B-TH1.5 - INTRA CODING TOOL PRUNING FOR REDUCING COMPLEXITY OF VVC SCREEN CONTENT CODING
Hu, Wenxuan
pg. 1097
OD-A-FR3.15 - OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
Hu, Xinhui
pg. 700
OD-A-TH2.2 - HIERARCHICAL PROSODY ANALYSIS IMPROVES CATEGORICAL AND DIMENSIONAL EMOTION RECOGNITION
Hu, Yanxin
pg. 672
OD-A-TH1.11 - TARGET SPEAKER EXTRACTION FOR CUSTOMIZABLE QUERY-BY-EXAMPLE KEYWORD SPOTTING
Huang, Chong-Rui
pg. 1197
LS-C-TH3.5 - DEVELOPMENT OF ACTIVE HEAR-THROUGH EQUALIZATION ALGORITHM FOR EARPHONES
Huang, Feiran
pg. 1839
LS-B-FR2.4 - END-TO-END LEARNING FOR ENCRYPTED IMAGE RETRIEVAL
Huang, Gongping
pg. 1221
LS-B-FR1.4 - CONSTRAINED MAXIMUM DIRECTIVITY BEAMFORMERS BASED ON UNIFORM LINEAR ACOUSTIC VECTOR SENSOR ARRAYS
pg. 49
OD-B-WE1.8 - KRONECKER PRODUCT ADAPTIVE BEAMFORMING FOR MICROPHONE ARRAYS
Huang, Hao
pg. 1021
OD-A-FR3.1 - ENRICHING UNDER-REPRESENTED NAMED ENTITIES FOR IMPROVED SPEECH RECOGNITION
pg. 1043
OD-A-FR3.5 - MULTILINGUAL APPROACH TO JOINT SPEECH AND ACCENT RECOGNITION WITH DNN-HMM FRAMEWORK
Huang, Huirong
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
Huang, Pin-Tuan
pg. 719
OD-A-TH2.5 - GENERATION OF SPEAKER REPRESENTATIONS USING HETEROGENEOUS TRAINING BATCH ASSEMBLY
Huang, Shuai
pg. 2085
LS-C-TH1.3 - PERSONALIZED LEARNING USING MULTIPLE KERNEL MODELS
Huang, Wen-Chin
pg. 1234
LS-D-FR3.1 - TIME ALIGNMENT USING LIP IMAGES FOR FRAME-BASED ELECTROLARYNGEAL VOICE CONVERSION
pg. 814
OD-A-TH3.4 - NOISY-TO-NOISY VOICE CONVERSION FRAMEWORK WITH DENOISING MODEL
Huang, Wen-chin
pg. 870
OD-A-TH3.13 - INVESTIGATION OF TEXT-TO-SPEECH-BASED SYNTHETIC PARALLEL DATA FOR SEQUENCE-TO-SEQUENCE NON-PARALLEL VOICE CONVERSION
Huang, Yih-Fang
pg. 2072
LS-C-TH1.1 - GRAPH KERNEL RECURSIVE LEAST-SQUARES ALGORITHMS
Huang, Yu-Min
pg. 2043
OD-B-TH3.12 - MULTI-RESOLUTION CONVOLUTIONAL RECURRENT NETWORKS
Huang, Yukai
pg. 750
OD-A-TH2.10 - A MULTILINGUAL FRAMEWORK BASED ON PRETRAINING MODEL FOR SPEECH EMOTION RECOGNITION
Huo, Guanying
pg. 1375
OD-B-TH1.3 - UNDERWATER IMAGE DEHAZING BASED ON DISPARITY ESTIMATION AND COLOR CONSTRAINT
I
Iida, Kenta
pg. 1846
LS-B-FR2.5 - A PRIVACY-PRESERVING IMAGE RETRIEVAL SCHEME USING A CODEBOOK GENERATED FROM INDEPENDENT PLAIN-IMAGE DATASET
Iiguni, Youji
pg. 920
OD-A-FR1.8 - PITCH ESTIMATION ALGORITHM FOR NARROWBAND SPEECH SIGNAL USING PHASE DIFFERENCES BETWEEN HARMONICS
Ikeda, Kazushi
pg. 1353
LS-A-TH1.4 - MATHEMATICAL MODEL OF A HORSE AND THE RIDER DURING A JUMP
OD-B-FR3.4 - STABILITY OF A FINANCIAL SYSTEM VIA FINDING SYSTEMICALLY IMPORTANT FINANCIAL INSTITUTIONS
pg. 1357
LS-A-TH1.5 - EVALUATION OF THE EFFECT OF TRANSFER LEARNING TO MULTI-INSTANCE DETECTION OF MONKEYS
Ikehara, Masaaki
pg. 1381
OD-B-TH1.4 - HIGH REFLECTION REMOVAL USING CNN WITH DETECTION AND ESTIMATION
Imaizumi, Shoko
pg. 1821
LS-B-FR2.1 - AN EXTENDED REVERSIBLE DATA HIDING METHOD FOR HDR IMAGES USING EDGE ESTIMATION
pg. 1794
LS-B-TH1.2 - A FLEXIBLE REVERSIBLE DATA HIDING METHOD IN COMPRESSIBLE ENCRYPTED IMAGES
Imakura, Akira
OD-B-FR3.1 - FAST ALGORITHM FOR LOW-RANK TENSOR COMPLETION IN DELAY EMBEDDED SPACE
Imaoka, Hitoshi
pg. 1781
LS-B-WE2.4 - COMPARATIVE STUDY OF FEATURE EXTRACTION METHOD FOR EMOTIONAL CLASSIFICATION BY MICRO-EXPRESSIONS
Imoto, Hirochika
pg. 1343
LS-A-TH1.2 - PREOPERATIVE MONITORING USING IMPLANTABLE, MULTIMODAL, MULTICHANNEL PROBE
Imoto, Keisuke
pg. 1156
LS-A-TH2.4 - MULTITASK LEARNING OF ACOUSTIC SCENES AND EVENTS USING DYNAMIC WEIGHT ADAPTATION BASED ON MULTI-FOCAL LOSS
pg. 1161
LS-A-TH2.5 - INVESTIGATION ON SPATIAL AND FREQUENCY-BASED FEATURES FOR ASYNCHRONOUS ACOUSTIC SCENE ANALYSIS
Imtiaz, Izbaila
pg. 1499
LS-A-WE1.1 - AN EFFICIENT IMAGE PROCESSING AND MACHINE LEARNING BASED TECHNIQUE FOR SKIN LESION SEGMENTATION AND CLASSIFICATION
Inagaki, Keiichiro
pg. 1289
OD-B-FR2.5 - EFFECT OF VISUAL ATTENTION AND DRIVING EXPERIENCES ON THE EVENT-RELATED POTENTIAL P300 IN THE PERCEPTION OF TRAFFIC SCENES
Inatsu, Nao
pg. 1348
LS-A-TH1.3 - PRELIMINARY STUDY USING AUTOENCODER FOR EARLY DETECTION OF HEAT ILLNESS FROM HEART RATE VARIABILITY OBTAINED WITH WEARABLE DEVICE
Inoue, Nakamasa
pg. 1993
OD-B-TH3.4 - AUGMENTATION-AGNOSTIC REGULARIZATION FOR UNSUPERVISED CONTRASTIVE LEARNING WITH ITS APPLICATION TO SPEAKER VERIFICATION
Inoue, Takao
pg. 1343
LS-A-TH1.2 - PREOPERATIVE MONITORING USING IMPLANTABLE, MULTIMODAL, MULTICHANNEL PROBE
Irie, Kei
pg. 392
LS-C-FR2.2 - ON IMPROVING THE ACCURACY OF OBJECT DETECTION FOR HIGH RESOLUTION IMAGES BASED ON SSD
Irino, Toshio
pg. 897
OD-A-FR1.4 - IMPLEMENTATION OF INTERACTIVE TOOLS FOR INVESTIGATING FUNDAMENTAL FREQUENCY RESPONSE OF VOICED SOUNDS TO AUDITORY STIMULATION
Itasaka, Tatsuki
OD-B-FR3.3 - STUDY ON GENERALIZATION PERFORMANCE OF DEEP IMAGE RESTORATION WITH UNFOLDING ON SMALL DATASETS
Ito, Hiroki
pg. 1833
LS-B-FR2.3 - ACCESS CONTROL USING SPATIALLY INVARIANT PERMUTATION OF FEATURE MAPS FOR SEMANTIC SEGMENTATION MODELS
Ito, Koichi
pg. 1762
LS-B-WE2.1 - A COMPREHENSIVE STUDY OF FACE RECOGNITION USING DEEP LEARNING
Ito, Yusuke
pg. 1899
LS-B-WE1.4 - MULTI-ARMED BANDIT-BASED ROUTING METHOD FOR IN-NETWORK CACHING
Itoh, Yoshiaki
pg. 1067
OD-A-FR3.9 - MULTIPLE DEEP LEARNING MODELS AND ARCHITECTURES WITH DIFFERENT NUMBERS OF STATES USED TO IMPROVE RETRIEVAL ACCURACY OF QUERY-BY-EXAMPLE
Iwahashi, Masahiro
pg. 1703
LS-A-FR3.5 - MULTI-VIEW VARIATIONAL AUTOENCODER FOR ROBUST CLASSIFICATION AGAINST IRRELEVANT DATA
pg. 1536
LS-A-WE1.6 - PRODUCT QUANTIZATION TO REDUCE ENTROPY OF LABELS FOR FAST AND ACCURATE IMAGE RETRIEVAL
Iwai, Kenta
pg. 1173
LS-C-TH3.1 - A STUDY ON OPTIMAL FILTER OF FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM BASED ON ANALYSIS OF FREQUENCY RESPONSE
pg. 989
OD-A-FR2.9 - FORMULATION OF MULTIDIMENSIONAL FREQUENCY CHARACTERISTICS OF SECOND-ORDER NONLINEAR IIR FILTER
pg. 995
OD-A-FR2.10 - TWO-STAGE PHASE RECONSTRUCTION USING DNN AND VON MISES DISTRIBUTION-BASED MAXIMUM LIKELIHOOD
pg. 1000
OD-A-FR2.11 - SHARP-SOUND-IMAGE CONSTRUCTION METHOD USING MULTICHANNEL SOUND SYSTEM WITH OPTIMAL PARAMETRIC LOUDSPEAKER ARRANGEMENT
Iwamae, Reina
pg. 1156
LS-A-TH2.4 - MULTITASK LEARNING OF ACOUSTIC SCENES AND EVENTS USING DYNAMIC WEIGHT ADAPTATION BASED ON MULTI-FOCAL LOSS
Iwamoto, Yu
pg. 1082
OD-A-FR3.12 - UNSUPERVISED SPOKEN TERM DISCOVERY USING WAV2VEC 2.0
Iwano, Koji
pg. 624
OD-A-TH1.3 - NOISE-TOLERANT TIME-DOMAIN SPEECH SEPARATION WITH NOISE BASES
J
Jamwal, Prashant
pg. 416
LS-C-FR3.2 - COST-EFFECTIVE PROPORTIONATE AFFINE PROJECTION ALGORITHM WITH VARIABLE PARAMETERS FOR ACOUSTIC FEEDBACK CANCELLATION
pg. 423
LS-C-FR3.3 - SELF-SUPERVISED VISUAL TRANSFORMERS FOR BREAST CANCER DIAGNOSIS
Jamwal, Prashant Kumar
pg. 410
LS-C-FR3.1 - EVENT-RELATED SPECTROGRAM REPRESENTATION OF EEG FOR CNN-BASED P300 SPELLER
Jang, Yeong Il
pg. 158
LS-D-TH2.4 - LOSSLESS IMAGE COMPRESSION BASED ON IMAGE DECOMPOSITION AND PROGRESSIVE PREDICTION USING CONVOLUTIONAL NEURAL NETWORKS
JAYAN P., DEEPAK
pg. 1987
OD-B-TH3.3 - CONVOLUTIONAL AUTOENCODER BASED DEEP LEARNING MODEL FOR IDENTIFICATION OF RED PALM WEEVIL SIGNALS
JEON, Gwanggil
pg. 1528
LS-A-WE1.5 - ACCELERATION OF PDS–BASED HIGH–DIMENSIONAL SIGNAL RESTORATION
Jeon, Gwanggil
pg. 1499
LS-A-WE1.1 - AN EFFICIENT IMAGE PROCESSING AND MACHINE LEARNING BASED TECHNIQUE FOR SKIN LESION SEGMENTATION AND CLASSIFICATION
pg. 1511
LS-A-WE1.3 - MULTI-RESIDUAL FEATURE FUSION NETWORK FOR LIGHTWEIGHT SINGLE IMAGE SUPER-RESOLUTION
Jeon, Moongu
pg. 1756
OD-B-TH2.9 - ANOMALY DETECTION FOR WIRELESS COMMUNICATION LINKS VIA DATA INTEGRITY MODELING
Jhong, Sin-Ye
pg. 1678
LS-D-FR2.5 - AN ATTENTION BASED EXPERT INSPECTION SYSTEM FOR SMART SCALP
Ji, Tiannan
pg. 1422
OD-B-TH1.12 - SPATIAL INFORMATION REFINEMENT FOR CHROMA INTRA PREDICTION IN VIDEO CODING
Ji, Xiaoli
pg. 1087
OD-A-FR3.13 - EFFECT OF PERCEPTUAL TRAINING WITH NOISE ON CHINESE LEARNERS’ ENGLISH CONSONANT RECEPTION THRESHOLDS
Jia, Chen
pg. 689
OD-A-TH1.14 - SPARSELY OVERLAPPED SPEECH TRAINING IN THE TIME DOMAIN: JOINT LEARNING OF TARGET SPEECH SEPARATION AND PERSONAL VAD BENEFITS
Jia, Jia
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
Jia, Maoshen
pg. 950
OD-A-FR2.3 - A MULTI-SOURCE LOCALIZATION METHOD BASED ON CLUSTERING AND OUTLIER REMOVAL
Jiang, Aimin
pg. 1333
LS-C-WE1.6 - ARRHYTHMIA CLASSIFICATION ALGORITHM BASED ON SPARSE AUTOENCODER
JIANG, BINGHONG
pg. 113
LS-D-TH1.3 - AN IMPROVED NAIVE BAYES MODEL FOR AIR TEMPERATURE PREDICTION
Jiang, Dongmei
pg. 305
OD-B-WE2.13 - POSITIONAL-SPECTRAL-TEMPORAL ATTENTION IN 3D CONVOLUTIONAL NEURAL NETWORKS FOR EEG EMOTION RECOGNITION
Jiang, Junping
pg. 1328
LS-C-WE1.5 - ADHD CLASSIFICATION VIA AUTO-ENCODING NETWORK WITH NON-IMAGING DATA FUSION
Jiang, Lin
pg. 106
LS-D-TH1.2 - AN IMPLEMENTATION METHOD OF HEVC DATAFLOW GRAPH BASED ON RECONFIGURABLE PROCESSER
Jin, Jilu
pg. 1221
LS-B-FR1.4 - CONSTRAINED MAXIMUM DIRECTIVITY BEAMFORMERS BASED ON UNIFORM LINEAR ACOUSTIC VECTOR SENSOR ARRAYS
K
K. R., RAJESH
pg. 1987
OD-B-TH3.3 - CONVOLUTIONAL AUTOENCODER BASED DEEP LEARNING MODEL FOR IDENTIFICATION OF RED PALM WEEVIL SIGNALS
Kagoshima, Takehiko
pg. 23
OD-B-WE1.4 - MASK-BASED BEAMFORMING USING COMPLEX-VALUED NEURAL NETWORK FOR RECOGNITION OF SPATIAL TARGET SPEECH
Kai, Atsuhiko
pg. 1037
OD-A-FR3.4 - RETRIEVAL-ORIENTED E2E ASR MODELING FOR IMPROVED QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
Kajikawa, Yoshinobu
OD-B-FR3.2 - PHASE CONTROL OF PARAMETRIC AEEAY LOUNDSPEAKER BY OPTIMIZING THE SIDEBAND WEIGHTING FUNCTIONS
pg. 1187
LS-C-TH3.3 - A SUBBAND ACTIVE NOISE CONTROL SYSTEM WITH AUTOMATIC TAP ASSIGNMENT IN CONSIDERATION OF PSYCHOACOUSTIC PROPERTIES
pg. 259
OD-B-WE2.5 - STATISTICAL-MECHANICAL ANALYSIS OF ADAPTIVE VOLTERRA FILTER FOR TIME-VARYING UNKNOWN SYSTEM
pg. 989
OD-A-FR2.9 - FORMULATION OF MULTIDIMENSIONAL FREQUENCY CHARACTERISTICS OF SECOND-ORDER NONLINEAR IIR FILTER
Kamble, Madhu
pg. 491
OD-A-WE1.11 - TEAGER ENERGY SUBBAND FILTERED FEATURES FOR NEAR AND FAR-FIELD AUTOMATIC SPEECH RECOGNITION
Kameda, Suguru
pg. 1953
LS-A-FR1.2 - FUNDAMENTAL INVESTIGATION OF BACKOFF CONTROL METHOD FOR FAIR COMMUNICATION OPPORTUNITY OF MMW WBAN IN OVERCROWDED ENVIRONMENT
Kameoka, Hirokazu
pg. 836
OD-A-TH3.7 - STARGAN-BASED EMOTIONAL VOICE CONVERSION FOR JAPANESE PHRASES
Kan, Yao-Chiang
pg. 1931
LS-D-WE2.4 - A PARKING MONITORING SYSTEM USING FMCW RADARS
Kang, Hong-Goo
pg. 591
OD-A-WE2.12 - STACKED U-NET WITH HIGH-LEVEL FEATURE TRANSFER FOR PARAMETER EFFICIENT SPEECH ENHANCEMENT
Kang, Je-Won
pg. 1598
LS-D-TH3.2 - RATE-DISTORTION OPTIMIZED TEMPORAL SEGMENTATION USING REINFORCEMENT LEARNING FOR VIDEO CODING
Kang, Jiwoo
pg. 1428
OD-B-FR1.1 - HIGH-QUALITY SINGLE IMAGE 3D FACIAL SHAPE RECONSTRUCTION VIA ROBUST ALBEDO ESTIMATION
pg. 1488
OD-B-FR1.11 - CHECKERBOARD CORNER LOCALIZATION ACCELERATED WITH DEEP FALSE DETECTION FOR MULTI-CAMERA CALIBRATION
Kang, Shiyin
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
Kang, Suk-Ju
pg. 1693
LS-A-FR3.3 - EDGE MAP-GUIDED SCALE-ITERATIVE IMAGE DEBLURRING
Kang, Xiangui
pg. 1858
LS-B-FR3.2 - DERIVING A COMPACT ANALYTICAL MODEL FOR CAMERA RESPONSE FUNCTIONS WITH APPLICATION TO CHARTLESS RADIOMETRIC CALIBRATION
Karnapi, Furi Andi
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Karnjana, Jessada
pg. 1634
LS-C-FR1.3 - HYBRIDIZATION OF SPEECH INFORMATION HIDING AND ENCRYPTION FOR DOUBLE-LAYER SECURITY IN SPEECH COMMUNICATION
Kasisopa, Benjawan
pg. 926
OD-A-FR1.9 - SVM-BASED EVALUATION OF THAI TONE IMITATIONS BY THAI-NAÏVE MANDARIN AND VIETNAMESE SPEAKERS
Katagiri, Kazuhiro
pg. 603
OD-A-WE2.14 - COMPARATIVE STUDY ON DNN-BASED MINIMUM VARIANCE BEAMFORMING ROBUST TO SMALL MOVEMENTS OF SOUND SOURCES
Katahira, Kenji
pg. 366
LS-B-TH3.2 - ESTIMATING BEVERAGE PREFERENCE BASED ON SUBJECTIVE EMOTIONAL REACTIONS AND EEG ACTIVITY
Kato, Koki
pg. 1781
LS-B-WE2.4 - COMPARATIVE STUDY OF FEATURE EXTRACTION METHOD FOR EMOTIONAL CLASSIFICATION BY MICRO-EXPRESSIONS
Kato, Ryota
pg. 212
LS-A-FR2.3 - AN IMPROVED PARAMETER FREE GENETIC ALGORITHM FOR CSD-FIR FILTER DESIGN
Kaushik, Manav
pg. 786
OD-A-TH2.16 - END-TO-END SPEAKER AGE AND HEIGHT ESTIMATION USING ATTENTION MECHANISM AND TRIPLET LOSS
Kawabata, Yoshiko
pg. 380
LS-B-TH3.4 - AIZUCHI AS A SIGN OF INTERNAL INFORMATION PROCESSING AND ITS INTERPRETATIONS BY LISTENERS
Kawahara, Hideki
pg. 897
OD-A-FR1.4 - IMPLEMENTATION OF INTERACTIVE TOOLS FOR INVESTIGATING FUNDAMENTAL FREQUENCY RESPONSE OF VOICED SOUNDS TO AUDITORY STIMULATION
Kawahara, Tatsuya
pg. 433
OD-A-WE1.1 - ON THE USE OF SPEAKER INFORMATION FOR AUTOMATIC SPEECH RECOGNITION IN SPEAKER-IMBALANCED CORPORA
pg. 438
OD-A-WE1.2 - SPECTROGRAMS FUSION-BASED END-TO-END ROBUST AUTOMATIC SPEECH RECOGNITION
pg. 465
OD-A-WE1.7 - AN END-TO-END MODEL FROM SPEECH TO CLEAN TRANSCRIPT FOR PARLIAMENTARY MEETINGS
Kawai, Hiroya
pg. 1762
LS-B-WE2.1 - A COMPREHENSIVE STUDY OF FACE RECOGNITION USING DEEP LEARNING
Kawai, Hisashi
pg. 769
OD-A-TH2.13 - SIAMESE NEURAL NETWORK WITH JOINT BAYESIAN MODEL STRUCTURE FOR SPEAKER VERIFICATION
Kawamura, Arata
pg. 920
OD-A-FR1.8 - PITCH ESTIMATION ALGORITHM FOR NARROWBAND SPEECH SIGNAL USING PHASE DIFFERENCES BETWEEN HARMONICS
Kawamura, Kei
pg. 70
LS-A-WE2.2 - FAST STILL PICTURE CODING FOR VVC
Kawamura, Masaki
pg. 1640
LS-C-FR1.4 - BSS-BASED EXTRACTION FOR ADDITIVE VIDEO WATERMARKING
pg. 1647
LS-C-FR1.5 - DETECTION OF PERIODIC PILOT SIGNAL IN IMAGE WATERMARKING
Kawano, Rinka
pg. 1647
LS-C-FR1.5 - DETECTION OF PERIODIC PILOT SIGNAL IN IMAGE WATERMARKING
Ke, Xiaoquan
pg. 743
OD-A-TH2.9 - DUAL DROPOUT RANKING OF LINGUISTIC FEATURES FOR ALZHEIMER’S DISEASE RECOGNITION
Kelesbekov, Rauan
pg. 423
LS-C-FR3.3 - SELF-SUPERVISED VISUAL TRANSFORMERS FOR BREAST CANCER DIAGNOSIS
Khan, Ahmed
pg. 1828
LS-B-FR2.2 - IMAGE WATERMARKING BASED ON NON-NEWTONIAN EFFECT AND INTERPOLATED SWT-DWT
Khassanov, Yerbolat
pg. 1021
OD-A-FR3.1 - ENRICHING UNDER-REPRESENTED NAMED ENTITIES FOR IMPROVED SPEECH RECOGNITION
Khoria, Kuldeep
pg. 775
OD-A-TH2.14 - DEEP CONVOLUTIONAL NEURAL NETWORK FOR VOICE LIVENESS DETECTION
Kidani, Yoshitaka
pg. 70
LS-A-WE2.2 - FAST STILL PICTURE CODING FOR VVC
Kim, Chang-Su
pg. 164
LS-D-TH2.5 - FACIAL VIDEO FRAME INTERPOLATION COMBINING SYMMETRIC AND ASYMMETRIC MOTIONS
Kim, Jintae
pg. 164
LS-D-TH2.5 - FACIAL VIDEO FRAME INTERPOLATION COMBINING SYMMETRIC AND ASYMMETRIC MOTIONS
Kim, Jinwoo
pg. 1465
OD-B-FR1.7 - A STUDY ON VIRTUAL REALITY SICKNESS AND VISUAL ATTENTION
pg. 1470
OD-B-FR1.8 - QUALITY OF INTERACTION ARISING FROM AUGMENTED REALITY CONTENT: A COMPREHENSIVE STUDY
Kim, Jong-Ok
pg. 1682
LS-A-FR3.1 - MULTI-BAND NIR COLORIZATION USING STRUCTURE-AWARE NETWORK
pg. 1698
LS-A-FR3.4 - SUPER-RESOLUTION IMAGING USING A FOCUS PIXEL SENSOR
Kim, Nayoung
pg. 1598
LS-D-TH3.2 - RATE-DISTORTION OPTIMIZED TEMPORAL SEGMENTATION USING REINFORCEMENT LEARNING FOR VIDEO CODING
Kim, Seongjean
pg. 1470
OD-B-FR1.8 - QUALITY OF INTERACTION ARISING FROM AUGMENTED REALITY CONTENT: A COMPREHENSIVE STUDY
Kim, Seyun
pg. 158
LS-D-TH2.4 - LOSSLESS IMAGE COMPRESSION BASED ON IMAGE DECOMPOSITION AND PROGRESSIVE PREDICTION USING CONVOLUTIONAL NEURAL NETWORKS
Kim, Woojae
pg. 1465
OD-B-FR1.7 - A STUDY ON VIRTUAL REALITY SICKNESS AND VISUAL ATTENTION
Kimata, Hideaki
pg. 1
OD-B-WE1.1 - FAST-PARALLEL SINGULAR VALUE THRESHOLDING FOR MANY SMALL MATRICES BASED ON GEOMETRIC FEATURE OF SINGULAR VALUES
Kimura, Tomotaka
pg. 1899
LS-B-WE1.4 - MULTI-ARMED BANDIT-BASED ROUTING METHOD FOR IN-NETWORK CACHING
pg. 1908
LS-B-WE1.6 - INHIBITION MODELING OF FUTURE MALWARE DIFFUSION WITH AN EVOLUTIONARY GAME THEORY
Kinoshita, Yuma
pg. 1571
LS-C-TH2.1 - SPATIALLY VARYING WHITE BALANCING FOR MIXED AND NON-UNIFORM ILLUMINANTS
pg. 1215
LS-B-FR1.3 - FRAMEWISE FINITE IMPULSE RESPONSE FILTERING BASED ON TIME-FREQUENCY MASK FOR LOW-LATENCY SPEECH ENHANCEMENT
pg. 1167
LS-A-TH2.6 - ANALYSIS ON ROLES OF DNNS IN END-TO-END ACOUSTIC SCENE ANALYSIS FRAMEWORK WITH DISTRIBUTED SOUND-TO-LIGHT CONVERSION DEVICES
pg. 585
OD-A-WE2.11 - CAUSAL DISTORTIONLESS RESPONSE BEAMFORMING BY ALTERNATING DIRECTION METHOD OF MULTIPLIERS
Kitahara, Masaki
pg. 1
OD-B-WE1.1 - FAST-PARALLEL SINGULAR VALUE THRESHOLDING FOR MANY SMALL MATRICES BASED ON GEOMETRIC FEATURE OF SINGULAR VALUES
Kitamura, Daichi
pg. 1226
LS-B-FR1.5 - MULTICHANNEL AUDIO SOURCE SEPARATION WITH INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS USING PRODUCT OF SOURCE MODELS
pg. 651
OD-A-TH1.8 - PRIOR DISTRIBUTION DESIGN FOR MUSIC BLEEDING-SOUND REDUCTION BASED ON NONNEGATIVE MATRIX FACTORIZATION
pg. 578
OD-A-WE2.10 - SPEECH ENHANCEMENT BY NOISE SELF-SUPERVISED RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION VIA INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS
Kitaoka, Norihide
pg. 849
OD-A-TH3.9 - MULTI-SPEAKER TTS SYSTEM FOR LOW-RESOURCE LANGUAGE USING CROSS-LINGUAL TRANSFER LEARNING AND DATA AUGMENTATION
pg. 1077
OD-A-FR3.11 - END-TO-END SPONTANEOUS SPEECH RECOGNITION USING HESITATION LABELING
pg. 503
OD-A-WE1.13 - ADVANCED LANGUAGE MODEL FUSION METHOD FOR ENCODER-DECODER MODEL IN JAPANESE SPEECH RECOGNITION
Kitazumi, Koki
pg. 1949
LS-A-FR1.1 - MEASUREMENT OF CO2 IN OUTDOOR ENVIRONMENTS USING LPWAN BASED WSN AND ITS TIME CORRELATION CHARACTERISTICS
Kiya, Hitoshi
pg. 1571
LS-C-TH2.1 - SPATIALLY VARYING WHITE BALANCING FOR MIXED AND NON-UNIFORM ILLUMINANTS
pg. 1851
LS-B-FR3.1 - A PROTECTION METHOD OF TRAINED CNN MODEL USING FEATURE MAPS TRANSFORMED WITH SECRET KEY FROM UNAUTHORIZED ACCESS
pg. 1794
LS-B-TH1.2 - A FLEXIBLE REVERSIBLE DATA HIDING METHOD IN COMPRESSIBLE ENCRYPTED IMAGES
pg. 1833
LS-B-FR2.3 - ACCESS CONTROL USING SPATIALLY INVARIANT PERMUTATION OF FEATURE MAPS FOR SEMANTIC SEGMENTATION MODELS
pg. 1846
LS-B-FR2.5 - A PRIVACY-PRESERVING IMAGE RETRIEVAL SCHEME USING A CODEBOOK GENERATED FROM INDEPENDENT PLAIN-IMAGE DATASET
pg. 1161
LS-A-TH2.5 - INVESTIGATION ON SPATIAL AND FREQUENCY-BASED FEATURES FOR ASYNCHRONOUS ACOUSTIC SCENE ANALYSIS
Kobayashi, Kazuhiro
pg. 546
OD-A-WE2.5 - MANDARIN ELECTRO-LARYNGEAL SPEECH ENHANCEMENT BASED ON STATISTICAL VOICE CONVERSION AND MANUAL TONE CONTROL
Kobayashi, Ruiki
pg. 1687
LS-A-FR3.2 - PROXIMAL GRADIENT-BASED LOOP UNROLLING WITH INTERSCALE THRESHOLDING
Kobayashi, Takuya
pg. 1963
LS-A-FR1.4 - OFFLOADING SELECTION WITH UNEQUAL TIMESLOT IN MOBILE EDGE COMPUTING
Kobayashi, Tetsunori
pg. 477
OD-A-WE1.9 - AN INVESTIGATION OF ENHANCING CTC MODEL FOR TRIGGERED ATTENTION-BASED STREAMING ASR
pg. 603
OD-A-WE2.14 - COMPARATIVE STUDY ON DNN-BASED MINIMUM VARIANCE BEAMFORMING ROBUST TO SMALL MOVEMENTS OF SOUND SOURCES
KODAMA, Yuya
pg. 1528
LS-A-WE1.5 - ACCELERATION OF PDS–BASED HIGH–DIMENSIONAL SIGNAL RESTORATION
Koh, Yeong Jun
pg. 146
LS-D-TH2.2 - UNPAIRED IMAGE DEMOIRÉING BASED ON CYCLIC MOIRÉ LEARNING
Kojima, Atsushi
pg. 460
OD-A-WE1.6 - LARGE-CONTEXT AUTOMATIC SPEECH RECOGNITION BASED ON RNN TRANSDUCER
Kojima, Kazunori
pg. 1067
OD-A-FR3.9 - MULTIPLE DEEP LEARNING MODELS AND ARCHITECTURES WITH DIFFERENT NUMBERS OF STATES USED TO IMPROVE RETRIEVAL ACCURACY OF QUERY-BY-EXAMPLE
Kojima, Masaki
pg. 428
LS-C-FR3.4 - PITCH AND VOLUME STABILITY IN THE COMMUNICATIVE RESPONSE OF ADULTS WITH AUTISM
Kojima, Tetsuya
pg. 1653
LS-C-FR1.6 - AN ACOUSTIC COMMUNICATION TECHNIQUE BASED ON AUDIO DATA HIDING UTILIZING ARTIFICIAL FLOWING WATER SOUNDS
Komatani, Kazunori
pg. 248
OD-B-WE2.3 - SPATIAL NORMALIZATION TO REDUCE POSITIONAL COMPLEXITY IN DIRECTION-AIDED SUPERVISED BINAURAL SOUND SOURCE SEPARATION
pg. 961
OD-A-FR2.5 - MULTIPLE-EMBEDDING SEPARATION NETWORKS: SOUND CLASS-SPECIFIC FEATURE EXTRACTION FOR UNIVERSAL SOUND SEPARATION
Komatsu, Tatsuya
pg. 1139
LS-A-TH2.1 - COMPARISON OF LOW COMPLEXITY SELF-ATTENTION MECHANISMS FOR ACOUSTIC EVENT DETECTION
Kondo, Kazuhiro
pg. 2023
OD-B-TH3.9 - THE EFFECT OF DENSITY AND PLACEMENT OF BLE BEACONS ON INDOOR LOCATION AND MOTION DIRECTION ESTIMATION ACCURACY
pg. 608
OD-A-WE2.15 - IMPROVEMENTS TO NON-INTRUSIVE INTELLIGIBILITY PREDICTION FOR REVERBERANT SPEECH
Kondo, Kazunobu
pg. 1226
LS-B-FR1.5 - MULTICHANNEL AUDIO SOURCE SEPARATION WITH INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS USING PRODUCT OF SOURCE MODELS
pg. 651
OD-A-TH1.8 - PRIOR DISTRIBUTION DESIGN FOR MUSIC BLEEDING-SOUND REDUCTION BASED ON NONNEGATIVE MATRIX FACTORIZATION
Kondo, Reishi
pg. 956
OD-A-FR2.4 - IMPULSIVE TIMING DETECTION BASED ON MULTI-FRAME PHASE VOTING FOR ACOUSTIC EVENT DETECTION
Kondo, Takumi
pg. 74
LS-A-WE2.3 - ACCELERATING FINITE IMPULSE RESPONSE FILTERING USING TENSOR CORES
Konishi, Bungo
pg. 174
LS-A-TH3.1 - GENERALIZATION CHARACTERISTICS OF COMPLEX-VALUED RESERVOIR COMPUTING FOR INTERFEROMETRIC SYNTHETIC APERTURE RADAR APPLICATIONS
Koo, Hyung Il
pg. 2049
OD-B-TH3.13 - NETWORK INTRUSION DETECTION WITH IMPROVED FEATURE REPRESENTATION
Koriyama, Tomoki
pg. 794
OD-A-TH3.1 - EMOTION-CONTROLLABLE SPEECH SYNTHESIS USING EMOTION SOFT LABELS AND FINE-GRAINED PROSODY FACTORS
Koshita, Shunsuke
pg. 222
LS-A-FR2.5 - ON OPTIMAL REALIZATIONS FOR ALL-PASS FRACTIONAL DELAY DIGITAL FILTERS
Koyama, Yu
pg. 288
OD-B-WE2.10 - DENSE DEPTHMAP PREDICTION FROM ULTRASONIC SENSORS
Krishna, Gurugubelli
pg. 761
OD-A-TH2.12 - DETECTING MULTIPLE DISFLUENCIES FROM SPEECH USING PRE-LINGUISTIC AUTOMATIC SYLLABIFICATION WITH ACOUSTIC AND PROSODY FEATURES
Kubo, Masahiro
pg. 1781
LS-B-WE2.4 - COMPARATIVE STUDY OF FEATURE EXTRACTION METHOD FOR EMOTIONAL CLASSIFICATION BY MICRO-EXPRESSIONS
Kubo, Takatomi
pg. 1348
LS-A-TH1.3 - PRELIMINARY STUDY USING AUTOENCODER FOR EARLY DETECTION OF HEAT ILLNESS FROM HEART RATE VARIABILITY OBTAINED WITH WEARABLE DEVICE
pg. 1357
LS-A-TH1.5 - EVALUATION OF THE EFFECT OF TRANSFER LEARNING TO MULTI-INSTANCE DETECTION OF MONKEYS
Kubota, Ken
pg. 1294
OD-B-FR2.6 - TOWARD ESTIMATION OF ABNORMAL BRAKE IN AUTONOMOUS VEHICLES FROM ELECTROENCEPHALOGRAM AND HEART RATE INTERVAL
Kugiyama, Koyo
pg. 259
OD-B-WE2.5 - STATISTICAL-MECHANICAL ANALYSIS OF ADAPTIVE VOLTERRA FILTER FOR TIME-VARYING UNKNOWN SYSTEM
Kuh, Anthony
pg. 2085
LS-C-TH1.3 - PERSONALIZED LEARNING USING MULTIPLE KERNEL MODELS
pg. 2089
LS-C-TH1.4 - REAL TIME KERNEL LEARNING FOR SENSOR NETWORKS USING PRINCIPLES OF FEDERATED LEARNING
Kumagai, Yuiko
pg. 400
LS-C-FR2.3 - DETECTION OF NOTE ONSETS FROM EEG WHILE LISTENING TO MUSIC
Kuo, C.-C. Jay
pg. 1475
OD-B-FR1.9 - E-PIXELHOP: AN ENHANCED PIXELHOP METHOD FOR OBJECT CLASSIFICATION
Kuo, Chih-En
pg. 1262
OD-B-FR2.1 - A SELF-ATTENTION-BASED ENSEMBLE CONVOLUTION NEURAL NETWORK APPROACH FOR SLEEP STAGE CLASSIFICATION WITH MERGED SPECTROGRAM
Kuo, Sen M.
pg. 1197
LS-C-TH3.5 - DEVELOPMENT OF ACTIVE HEAR-THROUGH EQUALIZATION ALGORITHM FOR EARPHONES
Kuo, Tien-Ying
pg. 1391
OD-B-TH1.6 - IMAGE COMPRESSION ARCHITECTURE WITH BUILT-IN LIGHTWEIGHT MODEL
Kuribayashi, Minoru
pg. 1786
LS-B-TH1.1 - DETECTING DEEPFAKE VIDEOS USING DIGITAL WATERMARKING
pg. 1865
LS-B-FR3.3 - A STUDY OF PRIVACY PROTECTION OF PHOTOS TAKEN BY A WIDE-ANGLE SURVEILLANCE CAMERA
pg. 1808
LS-B-TH1.4 - FEATURE EXTRACTION SUITABLE FOR DOUBLE JPEG COMPRESSION ANALYSIS BASED ON STATISTICAL BIAS OBSERVATION OF DCT COEFFICIENTS
pg. 1815
LS-B-TH1.5 - FEATURE EXTRACTION BASED ON DENOISING AUTO ENCODER FOR CLASSIFICATION OF ADVERSARIAL EXAMPLES
Kurokawa, Takumi
pg. 1037
OD-A-FR3.4 - RETRIEVAL-ORIENTED E2E ASR MODELING FOR IMPROVED QUERY-BY-EXAMPLE SPOKEN TERM DETECTION
Kuroki, Takuma
pg. 1363
LS-A-TH1.6 - SEMI-SUPERVISED ESTIMATION OF DRIVING BEHAVIORS USING ROBUST TIME-CONTRASTIVE LEARNING
Kuroki, Yoshimitsu
pg. 1400
OD-B-TH1.8 - A CONSENSUS FRAMEWORK FOR CONVOLUTIONAL DICTIONARY LEARNING BASED ON L1 NORM ERROR
Kwan, Hon Keung
pg. 1333
LS-C-WE1.6 - ARRHYTHMIA CLASSIFICATION ALGORITHM BASED ON SPARSE AUTOENCODER
Kwok, Li-Long
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Kwok, Timothy C.Y.
pg. 1299
OD-B-FR2.7 - SPEAKER TURN AWARE SIMILARITY SCORING FOR DIARIZATION OF SPEECH-BASED COGNITIVE ASSESSMENTS
L
Lai, Chin-Feng
pg. 1674
LS-D-FR2.4 - SMART FACIAL SKINCARE PRODUCTS USING COMPUTER VISION TECHNOLOGIES
Lai, Hong-Lun
pg. 1935
LS-D-WE2.5 - A SEMI-EMPIRICAL DATA-RATE ESTIMATION METHOD OF 5G RAN SLICING
Lai, Ming-Jay
pg. 1935
LS-D-WE2.5 - A SEMI-EMPIRICAL DATA-RATE ESTIMATION METHOD OF 5G RAN SLICING
Lai, Shun-Cheung
pg. 1444
OD-B-FR1.4 - DEEP SIAMESE NETWORK FOR LOW-RESOLUTION FACE RECOGNITION
Lai, Wen-Ping
pg. 1935
LS-D-WE2.5 - A SEMI-EMPIRICAL DATA-RATE ESTIMATION METHOD OF 5G RAN SLICING
Lai, Ying-Hui
pg. 829
OD-A-TH3.6 - SPEECH RECONSTRUCTION FROM THE LARYNX VIBRATION FEATURE CAPTURED BY LASER-DOPPLER VIBROMETER SENSOR
Lai, Yu-Kuen
pg. 1942
LS-D-WE2.6 - AN ENTROPY-BASED DDOS ATTACK DETECTION AND CLASSIFICATION WITH HIERARCHICAL TEMPORAL MEMORY
Lam, Kin-Man
pg. 1444
OD-B-FR1.4 - DEEP SIAMESE NETWORK FOR LOW-RESOLUTION FACE RECOGNITION
Lan, Boon Leong
pg. 1269
OD-B-FR2.2 - SEIZURE CLASSIFICATION OF EEG BASED ON WAVELET SIGNAL DENOISING USING A NOVEL CHANNEL SELECTION ALGORITHM
Le, Thi Phuong
pg. 170
LS-D-TH2.6 - FACE ANTI-SPOOFING USING MULTI-BRANCH CNN
Lee, Chul
pg. 146
LS-D-TH2.2 - UNPAIRED IMAGE DEMOIRÉING BASED ON CYCLIC MOIRÉ LEARNING
Lee, Chung-Nan
pg. 1923
LS-D-WE2.3 - REALIZING 5G NETWORK SLICING PROVISIONING WITH OPEN SOURCE SOFTWARE
Lee, Geonsu
pg. 158
LS-D-TH2.4 - LOSSLESS IMAGE COMPRESSION BASED ON IMAGE DECOMPOSITION AND PROGRESSIVE PREDICTION USING CONVOLUTIONAL NEURAL NETWORKS
pg. 2049
OD-B-TH3.13 - NETWORK INTRUSION DETECTION WITH IMPROVED FEATURE REPRESENTATION
Lee, Hung-Shin
pg. 719
OD-A-TH2.5 - GENERATION OF SPEAKER REPRESENTATIONS USING HETEROGENEOUS TRAINING BATCH ASSEMBLY
Lee, Jeonghaeng
pg. 1465
OD-B-FR1.7 - A STUDY ON VIRTUAL REALITY SICKNESS AND VISUAL ATTENTION
Lee, Jinyoung
pg. 591
OD-A-WE2.12 - STACKED U-NET WITH HIGH-LEVEL FEATURE TRANSFER FOR PARAMETER EFFICIENT SPEECH ENHANCEMENT
Lee, Ju-Han
pg. 1682
LS-A-FR3.1 - MULTI-BAND NIR COLORIZATION USING STRUCTURE-AWARE NETWORK
Lee, Jung-Kyung
pg. 1598
LS-D-TH3.2 - RATE-DISTORTION OPTIMIZED TEMPORAL SEGMENTATION USING REINFORCEMENT LEARNING FOR VIDEO CODING
Lee, Junghsi
pg. 1931
LS-D-WE2.4 - A PARKING MONITORING SYSTEM USING FMCW RADARS
Lee, Kuan-Lin
pg. 1923
LS-D-WE2.3 - REALIZING 5G NETWORK SLICING PROVISIONING WITH OPEN SOURCE SOFTWARE
Lee, Kyoungoh
pg. 1615
LS-D-TH3.6 - ENVIRONMENT ADAPTIVE 3D POSE ESTIMATION MODEL AND LEARNING STRATEGY
Lee, Ming-Feng
pg. 1923
LS-D-WE2.3 - REALIZING 5G NETWORK SLICING PROVISIONING WITH OPEN SOURCE SOFTWARE
Lee, Oggyu
pg. 1607
LS-D-TH3.4 - CONTEXT-BASED MATCHING REFINEMENT FOR PERSON SEARCH
Lee, Sang-Ho
pg. 1682
LS-A-FR3.1 - MULTI-BAND NIR COLORIZATION USING STRUCTURE-AWARE NETWORK
Lee, Sanghoon
pg. 1428
OD-B-FR1.1 - HIGH-QUALITY SINGLE IMAGE 3D FACIAL SHAPE RECONSTRUCTION VIA ROBUST ALBEDO ESTIMATION
pg. 1615
LS-D-TH3.6 - ENVIRONMENT ADAPTIVE 3D POSE ESTIMATION MODEL AND LEARNING STRATEGY
pg. 1465
OD-B-FR1.7 - A STUDY ON VIRTUAL REALITY SICKNESS AND VISUAL ATTENTION
pg. 1470
OD-B-FR1.8 - QUALITY OF INTERACTION ARISING FROM AUGMENTED REALITY CONTENT: A COMPREHENSIVE STUDY
pg. 1488
OD-B-FR1.11 - CHECKERBOARD CORNER LOCALIZATION ACCELERATED WITH DEEP FALSE DETECTION FOR MULTI-CAMERA CALIBRATION
Lee, Seongmin
pg. 1488
OD-B-FR1.11 - CHECKERBOARD CORNER LOCALIZATION ACCELERATED WITH DEEP FALSE DETECTION FOR MULTI-CAMERA CALIBRATION
Lee, Shi-wook
pg. 1067
OD-A-FR3.9 - MULTIPLE DEEP LEARNING MODELS AND ARCHITECTURES WITH DIFFERENT NUMBERS OF STATES USED TO IMPROVE RETRIEVAL ACCURACY OF QUERY-BY-EXAMPLE
Leglaive, Simon
pg. 684
OD-A-TH1.13 - ON SPEECH SPARSITY FOR COMPUTATIONAL EFFICIENCY AND NOISE REDUCTION IN HEARING AIDS
Lei, Guangzhi
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
Lei, Ling
pg. 1127
LS-D-WE1.4 - MIXING OR EXTRACTING? FURTHER EXPLORING NECESSITY OF MUSIC SEPARATION FOR SINGER IDENTIFICATION
Lei, Xin
pg. 672
OD-A-TH1.11 - TARGET SPEAKER EXTRACTION FOR CUSTOMIZABLE QUERY-BY-EXAMPLE KEYWORD SPOTTING
Leow, Hui-Wen
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Li, Andong
pg. 553
OD-A-WE2.6 - INCORPORATING MULTI-TARGET IN MULTI-STAGE SPEECH ENHANCEMENT MODEL FOR BETTER GENERALIZATION
Li, Chenxing
pg. 884
OD-A-FR1.2 - RETHINKING SINGING VOICE SEPARATION WITH SPECTRAL-TEMPORAL TRANSFORMER
Li, Chunhao
pg. 1585
LS-C-TH2.4 - IMPLEMENTATION OF AVS3 MULTICAST SYSTEM BASED ON EMBMS
Li, Guanyu
pg. 1121
LS-D-WE1.3 - AN MAP ESTIMATION FOR BETWEEN-CLASS VARIANCE
Li, Haizhou
pg. 904
OD-A-FR1.5 - TRAINING EXPLAINABLE SINGING QUALITY ASSESSMENT NETWORK WITH AUGMENTED DATA
pg. 912
OD-A-FR1.6 - TOWARDS REFERENCE-INDEPENDENT RHYTHM ASSESSMENT OF SOLO SINGING
Li, Hongfeng
pg. 614
OD-A-TH1.1 - A TARGET SPEAKER SEPARATION NEURAL NETWORK WITH JOINT-TRAINING
Li, Jinchao
pg. 743
OD-A-TH2.9 - DUAL DROPOUT RANKING OF LINGUISTIC FEATURES FOR ALZHEIMER’S DISEASE RECOGNITION
Li, Jing
pg. 1097
OD-A-FR3.15 - OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
Li, Jinhu
pg. 904
OD-A-FR1.5 - TRAINING EXPLAINABLE SINGING QUALITY ASSESSMENT NETWORK WITH AUGMENTED DATA
pg. 912
OD-A-FR1.6 - TOWARDS REFERENCE-INDEPENDENT RHYTHM ASSESSMENT OF SOLO SINGING
Li, Jinwei
pg. 1722
OD-B-TH2.3 - UNDETECTABLE JPEG IMAGE BATCH REVERSIBLE DATA HIDING WITH CONTENT-ADAPTIVE PAYLOAD ALLOCATION
Li, Kai
pg. 36
OD-B-WE1.6 - STUDY ON SIMULTANEOUS ESTIMATION OF GLOTTAL SOURCE AND VOCAL TRACT PARAMETERS BY ARMAX-LF MODEL FOR SPEECH ANALYSIS/SYNTHESIS
Li, Ke
pg. 750
OD-A-TH2.10 - A MULTILINGUAL FRAMEWORK BASED ON PRETRAINING MODEL FOR SPEECH EMOTION RECOGNITION
Li, Lantian
pg. 1121
LS-D-WE1.3 - AN MAP ESTIMATION FOR BETWEEN-CLASS VARIANCE
pg. 713
OD-A-TH2.4 - A STUDY ON DECOUPLED PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS
pg. 780
OD-A-TH2.15 - HOW SPEECH IS RECOGNIZED TO BE EMOTIONAL - A STUDY BASED ON INFORMATION DECOMPOSITION
Li, Li
pg. 1210
LS-B-FR1.2 - REDUCING ALGORITHMIC DELAY USING LOW-OVERLAP WINDOW FOR ONLINE WAVE-U-NET
pg. 597
OD-A-WE2.13 - EXTENSION OF VIRTUAL MICROPHONE TECHNIQUE TO MULTIPLE REAL MICROPHONES AND INVESTIGATION OF THE IMPACT OF PHASE AND AMPLITUDE INTERPOLATION ON SPEECH ENHANCEMENT
Li, Lin
pg. 1551
LS-C-WE2.3 - TEXT DESCRIPTION GENERATION FROM VIDEOS VIA DEEP SEMANTIC MODELS
pg. 1097
OD-A-FR3.15 - OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
Li, Min
pg. 1328
LS-C-WE1.5 - ADHD CLASSIFICATION VIA AUTO-ENCODING NETWORK WITH NON-IMAGING DATA FUSION
Li, Ming
pg. 878
OD-A-FR1.1 - END-TO-END MANDARIN TONE CLASSIFICATION WITH SHORT TERM CONTEXT INFORMATION
pg. 1133
LS-D-WE1.5 - A UNIFIED DEEP SPEAKER EMBEDDING FRAMEWORK FOR MIXED-BANDWIDTH SPEECH DATA
Li, Mingzhe
pg. 1192
LS-C-TH3.4 - A TRUE DIGITAL FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM WITH NO ANALOG-TO-DIGITAL AND DIGITAL-TO-ANALOG CONVERTERS
Li, Nuo
pg. 541
OD-A-WE2.4 - DNN-BASED LINEAR PREDICTION RESIDUAL ENHANCEMENT FOR SPEECH DEREVERBERATION
Li, Peiya
pg. 1839
LS-B-FR2.4 - END-TO-END LEARNING FOR ENCRYPTED IMAGE RETRIEVAL
Li, Qingwu
pg. 1375
OD-B-TH1.3 - UNDERWATER IMAGE DEHAZING BASED ON DISPARITY ESTIMATION AND COLOR CONSTRAINT
Li, Ruifan
pg. 2060
OD-B-TH3.15 - ENTAILMENT METHOD BASED ON TEMPLATE SELECTION FOR CHINESE TEXT FEW-SHOT LEARNING
pg. 2066
OD-B-TH3.16 - IMAGE CAPTIONING BASED ON AN IMPROVED TRANSFORMER WITH IOU POSITION ENCODING
Li, Sheng
pg. 433
OD-A-WE1.1 - ON THE USE OF SPEAKER INFORMATION FOR AUTOMATIC SPEECH RECOGNITION IN SPEAKER-IMBALANCED CORPORA
pg. 438
OD-A-WE1.2 - SPECTROGRAMS FUSION-BASED END-TO-END ROBUST AUTOMATIC SPEECH RECOGNITION
pg. 1043
OD-A-FR3.5 - MULTILINGUAL APPROACH TO JOINT SPEECH AND ACCENT RECOGNITION WITH DNN-HMM FRAMEWORK
Li, Shengqiang
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
pg. 443
OD-A-WE1.3 - CONFORMER-BASED END-TO-END SPEECH RECOGNITION WITH ROTARY POSITION EMBEDDING
pg. 448
OD-A-WE1.4 - EFFICIENT CONFORMER-BASED SPEECH RECOGNITION WITH LINEAR ATTENTION
Li, Sixia
pg. 1056
OD-A-FR3.7 - ZERO-SHOT DOMAIN ADAPTATION WITH INFERENCE RELATION PATHS FOR SPOKEN LANGUAGE UNDERSTANDING
Li, Xiaodong
pg. 530
OD-A-WE2.2 - A ROBUST MAXIMUM LIKELIHOOD DISTORTIONLESS RESPONSE BEAMFORMER BASED ON A COMPLEX GENERALIZED GAUSSIAN DISTRIBUTION
LI, Xingfeng
pg. 700
OD-A-TH2.2 - HIERARCHICAL PROSODY ANALYSIS IMPROVES CATEGORICAL AND DIMENSIONAL EMOTION RECOGNITION
Li, Yazhou
pg. 2066
OD-B-TH3.16 - IMAGE CAPTIONING BASED ON AN IMPROVED TRANSFORMER WITH IOU POSITION ENCODING
Li, Yijie
pg. 939
OD-A-FR2.1 - CNN-BASED DISCRIMINATIVE TRAINING FOR DOMAIN COMPENSATION IN ACOUSTIC EVENT DETECTION WITH FRAME-WISE CLASSIFIER
Li, Yongwei
pg. 36
OD-B-WE1.6 - STUDY ON SIMULTANEOUS ESTIMATION OF GLOTTAL SOURCE AND VOCAL TRACT PARAMETERS BY ARMAX-LF MODEL FOR SPEECH ANALYSIS/SYNTHESIS
Li, You-Jin
pg. 1245
LS-D-FR3.3 - MIMO SPEECH COMPRESSION AND ENHANCEMENT BASED ON CONVOLUTIONAL DENOISING AUTOENCODER
Li, Zheng
pg. 1097
OD-A-FR3.15 - OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
Lian, Guansan
pg. 1016
OD-A-FR2.13 - SELF-ROTATION ANGLE ESTIMATION OF CIRCULAR MICROPHONE ARRAY BASED ON SOUND FIELD INTERPOLATION
Liang, Chengdong
pg. 1111
LS-D-WE1.1 - ATTENTION-BASED MULTI-CHANNEL SPEAKER VERIFICATION WITH AD-HOC MICROPHONE ARRAYS
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
Liang, Jiaen
pg. 939
OD-A-FR2.1 - CNN-BASED DISCRIMINATIVE TRAINING FOR DOMAIN COMPENSATION IN ACOUSTIC EVENT DETECTION WITH FRAME-WISE CLASSIFIER
Liang, Mengnan
pg. 1333
LS-C-WE1.6 - ARRHYTHMIA CLASSIFICATION ALGORITHM BASED ON SPARSE AUTOENCODER
Liang, Paul
pg. 841
OD-A-TH3.8 - UNDERSTANDING THE TRADEOFFS IN CLIENT-SIDE PRIVACY FOR DOWNSTREAM SPEECH TASKS
Liao, Po-Yu
pg. 1262
OD-B-FR2.1 - A SELF-ATTENTION-BASED ENSEMBLE CONVOLUTION NEURAL NETWORK APPROACH FOR SLEEP STAGE CLASSIFICATION WITH MERGED SPECTROGRAM
Liaw, Andrew
pg. 1026
OD-A-FR3.2 - ENSEMBLE OF ONE MODEL: CREATING MODEL VARIATIONS FOR TRANSFORMER WITH LAYER PERMUTATION
Lin, Binghuai
pg. 1031
OD-A-FR3.3 - UNCERTAINTY ESTIMATION IN AUTOMATIC PRONUNCIATION ASSESSMENT WITH PSEUDO SAMPLES BASED ON DEEP KERNEL LEARNING
Lin, Hsueh-Chun
pg. 1931
LS-D-WE2.4 - A PARKING MONITORING SYSTEM USING FMCW RADARS
Lin, Jhih-Jhou
pg. 1391
OD-B-TH1.6 - IMAGE COMPRESSION ARCHITECTURE WITH BUILT-IN LIGHTWEIGHT MODEL
Lin, Po-Chiang
pg. 1903
LS-B-WE1.5 - GENERALIZED CLASSIFICATION OF DNS OVER HTTPS TRAFFIC WITH DEEP LEARNING
Lin, Qingjian
pg. 689
OD-A-TH1.14 - SPARSELY OVERLAPPED SPEECH TRAINING IN THE TIME DOMAIN: JOINT LEARNING OF TARGET SPEECH SEPARATION AND PERSONAL VAD BENEFITS
Lin, Ting-Yu
pg. 1674
LS-D-FR2.4 - SMART FACIAL SKINCARE PRODUCTS USING COMPUTER VISION TECHNOLOGIES
Lin, Yen-Po
pg. 2013
OD-B-TH3.7 - 3D-GFE: A THREE-DIMENSIONAL GEOMETRIC-FEATURE EXTRACTOR FOR POINT CLOUD DATA
pg. 2018
OD-B-TH3.8 - ATTENTION EDGECONV FOR 3D POINT CLOUD CLASSIFICATION
Lin, Yi-Chieh
pg. 829
OD-A-TH3.6 - SPEECH RECONSTRUCTION FROM THE LARYNX VIBRATION FEATURE CAPTURED BY LASER-DOPPLER VIBROMETER SENSOR
Lin, Yijun
OD-B-TH2.6 - CLUSTER-TRNET: JOINTED MODEL FOR REAL-TIME TRAFFIC IDENTIFICATION WITH HIGH ACCURACY
Lin, Yinyi
OD-B-TH1.1 - COMPUTATION REDUCTION FOR HEVC INTER PREDICTION
Lin, Yu-Chieh
pg. 1258
LS-D-FR3.5 - INSTRUMENTED ROMBERG TEST OF POSTURAL STABILITY IN PATIENTS WITH VESTIBULAR DISORDERS USING INERTIAL MEASUREMENT UNITS
Lin, Yu-Min
pg. 829
OD-A-TH3.6 - SPEECH RECONSTRUCTION FROM THE LARYNX VIBRATION FEATURE CAPTURED BY LASER-DOPPLER VIBROMETER SENSOR
Lin, Yu-Syuan
pg. 1262
OD-B-FR2.1 - A SELF-ATTENTION-BASED ENSEMBLE CONVOLUTION NEURAL NETWORK APPROACH FOR SLEEP STAGE CLASSIFICATION WITH MERGED SPECTROGRAM
Lin, Yun-Wen
pg. 536
OD-A-WE2.3 - SPEECH ENHANCEMENT BASED ON MASKING APPROACH CONSIDERING SPEECH QUALITY AND ACOUSTIC CONFIDENCE FOR NOISY SPEECH RECOGNITION
Ling, Zhen-hua
pg. 667
OD-A-TH1.10 - A DEEP ANALYSIS OF SPEECH SEPARATION GUIDED DIARIZATION UNDER REALISTIC CONDITIONS
Liou, Yi-Syuan
pg. 1234
LS-D-FR3.1 - TIME ALIGNMENT USING LIP IMAGES FOR FRAME-BASED ELECTROLARYNGEAL VOICE CONVERSION
Liu, Cong
pg. 667
OD-A-TH1.10 - A DEEP ANALYSIS OF SPEECH SEPARATION GUIDED DIARIZATION UNDER REALISTIC CONDITIONS
Liu, Guan
pg. 1839
LS-B-FR2.4 - END-TO-END LEARNING FOR ENCRYPTED IMAGE RETRIEVAL
Liu, Hu
pg. 1541
LS-C-WE2.1 - DEEP LEARNING ANALYSIS MODELS FOR SPEECH AND EMOTIONAL RECOGNITION
Liu, Hui
pg. 864
OD-A-TH3.12 - SPTTS: PARALLEL SPEECH SYNTHESIS WITHOUT EXTRA ALIGNER MODEL
Liu, Jianquan
pg. 1556
LS-C-WE2.4 - VIEW-INVARIANT FEATURE USING POSE INFORMATION AND FLEXIBLE MATCHING ALGORITHM FOR ACTION RETRIEVAL
Liu, Jiyao
pg. 305
OD-B-WE2.13 - POSITIONAL-SPECTRAL-TEMPORAL ATTENTION IN 3D CONVOLUTIONAL NEURAL NETWORKS FOR EEG EMOTION RECOGNITION
Liu, Kai-Chun
pg. 1258
LS-D-FR3.5 - INSTRUMENTED ROMBERG TEST OF POSTURAL STABILITY IN PATIENTS WITH VESTIBULAR DISORDERS USING INERTIAL MEASUREMENT UNITS
Liu, Miao
pg. 945
OD-A-FR2.2 - FREQUENCY AXIS POOLING METHOD FOR WEAKLY LABELED SOUND EVENT DETECTION AND CLASSIFICATION
Liu, Peng
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
Liu, Runji
OD-B-TH2.6 - CLUSTER-TRNET: JOINTED MODEL FOR REAL-TIME TRAFFIC IDENTIFICATION WITH HIGH ACCURACY
Liu, Shupei
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
Liu, Tan
pg. 1062
OD-A-FR3.8 - END TO END SPOKEN LANGUAGE UNDERSTANDING USING PARTIAL DISENTANGLED SLOT EMBEDDING
Liu, Xiaofeng
pg. 1333
LS-C-WE1.6 - ARRHYTHMIA CLASSIFICATION ALGORITHM BASED ON SPARSE AUTOENCODER
Liu, Yan
pg. 1375
OD-B-TH1.3 - UNDERWATER IMAGE DEHAZING BASED ON DISPARITY ESTIMATION AND COLOR CONSTRAINT
Liu, Yang
pg. 1317
LS-C-WE1.3 - DEPRESSION SEVERITY LEVEL CLASSIFICATION USING MULTITASK LEARNING OF GENDER RECOGNITION
Liu, Yi-Wen
pg. 1975
OD-B-TH3.1 - MANDARIN SINGING VOICE SYNTHESIS WITH A PHONOLOGY-BASED DURATION MODEL
Liu, Yun
pg. 2066
OD-B-TH3.16 - IMAGE CAPTIONING BASED ON AN IMPROVED TRANSFORMER WITH IOU POSITION ENCODING
Liu, Zhi-Song
pg. 1450
OD-B-FR1.5 - LEARN TO SKETCH: A FAST APPROACH FOR UNIVERSAL PHOTO SKETCH
Llave, Adrien
pg. 684
OD-A-TH1.13 - ON SPEECH SPARSITY FOR COMPUTATIONAL EFFICIENCY AND NOISE REDUCTION IN HEARING AIDS
Lo, Tien-Hong
pg. 1049
OD-A-FR3.6 - IMPROVING END-TO-END MODELING FOR MISPRONUNCIATION DETECTION WITH EFFECTIVE AUGMENTATION MECHANISMS
pg. 1104
OD-A-FR3.16 - CROSS-UTTERANCE RERANKING MODELS WITH BERT AND GRAPH CONVOLUTIONAL NETWORKS FOR CONVERSATIONAL SPEECH RECOGNITION
Loh, Yuen Peng
pg. 1877
LS-B-FR3.5 - RELABEL, SCRAMBLE, SYNTHESIZE: A NOVEL COVERLESS STEGANOGRAPHY APPROACH VIA COLLAGE IMAGE
Loh, Zhen-Ann
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Long, Yanhua
pg. 939
OD-A-FR2.1 - CNN-BASED DISCRIMINATIVE TRAINING FOR DOMAIN COMPENSATION IN ACOUSTIC EVENT DETECTION WITH FRAME-WISE CLASSIFIER
Lu, Chung-Li
pg. 269
OD-B-WE2.7 - SEMI-SUPERVISED SOUND EVENT DETECTION USING SELF-ATTENTION AND MULTIPLE TECHNIQUES OF CONSISTENCY TRAINING
Lu, WeiRui
pg. 854
OD-A-TH3.10 - TOWARDS UNSEEN SPEAKERS ZERO-SHOT VOICE CONVERSION WITH GENERATIVE ADVERSARIAL NETWORKS
Lu, Xiaoyong
pg. 1317
LS-C-WE1.3 - DEPRESSION SEVERITY LEVEL CLASSIFICATION USING MULTITASK LEARNING OF GENDER RECOGNITION
Lu, Xugang
pg. 769
OD-A-TH2.13 - SIAMESE NEURAL NETWORK WITH JOINT BAYESIAN MODEL STRUCTURE FOR SPEAKER VERIFICATION
Lu, Yen-Ju
pg. 659
OD-A-TH1.9 - A STUDY ON SPEECH ENHANCEMENT BASED ON DIFFUSION PROBABILISTIC MODEL
Lu, Yi-Chang
pg. 2013
OD-B-TH3.7 - 3D-GFE: A THREE-DIMENSIONAL GEOMETRIC-FEATURE EXTRACTOR FOR POINT CLOUD DATA
pg. 2018
OD-B-TH3.8 - ATTENTION EDGECONV FOR 3D POINT CLOUD CLASSIFICATION
Lu, ZhiXun
pg. 1839
LS-B-FR2.4 - END-TO-END LEARNING FOR ENCRYPTED IMAGE RETRIEVAL
Luan, Jian
OD-A-FR1.7 - NOISE ROBUST SINGING VOICE SYNTHESIS USING GAUSSIAN MIXTURE VARIATIONAL AUTOENCODER
Lumban Tobing, Patrick
pg. 814
OD-A-TH3.4 - NOISY-TO-NOISY VOICE CONVERSION FRAMEWORK WITH DENOISING MODEL
Luo, Sixun
pg. 2036
OD-B-TH3.11 - SELF-SUPERVISED LEARNING FOR ONLINE SPEAKER DIARIZATION
Luo, Xuan
pg. 794
OD-A-TH3.1 - EMOTION-CONTROLLABLE SPEECH SYNTHESIS USING EMOTION SOFT LABELS AND FINE-GRAINED PROSODY FACTORS
Luo, Xueqin
pg. 1221
LS-B-FR1.4 - CONSTRAINED MAXIMUM DIRECTIVITY BEAMFORMERS BASED ON UNIFORM LINEAR ACOUSTIC VECTOR SENSOR ARRAYS
M
Ma, Ding
pg. 870
OD-A-TH3.13 - INVESTIGATION OF TEXT-TO-SPEECH-BASED SYNTHETIC PARALLEL DATA FOR SEQUENCE-TO-SEQUENCE NON-PARALLEL VOICE CONVERSION
Ma, Duo
pg. 679
OD-A-TH1.12 - TIME DOMAIN SPEECH ENHANCEMENT WITH ATTENTIVE MULTI-SCALE APPROACH
pg. 497
OD-A-WE1.12 - MULTITASK-BASED JOINT LEARNING APPROACH TO ROBUST ASR FOR RADIO COMMUNICATION SPEECH
Ma, Qingqing
pg. 133
LS-D-TH1.6 - PERFORMANCE CHARACTERIZATION OF RASTERIZATION ALGORITHMS FOR RECONFIGURABLE GRAPHICS PROCESSOR
Ma, Zhanyu
pg. 2060
OD-B-TH3.15 - ENTAILMENT METHOD BASED ON TEMPLATE SELECTION FOR CHINESE TEXT FEW-SHOT LEARNING
pg. 2066
OD-B-TH3.16 - IMAGE CAPTIONING BASED ON AN IMPROVED TRANSFORMER WITH IOU POSITION ENCODING
Maeda, Tsubasa
pg. 1092
OD-A-FR3.14 - MULTI-VIEW CONVOLUTION FOR LIPREADING
Maeda, Yoshihiro
pg. 74
LS-A-WE2.3 - ACCELERATING FINITE IMPULSE RESPONSE FILTERING USING TENSOR CORES
pg. 88
LS-A-WE2.5 - COLOR TRANSFORMATION FOR COMPRESSIVE COMPUTING IN IMAGE FILTERING
Magoulianitis, Vasileios
pg. 1475
OD-B-FR1.9 - E-PIXELHOP: AN ENHANCED PIXELHOP METHOD FOR OBJECT CLASSIFICATION
Mahmood, Jabar
pg. 1519
LS-A-WE1.4 - AUTOMOTIVE ENGINE CYLINDER HEAD CRACK DETECTION: CANNY EDGE DETECTION WITH MORPHOLOGICAL DILATION
Maity, Sudhamay
pg. 511
OD-A-WE1.14 - CSTD-TELUGU CORPUS: CROWD-SOURCED APPROACH FOR LARGE-SCALE SPEECH DATA COLLECTION
Mak, Man-Wai
pg. 1299
OD-B-FR2.7 - SPEAKER TURN AWARE SIMILARITY SCORING FOR DIARIZATION OF SPEECH-BASED COGNITIVE ASSESSMENTS
pg. 743
OD-A-TH2.9 - DUAL DROPOUT RANKING OF LINGUISTIC FEATURES FOR ALZHEIMER’S DISEASE RECOGNITION
Makino, Shoji
pg. 1210
LS-B-FR1.2 - REDUCING ALGORITHMIC DELAY USING LOW-OVERLAP WINDOW FOR ONLINE WAVE-U-NET
pg. 578
OD-A-WE2.10 - SPEECH ENHANCEMENT BY NOISE SELF-SUPERVISED RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION VIA INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS
pg. 597
OD-A-WE2.13 - EXTENSION OF VIRTUAL MICROPHONE TECHNIQUE TO MULTIPLE REAL MICROPHONES AND INVESTIGATION OF THE IMPACT OF PHASE AND AMPLITUDE INTERPOLATION ON SPEECH ENHANCEMENT
Makita, Kenichi
pg. 1294
OD-B-FR2.6 - TOWARD ESTIMATION OF ABNORMAL BRAKE IN AUTONOMOUS VEHICLES FROM ELECTROENCEPHALOGRAM AND HEART RATE INTERVAL
Mametani, Kohki
pg. 808
OD-A-TH3.3 - CONDITIONAL DEEP HIERARCHICAL VARIATIONAL AUTOENCODER FOR VOICE CONVERSION
Manabe, Yoshitsugu
pg. 386
LS-C-FR2.1 - INTERNAL STATE ESTIMATION BY THERMAL IMAGE AND IDENTIFICATION OF FACE AND NOSE POSITION
Mao, Tingzhi
pg. 1021
OD-A-FR3.1 - ENRICHING UNDER-REPRESENTED NAMED ENTITIES FOR IMPROVED SPEECH RECOGNITION
Maruyama, Tsubasa
pg. 1993
OD-B-TH3.4 - AUGMENTATION-AGNOSTIC REGULARIZATION FOR UNSUPERVISED CONTRASTIVE LEARNING WITH ITS APPLICATION TO SPEAKER VERIFICATION
Masuyama, Yoshiki
pg. 585
OD-A-WE2.11 - CAUSAL DISTORTIONLESS RESPONSE BEAMFORMING BY ALTERNATING DIRECTION METHOD OF MULTIPLIERS
Matsui, Toshie
pg. 897
OD-A-FR1.4 - IMPLEMENTATION OF INTERACTIVE TOOLS FOR INVESTIGATING FUNDAMENTAL FREQUENCY RESPONSE OF VOICED SOUNDS TO AUDITORY STIMULATION
Matsuka, Toshihiko
pg. 359
LS-B-TH3.1 - MODELING THE DYNAMICS OF OBSERVATIONAL BEHAVIORS BASE ON OBSERVERS’ PERSONALITY TRAITS USING HIDDEN MARKOV MODELS
pg. 380
LS-B-TH3.4 - AIZUCHI AS A SIGN OF INTERNAL INFORMATION PROCESSING AND ITS INTERPRETATIONS BY LISTENERS
Matsuoka, Ryo
pg. 1405
OD-B-TH1.9 - NOISE REMOVAL FOR DYNAMIC MODE DECOMPOSITION BASED ON PLUG-AND-PLAY ADMM
Matsuura, Mitsuyasu
pg. 288
OD-B-WE2.10 - DENSE DEPTHMAP PREDICTION FROM ULTRASONIC SENSORS
Matsuzaki, Naoyuki
pg. 366
LS-B-TH3.2 - ESTIMATING BEVERAGE PREFERENCE BASED ON SUBJECTIVE EMOTIONAL REACTIONS AND EEG ACTIVITY
Matsuzaki, Raito
pg. 1653
LS-C-FR1.6 - AN ACOUSTIC COMMUNICATION TECHNIQUE BASED ON AUDIO DATA HIDING UTILIZING ARTIFICIAL FLOWING WATER SOUNDS
Matumoto, Kazuki
pg. 218
LS-A-FR2.4 - A PROPOSAL TOWARD STANDARDIZATION OF DESIGN EXAMPLES FOR IIR FILTER DESIGN METHODS
MaungMaung, AprilPyone
pg. 1851
LS-B-FR3.1 - A PROTECTION METHOD OF TRAINED CNN MODEL USING FEATURE MAPS TRANSFORMED WITH SECRET KEY FROM UNAUTHORIZED ACCESS
Mawalim, Candy Olivia
pg. 1627
LS-C-FR1.2 - IMPROVING SECURITY IN MCADAMS COEFFICIENT-BASED SPEAKER ANONYMIZATION BY WATERMARKING METHOD
McCallan, Niamh
pg. 1269
OD-B-FR2.2 - SEIZURE CLASSIFICATION OF EEG BASED ON WAVELET SIGNAL DENOISING USING A NOVEL CHANNEL SELECTION ALGORITHM
McKnight, Simon
pg. 705
OD-A-TH2.3 - A STUDY OF SALIENT MODULATION DOMAIN FEATURES FOR SPEAKER IDENTIFICATION
McLaughlin, James
pg. 1269
OD-B-FR2.2 - SEIZURE CLASSIFICATION OF EEG BASED ON WAVELET SIGNAL DENOISING USING A NOVEL CHANNEL SELECTION ALGORITHM
Megías, David
pg. 1786
LS-B-TH1.1 - DETECTING DEEPFAKE VIDEOS USING DIGITAL WATERMARKING
Mehrotra, Utkarsh
pg. 761
OD-A-TH2.12 - DETECTING MULTIPLE DISFLUENCIES FROM SPEECH USING PRE-LINGUISTIC AUTOMATIC SYLLABIFICATION WITH ACOUSTIC AND PROSODY FEATURES
Meng, Helen
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
pg. 1299
OD-B-FR2.7 - SPEAKER TURN AWARE SIMILARITY SCORING FOR DIARIZATION OF SPEECH-BASED COGNITIVE ASSESSMENTS
Meng, Helen M.
pg. 743
OD-A-TH2.9 - DUAL DROPOUT RANKING OF LINGUISTIC FEATURES FOR ALZHEIMER’S DISEASE RECOGNITION
Meng, Weixin
pg. 530
OD-A-WE2.2 - A ROBUST MAXIMUM LIKELIHOOD DISTORTIONLESS RESPONSE BEAMFORMER BASED ON A COMPLEX GENERALIZED GAUSSIAN DISTRIBUTION
Meng, Xiaojin Meng
pg. 1328
LS-C-WE1.5 - ADHD CLASSIFICATION VIA AUTO-ENCODING NETWORK WITH NON-IMAGING DATA FUSION
Meteer, Oğuz
pg. 44
OD-B-WE1.7 - LOW-POWER BOOTH MULTIPLICATION WITHOUT DYNAMIC RANGE DETECTION IN FFTS FOR FMCW RADAR SIGNAL PROCESSING
pg. 55
OD-B-WE1.9 - AN OPTIMAL VARIABLE-LATENCY ARCHITECTURE FOR DETERMINISTIC APPROACHES TO STOCHASTIC COMPUTING WITH UNARY BIT STREAM PRESERVING PROPERTIES
Mimura, Masato
pg. 433
OD-A-WE1.1 - ON THE USE OF SPEAKER INFORMATION FOR AUTOMATIC SPEECH RECOGNITION IN SPEAKER-IMBALANCED CORPORA
pg. 465
OD-A-WE1.7 - AN END-TO-END MODEL FROM SPEECH TO CLEAN TRANSCRIPT FOR PARLIAMENTARY MEETINGS
Min, Sung-Jun
pg. 1693
LS-A-FR3.3 - EDGE MAP-GUIDED SCALE-ITERATIVE IMAGE DEBLURRING
Minematsu, Nobuaki
pg. 821
OD-A-TH3.5 - ACOUSTIC SIMULATION OF BODY-CONDUCTED SPEECH AND ITS USE TO CONVERT ONE'S RECORDED VOICES TO ONE'S OWN VOICES
Mirishkar, Ganesh S
pg. 511
OD-A-WE1.14 - CSTD-TELUGU CORPUS: CROWD-SOURCED APPROACH FOR LARGE-SCALE SPEECH DATA COLLECTION
Misawa, Sota
pg. 578
OD-A-WE2.10 - SPEECH ENHANCEMENT BY NOISE SELF-SUPERVISED RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION VIA INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS
Mishima, Sakiko
pg. 956
OD-A-FR2.4 - IMPULSIVE TIMING DETECTION BASED ON MULTI-FRAME PHASE VOTING FOR ACOUSTIC EVENT DETECTION
Misugi, Takeru
pg. 1895
LS-B-WE1.3 - IMPLEMENTATION OF A FAST FAILURE RECOVERY METHOD CONSIDERING LOAD DISTRIBUTION FOR NETWORK SLICING
Miura, Hideyoshi
pg. 1908
LS-B-WE1.6 - INHIBITION MODELING OF FUTURE MALWARE DIFFUSION WITH AN EVOLUTIONARY GAME THEORY
Miyoshi, Seiji
pg. 259
OD-B-WE2.5 - STATISTICAL-MECHANICAL ANALYSIS OF ADAPTIVE VOLTERRA FILTER FOR TIME-VARYING UNKNOWN SYSTEM
Mizobuchi, Yusaku
pg. 651
OD-A-TH1.8 - PRIOR DISTRIBUTION DESIGN FOR MUSIC BLEEDING-SOUND REDUCTION BASED ON NONNEGATIVE MATRIX FACTORIZATION
Mizoguchi, Takehiko
pg. 179
LS-A-TH3.2 - A HYPERCOMPLEX TENSOR-SVD AND ITS APPLICATION
Mohr, Marisa
pg. 240
OD-B-WE2.2 - ORDERING PRINCIPAL COMPONENTS OF MULTIVARIATE FRACTIONAL BROWNIAN MOTION FOR SOLVING INVERSE PROBLEMS
Morency, Louis-Philippe
pg. 841
OD-A-TH3.8 - UNDERSTANDING THE TRADEOFFS IN CLIENT-SIDE PRIVACY FOR DOWNSTREAM SPEECH TASKS
Mori, Daiki
pg. 503
OD-A-WE1.13 - ADVANCED LANGUAGE MODEL FUSION METHOD FOR ENCODER-DECODER MODEL IN JAPANESE SPEECH RECOGNITION
Mori, Hiroki
pg. 1748
OD-B-TH2.8 - RECEIVED SIGNAL POWER BASED SENSOR ZONE ESTIMATION WITH MAXIMUM LIKELIHOOD APPROACH
Mori, Koichiro
pg. 808
OD-A-TH3.3 - CONDITIONAL DEEP HIERARCHICAL VARIATIONAL AUTOENCODER FOR VOICE CONVERSION
MORIKAWA, Takashi
pg. 373
LS-B-TH3.3 - MEASURING ATTRACTIVENESS OF TOURISM RESOURCES BY FOCUSING ON KANSEI VALUE STRUCTURE: POSSIBILITY OF INVITING VISITORS USING THE JAPANESE HERITAGE “AKO SALT.”
Morise, Masanori
pg. 897
OD-A-FR1.4 - IMPLEMENTATION OF INTERACTIVE TOOLS FOR INVESTIGATING FUNDAMENTAL FREQUENCY RESPONSE OF VOICED SOUNDS TO AUDITORY STIMULATION
Moritani, Asuka
pg. 836
OD-A-TH3.7 - STARGAN-BASED EMOTIONAL VOICE CONVERSION FOR JAPANESE PHRASES
Motomura, Ryota
pg. 1794
LS-B-TH1.2 - A FLEXIBLE REVERSIBLE DATA HIDING METHOD IN COMPRESSIBLE ENCRYPTED IMAGES
Motonaka, Kimiko
pg. 259
OD-B-WE2.5 - STATISTICAL-MECHANICAL ANALYSIS OF ADAPTIVE VOLTERRA FILTER FOR TIME-VARYING UNKNOWN SYSTEM
Mrak, Marta
pg. 1422
OD-B-TH1.12 - SPATIAL INFORMATION REFINEMENT FOR CHROMA INTRA PREDICTION IN VIDEO CODING
Munakata, Hokuto
pg. 961
OD-A-FR2.5 - MULTIPLE-EMBEDDING SEPARATION NETWORKS: SOUND CLASS-SPECIFIC FEATURE EXTRACTION FOR UNIVERSAL SOUND SEPARATION
Murakami, Takahiro
pg. 205
LS-A-FR2.2 - LEARNING THE STATISTICAL MODEL OF THE NMF USING THE DEEP MULTIPLICATIVE UPDATE ALGORITHM WITH APPLICATIONS
MURAMATSU, Shogo
pg. 1528
LS-A-WE1.5 - ACCELERATION OF PDS–BASED HIGH–DIMENSIONAL SIGNAL RESTORATION
Muramatsu, Shogo
pg. 1499
LS-A-WE1.1 - AN EFFICIENT IMAGE PROCESSING AND MACHINE LEARNING BASED TECHNIQUE FOR SKIN LESION SEGMENTATION AND CLASSIFICATION
pg. 1687
LS-A-FR3.2 - PROXIMAL GRADIENT-BASED LOOP UNROLLING WITH INTERSCALE THRESHOLDING
Muraoka, Naoyuki
pg. 1653
LS-C-FR1.6 - AN ACOUSTIC COMMUNICATION TECHNIQUE BASED ON AUDIO DATA HIDING UTILIZING ARTIFICIAL FLOWING WATER SOUNDS
Mussabayeva, Ayana
pg. 410
LS-C-FR3.1 - EVENT-RELATED SPECTROGRAM REPRESENTATION OF EEG FOR CNN-BASED P300 SPELLER
Möller, Ralf
pg. 240
OD-B-WE2.2 - ORDERING PRINCIPAL COMPONENTS OF MULTIVARIATE FRACTIONAL BROWNIAN MOTION FOR SOLVING INVERSE PROBLEMS
N
N, Narendra
pg. 756
OD-A-TH2.11 - FILTERS KNOW HOW YOU FEEL: EXPLAINING INTERMEDIATE SPEECH EMOTION CLASSIFICATION REPRESENTATIONS
Nada, Kayo
pg. 1156
LS-A-TH2.4 - MULTITASK LEARNING OF ACOUSTIC SCENES AND EVENTS USING DYNAMIC WEIGHT ADAPTATION BASED ON MULTI-FOCAL LOSS
Nagamori, Shunta
pg. 406
LS-C-FR2.4 - SPEECH ENHANCEMENT NETWORK WITH UNSUPERVISED ATTENTION USING INVARIANT INFORMATION CLUSTERING
Nagase, Ryotaro
pg. 725
OD-A-TH2.6 - SPEECH EMOTION RECOGNITION WITH FUSION OF ACOUSTIC- AND LINGUISTIC-FEATURE-BASED DECISIONS
NAGATA, Noriko
pg. 373
LS-B-TH3.3 - MEASURING ATTRACTIVENESS OF TOURISM RESOURCES BY FOCUSING ON KANSEI VALUE STRUCTURE: POSSIBILITY OF INVITING VISITORS USING THE JAPANESE HERITAGE “AKO SALT.”
Nagata, Noriko
pg. 359
LS-B-TH3.1 - MODELING THE DYNAMICS OF OBSERVATIONAL BEHAVIORS BASE ON OBSERVERS’ PERSONALITY TRAITS USING HIDDEN MARKOV MODELS
pg. 366
LS-B-TH3.2 - ESTIMATING BEVERAGE PREFERENCE BASED ON SUBJECTIVE EMOTIONAL REACTIONS AND EEG ACTIVITY
Nakadai, Kazuhiro
pg. 248
OD-B-WE2.3 - SPATIAL NORMALIZATION TO REDUCE POSITIONAL COMPLEXITY IN DIRECTION-AIDED SUPERVISED BINAURAL SOUND SOURCE SEPARATION
Nakai, Koki
pg. 1865
LS-B-FR3.3 - A STUDY OF PRIVACY PROTECTION OF PHOTOS TAKEN BY A WIDE-ANGLE SURVEILLANCE CAMERA
Nakamura, Fuga
pg. 1536
LS-A-WE1.6 - PRODUCT QUANTIZATION TO REDUCE ENTROPY OF LABELS FOR FAST AND ACCURATE IMAGE RETRIEVAL
Nakamura, Kazuaki
pg. 1800
LS-B-TH1.3 - MODEL INVERSION ATTACK AGAINST A FACE RECOGNITION SYSTEM IN A BLACK-BOX SETTING
Nakamura, Shun
pg. 1294
OD-B-FR2.6 - TOWARD ESTIMATION OF ABNORMAL BRAKE IN AUTONOMOUS VEHICLES FROM ELECTROENCEPHALOGRAM AND HEART RATE INTERVAL
Nakamura, Tomohiko
pg. 1226
LS-B-FR1.5 - MULTICHANNEL AUDIO SOURCE SEPARATION WITH INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS USING PRODUCT OF SOURCE MODELS
pg. 651
OD-A-TH1.8 - PRIOR DISTRIBUTION DESIGN FOR MUSIC BLEEDING-SOUND REDUCTION BASED ON NONNEGATIVE MATRIX FACTORIZATION
pg. 578
OD-A-WE2.10 - SPEECH ENHANCEMENT BY NOISE SELF-SUPERVISED RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION VIA INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS
Nakaoka, Sotaro
pg. 1210
LS-B-FR1.2 - REDUCING ALGORITHMIC DELAY USING LOW-OVERLAP WINDOW FOR ONLINE WAVE-U-NET
Nakashima, Hidetoshi
pg. 9
OD-B-WE1.2 - ADAPTIVE FEEDBACK CANCELLATION BASED ON PREDICTION ERROR METHOD USING INTERAURAL LEVEL DIFFERENCES IN HEARING DEVICE
Nakashima, Taishi
pg. 1016
OD-A-FR2.13 - SELF-ROTATION ANGLE ESTIMATION OF CIRCULAR MICROPHONE ARRAY BASED ON SOUND FIELD INTERPOLATION
NAKAYAMA, Masato
pg. 968
OD-A-FR2.6 - NARROW-EDGED BEAMFORMING USING MASKED PARAMETRIC ARRAY LOUDSPEAKERS
Nakayama, Masato
pg. 30
OD-B-WE1.5 - MOVING SOUND SOURCE TRACKING IN WIDE SPACE BY MULTIPLE MICROPHONE ARRAYS
pg. 1000
OD-A-FR2.11 - SHARP-SOUND-IMAGE CONSTRUCTION METHOD USING MULTICHANNEL SOUND SYSTEM WITH OPTIMAL PARAMETRIC LOUDSPEAKER ARRANGEMENT
pg. 1008
OD-A-FR2.12 - VIRTUAL SOUND SOURCE RENDERING BASED ON DISTANCE CONTROL TO PENETRATE LISTENERS USING SURROUND PARAMETRIC-ARRAY AND ELECTRODYNAMIC LOUDSPEAKERS
Nakazawa, Kazushi
pg. 608
OD-A-WE2.15 - IMPROVEMENTS TO NON-INTRUSIVE INTELLIGIBILITY PREDICTION FOR REVERBERANT SPEECH
Nam, Juhan
pg. 890
OD-A-FR1.3 - INVESTIGATING TIME-FREQUENCY REPRESENTATIONS FOR AUDIO FEATURE EXTRACTION IN SINGING TECHNIQUE CLASSIFICATION
Narieda, Shusuke
pg. 1949
LS-A-FR1.1 - MEASUREMENT OF CO2 IN OUTDOOR ENVIRONMENTS USING LPWAN BASED WSN AND ITS TIME CORRELATION CHARACTERISTICS
Naroju, Meher Dinesh
pg. 511
OD-A-WE1.14 - CSTD-TELUGU CORPUS: CROWD-SOURCED APPROACH FOR LARGE-SCALE SPEECH DATA COLLECTION
Naruse, Hiroshi
pg. 1949
LS-A-FR1.1 - MEASUREMENT OF CO2 IN OUTDOOR ENVIRONMENTS USING LPWAN BASED WSN AND ITS TIME CORRELATION CHARACTERISTICS
Natsuaki, Ryo
pg. 174
LS-A-TH3.1 - GENERALIZATION CHARACTERISTICS OF COMPLEX-VALUED RESERVOIR COMPUTING FOR INTERFEROMETRIC SYNTHETIC APERTURE RADAR APPLICATIONS
Naumova, Valeriya
pg. 2072
LS-C-TH1.1 - GRAPH KERNEL RECURSIVE LEAST-SQUARES ALGORITHMS
Nayak, Shekhar
pg. 491
OD-A-WE1.11 - TEAGER ENERGY SUBBAND FILTERED FEATURES FOR NEAR AND FAR-FIELD AUTOMATIC SPEECH RECOGNITION
Naylor, Patrick
pg. 705
OD-A-TH2.3 - A STUDY OF SALIENT MODULATION DOMAIN FEATURES FOR SPEAKER IDENTIFICATION
Negi, Shubham
pg. 756
OD-A-TH2.11 - FILTERS KNOW HOW YOU FEEL: EXPLAINING INTERMEDIATE SPEECH EMOTION CLASSIFICATION REPRESENTATIONS
Nemati, Mahyar
pg. 1756
OD-B-TH2.9 - ANOMALY DETECTION FOR WIRELESS COMMUNICATION LINKS VIA DATA INTEGRITY MODELING
Neo, Vincent
pg. 705
OD-A-TH2.3 - A STUDY OF SALIENT MODULATION DOMAIN FEATURES FOR SPEAKER IDENTIFICATION
Ng, Koi Yee
pg. 1877
LS-B-FR3.5 - RELABEL, SCRAMBLE, SYNTHESIZE: A NOVEL COVERLESS STEGANOGRAPHY APPROACH VIA COLLAGE IMAGE
Ng, Kok Yew
pg. 1269
OD-B-FR2.2 - SEIZURE CLASSIFICATION OF EEG BASED ON WAVELET SIGNAL DENOISING USING A NOVEL CHANNEL SELECTION ALGORITHM
Ng, Xin-Lei
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Nguyen, Binh Thien
pg. 995
OD-A-FR2.10 - TWO-STAGE PHASE RECONSTRUCTION USING DNN AND VON MISES DISTRIBUTION-BASED MAXIMUM LIKELIHOOD
Nguyen, Duc-Chien
pg. 1149
LS-A-TH2.3 - SPEAKER COUNT: A NEW BUILDING BLOCK FOR SPEAKER DIARIZATION
Nguyen, Duy Hai
pg. 1180
LS-C-TH3.2 - DESIGN AND EVALUATION OF ACTIVE NOISE CONTROL ON MACHINERY NOISE
Nguyen, Hong-Son
pg. 1149
LS-A-TH2.3 - SPEAKER COUNT: A NEW BUILDING BLOCK FOR SPEAKER DIARIZATION
Nguyen, Manh Hung
pg. 1942
LS-D-WE2.6 - AN ENTROPY-BASED DDOS ATTACK DETECTION AND CLASSIFICATION WITH HIERARCHICAL TEMPORAL MEMORY
Nguyen, Phi-Le
pg. 1149
LS-A-TH2.3 - SPEAKER COUNT: A NEW BUILDING BLOCK FOR SPEAKER DIARIZATION
Nguyen, Thanh Binh
pg. 619
OD-A-TH1.2 - IMPROVEMENT OF SPATIAL AMBIGUITY IN MULTI-CHANNEL SPEECH SEPARATION USING CHANNEL ATTENTION
Nguyen, Tin Cong
pg. 170
LS-D-TH2.6 - FACE ANTI-SPOOFING USING MULTI-BRANCH CNN
Ni, Tianyi
pg. 926
OD-A-FR1.9 - SVM-BASED EVALUATION OF THAI TONE IMITATIONS BY THAI-NAÏVE MANDARIN AND VIETNAMESE SPEAKERS
Nishida, Naoki
pg. 366
LS-B-TH3.2 - ESTIMATING BEVERAGE PREFERENCE BASED ON SUBJECTIVE EMOTIONAL REACTIONS AND EEG ACTIVITY
Nishigaki, Masakatsu
pg. 1775
LS-B-WE2.3 - EXAMINING OF SHALLOW AUTOENCODER ON BLACK-BOX ATTACK AGAINST FACE RECOGNITION
Nishikawa, Daichi
pg. 1703
LS-A-FR3.5 - MULTI-VIEW VARIATIONAL AUTOENCODER FOR ROBUST CLASSIFICATION AGAINST IRRELEVANT DATA
Nishikawa, Kiyoshi
pg. 392
LS-C-FR2.2 - ON IMPROVING THE ACCURACY OF OBJECT DETECTION FOR HIGH RESOLUTION IMAGES BASED ON SSD
Nishimura, Ryota
pg. 849
OD-A-TH3.9 - MULTI-SPEAKER TTS SYSTEM FOR LOW-RESOURCE LANGUAGE USING CROSS-LINGUAL TRANSFER LEARNING AND DATA AUGMENTATION
pg. 1077
OD-A-FR3.11 - END-TO-END SPONTANEOUS SPEECH RECOGNITION USING HESITATION LABELING
pg. 503
OD-A-WE1.13 - ADVANCED LANGUAGE MODEL FUSION METHOD FOR ENCODER-DECODER MODEL IN JAPANESE SPEECH RECOGNITION
Nishimura, Tazuko
pg. 821
OD-A-TH3.5 - ACOUSTIC SIMULATION OF BODY-CONDUCTED SPEECH AND ITS USE TO CONVERT ONE'S RECORDED VOICES TO ONE'S OWN VOICES
Nishino, Masahiro
pg. 1067
OD-A-FR3.9 - MULTIPLE DEEP LEARNING MODELS AND ARCHITECTURES WITH DIFFERENT NUMBERS OF STATES USED TO IMPROVE RETRIEVAL ACCURACY OF QUERY-BY-EXAMPLE
NISHIURA, Takanobu
pg. 968
OD-A-FR2.6 - NARROW-EDGED BEAMFORMING USING MASKED PARAMETRIC ARRAY LOUDSPEAKERS
Nishiura, Takanobu
pg. 1173
LS-C-TH3.1 - A STUDY ON OPTIMAL FILTER OF FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM BASED ON ANALYSIS OF FREQUENCY RESPONSE
pg. 989
OD-A-FR2.9 - FORMULATION OF MULTIDIMENSIONAL FREQUENCY CHARACTERISTICS OF SECOND-ORDER NONLINEAR IIR FILTER
pg. 995
OD-A-FR2.10 - TWO-STAGE PHASE RECONSTRUCTION USING DNN AND VON MISES DISTRIBUTION-BASED MAXIMUM LIKELIHOOD
pg. 1000
OD-A-FR2.11 - SHARP-SOUND-IMAGE CONSTRUCTION METHOD USING MULTICHANNEL SOUND SYSTEM WITH OPTIMAL PARAMETRIC LOUDSPEAKER ARRANGEMENT
Nitta, Naoko
pg. 1800
LS-B-TH1.3 - MODEL INVERSION ATTACK AGAINST A FACE RECOGNITION SYSTEM IN A BLACK-BOX SETTING
Nitta, Tohru
pg. 187
LS-A-TH3.3 - LEARNING PROPERTIES OF FEEDFORWARD NEURAL NETWORKS USING DUAL NUMBERS
Niu, Haijun
pg. 546
OD-A-WE2.5 - MANDARIN ELECTRO-LARYNGEAL SPEECH ENHANCEMENT BASED ON STATISTICAL VOICE CONVERSION AND MANUAL TONE CONTROL
Niu, Shu-Tong
pg. 667
OD-A-TH1.10 - A DEEP ANALYSIS OF SPEECH SEPARATION GUIDED DIARIZATION UNDER REALISTIC CONDITIONS
Nobukawa, Sou
pg. 1289
OD-B-FR2.5 - EFFECT OF VISUAL ATTENTION AND DRIVING EXPERIENCES ON THE EVENT-RELATED POTENTIAL P300 IN THE PERCEPTION OF TRAFFIC SCENES
Noguchi, Aoi
pg. 1338
LS-A-TH1.1 - REAL-TIME MONITORING SYSTEM TO EVALUATE EXERCISE LOAD, HYPOXIC LOAD, AND SAFETY IN A NORMOBARIC HYPOXIC ROOM
pg. 1348
LS-A-TH1.3 - PRELIMINARY STUDY USING AUTOENCODER FOR EARLY DETECTION OF HEAT ILLNESS FROM HEART RATE VARIABILITY OBTAINED WITH WEARABLE DEVICE
Nohara, Kanji
pg. 471
OD-A-WE1.8 - DATA AUGMENTATION BASED ON FREQUENCY WARPING FOR RECOGNITION OF CLEFT PALATE SPEECH
Nomo Sudro, Protima
pg. 571
OD-A-WE2.9 - PROCESSING PHONEME SPECIFIC SEGMENTS FOR CLEFT LIP AND PALATE SPEECH ENHANCEMENT
pg. 484
OD-A-WE1.10 - SIGNIFICANCE OF DATA AUGMENTATION FOR IMPROVING CLEFT LIP AND PALATE SPEECH RECOGNITION
Nomura, Sadahiro
pg. 1343
LS-A-TH1.2 - PREOPERATIVE MONITORING USING IMPLANTABLE, MULTIMODAL, MULTICHANNEL PROBE
Nozaki, Kazunori
pg. 471
OD-A-WE1.8 - DATA AUGMENTATION BASED ON FREQUENCY WARPING FOR RECOGNITION OF CLEFT PALATE SPEECH
O
OCHI, Keiko
pg. 428
LS-C-FR3.4 - PITCH AND VOLUME STABILITY IN THE COMMUNICATIVE RESPONSE OF ADULTS WITH AUTISM
Ogawa, Atsunori
pg. 1077
OD-A-FR3.11 - END-TO-END SPONTANEOUS SPEECH RECOGNITION USING HESITATION LABELING
pg. 503
OD-A-WE1.13 - ADVANCED LANGUAGE MODEL FUSION METHOD FOR ENCODER-DECODER MODEL IN JAPANESE SPEECH RECOGNITION
Ogawa, Tetsuji
pg. 477
OD-A-WE1.9 - AN INVESTIGATION OF ENHANCING CTC MODEL FOR TRIGGERED ATTENTION-BASED STREAMING ASR
pg. 603
OD-A-WE2.14 - COMPARATIVE STUDY ON DNN-BASED MINIMUM VARIANCE BEAMFORMING ROBUST TO SMALL MOVEMENTS OF SOUND SOURCES
Oh, Youngjin
pg. 151
LS-D-TH2.3 - RESIDUAL DILATED U-NET WITH SPATIALLY ADAPTIVE NORMALIZATION FOR THE RESTORATION OF UNDER DISPLAY CAMERA IMAGES
Ohki, Tetsushi
pg. 1775
LS-B-WE2.3 - EXAMINING OF SHALLOW AUTOENCODER ON BLACK-BOX ATTACK AGAINST FACE RECOGNITION
Ohno, Shuichi
pg. 295
OD-B-WE2.11 - FEEDBACK QUANTIZATION AND BIT ALLOCATION FOR NETWORKED CONTROL SYSTEMS WITH RATE LIMITED CHANNELS
Ohta, Kengo
pg. 849
OD-A-TH3.9 - MULTI-SPEAKER TTS SYSTEM FOR LOW-RESOURCE LANGUAGE USING CROSS-LINGUAL TRANSFER LEARNING AND DATA AUGMENTATION
pg. 1077
OD-A-FR3.11 - END-TO-END SPONTANEOUS SPEECH RECOGNITION USING HESITATION LABELING
pg. 503
OD-A-WE1.13 - ADVANCED LANGUAGE MODEL FUSION METHOD FOR ENCODER-DECODER MODEL IN JAPANESE SPEECH RECOGNITION
Ohta, Mai
pg. 1958
LS-A-FR1.3 - INTRA-SYSTEM INTERFERENCE AVOIDANCE FOR PACKET-LEVEL INDEX MODULATION IN INTERNET OF THINGS
Oikawa, Yasuhiro
pg. 254
OD-B-WE2.4 - PHASE-AWARE AUDIO INPAINTING BASED ON INSTANTANEOUS FREQUENCY
OKANO, Ai
OD-B-FR3.2 - PHASE CONTROL OF PARAMETRIC AEEAY LOUNDSPEAKER BY OPTIMIZING THE SIDEBAND WEIGHTING FUNCTIONS
Okawa, Yuto
pg. 187
LS-A-TH3.3 - LEARNING PROPERTIES OF FEEDFORWARD NEURAL NETWORKS USING DUAL NUMBERS
Okhassov, Timur
pg. 416
LS-C-FR3.2 - COST-EFFECTIVE PROPORTIONATE AFFINE PROJECTION ALGORITHM WITH VARIABLE PARAMETERS FOR ACOUSTIC FEEDBACK CANCELLATION
Okuda, Ippei
pg. 80
LS-A-WE2.4 - HISUI: AN IMAGE AND VIDEO PROCESSING FRAMEWORK WITH AUTO-OPTIMIZER
Okuda, Masahiro
OD-B-FR3.3 - STUDY ON GENERALIZATION PERFORMANCE OF DEEP IMAGE RESTORATION WITH UNFOLDING ON SMALL DATASETS
pg. 1395
OD-B-TH1.7 - DENOISING HYPERSPECTRAL IMAGES USING INTERBAND CORRELATION
Olalere, Feyisayo
pg. 1563
LS-C-WE2.5 - VIDEO-BASED SPORTS ACTIVITY RECOGNITION FOR CHILDREN
Ong, Simying
pg. 1877
LS-B-FR3.5 - RELABEL, SCRAMBLE, SYNTHESIZE: A NOVEL COVERLESS STEGANOGRAPHY APPROACH VIA COLLAGE IMAGE
Ong, Zhen-Ting
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Onishi, Kotaro
pg. 808
OD-A-TH3.3 - CONDITIONAL DEEP HIERARCHICAL VARIATIONAL AUTOENCODER FOR VOICE CONVERSION
Ono, Nobutaka
pg. 1215
LS-B-FR1.3 - FRAMEWISE FINITE IMPULSE RESPONSE FILTERING BASED ON TIME-FREQUENCY MASK FOR LOW-LATENCY SPEECH ENHANCEMENT
pg. 428
LS-C-FR3.4 - PITCH AND VOLUME STABILITY IN THE COMMUNICATIVE RESPONSE OF ADULTS WITH AUTISM
pg. 1161
LS-A-TH2.5 - INVESTIGATION ON SPATIAL AND FREQUENCY-BASED FEATURES FOR ASYNCHRONOUS ACOUSTIC SCENE ANALYSIS
pg. 1167
LS-A-TH2.6 - ANALYSIS ON ROLES OF DNNS IN END-TO-END ACOUSTIC SCENE ANALYSIS FRAMEWORK WITH DISTRIBUTED SOUND-TO-LIGHT CONVERSION DEVICES
pg. 585
OD-A-WE2.11 - CAUSAL DISTORTIONLESS RESPONSE BEAMFORMING BY ALTERNATING DIRECTION METHOD OF MULTIPLIERS
pg. 1016
OD-A-FR2.13 - SELF-ROTATION ANGLE ESTIMATION OF CIRCULAR MICROPHONE ARRAY BASED ON SOUND FIELD INTERPOLATION
Ono, Shunsuke
pg. 1687
LS-A-FR3.2 - PROXIMAL GRADIENT-BASED LOOP UNROLLING WITH INTERSCALE THRESHOLDING
pg. 330
LS-B-TH2.2 - RECOVERY OF TIME SERIES OF GRAPH SIGNALS OVER DYNAMIC TOPOLOGY
Ooi, Kenneth
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Ortega, Antonio
pg. 351
LS-B-TH2.5 - CHANNEL-WISE EARLY STOPPING WITHOUT A VALIDATION SET VIA NNK POLYTOPE INTERPOLATION
Ota, Koshi
pg. 1348
LS-A-TH1.3 - PRELIMINARY STUDY USING AUTOENCODER FOR EARLY DETECTION OF HEAT ILLNESS FROM HEART RATE VARIABILITY OBTAINED WITH WEARABLE DEVICE
Owada, Keiho
pg. 428
LS-C-FR3.4 - PITCH AND VOLUME STABILITY IN THE COMMUNICATIVE RESPONSE OF ADULTS WITH AUTISM
Ozaki, Ryo
pg. 836
OD-A-TH3.7 - STARGAN-BASED EMOTIONAL VOICE CONVERSION FOR JAPANESE PHRASES
Ozamoto, Kohei
pg. 624
OD-A-TH1.3 - NOISE-TOLERANT TIME-DOMAIN SPEECH SEPARATION WITH NOISE BASES
Ozawa, Keisuke
pg. 1367
OD-B-TH1.2 - SNAPSHOT MULTISPECTRAL IMAGE COMPLETION AND UNMIXING WITH TOTAL VARIATION REGULARIZATION ON ABUNDANCE MAPS
P
Pabbisetty, Gurusanthosh
pg. 1748
OD-B-TH2.8 - RECEIVED SIGNAL POWER BASED SENSOR ZONE ESTIMATION WITH MAXIMUM LIKELIHOOD APPROACH
Pan, Jen-Yi
pg. 1912
LS-D-WE2.1 - AN ADAPTIVE RANK SELECTION METHOD IN 3GPP 5G NR SYSTEMS
pg. 1889
LS-B-WE1.2 - A THRESHOLD-BASED SCHEDULING AND POWER CONTROL DESIGN ON IMT-2020 EVALUATION
pg. 1917
LS-D-WE2.2 - A LOW COMPLEXITY PMI SELECTION SCHEME FOR 3GPP 5G NR FDD SYSTEMS
Pang, LieLin
pg. 1416
OD-B-TH1.11 - MOVING OBJECT DETECTION IN HEVC VIDEO
Pang, Yik Siang
pg. 1872
LS-B-FR3.4 - A PILOT EXPLORATION OF INDUSTRIAL VIDEO SCENE DATA EMBEDDING USING REAL-TIME MV-HEVC
Pao, Wei-Chen
pg. 1912
LS-D-WE2.1 - AN ADAPTIVE RANK SELECTION METHOD IN 3GPP 5G NR SYSTEMS
pg. 1889
LS-B-WE1.2 - A THRESHOLD-BASED SCHEDULING AND POWER CONTROL DESIGN ON IMT-2020 EVALUATION
pg. 1917
LS-D-WE2.2 - A LOW COMPLEXITY PMI SELECTION SCHEME FOR 3GPP 5G NR FDD SYSTEMS
Park, Gu Yong
pg. 151
LS-D-TH2.3 - RESIDUAL DILATED U-NET WITH SPATIALLY ADAPTIVE NORMALIZATION FOR THE RESTORATION OF UNDER DISPLAY CAMERA IMAGES
Park, Hyunkook
pg. 146
LS-D-TH2.2 - UNPAIRED IMAGE DEMOIRÉING BASED ON CYCLIC MOIRÉ LEARNING
Park, Jihong
pg. 1756
OD-B-TH2.9 - ANOMALY DETECTION FOR WIRELESS COMMUNICATION LINKS VIA DATA INTEGRITY MODELING
Park, Junheum
pg. 164
LS-D-TH2.5 - FACIAL VIDEO FRAME INTERPOLATION COMBINING SYMMETRIC AND ASYMMETRIC MOTIONS
Park, Min-Je
pg. 1682
LS-A-FR3.1 - MULTI-BAND NIR COLORIZATION USING STRUCTURE-AWARE NETWORK
Park, Yeseung
pg. 1615
LS-D-TH3.6 - ENVIRONMENT ADAPTIVE 3D POSE ESTIMATION MODEL AND LEARNING STRATEGY
PATHROSE, NIMMY
pg. 1987
OD-B-TH3.3 - CONVOLUTIONAL AUTOENCODER BASED DEEP LEARNING MODEL FOR IDENTIFICATION OF RED PALM WEEVIL SIGNALS
Patil, Ankur T.
pg. 775
OD-A-TH2.14 - DEEP CONVOLUTIONAL NEURAL NETWORK FOR VOICE LIVENESS DETECTION
Patil, Hemant
pg. 491
OD-A-WE1.11 - TEAGER ENERGY SUBBAND FILTERED FEATURES FOR NEAR AND FAR-FIELD AUTOMATIC SPEECH RECOGNITION
Patil, Hemant A.
pg. 775
OD-A-TH2.14 - DEEP CONVOLUTIONAL NEURAL NETWORK FOR VOICE LIVENESS DETECTION
Peksi, Santi
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Peng, Anjie
pg. 1716
OD-B-TH2.2 - JOINT ESTIMATION OF IMAGE ROTATION ANGLE AND SCALING FACTOR
Peng, Yizhou
pg. 1043
OD-A-FR3.5 - MULTILINGUAL APPROACH TO JOINT SPEECH AND ACCENT RECOGNITION WITH DNN-HMM FRAMEWORK
Peng, Yu-Huai
pg. 1234
LS-D-FR3.1 - TIME ALIGNMENT USING LIP IMAGES FOR FRAME-BASED ELECTROLARYNGEAL VOICE CONVERSION
pg. 719
OD-A-TH2.5 - GENERATION OF SPEAKER REPRESENTATIONS USING HETEROGENEOUS TRAINING BATCH ASSEMBLY
Pham, Bach-Tung
pg. 170
LS-D-TH2.6 - FACE ANTI-SPOOFING USING MULTI-BRANCH CNN
Pham, Van Tung
pg. 1021
OD-A-FR3.1 - ENRICHING UNDER-REPRESENTED NAMED ENTITIES FOR IMPROVED SPEECH RECOGNITION
pg. 497
OD-A-WE1.12 - MULTITASK-BASED JOINT LEARNING APPROACH TO ROBUST ASR FOR RADIO COMMUNICATION SPEECH
pg. 786
OD-A-TH2.16 - END-TO-END SPEAKER AGE AND HEIGHT ESTIMATION USING ATTENTION MECHANISM AND TRIPLET LOSS
Phan, Huy
pg. 1149
LS-A-TH2.3 - SPEAKER COUNT: A NEW BUILDING BLOCK FOR SPEAKER DIARIZATION
Pineda, Riza Rae
pg. 1357
LS-A-TH1.5 - EVALUATION OF THE EFFECT OF TRANSFER LEARNING TO MULTI-INSTANCE DETECTION OF MONKEYS
Poppe, Ronald
pg. 1563
LS-C-WE2.5 - VIDEO-BASED SPORTS ACTIVITY RECOGNITION FOR CHILDREN
Prasanna, S R Mahadeva
pg. 571
OD-A-WE2.9 - PROCESSING PHONEME SPECIFIC SEGMENTS FOR CLEFT LIP AND PALATE SPEECH ENHANCEMENT
pg. 484
OD-A-WE1.10 - SIGNIFICANCE OF DATA AUGMENTATION FOR IMPROVING CLEFT LIP AND PALATE SPEECH RECOGNITION
Priyambodo, Tri Kuntoro
pg. 1611
LS-D-TH3.5 - PARTIAL FINGERPRINT ON COMBINED EVALUATION USING DEEP LEARNING AND FEATURE DESCRIPTOR
Pu, Yen-Yu
pg. 1483
OD-B-FR1.10 - REAL-TIME EDGE ATTENTION-BASED LEARNING FOR LOW-LIGHT ONE-STAGE OBJECT DETECTION
Q
Q. K. Duong, Ngoc
pg. 1149
LS-A-TH2.3 - SPEAKER COUNT: A NEW BUILDING BLOCK FOR SPEAKER DIARIZATION
Qian, Kai
pg. 614
OD-A-TH1.1 - A TARGET SPEAKER SEPARATION NEURAL NETWORK WITH JOINT-TRAINING
Qian, Sichong
pg. 564
OD-A-WE2.8 - MULTI-CHANNEL SPEECH ENHANCEMENT WITH 2-D CONVOLUTIONAL TIME-FREQUENCY DOMAIN FEATURES AND A PRE-TRAINED ACOUSTIC MODEL
Qian, Zhaopeng
pg. 546
OD-A-WE2.5 - MANDARIN ELECTRO-LARYNGEAL SPEECH ENHANCEMENT BASED ON STATISTICAL VOICE CONVERSION AND MANUAL TONE CONTROL
Qin, Jiayi
pg. 1511
LS-A-WE1.3 - MULTI-RESIDUAL FEATURE FUSION NETWORK FOR LIGHTWEIGHT SINGLE IMAGE SUPER-RESOLUTION
Qiu, Yicheng
pg. 392
LS-C-FR2.2 - ON IMPROVING THE ACCURACY OF OBJECT DETECTION FOR HIGH RESOLUTION IMAGES BASED ON SSD
Qu, Zhenhua
pg. 1858
LS-B-FR3.2 - DERIVING A COMPACT ANALYTICAL MODEL FOR CAMERA RESPONSE FUNCTIONS WITH APPLICATION TO CHARTLESS RADIOMETRIC CALIBRATION
Qureshi, Amna
pg. 1786
LS-B-TH1.1 - DETECTING DEEPFAKE VIDEOS USING DIGITAL WATERMARKING
R
Rahardja, Susanto
pg. 635
OD-A-TH1.5 - A COMPARISON OF HANDCRAFTED, PARAMETERIZED, AND LEARNABLE FEATURES FOR SPEECH SEPARATION
Rao, Arjun Ashok
pg. 337
LS-B-TH2.3 - AN EMPIRICAL STUDY ON COMPRESSED DECENTRALIZED STOCHASTIC GRADIENT ALGORITHMS WITH OVERPARAMETERIZED MODELS
Raswa, Farchan Hakim
pg. 1602
LS-D-TH3.3 - A FUSION METHODOLOGY OF AKAZE AND NEURAL NETWORK FOR FINGERPRINT RECOGNITION
pg. 1611
LS-D-TH3.5 - PARTIAL FINGERPRINT ON COMBINED EVALUATION USING DEEP LEARNING AND FEATURE DESCRIPTOR
Rath, Shakti P.
pg. 491
OD-A-WE1.11 - TEAGER ENERGY SUBBAND FILTERED FEATURES FOR NEAR AND FAR-FIELD AUTOMATIC SPEECH RECOGNITION
Retta, Ephrem Afele
pg. 1519
LS-A-WE1.4 - AUTOMOTIVE ENGINE CYLINDER HEAD CRACK DETECTION: CANNY EDGE DETECTION WITH MORPHOLOGICAL DILATION
Rhee, Hochang
pg. 158
LS-D-TH2.4 - LOSSLESS IMAGE COMPRESSION BASED ON IMAGE DECOMPOSITION AND PROGRESSIVE PREDICTION USING CONVOLUTIONAL NEURAL NETWORKS
pg. 2049
OD-B-TH3.13 - NETWORK INTRUSION DETECTION WITH IMPROVED FEATURE REPRESENTATION
Ringhofer, Monamie
pg. 1353
LS-A-TH1.4 - MATHEMATICAL MODEL OF A HORSE AND THE RIDER DURING A JUMP
Ritz, Christian
pg. 1202
LS-B-FR1.1 - DEVELOPMENT OF A SYNTHETIC DATABASE FOR COMPACT NEURAL NETWORK CLASSIFICATION OF ACOUSTIC SCENES IN DEMENTIA CARE ENVIRONMENTS
pg. 974
OD-A-FR2.7 - COPRIME MICROPHONE ARRAYS FOR ESTIMATING SPEECH DIRECTION OF ARRIVAL USING DEEP LEARNING
Ruiz-Hidalgo, Javier
pg. 351
LS-B-TH2.5 - CHANNEL-WISE EARLY STOPPING WITHOUT A VALIDATION SET VIA NNK POLYTOPE INTERPOLATION
S
S. R., PARVATHY
pg. 1987
OD-B-TH3.3 - CONVOLUTIONAL AUTOENCODER BASED DEEP LEARNING MODEL FOR IDENTIFICATION OF RED PALM WEEVIL SIGNALS
Sagayama, Shigeki
pg. 428
LS-C-FR3.4 - PITCH AND VOLUME STABILITY IN THE COMMUNICATIVE RESPONSE OF ADULTS WITH AUTISM
Saidnassim, Nurbek
pg. 423
LS-C-FR3.3 - SELF-SUPERVISED VISUAL TRANSFORMERS FOR BREAST CANCER DIAGNOSIS
Saijo, Kohei
pg. 603
OD-A-WE2.14 - COMPARATIVE STUDY ON DNN-BASED MINIMUM VARIANCE BEAMFORMING ROBUST TO SMALL MOVEMENTS OF SOUND SOURCES
Saiko, Masahiro
pg. 1781
LS-B-WE2.4 - COMPARATIVE STUDY OF FEATURE EXTRACTION METHOD FOR EMOTIONAL CLASSIFICATION BY MICRO-EXPRESSIONS
Saito, Daisuke
pg. 821
OD-A-TH3.5 - ACOUSTIC SIMULATION OF BODY-CONDUCTED SPEECH AND ITS USE TO CONVERT ONE'S RECORDED VOICES TO ONE'S OWN VOICES
Saito, Yuki
pg. 794
OD-A-TH3.1 - EMOTION-CONTROLLABLE SPEECH SYNTHESIS USING EMOTION SOFT LABELS AND FINE-GRAINED PROSODY FACTORS
Sakai, Shinsuke
pg. 465
OD-A-WE1.7 - AN END-TO-END MODEL FROM SPEECH TO CLEAN TRANSCRIPT FOR PARLIAMENTARY MEETINGS
Sakakibara, Ken-Ichi
pg. 897
OD-A-FR1.4 - IMPLEMENTATION OF INTERACTIVE TOOLS FOR INVESTIGATING FUNDAMENTAL FREQUENCY RESPONSE OF VOICED SOUNDS TO AUDITORY STIMULATION
Sakamoto, Shoki
pg. 836
OD-A-TH3.7 - STARGAN-BASED EMOTIONAL VOICE CONVERSION FOR JAPANESE PHRASES
Sakaue, Fumihiko
pg. 288
OD-B-WE2.10 - DENSE DEPTHMAP PREDICTION FROM ULTRASONIC SENSORS
Salah, Albert Ali
pg. 1563
LS-C-WE2.5 - VIDEO-BASED SPORTS ACTIVITY RECOGNITION FOR CHILDREN
Salakhutdinov, Ruslan
pg. 841
OD-A-TH3.8 - UNDERSTANDING THE TRADEOFFS IN CLIENT-SIDE PRIVACY FOR DOWNSTREAM SPEECH TASKS
Saruwatari, Hiroshi
pg. 794
OD-A-TH3.1 - EMOTION-CONTROLLABLE SPEECH SYNTHESIS USING EMOTION SOFT LABELS AND FINE-GRAINED PROSODY FACTORS
pg. 1226
LS-B-FR1.5 - MULTICHANNEL AUDIO SOURCE SEPARATION WITH INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS USING PRODUCT OF SOURCE MODELS
pg. 651
OD-A-TH1.8 - PRIOR DISTRIBUTION DESIGN FOR MUSIC BLEEDING-SOUND REDUCTION BASED ON NONNEGATIVE MATRIX FACTORIZATION
pg. 578
OD-A-WE2.10 - SPEECH ENHANCEMENT BY NOISE SELF-SUPERVISED RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION VIA INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS
Sasaki, Takayuki
pg. 1
OD-B-WE1.1 - FAST-PARALLEL SINGULAR VALUE THRESHOLDING FOR MANY SMALL MATRICES BASED ON GEOMETRIC FEATURE OF SINGULAR VALUES
Sasou, Akira
pg. 731
OD-A-TH2.7 - AUTOMATIC NATURALNESS RECOGNITION FROM ACTED SPEECH USING NEURAL NETWORKS
Sato, Jun
pg. 288
OD-B-WE2.10 - DENSE DEPTHMAP PREDICTION FROM ULTRASONIC SENSORS
Scheibler, Robin
pg. 1139
LS-A-TH2.1 - COMPARISON OF LOW COMPLEXITY SELF-ATTENTION MECHANISMS FOR ACOUSTIC EVENT DETECTION
pg. 640
OD-A-TH1.6 - OVER-DETERMINED SEMI-BLIND SPEECH SOURCE SEPARATION
Segawa, Hanako
pg. 597
OD-A-WE2.13 - EXTENSION OF VIRTUAL MICROPHONE TECHNIQUE TO MULTIPLE REAL MICROPHONES AND INVESTIGATION OF THE IMPACT OF PHASE AND AMPLITUDE INTERPOLATION ON SPEECH ENHANCEMENT
Sekiguchi, Erika
pg. 1294
OD-B-FR2.6 - TOWARD ESTIMATION OF ABNORMAL BRAKE IN AUTONOMOUS VEHICLES FROM ELECTROENCEPHALOGRAM AND HEART RATE INTERVAL
Seshathiri, Sankarasrinivasan
pg. 1670
LS-D-FR2.3 - DIGITAL MULTITONE IMAGE RECONSTRUCTION USING DEEP GENERATIVE ADVERSARIAL NETS
pg. 1580
LS-C-TH2.3 - DIGITAL HALFTONE CLASSIFICATION USING SIMPLIFIED CNN AND STOCHASTIC STATISTICS
Shafi, Rabia
pg. 1458
OD-B-FR1.6 - HEAD MOVEMENT PREDICTION USING FCNN
Shaik, M. Ali Basha
pg. 491
OD-A-WE1.11 - TEAGER ENERGY SUBBAND FILTERED FEATURES FOR NEAR AND FAR-FIELD AUTOMATIC SPEECH RECOGNITION
Shao, Qijie
pg. 672
OD-A-TH1.11 - TARGET SPEAKER EXTRACTION FOR CUSTOMIZABLE QUERY-BY-EXAMPLE KEYWORD SPOTTING
Shekkizhar, Sarath
pg. 351
LS-B-TH2.5 - CHANNEL-WISE EARLY STOPPING WITHOUT A VALIDATION SET VIA NNK POLYTOPE INTERPOLATION
Shen, Peng
pg. 769
OD-A-TH2.13 - SIAMESE NEURAL NETWORK WITH JOINT BAYESIAN MODEL STRUCTURE FOR SPEAKER VERIFICATION
Shen, Xubang
pg. 106
LS-D-TH1.2 - AN IMPLEMENTATION METHOD OF HEVC DATAFLOW GRAPH BASED ON RECONFIGURABLE PROCESSER
Sheng, Zhi-chao
pg. 667
OD-A-TH1.10 - A DEEP ANALYSIS OF SPEECH SEPARATION GUIDED DIARIZATION UNDER REALISTIC CONDITIONS
Sheu, Ji-Tian
pg. 1251
LS-D-FR3.4 - PREDICTING PATIENT'S CHOICES OF HOSPITAL LEVELS USING DEEP LEARNING AND REPRESENTATION IMPROVEMENTS
Shi, Chuang
pg. 1192
LS-C-TH3.4 - A TRUE DIGITAL FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM WITH NO ANALOG-TO-DIGITAL AND DIGITAL-TO-ANALOG CONVERTERS
Shi, Daimin
pg. 1317
LS-C-WE1.3 - DEPRESSION SEVERITY LEVEL CLASSIFICATION USING MULTITASK LEARNING OF GENDER RECOGNITION
Shi, Hao
pg. 438
OD-A-WE1.2 - SPECTROGRAMS FUSION-BASED END-TO-END ROBUST AUTOMATIC SPEECH RECOGNITION
Shi, Jiatong
pg. 841
OD-A-TH3.8 - UNDERSTANDING THE TRADEOFFS IN CLIENT-SIDE PRIVACY FOR DOWNSTREAM SPEECH TASKS
Shi, Yihui
pg. 2066
OD-B-TH3.16 - IMAGE CAPTIONING BASED ON AN IMPROVED TRANSFORMER WITH IOU POSITION ENCODING
SHIBUTA, Kazuo
pg. 373
LS-B-TH3.3 - MEASURING ATTRACTIVENESS OF TOURISM RESOURCES BY FOCUSING ON KANSEI VALUE STRUCTURE: POSSIBILITY OF INVITING VISITORS USING THE JAPANESE HERITAGE “AKO SALT.”
Shim, Jae Hoon
pg. 158
LS-D-TH2.4 - LOSSLESS IMAGE COMPRESSION BASED ON IMAGE DECOMPOSITION AND PROGRESSIVE PREDICTION USING CONVOLUTIONAL NEURAL NETWORKS
pg. 2049
OD-B-TH3.13 - NETWORK INTRUSION DETECTION WITH IMPROVED FEATURE REPRESENTATION
Shimada, Masaki
pg. 1357
LS-A-TH1.5 - EVALUATION OF THE EFFECT OF TRANSFER LEARNING TO MULTI-INSTANCE DETECTION OF MONKEYS
Shimada, Naoto
pg. 1000
OD-A-FR2.11 - SHARP-SOUND-IMAGE CONSTRUCTION METHOD USING MULTICHANNEL SOUND SYSTEM WITH OPTIMAL PARAMETRIC LOUDSPEAKER ARRANGEMENT
Shimamura, Tetsuya
pg. 406
LS-C-FR2.4 - SPEECH ENHANCEMENT NETWORK WITH UNSUPERVISED ATTENTION USING INVARIANT INFORMATION CLUSTERING
Shimomura, Soshi
pg. 193
LS-A-TH3.4 - ADAPTIVE SUBSURFACE IMAGING BASED ON PEAK PHASE-PROFILE: THE SIGNIFICANCE IN SEPARATION OF SCATTERING PHASE FROM PROPAGATION PHASE
Shinoda, Koichi
pg. 624
OD-A-TH1.3 - NOISE-TOLERANT TIME-DOMAIN SPEECH SEPARATION WITH NOISE BASES
Shinozaki, Takahiro
pg. 859
OD-A-TH3.11 - LOW-RESOURCE MANDARIN PROSODIC STRUCTURE PREDICTION USING SELF-TRAINING
pg. 1082
OD-A-FR3.12 - UNSUPERVISED SPOKEN TERM DISCOVERY USING WAV2VEC 2.0
Shiota, Sayaka
pg. 1161
LS-A-TH2.5 - INVESTIGATION ON SPATIAL AND FREQUENCY-BASED FEATURES FOR ASYNCHRONOUS ACOUSTIC SCENE ANALYSIS
Shiroma, Yuki
pg. 1161
LS-A-TH2.5 - INVESTIGATION ON SPATIAL AND FREQUENCY-BASED FEATURES FOR ASYNCHRONOUS ACOUSTIC SCENE ANALYSIS
Shouno, Osamu
pg. 1363
LS-A-TH1.6 - SEMI-SUPERVISED ESTIMATION OF DRIVING BEHAVIORS USING ROBUST TIME-CONTRASTIVE LEARNING
Shuai, Wan
pg. 1458
OD-B-FR1.6 - HEAD MOVEMENT PREDICTION USING FCNN
Sim, Jae-Young
pg. 1607
LS-D-TH3.4 - CONTEXT-BASED MATCHING REFINEMENT FOR PERSON SEARCH
Sinha, Pawan
pg. 1281
OD-B-FR2.4 - UNDERSTANDING STRUCTURE INDUCED FUNCTIONAL CONNECTIVITY IN BRAIN USING EEG
Sinha, Rohit
pg. 571
OD-A-WE2.9 - PROCESSING PHONEME SPECIFIC SEGMENTS FOR CLEFT LIP AND PALATE SPEECH ENHANCEMENT
pg. 484
OD-A-WE1.10 - SIGNIFICANCE OF DATA AUGMENTATION FOR IMPROVING CLEFT LIP AND PALATE SPEECH RECOGNITION
Siu, Wan-Chi
pg. 1450
OD-B-FR1.5 - LEARN TO SKETCH: A FAST APPROACH FOR UNIVERSAL PHOTO SKETCH
Soky, Kak
pg. 433
OD-A-WE1.1 - ON THE USE OF SPEAKER INFORMATION FOR AUTOMATIC SPEECH RECOGNITION IN SPEAKER-IMBALANCED CORPORA
Sole-Casals, Jordi
pg. 1323
LS-C-WE1.4 - MULTI-FEATURE FUSION FOR EPILEPTIC FOCUS LOCALIZATION BASED ON TENSOR REPRESENTATION
Song, Hyewon
pg. 1428
OD-B-FR1.1 - HIGH-QUALITY SINGLE IMAGE 3D FACIAL SHAPE RECONSTRUCTION VIA ROBUST ALBEDO ESTIMATION
Song, Liming
pg. 1097
OD-A-FR3.15 - OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
Song, Yicheng
pg. 200
LS-A-TH3.5 - DISCUSSION ON THE ORIGIN OF THE STRENGTH OF PHASOR QUATERNION SELF-ORGANIZING MAP
Stankovic, Lina
pg. 344
LS-B-TH2.4 - MODEL SELECTION-INSPIRED COEFFICIENTS OPTIMIZATION FOR POLYNOMIAL-KERNEL GRAPH LEARNING
Stankovic, Vladimir
pg. 344
LS-B-TH2.4 - MODEL SELECTION-INSPIRED COEFFICIENTS OPTIMIZATION FOR POLYNOMIAL-KERNEL GRAPH LEARNING
Su, Borching
pg. 1245
LS-D-FR3.3 - MIMO SPEECH COMPRESSION AND ENHANCEMENT BASED ON CONVOLUTIONAL DENOISING AUTOENCODER
Su, Dan
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
Su, Li
pg. 17
OD-B-WE1.3 - DUAL-CHANNEL DRUM SEPARATION FOR LOW-COST DRUM RECORDING USING NON-NEGATIVE MATRIX FACTORIZATION
Su, Po-Chyi
pg. 1494
OD-B-FR1.12 - STRATEGIES OF TRADITIONAL CHINESE CHARACTER RECOGNITION IN STREETSCAPE BASED ON DEEP LEARNING NETWORKS
Su, Yu-Hui
pg. 17
OD-B-WE1.3 - DUAL-CHANNEL DRUM SEPARATION FOR LOW-COST DRUM RECORDING USING NON-NEGATIVE MATRIX FACTORIZATION
Suematsu, Noriharu
pg. 1953
LS-A-FR1.2 - FUNDAMENTAL INVESTIGATION OF BACKOFF CONTROL METHOD FOR FAIR COMMUNICATION OPPORTUNITY OF MMW WBAN IN OVERCROWDED ENVIRONMENT
SUGIMOTO, Masashi
pg. 373
LS-B-TH3.3 - MEASURING ATTRACTIVENESS OF TOURISM RESOURCES BY FOCUSING ON KANSEI VALUE STRUCTURE: POSSIBILITY OF INVITING VISITORS USING THE JAPANESE HERITAGE “AKO SALT.”
Sugimoto, Ryota
pg. 1969
LS-A-FR1.5 - HIGHLY EFFICIENT DATA GATHERING WITH TENDENCY PREDICTION BASED ON POSITION INFORMATION OF EVENT IN WIRELESS SENSOR NETWORKS
Sugiura, Yosuke
pg. 406
LS-C-FR2.4 - SPEECH ENHANCEMENT NETWORK WITH UNSUPERVISED ATTENTION USING INVARIANT INFORMATION CLUSTERING
Sugiyama, Chihiro
pg. 471
OD-A-WE1.8 - DATA AUGMENTATION BASED ON FREQUENCY WARPING FOR RECOGNITION OF CLEFT PALATE SPEECH
Sumiyoshi, Shinichi
pg. 1367
OD-B-TH1.2 - SNAPSHOT MULTISPECTRAL IMAGE COMPLETION AND UNMIXING WITH TOTAL VARIATION REGULARIZATION ON ABUNDANCE MAPS
Sun, Haoran
pg. 780
OD-A-TH2.15 - HOW SPEECH IS RECOGNIZED TO BE EMOTIONAL - A STUDY BASED ON INFORMATION DECOMPOSITION
SUN, Jingtao
pg. 121
LS-D-TH1.4 - AN IDE FOR RECONFIGURABLE VIDEO ARRAY PROCESSOR
Sun, Lei
pg. 667
OD-A-TH1.10 - A DEEP ANALYSIS OF SPEECH SEPARATION GUIDED DIARIZATION UNDER REALISTIC CONDITIONS
Sun, Songlin
pg. 1585
LS-C-TH2.4 - IMPLEMENTATION OF AVS3 MULTICAST SYSTEM BASED ON EMBMS
Sung, Yao-Ting
pg. 1049
OD-A-FR3.6 - IMPROVING END-TO-END MODELING FOR MISPRONUNCIATION DETECTION WITH EFFECTIVE AUGMENTATION MECHANISMS
Suyama, Kenji
pg. 212
LS-A-FR2.3 - AN IMPROVED PARAMETER FREE GENETIC ALGORITHM FOR CSD-FIR FILTER DESIGN
pg. 218
LS-A-FR2.4 - A PROPOSAL TOWARD STANDARDIZATION OF DESIGN EXAMPLES FOR IIR FILTER DESIGN METHODS
Suzuki, Michiyasu
pg. 1343
LS-A-TH1.2 - PREOPERATIVE MONITORING USING IMPLANTABLE, MULTIMODAL, MULTICHANNEL PROBE
Syu, Sin-Wun
pg. 1494
OD-B-FR1.12 - STRATEGIES OF TRADITIONAL CHINESE CHARACTER RECOGNITION IN STREETSCAPE BASED ON DEEP LEARNING NETWORKS
T
Tabei, Gen
pg. 1899
LS-B-WE1.4 - MULTI-ARMED BANDIT-BASED ROUTING METHOD FOR IN-NETWORK CACHING
Tachibana, Takuji
pg. 1895
LS-B-WE1.3 - IMPLEMENTATION OF A FAST FAILURE RECOVERY METHOD CONSIDERING LOAD DISTRIBUTION FOR NETWORK SLICING
Tachioka, Yuki
pg. 1367
OD-B-TH1.2 - SNAPSHOT MULTISPECTRAL IMAGE COMPLETION AND UNMIXING WITH TOTAL VARIATION REGULARIZATION ON ABUNDANCE MAPS
Tachioka, Yuuki
pg. 694
OD-A-TH2.1 - INTEGRATION OF ANNOTATOR-WISE ESTIMATIONS FOR EMOTION RECOGNITION BY USING GROUP SOFTMAX
Tai, Tzu-Chiang
pg. 170
LS-D-TH2.6 - FACE ANTI-SPOOFING USING MULTI-BRANCH CNN
Takagi, Hiroyasu
pg. 63
LS-A-WE2.1 - DOMAIN SPECIFIC DESCRIPTION IN HALIDE FOR RANDOMIZED IMAGE CONVOLUTION
Takahashi, Toru
pg. 30
OD-B-WE1.5 - MOVING SOUND SOURCE TRACKING IN WIDE SPACE BY MULTIPLE MICROPHONE ARRAYS
pg. 1008
OD-A-FR2.12 - VIRTUAL SOUND SOURCE RENDERING BASED ON DISTANCE CONTROL TO PENETRATE LISTENERS USING SURROUND PARAMETRIC-ARRAY AND ELECTRODYNAMIC LOUDSPEAKERS
Takahashi, Yu
pg. 1226
LS-B-FR1.5 - MULTICHANNEL AUDIO SOURCE SEPARATION WITH INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS USING PRODUCT OF SOURCE MODELS
pg. 651
OD-A-TH1.8 - PRIOR DISTRIBUTION DESIGN FOR MUSIC BLEEDING-SOUND REDUCTION BASED ON NONNEGATIVE MATRIX FACTORIZATION
Takamichi, Shinnosuke
pg. 794
OD-A-TH3.1 - EMOTION-CONTROLLABLE SPEECH SYNTHESIS USING EMOTION SOFT LABELS AND FINE-GRAINED PROSODY FACTORS
Takamune, Norihiro
pg. 1226
LS-B-FR1.5 - MULTICHANNEL AUDIO SOURCE SEPARATION WITH INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS USING PRODUCT OF SOURCE MODELS
pg. 578
OD-A-WE2.10 - SPEECH ENHANCEMENT BY NOISE SELF-SUPERVISED RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION VIA INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS
Takanashi, Mizuki
pg. 1400
OD-B-TH1.8 - A CONSENSUS FRAMEWORK FOR CONVOLUTIONAL DICTIONARY LEARNING BASED ON L1 NORM ERROR
Takano, Hironobu
pg. 1781
LS-B-WE2.4 - COMPARATIVE STUDY OF FEATURE EXTRACTION METHOD FOR EMOTIONAL CLASSIFICATION BY MICRO-EXPRESSIONS
Takaoka, Masahiro
pg. 80
LS-A-WE2.4 - HISUI: AN IMAGE AND VIDEO PROCESSING FRAMEWORK WITH AUTO-OPTIMIZER
Takashima, Ryoichi
pg. 471
OD-A-WE1.8 - DATA AUGMENTATION BASED ON FREQUENCY WARPING FOR RECOGNITION OF CLEFT PALATE SPEECH
Takata, Shogo
pg. 1546
LS-C-WE2.2 - INFANT POSTURE ASSESSMENT BASED ON ROTATIONAL KEYPOINT DETECTION
Takeda, Ryu
pg. 248
OD-B-WE2.3 - SPATIAL NORMALIZATION TO REDUCE POSITIONAL COMPLEXITY IN DIRECTION-AIDED SUPERVISED BINAURAL SOUND SOURCE SEPARATION
pg. 961
OD-A-FR2.5 - MULTIPLE-EMBEDDING SEPARATION NETWORKS: SOUND CLASS-SPECIFIC FEATURE EXTRACTION FOR UNIVERSAL SOUND SEPARATION
Takehisa, Shuhei
pg. 1395
OD-B-TH1.7 - DENOISING HYPERSPECTRAL IMAGES USING INTERBAND CORRELATION
Takeshita, Daichi
pg. 1808
LS-B-TH1.4 - FEATURE EXTRACTION SUITABLE FOR DOUBLE JPEG COMPRESSION ANALYSIS BASED ON STATISTICAL BIAS OBSERVATION OF DCT COEFFICIENTS
TAKEZAWA, Tomomi
pg. 373
LS-B-TH3.3 - MEASURING ATTRACTIVENESS OF TOURISM RESOURCES BY FOCUSING ON KANSEI VALUE STRUCTURE: POSSIBILITY OF INVITING VISITORS USING THE JAPANESE HERITAGE “AKO SALT.”
Takiguchi, Keisuke
pg. 808
OD-A-TH3.3 - CONDITIONAL DEEP HIERARCHICAL VARIATIONAL AUTOENCODER FOR VOICE CONVERSION
Takiguchi, Tetsuya
pg. 471
OD-A-WE1.8 - DATA AUGMENTATION BASED ON FREQUENCY WARPING FOR RECOGNITION OF CLEFT PALATE SPEECH
Takizawa, Masaaki
pg. 2077
LS-C-TH1.2 - A HILBERTIAN PROJECTION APPROACH WITH DICTIONARY DIVIDING STRATEGY: ACCELERATING NONLINEAR ESTIMATION ALGORITHM WITH MULTISCALE GAUSSIANS
Takyu, Osamu
pg. 1969
LS-A-FR1.5 - HIGHLY EFFICIENT DATA GATHERING WITH TENDENCY PREDICTION BASED ON POSITION INFORMATION OF EVENT IN WIRELESS SENSOR NETWORKS
Tamai, Yuichiro
pg. 366
LS-B-TH3.2 - ESTIMATING BEVERAGE PREFERENCE BASED ON SUBJECTIVE EMOTIONAL REACTIONS AND EEG ACTIVITY
Tamura, Satoshi
pg. 1092
OD-A-FR3.14 - MULTI-VIEW CONVOLUTION FOR LIPREADING
Tan, Xu
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
pg. 1127
LS-D-WE1.4 - MIXING OR EXTRACTING? FURTHER EXPLORING NECESSITY OF MUSIC SEPARATION FOR SINGER IDENTIFICATION
Tanabe, Nari
pg. 275
OD-B-WE2.8 - NONLINEAR SVM-TYPE AUTOMATIC DICISION ALGORITHM IN NOISY ENVIRONMENT FOR HAMMERING TEST SYSTEM
Tanaka, Nobukazu
pg. 471
OD-A-WE1.8 - DATA AUGMENTATION BASED ON FREQUENCY WARPING FOR RECOGNITION OF CLEFT PALATE SPEECH
Tanaka, Tomoro
pg. 254
OD-B-WE2.4 - PHASE-AWARE AUDIO INPAINTING BASED ON INSTANTANEOUS FREQUENCY
Tanaka, Toshihisa
pg. 1546
LS-C-WE2.2 - INFANT POSTURE ASSESSMENT BASED ON ROTATIONAL KEYPOINT DETECTION
pg. 400
LS-C-FR2.3 - DETECTION OF NOTE ONSETS FROM EEG WHILE LISTENING TO MUSIC
pg. 1323
LS-C-WE1.4 - MULTI-FEATURE FUSION FOR EPILEPTIC FOCUS LOCALIZATION BASED ON TENSOR REPRESENTATION
pg. 1294
OD-B-FR2.6 - TOWARD ESTIMATION OF ABNORMAL BRAKE IN AUTONOMOUS VEHICLES FROM ELECTROENCEPHALOGRAM AND HEART RATE INTERVAL
Tanaka, Yuichi
pg. 324
LS-B-TH2.1 - NODE CLUSTERING OF TIME-VARYING GRAPHS BASED ON TEMPORAL LABEL SMOOTHNESS
Tang, Ching-Tung
pg. 2055
OD-B-TH3.14 - 3D LANDMARK-BASED FACE DETECTION AND RECOGNITION SYSTEM FOR LARGE POSES
Tang, Jiyang
pg. 878
OD-A-FR1.1 - END-TO-END MANDARIN TONE CLASSIFICATION WITH SHORT TERM CONTEXT INFORMATION
Tang, Tiantian
pg. 939
OD-A-FR2.1 - CNN-BASED DISCRIMINATIVE TRAINING FOR DOMAIN COMPENSATION IN ACOUSTIC EVENT DETECTION WITH FRAME-WISE CLASSIFIER
Tang, Tong
pg. 1386
OD-B-TH1.5 - INTRA CODING TOOL PRUNING FOR REDUCING COMPLEXITY OF VVC SCREEN CONTENT CODING
Tang, Yibin
pg. 1328
LS-C-WE1.5 - ADHD CLASSIFICATION VIA AUTO-ENCODING NETWORK WITH NON-IMAGING DATA FUSION
Tanida, Ryuichi
pg. 1
OD-B-WE1.1 - FAST-PARALLEL SINGULAR VALUE THRESHOLDING FOR MANY SMALL MATRICES BASED ON GEOMETRIC FEATURE OF SINGULAR VALUES
Taniguchi, Tadahiro
pg. 836
OD-A-TH3.7 - STARGAN-BASED EMOTIONAL VOICE CONVERSION FOR JAPANESE PHRASES
Tanji, Hiroki
pg. 205
LS-A-FR2.2 - LEARNING THE STATISTICAL MODEL OF THE NMF USING THE DEEP MULTIPLICATIVE UPDATE ALGORITHM WITH APPLICATIONS
Tao, Jianhua
pg. 454
OD-A-WE1.5 - ONE IN A HUNDRED: SELECTING THE BEST PREDICTED SEQUENCE FROM NUMEROUS CANDIDATES FOR SPEECH RECOGNITION
Terada, Takamichi
pg. 1775
LS-B-WE2.3 - EXAMINING OF SHALLOW AUTOENCODER ON BLACK-BOX ATTACK AGAINST FACE RECOGNITION
Terasawa, Hiroko
pg. 890
OD-A-FR1.3 - INVESTIGATING TIME-FREQUENCY REPRESENTATIONS FOR AUDIO FEATURE EXTRACTION IN SINGING TECHNIQUE CLASSIFICATION
Tew, Yiqi
pg. 1872
LS-B-FR3.4 - A PILOT EXPLORATION OF INDUSTRIAL VIDEO SCENE DATA EMBEDDING USING REAL-TIME MV-HEVC
The Anh, Tran
pg. 786
OD-A-TH2.16 - END-TO-END SPEAKER AGE AND HEIGHT ESTIMATION USING ATTENTION MECHANISM AND TRIPLET LOSS
Thi-Hien Duong, Thanh
pg. 1149
LS-A-TH2.3 - SPEAKER COUNT: A NEW BUILDING BLOCK FOR SPEAKER DIARIZATION
Tian, Zhengkun
pg. 454
OD-A-WE1.5 - ONE IN A HUNDRED: SELECTING THE BEST PREDICTED SEQUENCE FROM NUMEROUS CANDIDATES FOR SPEECH RECOGNITION
Ting, Jiang
pg. 646
OD-A-TH1.7 - GROUP MULTI-SCALE CONVOLUTIONAL NETWORK FOR MONAURAL SPEECH ENHANCEMENT IN TIME-DOMAIN
Ting, Kuan-Chung
pg. 1258
LS-D-FR3.5 - INSTRUMENTED ROMBERG TEST OF POSTURAL STABILITY IN PATIENTS WITH VESTIBULAR DISORDERS USING INERTIAL MEASUREMENT UNITS
Toda, Tomoki
pg. 1234
LS-D-FR3.1 - TIME ALIGNMENT USING LIP IMAGES FOR FRAME-BASED ELECTROLARYNGEAL VOICE CONVERSION
pg. 814
OD-A-TH3.4 - NOISY-TO-NOISY VOICE CONVERSION FRAMEWORK WITH DENOISING MODEL
pg. 546
OD-A-WE2.5 - MANDARIN ELECTRO-LARYNGEAL SPEECH ENHANCEMENT BASED ON STATISTICAL VOICE CONVERSION AND MANUAL TONE CONTROL
pg. 870
OD-A-TH3.13 - INVESTIGATION OF TEXT-TO-SPEECH-BASED SYNTHETIC PARALLEL DATA FOR SEQUENCE-TO-SEQUENCE NON-PARALLEL VOICE CONVERSION
Togami, Masahito
pg. 640
OD-A-TH1.6 - OVER-DETERMINED SEMI-BLIND SPEECH SOURCE SEPARATION
Trapp, Arvid
pg. 313
OD-B-WE2.14 - INTEGRATED SPECTRAL KURTOSIS ANALYSIS
Tsai, Chun-Chia
pg. 1912
LS-D-WE2.1 - AN ADAPTIVE RANK SELECTION METHOD IN 3GPP 5G NR SYSTEMS
pg. 1917
LS-D-WE2.2 - A LOW COMPLEXITY PMI SELECTION SCHEME FOR 3GPP 5G NR FDD SYSTEMS
pg. 1889
LS-B-WE1.2 - A THRESHOLD-BASED SCHEDULING AND POWER CONTROL DESIGN ON IMT-2020 EVALUATION
Tsai, Shu-Wei
pg. 1234
LS-D-FR3.1 - TIME ALIGNMENT USING LIP IMAGES FOR FRAME-BASED ELECTROLARYNGEAL VOICE CONVERSION
Tsai, Yi-Ta
OD-B-TH1.1 - COMPUTATION REDUCTION FOR HEVC INTER PREDICTION
Tsao, Yu
pg. 1234
LS-D-FR3.1 - TIME ALIGNMENT USING LIP IMAGES FOR FRAME-BASED ELECTROLARYNGEAL VOICE CONVERSION
pg. 1239
LS-D-FR3.2 - ESTIMATION AND CORRECTION OF RELATIVE TRANSFER FUNCTION FOR BINAURAL SPEECH SEPARATION NETWORKS TO PRESERVE SPATIAL CUES
pg. 1245
LS-D-FR3.3 - MIMO SPEECH COMPRESSION AND ENHANCEMENT BASED ON CONVOLUTIONAL DENOISING AUTOENCODER
pg. 659
OD-A-TH1.9 - A STUDY ON SPEECH ENHANCEMENT BASED ON DIFFUSION PROBABILISTIC MODEL
pg. 769
OD-A-TH2.13 - SIAMESE NEURAL NETWORK WITH JOINT BAYESIAN MODEL STRUCTURE FOR SPEAKER VERIFICATION
Tseng, Wan-Ting
pg. 2006
OD-B-TH3.6 - FAQ RETRIEVAL USING QUESTION-AWARE GRAPH CONVOLUTIONAL NETWORK AND CONTEXTUALIZED LANGUAGE MODEL
Tsuchiya, Takao
pg. 1156
LS-A-TH2.4 - MULTITASK LEARNING OF ACOUSTIC SCENES AND EVENTS USING DYNAMIC WEIGHT ADAPTATION BASED ON MULTI-FOCAL LOSS
Tsumura, Tomoaki
pg. 80
LS-A-WE2.4 - HISUI: AN IMAGE AND VIDEO PROCESSING FRAMEWORK WITH AUTO-OPTIMIZER
Tsuruo, Asahi
pg. 1353
LS-A-TH1.4 - MATHEMATICAL MODEL OF A HORSE AND THE RIDER DURING A JUMP
Tsuzaki, Minoru
pg. 897
OD-A-FR1.4 - IMPLEMENTATION OF INTERACTIVE TOOLS FOR INVESTIGATING FUNDAMENTAL FREQUENCY RESPONSE OF VOICED SOUNDS TO AUDITORY STIMULATION
Tugnait, Jitendra
pg. 232
OD-B-WE2.1 - ON SPARSE GRAPH ESTIMATION UNDER STATISTICAL AND LAPLACIAN CONSTRAINTS
Tuo, Deyi
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
U
Udaya, Parampalli
pg. 1999
OD-B-TH3.5 - DEEP LEARNING EVALUATION OF A STEGANOGRAPHIC ALGORITHM
Ueda, Minagi
pg. 1821
LS-B-FR2.1 - AN EXTENDED REVERSIBLE DATA HIDING METHOD FOR HDR IMAGES USING EDGE ESTIMATION
Ueda, Yuto
pg. 9
OD-B-WE1.2 - ADAPTIVE FEEDBACK CANCELLATION BASED ON PREDICTION ERROR METHOD USING INTERAURAL LEVEL DIFFERENCES IN HEARING DEVICE
Une, Masakazu
pg. 578
OD-A-WE2.10 - SPEECH ENHANCEMENT BY NOISE SELF-SUPERVISED RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION VIA INDEPENDENT DEEPLY LEARNED MATRIX ANALYSIS
Unno, Kyohei
pg. 70
LS-A-WE2.2 - FAST STILL PICTURE CODING FOR VVC
Unoki, Masashi
pg. 1621
LS-C-FR1.1 - TAMPERING DETECTION FOR SPEECH SIGNALS USING SYNCHRONIZATION CODE AND LSF-BASED WATERMARKS
pg. 1627
LS-C-FR1.2 - IMPROVING SECURITY IN MCADAMS COEFFICIENT-BASED SPEAKER ANONYMIZATION BY WATERMARKING METHOD
pg. 1634
LS-C-FR1.3 - HYBRIDIZATION OF SPEECH INFORMATION HIDING AND ENCRYPTION FOR DOUBLE-LAYER SECURITY IN SPEECH COMMUNICATION
pg. 36
OD-B-WE1.6 - STUDY ON SIMULTANEOUS ESTIMATION OF GLOTTAL SOURCE AND VOCAL TRACT PARAMETERS BY ARMAX-LF MODEL FOR SPEECH ANALYSIS/SYNTHESIS
Uto, Kuniaki
pg. 624
OD-A-TH1.3 - NOISE-TOLERANT TIME-DOMAIN SPEECH SEPARATION WITH NOISE BASES
V
V, Vishnu Vidyadhara Raju
pg. 511
OD-A-WE1.14 - CSTD-TELUGU CORPUS: CROWD-SOURCED APPROACH FOR LARGE-SCALE SPEECH DATA COLLECTION
van den Brink, Arvid
pg. 299
OD-B-WE2.12 - ENHANCED LOOP-WEAKENED BELIEF PROPAGATION ALGORITHM FOR PERFORMANCE ENHANCED POLAR CODE DECODERS
pg. 318
OD-B-WE2.15 - COMPUTATIONAL COMPLEXITY REDUCED BELIEF PROPAGATION ALGORITHM FOR POLAR CODE DECODERS
Vien, An Gia
pg. 146
LS-D-TH2.2 - UNPAIRED IMAGE DEMOIRÉING BASED ON CYCLIC MOIRÉ LEARNING
Vij, Vikram
pg. 491
OD-A-WE1.11 - TEAGER ENERGY SUBBAND FILTERED FEATURES FOR NEAR AND FAR-FIELD AUTOMATIC SPEECH RECOGNITION
Vo, Ngoc Khoi Nguyen
pg. 1775
LS-B-WE2.3 - EXAMINING OF SHALLOW AUTOENCODER ON BLACK-BOX ATTACK AGAINST FACE RECOGNITION
Vuppala, Anil Kumar
pg. 737
OD-A-TH2.8 - COMPARATIVE STUDY OF FILTER BANKS TO IMPROVE THE PERFORMANCE OF VOICE DISORDER ASSESSMENT SYSTEMS USING LTAS FEATURES
pg. 761
OD-A-TH2.12 - DETECTING MULTIPLE DISFLUENCIES FROM SPEECH USING PRE-LINGUISTIC AUTOMATIC SYLLABIFICATION WITH ACOUSTIC AND PROSODY FEATURES
pg. 511
OD-A-WE1.14 - CSTD-TELUGU CORPUS: CROWD-SOURCED APPROACH FOR LARGE-SCALE SPEECH DATA COLLECTION
W
Wagatsuma, Nobuhiko
pg. 1289
OD-B-FR2.5 - EFFECT OF VISUAL ATTENTION AND DRIVING EXPERIENCES ON THE EVENT-RELATED POTENTIAL P300 IN THE PERCEPTION OF TRAFFIC SCENES
Wai, Hoi-To
pg. 337
LS-B-TH2.3 - AN EMPIRICAL STUDY ON COMPRESSED DECENTRALIZED STOCHASTIC GRADIENT ALGORITHMS WITH OVERPARAMETERIZED MODELS
Wakabayashi, Yukoh
pg. 995
OD-A-FR2.10 - TWO-STAGE PHASE RECONSTRUCTION USING DNN AND VON MISES DISTRIBUTION-BASED MAXIMUM LIKELIHOOD
pg. 1016
OD-A-FR2.13 - SELF-ROTATION ANGLE ESTIMATION OF CIRCULAR MICROPHONE ARRAY BASED ON SOUND FIELD INTERPOLATION
Wakuya, Manami
pg. 1343
LS-A-TH1.2 - PREOPERATIVE MONITORING USING IMPLANTABLE, MULTIMODAL, MULTICHANNEL PROBE
Wan, Shuai
pg. 1422
OD-B-TH1.12 - SPATIAL INFORMATION REFINEMENT FOR CHROMA INTRA PREDICTION IN VIDEO CODING
Wang, Binling
pg. 1097
OD-A-FR3.15 - OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
WANG, CHUNZHI
pg. 1541
LS-C-WE2.1 - DEEP LEARNING ANALYSIS MODELS FOR SPEECH AND EMOTIONAL RECOGNITION
Wang, Di
pg. 713
OD-A-TH2.4 - A STUDY ON DECOUPLED PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS
Wang, Dong
pg. 1121
LS-D-WE1.3 - AN MAP ESTIMATION FOR BETWEEN-CLASS VARIANCE
pg. 713
OD-A-TH2.4 - A STUDY ON DECOUPLED PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS
pg. 1097
OD-A-FR3.15 - OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
pg. 780
OD-A-TH2.15 - HOW SPEECH IS RECOGNIZED TO BE EMOTIONAL - A STUDY BASED ON INFORMATION DECOMPOSITION
Wang, Fen
pg. 344
LS-B-TH2.4 - MODEL SELECTION-INSPIRED COEFFICIENTS OPTIMIZATION FOR POLYNOMIAL-KERNEL GRAPH LEARNING
Wang, Feng
pg. 1087
OD-A-FR3.13 - EFFECT OF PERCEPTUAL TRAINING WITH NOISE ON CHINESE LEARNERS’ ENGLISH CONSONANT RECEPTION THRESHOLDS
WANG, Haonan
pg. 968
OD-A-FR2.6 - NARROW-EDGED BEAMFORMING USING MASKED PARAMETRIC ARRAY LOUDSPEAKERS
Wang, Haonan
pg. 1000
OD-A-FR2.11 - SHARP-SOUND-IMAGE CONSTRUCTION METHOD USING MULTICHANNEL SOUND SYSTEM WITH OPTIMAL PARAMETRIC LOUDSPEAKER ARRANGEMENT
Wang, Hsin-Min
pg. 1234
LS-D-FR3.1 - TIME ALIGNMENT USING LIP IMAGES FOR FRAME-BASED ELECTROLARYNGEAL VOICE CONVERSION
pg. 619
OD-A-TH1.2 - IMPROVEMENT OF SPATIAL AMBIGUITY IN MULTI-CHANNEL SPEECH SEPARATION USING CHANNEL ATTENTION
pg. 719
OD-A-TH2.5 - GENERATION OF SPEAKER REPRESENTATIONS USING HETEROGENEOUS TRAINING BATCH ASSEMBLY
Wang, Hui
pg. 523
OD-A-WE2.1 - CYCLEGAN-BASED NON-PARALLEL SPEECH ENHANCEMENT WITH AN ADAPTIVE ATTENTION-IN-ATTENTION MECHANISM
Wang, Jia-Ching
pg. 1602
LS-D-TH3.3 - A FUSION METHODOLOGY OF AKAZE AND NEURAL NETWORK FOR FINGERPRINT RECOGNITION
pg. 1611
LS-D-TH3.5 - PARTIAL FINGERPRINT ON COMBINED EVALUATION USING DEEP LEARNING AND FEATURE DESCRIPTOR
pg. 170
LS-D-TH2.6 - FACE ANTI-SPOOFING USING MULTI-BRANCH CNN
Wang, Jianming
pg. 1621
LS-C-FR1.1 - TAMPERING DETECTION FOR SPEECH SIGNALS USING SYNCHRONIZATION CODE AND LSF-BASED WATERMARKS
Wang, Jianyu
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
pg. 630
OD-A-TH1.4 - MINIMUM-VOLUME REGULARIZED ILRMA FOR BLIND AUDIO SOURCE SEPARATION
Wang, Jing
pg. 614
OD-A-TH1.1 - A TARGET SPEAKER SEPARATION NEURAL NETWORK WITH JOINT-TRAINING
pg. 945
OD-A-FR2.2 - FREQUENCY AXIS POOLING METHOD FOR WEAKLY LABELED SOUND EVENT DETECTION AND CLASSIFICATION
pg. 1072
OD-A-FR3.10 - SEPARABLE TEMPORAL CONVOLUTION PLUS TEMPORALLY POOLED ATTENTION FOR LIGHTWEIGHT HIGH-PERFORMANCE KEYWORD SPOTTING
Wang, Junjie
pg. 864
OD-A-TH3.12 - SPTTS: PARALLEL SPEECH SYNTHESIS WITHOUT EXTRA ALIGNER MODEL
pg. 689
OD-A-TH1.14 - SPARSELY OVERLAPPED SPEECH TRAINING IN THE TIME DOMAIN: JOINT LEARNING OF TARGET SPEECH SEPARATION AND PERSONAL VAD BENEFITS
Wang, Li
pg. 546
OD-A-WE2.5 - MANDARIN ELECTRO-LARYNGEAL SPEECH ENHANCEMENT BASED ON STATISTICAL VOICE CONVERSION AND MANUAL TONE CONTROL
Wang, Liyuan
pg. 1031
OD-A-FR3.3 - UNCERTAINTY ESTIMATION IN AUTOMATIC PRONUNCIATION ASSESSMENT WITH PSEUDO SAMPLES BASED ON DEEP KERNEL LEARNING
Wang, Longbiao
pg. 438
OD-A-WE1.2 - SPECTROGRAMS FUSION-BASED END-TO-END ROBUST AUTOMATIC SPEECH RECOGNITION
Wang, Mingjiang
pg. 553
OD-A-WE2.6 - INCORPORATING MULTI-TARGET IN MULTI-STAGE SPEECH ENHANCEMENT MODEL FOR BETTER GENERALIZATION
Wang, Miqing
pg. 1180
LS-C-TH3.2 - DESIGN AND EVALUATION OF ACTIVE NOISE CONTROL ON MACHINERY NOISE
Wang, Mou
pg. 1144
LS-A-TH2.2 - DUAL-PATH TRANSFORMER FOR MACHINE CONDITION MONITORING
pg. 635
OD-A-TH1.5 - A COMPARISON OF HANDCRAFTED, PARAMETERIZED, AND LEARNABLE FEATURES FOR SPEECH SEPARATION
Wang, Qing
pg. 672
OD-A-TH1.11 - TARGET SPEAKER EXTRACTION FOR CUSTOMIZABLE QUERY-BY-EXAMPLE KEYWORD SPOTTING
Wang, Quandong
pg. 564
OD-A-WE2.8 - MULTI-CHANNEL SPEECH ENHANCEMENT WITH 2-D CONVOLUTIONAL TIME-FREQUENCY DOMAIN FEATURES AND A PRE-TRAINED ACOUSTIC MODEL
Wang, Shengbei
pg. 1621
LS-C-FR1.1 - TAMPERING DETECTION FOR SPEECH SIGNALS USING SYNCHRONIZATION CODE AND LSF-BASED WATERMARKS
Wang, Shengjin
pg. 1127
LS-D-WE1.4 - MIXING OR EXTRACTING? FURTHER EXPLORING NECESSITY OF MUSIC SEPARATION FOR SINGER IDENTIFICATION
Wang, Syu-Siang
pg. 1245
LS-D-FR3.3 - MIMO SPEECH COMPRESSION AND ENHANCEMENT BASED ON CONVOLUTIONAL DENOISING AUTOENCODER
Wang, Xiaorui
pg. 884
OD-A-FR1.2 - RETHINKING SINGING VOICE SEPARATION WITH SPECTRAL-TEMPORAL TRANSFORMER
Wang, Xingrui
pg. 859
OD-A-TH3.11 - LOW-RESOURCE MANDARIN PROSODIC STRUCTURE PREDICTION USING SELF-TRAINING
Wang, Xuehan
pg. 49
OD-B-WE1.8 - KRONECKER PRODUCT ADAPTIVE BEAMFORMING FOR MICROPHONE ARRAYS
Wang, Xuyang
pg. 689
OD-A-TH1.14 - SPARSELY OVERLAPPED SPEECH TRAINING IN THE TIME DOMAIN: JOINT LEARNING OF TARGET SPEECH SEPARATION AND PERSONAL VAD BENEFITS
Wang, XuYang
pg. 864
OD-A-TH3.12 - SPTTS: PARALLEL SPEECH SYNTHESIS WITHOUT EXTRA ALIGNER MODEL
Wang, Yangguang
pg. 1722
OD-B-TH2.3 - UNDETECTABLE JPEG IMAGE BATCH REVERSIBLE DATA HIDING WITH CONTENT-ADAPTIVE PAYLOAD ALLOCATION
Wang, Yih-Wen
pg. 269
OD-B-WE2.7 - SEMI-SUPERVISED SOUND EVENT DETECTION USING SELF-ATTENTION AND MULTIPLE TECHNIQUES OF CONSISTENCY TRAINING
Wang, Yu
pg. 100
LS-D-TH1.1 - IMPROVED FRUIT FLY OPTIMIZATION ALGORITHM BASED ON SIMULATED ANNEALING IN NEURAL NETWORK
Wang, Yue
pg. 1192
LS-C-TH3.4 - A TRUE DIGITAL FEEDFORWARD ACTIVE NOISE CONTROL SYSTEM WITH NO ANALOG-TO-DIGITAL AND DIGITAL-TO-ANALOG CONVERTERS
Wang, Yujun
pg. 945
OD-A-FR2.2 - FREQUENCY AXIS POOLING METHOD FOR WEAKLY LABELED SOUND EVENT DETECTION AND CLASSIFICATION
OD-A-FR1.7 - NOISE ROBUST SINGING VOICE SYNTHESIS USING GAUSSIAN MIXTURE VARIATIONAL AUTOENCODER
pg. 564
OD-A-WE2.8 - MULTI-CHANNEL SPEECH ENHANCEMENT WITH 2-D CONVOLUTIONAL TIME-FREQUENCY DOMAIN FEATURES AND A PRE-TRAINED ACOUSTIC MODEL
pg. 1072
OD-A-FR3.10 - SEPARABLE TEMPORAL CONVOLUTION PLUS TEMPORALLY POOLED ATTENTION FOR LIGHTWEIGHT HIGH-PERFORMANCE KEYWORD SPOTTING
Wang, Yutian
pg. 523
OD-A-WE2.1 - CYCLEGAN-BASED NON-PARALLEL SPEECH ENHANCEMENT WITH AN ADAPTIVE ATTENTION-IN-ATTENTION MECHANISM
Wang, Zeyuan
pg. 2060
OD-B-TH3.15 - ENTAILMENT METHOD BASED ON TEMPLATE SELECTION FOR CHINESE TEXT FEW-SHOT LEARNING
Wang, Zirui
pg. 1438
OD-B-FR1.3 - HMM-BASED LIP READING WITH STINGY RESIDUAL 3D CONVOLUTION
Watanabe, Shinji
pg. 841
OD-A-TH3.8 - UNDERSTANDING THE TRADEOFFS IN CLIENT-SIDE PRIVACY FOR DOWNSTREAM SPEECH TASKS
pg. 659
OD-A-TH1.9 - A STUDY ON SPEECH ENHANCEMENT BASED ON DIFFUSION PROBABILISTIC MODEL
Watanabe, Yuka
pg. 1769
LS-B-WE2.2 - CONTINUOUS BIOMETRIC AUTHENTICATION FOR SMARTPHONES CONSIDERING USAGE ENVIRONMENTS
Watanabe, Yuta
pg. 386
LS-C-FR2.1 - INTERNAL STATE ESTIMATION BY THERMAL IMAGE AND IDENTIFICATION OF FACE AND NOSE POSITION
Watcharasupat, Karn
pg. 982
OD-A-FR2.8 - A STRONGLY-LABELLED POLYPHONIC DATASET OF URBAN SOUNDS WITH SPATIOTEMPORAL CONTEXT
Wei, Yu-Jen
pg. 1391
OD-B-TH1.6 - IMAGE COMPRESSION ARCHITECTURE WITH BUILT-IN LIGHTWEIGHT MODEL
Wei, Zhiyu
pg. 2060
OD-B-TH3.15 - ENTAILMENT METHOD BASED ON TEMPLATE SELECTION FOR CHINESE TEXT FEW-SHOT LEARNING
Wen, Shulin
pg. 1180
LS-C-TH3.2 - DESIGN AND EVALUATION OF ACTIVE NOISE CONTROL ON MACHINERY NOISE
Wen, Zhengqi
pg. 454
OD-A-WE1.5 - ONE IN A HUNDRED: SELECTING THE BEST PREDICTED SEQUENCE FROM NUMEROUS CANDIDATES FOR SPEECH RECOGNITION
Weng, Shi-Yan
pg. 518
OD-A-WE1.15 - AN EMPIRICAL STUDY ON TRANSFORMER-BASED END-TO-END SPEECH RECOGNITION WITH NOVEL DECODER MASKING
Werner, Stefan
pg. 2072
LS-C-TH1.1 - GRAPH KERNEL RECURSIVE LEAST-SQUARES ALGORITHMS
Wolfsteiner, Peter
pg. 313
OD-B-WE2.14 - INTEGRATED SPECTRAL KURTOSIS ANALYSIS
Wong, Ka Ho
pg. 1299
OD-B-FR2.7 - SPEAKER TURN AWARE SIMILARITY SCORING FOR DIARIZATION OF SPEECH-BASED COGNITIVE ASSESSMENTS
Wong, KokSheik
pg. 1821
LS-B-FR2.1 - AN EXTENDED REVERSIBLE DATA HIDING METHOD FOR HDR IMAGES USING EDGE ESTIMATION
pg. 1828
LS-B-FR2.2 - IMAGE WATERMARKING BASED ON NON-NEWTONIAN EFFECT AND INTERPOLATED SWT-DWT
pg. 1416
OD-B-TH1.11 - MOVING OBJECT DETECTION IN HEVC VIDEO
Woo, Sung-Min
pg. 1698
LS-A-FR3.4 - SUPER-RESOLUTION IMAGING USING A FOCUS PIXEL SENSOR
Wu, Chaoyan
pg. 1305
LS-C-WE1.1 - MICROPHONE ARRAY SPEECH SEPARATION ALGORITHM BASED ON DNN
Wu, Chin-Ying
pg. 2006
OD-B-TH3.6 - FAQ RETRIEVAL USING QUESTION-AWARE GRAPH CONVOLUTIONAL NETWORK AND CONTEXTUALIZED LANGUAGE MODEL
Wu, Chung-Hsien
pg. 619
OD-A-TH1.2 - IMPROVEMENT OF SPATIAL AMBIGUITY IN MULTI-CHANNEL SPEECH SEPARATION USING CHANNEL ATTENTION
pg. 1026
OD-A-FR3.2 - ENSEMBLE OF ONE MODEL: CREATING MODEL VARIATIONS FOR TRANSFORMER WITH LAYER PERMUTATION
pg. 1982
OD-B-TH3.2 - TASK-AWARE BERT-BASED SENTIMENT ANALYSIS FROM MULTIPLE ESSENCES OF THE TEXT
pg. 536
OD-A-WE2.3 - SPEECH ENHANCEMENT BASED ON MASKING APPROACH CONSIDERING SPEECH QUALITY AND ACOUSTIC CONFIDENCE FOR NOISY SPEECH RECOGNITION
Wu, Da-Yi
pg. 1975
OD-B-TH3.1 - MANDARIN SINGING VOICE SYNTHESIS WITH A PHONOLOGY-BASED DURATION MODEL
Wu, Hao
pg. 305
OD-B-WE2.13 - POSITIONAL-SPECTRAL-TEMPORAL ATTENTION IN 3D CONVOLUTIONAL NEURAL NETWORKS FOR EEG EMOTION RECOGNITION
Wu, Jie
OD-A-FR1.7 - NOISE ROBUST SINGING VOICE SYNTHESIS USING GAUSSIAN MIXTURE VARIATIONAL AUTOENCODER
Wu, Jin
pg. 100
LS-D-TH1.1 - IMPROVED FRUIT FLY OPTIMIZATION ALGORITHM BASED ON SIMULATED ANNEALING IN NEURAL NETWORK
WU, JUN
pg. 1541
LS-C-WE2.1 - DEEP LEARNING ANALYSIS MODELS FOR SPEECH AND EMOTIONAL RECOGNITION
Wu, Junnan
pg. 564
OD-A-WE2.8 - MULTI-CHANNEL SPEECH ENHANCEMENT WITH 2-D CONVOLUTIONAL TIME-FREQUENCY DOMAIN FEATURES AND A PRE-TRAINED ACOUSTIC MODEL
Wu, Peter
pg. 841
OD-A-TH3.8 - UNDERSTANDING THE TRADEOFFS IN CLIENT-SIDE PRIVACY FOR DOWNSTREAM SPEECH TASKS
Wu, Shan-Hung
pg. 1975
OD-B-TH3.1 - MANDARIN SINGING VOICE SYNTHESIS WITH A PHONOLOGY-BASED DURATION MODEL
Wu, Shu-Yun
pg. 1483
OD-B-FR1.10 - REAL-TIME EDGE ATTENTION-BASED LEARNING FOR LOW-LIGHT ONE-STAGE OBJECT DETECTION
Wu, Yi-Chiao
pg. 814
OD-A-TH3.4 - NOISY-TO-NOISY VOICE CONVERSION FRAMEWORK WITH DENOISING MODEL
Wu, Zhiyong
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
Wumaier, Aishan
pg. 1021
OD-A-FR3.1 - ENRICHING UNDER-REPRESENTED NAMED ENTITIES FOR IMPROVED SPEECH RECOGNITION
X
Xiang, Fei
pg. 614
OD-A-TH1.1 - A TARGET SPEAKER SEPARATION NEURAL NETWORK WITH JOINT-TRAINING
Xiao, Ruitong
pg. 800
OD-A-TH3.2 - CA-VC: A NOVEL ZERO-SHOT VOICE CONVERSION METHOD WITH CHANNEL ATTENTION
Xiao, Yatong
pg. 1127
LS-D-WE1.4 - MIXING OR EXTRACTING? FURTHER EXPLORING NECESSITY OF MUSIC SEPARATION FOR SINGER IDENTIFICATION
Xie, Chao
pg. 814
OD-A-TH3.4 - NOISY-TO-NOISY VOICE CONVERSION FRAMEWORK WITH DENOISING MODEL
Xie, Lei
OD-A-FR1.7 - NOISE ROBUST SINGING VOICE SYNTHESIS USING GAUSSIAN MIXTURE VARIATIONAL AUTOENCODER
pg. 672
OD-A-TH1.11 - TARGET SPEAKER EXTRACTION FOR CUSTOMIZABLE QUERY-BY-EXAMPLE KEYWORD SPOTTING
Xie, Luyuan
pg. 689
OD-A-TH1.14 - SPARSELY OVERLAPPED SPEECH TRAINING IN THE TIME DOMAIN: JOINT LEARNING OF TARGET SPEECH SEPARATION AND PERSONAL VAD BENEFITS
XIE, Xiaoyan
pg. 121
LS-D-TH1.4 - AN IDE FOR RECONFIGURABLE VIDEO ARRAY PROCESSOR
Xie, Xiaoyan
pg. 127
LS-D-TH1.5 - A RECONFIGURABLE PARALLELIZATION OF GENERATIVE ADVERSARIAL NETWORKS BASED ON ARRAY PROCESSOR
OD-B-TH2.6 - CLUSTER-TRNET: JOINTED MODEL FOR REAL-TIME TRAFFIC IDENTIFICATION WITH HIGH ACCURACY
Xing, Xiaofen
pg. 800
OD-A-TH3.2 - CA-VC: A NOVEL ZERO-SHOT VOICE CONVERSION METHOD WITH CHANNEL ATTENTION
pg. 854
OD-A-TH3.10 - TOWARDS UNSEEN SPEAKERS ZERO-SHOT VOICE CONVERSION WITH GENERATIVE ADVERSARIAL NETWORKS
Xiong, Chuxi
OD-B-TH2.6 - CLUSTER-TRNET: JOINTED MODEL FOR REAL-TIME TRAFFIC IDENTIFICATION WITH HIGH ACCURACY
Xu, Haihua
pg. 1021
OD-A-FR3.1 - ENRICHING UNDER-REPRESENTED NAMED ENTITIES FOR IMPROVED SPEECH RECOGNITION
pg. 1043
OD-A-FR3.5 - MULTILINGUAL APPROACH TO JOINT SPEECH AND ACCENT RECOGNITION WITH DNN-HMM FRAMEWORK
pg. 497
OD-A-WE1.12 - MULTITASK-BASED JOINT LEARNING APPROACH TO ROBUST ASR FOR RADIO COMMUNICATION SPEECH
Xu, Kuangzhe
pg. 359
LS-B-TH3.1 - MODELING THE DYNAMICS OF OBSERVATIONAL BEHAVIORS BASE ON OBSERVERS’ PERSONALITY TRAITS USING HIDDEN MARKOV MODELS
pg. 366
LS-B-TH3.2 - ESTIMATING BEVERAGE PREFERENCE BASED ON SUBJECTIVE EMOTIONAL REACTIONS AND EEG ACTIVITY
XU, Kuangzhe
pg. 373
LS-B-TH3.3 - MEASURING ATTRACTIVENESS OF TOURISM RESOURCES BY FOCUSING ON KANSEI VALUE STRUCTURE: POSSIBILITY OF INVITING VISITORS USING THE JAPANESE HERITAGE “AKO SALT.”
Xu, Menglong
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
pg. 443
OD-A-WE1.3 - CONFORMER-BASED END-TO-END SPEECH RECOGNITION WITH ROTARY POSITION EMBEDDING
pg. 448
OD-A-WE1.4 - EFFICIENT CONFORMER-BASED SPEECH RECOGNITION WITH LINEAR ATTENTION
Xu, Na
pg. 614
OD-A-TH1.1 - A TARGET SPEAKER SEPARATION NEURAL NETWORK WITH JOINT-TRAINING
Xu, Sean Shensheng
pg. 1299
OD-B-FR2.7 - SPEAKER TURN AWARE SIMILARITY SCORING FOR DIARIZATION OF SPEECH-BASED COGNITIVE ASSESSMENTS
Xu, Xiangmin
pg. 800
OD-A-TH3.2 - CA-VC: A NOVEL ZERO-SHOT VOICE CONVERSION METHOD WITH CHANNEL ATTENTION
pg. 854
OD-A-TH3.10 - TOWARDS UNSEEN SPEAKERS ZERO-SHOT VOICE CONVERSION WITH GENERATIVE ADVERSARIAL NETWORKS
Xu, Xinkang
pg. 700
OD-A-TH2.2 - HIERARCHICAL PROSODY ANALYSIS IMPROVES CATEGORICAL AND DIMENSIONAL EMOTION RECOGNITION
Xue, Heyang
OD-A-FR1.7 - NOISE ROBUST SINGING VOICE SYNTHESIS USING GAUSSIAN MIXTURE VARIATIONAL AUTOENCODER
Y
Yalla, Prakash
pg. 511
OD-A-WE1.14 - CSTD-TELUGU CORPUS: CROWD-SOURCED APPROACH FOR LARGE-SCALE SPEECH DATA COLLECTION
Yamada, Isao
pg. 179
LS-A-TH3.2 - A HYPERCOMPLEX TENSOR-SVD AND ITS APPLICATION
Yamada, Koki
pg. 324
LS-B-TH2.1 - NODE CLUSTERING OF TIME-VARYING GRAPHS BASED ON TEMPORAL LABEL SMOOTHNESS
Yamada, Takeshi
pg. 1210
LS-B-FR1.2 - REDUCING ALGORITHMIC DELAY USING LOW-OVERLAP WINDOW FOR ONLINE WAVE-U-NET
pg. 597
OD-A-WE2.13 - EXTENSION OF VIRTUAL MICROPHONE TECHNIQUE TO MULTIPLE REAL MICROPHONES AND INVESTIGATION OF THE IMPACT OF PHASE AND AMPLITUDE INTERPOLATION ON SPEECH ENHANCEMENT
Yamagata, Eisuke
pg. 330
LS-B-TH2.2 - RECOVERY OF TIME SERIES OF GRAPH SIGNALS OVER DYNAMIC TOPOLOGY
Yamakawa, Toshitaka
pg. 1338
LS-A-TH1.1 - REAL-TIME MONITORING SYSTEM TO EVALUATE EXERCISE LOAD, HYPOXIC LOAD, AND SAFETY IN A NORMOBARIC HYPOXIC ROOM
pg. 1343
LS-A-TH1.2 - PREOPERATIVE MONITORING USING IMPLANTABLE, MULTIMODAL, MULTICHANNEL PROBE
pg. 1348
LS-A-TH1.3 - PRELIMINARY STUDY USING AUTOENCODER FOR EARLY DETECTION OF HEAT ILLNESS FROM HEART RATE VARIABILITY OBTAINED WITH WEARABLE DEVICE
YAMAMOTO, Gai
pg. 1528
LS-A-WE1.5 - ACCELERATION OF PDS–BASED HIGH–DIMENSIONAL SIGNAL RESTORATION
Yamamoto, Kota
pg. 1289
OD-B-FR2.5 - EFFECT OF VISUAL ATTENTION AND DRIVING EXPERIENCES ON THE EVENT-RELATED POTENTIAL P300 IN THE PERCEPTION OF TRAFFIC SCENES
Yamamoto, Ryuki
OD-B-FR3.1 - FAST ALGORITHM FOR LOW-RANK TENSOR COMPLETION IN DELAY EMBEDDED SPACE
Yamamoto, Shinya
pg. 1353
LS-A-TH1.4 - MATHEMATICAL MODEL OF A HORSE AND THE RIDER DURING A JUMP
Yamamoto, Yuya
pg. 890
OD-A-FR1.3 - INVESTIGATING TIME-FREQUENCY REPRESENTATIONS FOR AUDIO FEATURE EXTRACTION IN SINGING TECHNIQUE CLASSIFICATION
Yamanouchi, Satoshi
pg. 1187
LS-C-TH3.3 - A SUBBAND ACTIVE NOISE CONTROL SYSTEM WITH AUTOMATIC TAP ASSIGNMENT IN CONSIDERATION OF PSYCHOACOUSTIC PROPERTIES
Yamaoka, Kouei
pg. 585
OD-A-WE2.11 - CAUSAL DISTORTIONLESS RESPONSE BEAMFORMING BY ALTERNATING DIRECTION METHOD OF MULTIPLIERS
Yamasaki, Yuma
pg. 1815
LS-B-TH1.5 - FEATURE EXTRACTION BASED ON DENOISING AUTO ENCODER FOR CLASSIFICATION OF ADVERSARIAL EXAMPLES
Yamashita, Naoki
pg. 1381
OD-B-TH1.4 - HIGH REFLECTION REMOVAL USING CNN WITH DETECTION AND ESTIMATION
Yamashita, Yoichi
pg. 725
OD-A-TH2.6 - SPEECH EMOTION RECOGNITION WITH FUSION OF ACOUSTIC- AND LINGUISTIC-FEATURE-BASED DECISIONS
Yamasue, Hidenori
pg. 428
LS-C-FR3.4 - PITCH AND VOLUME STABILITY IN THE COMMUNICATIVE RESPONSE OF ADULTS WITH AUTISM
Yamazaki, Yasushi
pg. 1769
LS-B-WE2.2 - CONTINUOUS BIOMETRIC AUTHENTICATION FOR SMARTPHONES CONSIDERING USAGE ENVIRONMENTS
Yamazaki, Yoichi
pg. 366
LS-B-TH3.2 - ESTIMATING BEVERAGE PREFERENCE BASED ON SUBJECTIVE EMOTIONAL REACTIONS AND EEG ACTIVITY
Yan, Binyu
pg. 1511
LS-A-WE1.3 - MULTI-RESIDUAL FEATURE FUSION NETWORK FOR LIGHTWEIGHT SINGLE IMAGE SUPER-RESOLUTION
Yan, Jiabin
pg. 1708
OD-B-TH2.1 - CROSS-DOMAIN RECAPTURED DOCUMENT DETECTION WITH TEXTURE AND REFLECTANCE CHARACTERISTICS
Yan, Zhao
pg. 564
OD-A-WE2.8 - MULTI-CHANNEL SPEECH ENHANCEMENT WITH 2-D CONVOLUTIONAL TIME-FREQUENCY DOMAIN FEATURES AND A PRE-TRAINED ACOUSTIC MODEL
Yang, Cheng
pg. 344
LS-B-TH2.4 - MODEL SELECTION-INSPIRED COEFFICIENTS OPTIMIZATION FOR POLYNOMIAL-KERNEL GRAPH LEARNING
pg. 1097
OD-A-FR3.15 - OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
Yang, Fu-Rong
pg. 1975
OD-B-TH3.1 - MANDARIN SINGING VOICE SYNTHESIS WITH A PHONOLOGY-BASED DURATION MODEL
Yang, Jae-Won
pg. 1607
LS-D-TH3.4 - CONTEXT-BASED MATCHING REFINEMENT FOR PERSON SEARCH
Yang, Jichen
pg. 800
OD-A-TH3.2 - CA-VC: A NOVEL ZERO-SHOT VOICE CONVERSION METHOD WITH CHANNEL ATTENTION
Yang, Kun
pg. 127
LS-D-TH1.5 - A RECONFIGURABLE PARALLELIZATION OF GENERATIVE ADVERSARIAL NETWORKS BASED ON ARRAY PROCESSOR
Yang, Lidong
pg. 945
OD-A-FR2.2 - FREQUENCY AXIS POOLING METHOD FOR WEAKLY LABELED SOUND EVENT DETECTION AND CLASSIFICATION
Yang, Lin
pg. 864
OD-A-TH3.12 - SPTTS: PARALLEL SPEECH SYNTHESIS WITHOUT EXTRA ALIGNER MODEL
pg. 689
OD-A-TH1.14 - SPARSELY OVERLAPPED SPEECH TRAINING IN THE TIME DOMAIN: JOINT LEARNING OF TARGET SPEECH SEPARATION AND PERSONAL VAD BENEFITS
Yang, Nan
pg. 1506
LS-A-WE1.2 - DISTRIBUTED ARITHMETIC CODING FOR SOURCES WITH HIDDEN MARKOV CORRELATION
Yang, Po-Yen
pg. 1678
LS-D-FR2.5 - AN ATTENTION BASED EXPERT INSPECTION SYSTEM FOR SMART SCALP
YANG, Rong
pg. 121
LS-D-TH1.4 - AN IDE FOR RECONFIGURABLE VIDEO ARRAY PROCESSOR
Yang, Rongsong
pg. 1716
OD-B-TH2.2 - JOINT ESTIMATION OF IMAGE ROTATION ANGLE AND SCALING FACTOR
Yang, Shu-Hsiang
pg. 2028
OD-B-TH3.10 - MODEL-BASED SOFT ACTOR-CRITIC
Yang, Tsung-Hsien
pg. 1982
OD-B-TH3.2 - TASK-AWARE BERT-BASED SENTIMENT ANALYSIS FROM MULTIPLE ESSENCES OF THE TEXT
Yang, Wenjing
pg. 614
OD-A-TH1.1 - A TARGET SPEAKER SEPARATION NEURAL NETWORK WITH JOINT-TRAINING
pg. 1072
OD-A-FR3.10 - SEPARABLE TEMPORAL CONVOLUTION PLUS TEMPORALLY POOLED ATTENTION FOR LIGHTWEIGHT HIGH-PERFORMANCE KEYWORD SPOTTING
Yang, Xiaomin
pg. 1511
LS-A-WE1.3 - MULTI-RESIDUAL FEATURE FUSION NETWORK FOR LIGHTWEIGHT SINGLE IMAGE SUPER-RESOLUTION
Yang, Yi-Hsuan
pg. 1975
OD-B-TH3.1 - MANDARIN SINGING VOICE SYNTHESIS WITH A PHONOLOGY-BASED DURATION MODEL
Yang, Yijing
pg. 1475
OD-B-FR1.9 - E-PIXELHOP: AN ENHANCED PIXELHOP METHOD FOR OBJECT CLASSIFICATION
Yang, Ziye
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
Yao, Yuanzhi
pg. 1722
OD-B-TH2.3 - UNDETECTABLE JPEG IMAGE BATCH REVERSIBLE DATA HIDING WITH CONTENT-ADAPTIVE PAYLOAD ALLOCATION
Yasui, Koki
pg. 288
OD-B-WE2.10 - DENSE DEPTHMAP PREDICTION FROM ULTRASONIC SENSORS
Yasutani, Ryoma
pg. 1949
LS-A-FR1.1 - MEASUREMENT OF CO2 IN OUTDOOR ENVIRONMENTS USING LPWAN BASED WSN AND ITS TIME CORRELATION CHARACTERISTICS
Yata, Noriko
pg. 386
LS-C-FR2.1 - INTERNAL STATE ESTIMATION BY THERMAL IMAGE AND IDENTIFICATION OF FACE AND NOSE POSITION
Yatabe, Kohei
pg. 897
OD-A-FR1.4 - IMPLEMENTATION OF INTERACTIVE TOOLS FOR INVESTIGATING FUNDAMENTAL FREQUENCY RESPONSE OF VOICED SOUNDS TO AUDITORY STIMULATION
pg. 254
OD-B-WE2.4 - PHASE-AWARE AUDIO INPAINTING BASED ON INSTANTANEOUS FREQUENCY
Ye, Minxiang
pg. 344
LS-B-TH2.4 - MODEL SELECTION-INSPIRED COEFFICIENTS OPTIMIZATION FOR POLYNOMIAL-KERNEL GRAPH LEARNING
Ye, Zekun
pg. 133
LS-D-TH1.6 - PERFORMANCE CHARACTERIZATION OF RASTERIZATION ALGORITHMS FOR RECONFIGURABLE GRAPHICS PROCESSOR
Yeh, Ting-Yu
pg. 1912
LS-D-WE2.1 - AN ADAPTIVE RANK SELECTION METHOD IN 3GPP 5G NR SYSTEMS
pg. 1917
LS-D-WE2.2 - A LOW COMPLEXITY PMI SELECTION SCHEME FOR 3GPP 5G NR FDD SYSTEMS
pg. 1889
LS-B-WE1.2 - A THRESHOLD-BASED SCHEDULING AND POWER CONTROL DESIGN ON IMT-2020 EVALUATION
Yeh, Yang-Ming
pg. 2013
OD-B-TH3.7 - 3D-GFE: A THREE-DIMENSIONAL GEOMETRIC-FEATURE EXTRACTOR FOR POINT CLOUD DATA
pg. 2018
OD-B-TH3.8 - ATTENTION EDGECONV FOR 3D POINT CLOUD CLASSIFICATION
Yen, Ming-Chi
pg. 1234
LS-D-FR3.1 - TIME ALIGNMENT USING LIP IMAGES FOR FRAME-BASED ELECTROLARYNGEAL VOICE CONVERSION
Yi, Jiangyan
pg. 454
OD-A-WE1.5 - ONE IN A HUNDRED: SELECTING THE BEST PREDICTED SEQUENCE FROM NUMEROUS CANDIDATES FOR SPEECH RECOGNITION
Yin, Shaorun
pg. 127
LS-D-TH1.5 - A RECONFIGURABLE PARALLELIZATION OF GENERATIVE ADVERSARIAL NETWORKS BASED ON ARRAY PROCESSOR
Yin, Zhiyang
pg. 1386
OD-B-TH1.5 - INTRA CODING TOOL PRUNING FOR REDUCING COMPLEXITY OF VVC SCREEN CONTENT CODING
Yokota, Akane
pg. 1640
LS-C-FR1.4 - BSS-BASED EXTRACTION FOR ADDITIVE VIDEO WATERMARKING
Yokota, Tatsuya
OD-B-FR3.1 - FAST ALGORITHM FOR LOW-RANK TENSOR COMPLETION IN DELAY EMBEDDED SPACE
Yoon, Hyunse
pg. 1488
OD-B-FR1.11 - CHECKERBOARD CORNER LOCALIZATION ACCELERATED WITH DEEP FALSE DETECTION FOR MULTI-CAMERA CALIBRATION
Yoshida, Noboru
pg. 1556
LS-C-WE2.4 - VIEW-INVARIANT FEATURE USING POSE INFORMATION AND FLEXIBLE MATCHING ALGORITHM FOR ACTION RETRIEVAL
Yoshida, Taichi
pg. 1381
OD-B-TH1.4 - HIGH REFLECTION REMOVAL USING CNN WITH DETECTION AND ESTIMATION
Yoshida, Takashi
pg. 226
LS-A-FR2.6 - LOW-PASS MAXIMALLY FLAT IIR DIGITAL DIFFERENTIATOR DESIGN WITH ARBITRARY FLATNESS DEGREE
Yoshimoto, Junichiro
pg. 1363
LS-A-TH1.6 - SEMI-SUPERVISED ESTIMATION OF DRIVING BEHAVIORS USING ROBUST TIME-CONTRASTIVE LEARNING
Yoshimura, Shunsuke
pg. 1800
LS-B-TH1.3 - MODEL INVERSION ATTACK AGAINST A FACE RECOGNITION SYSTEM IN A BLACK-BOX SETTING
Young, Shuenn-Tsong
pg. 829
OD-A-TH3.6 - SPEECH RECONSTRUCTION FROM THE LARYNX VIBRATION FEATURE CAPTURED BY LASER-DOPPLER VIBROMETER SENSOR
Younus, Muhammad Usman
pg. 1458
OD-B-FR1.6 - HEAD MOVEMENT PREDICTION USING FCNN
YU, CHENGTIAN
pg. 1541
LS-C-WE2.1 - DEEP LEARNING ANALYSIS MODELS FOR SPEECH AND EMOTIONAL RECOGNITION
Yu, Dabin
pg. 1375
OD-B-TH1.3 - UNDERWATER IMAGE DEHAZING BASED ON DISPARITY ESTIMATION AND COLOR CONSTRAINT
Yu, Dong
pg. 1433
OD-B-FR1.2 - SPEAKER INDEPENDENT AND MULTILINGUAL/MIXLINGUAL SPEECH-DRIVEN TALKING HEAD GENERATION USING PHONETIC POSTERIORGRAMS
Yu, Guochen
pg. 523
OD-A-WE2.1 - CYCLEGAN-BASED NON-PARALLEL SPEECH ENHANCEMENT WITH AN ADAPTIVE ATTENTION-IN-ATTENTION MECHANISM
Yu, Hongzhi
pg. 713
OD-A-TH2.4 - A STUDY ON DECOUPLED PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS
Yu, Jiacheng
pg. 646
OD-A-TH1.7 - GROUP MULTI-SCALE CONVOLUTIONAL NETWORK FOR MONAURAL SPEECH ENHANCEMENT IN TIME-DOMAIN
Yu, Juntao
pg. 646
OD-A-TH1.7 - GROUP MULTI-SCALE CONVOLUTIONAL NETWORK FOR MONAURAL SPEECH ENHANCEMENT IN TIME-DOMAIN
Yu, Kun
pg. 1716
OD-B-TH2.2 - JOINT ESTIMATION OF IMAGE ROTATION ANGLE AND SCALING FACTOR
Yu, Nenghai
pg. 1722
OD-B-TH2.3 - UNDETECTABLE JPEG IMAGE BATCH REVERSIBLE DATA HIDING WITH CONTENT-ADAPTIVE PAYLOAD ALLOCATION
Yu, Shuai
pg. 884
OD-A-FR1.2 - RETHINKING SINGING VOICE SEPARATION WITH SPECTRAL-TEMPORAL TRANSFORMER
Yu, Xumin
pg. 93
LS-A-WE2.6 - IMBALANCED SAMPLE FEATURE ENHANCEMENT OF HYPERSPECTRAL IMAGERY CLASSIFICATION
Yu, Ya-Ju
pg. 1883
LS-B-WE1.1 - DEEP REINFORCEMENT LEARNING FOR NPDCCH PERIOD ADJUSTMENT IN NB-IOT NETWORKS
Yu, Yameng
pg. 1087
OD-A-FR3.13 - EFFECT OF PERCEPTUAL TRAINING WITH NOISE ON CHINESE LEARNERS’ ENGLISH CONSONANT RECEPTION THRESHOLDS
Yuan, Jingyi
pg. 1317
LS-C-WE1.3 - DEPRESSION SEVERITY LEVEL CLASSIFICATION USING MULTITASK LEARNING OF GENDER RECOGNITION
Yuan, Weitao
pg. 1621
LS-C-FR1.1 - TAMPERING DETECTION FOR SPEECH SIGNALS USING SYNCHRONIZATION CODE AND LSF-BASED WATERMARKS
Yukawa, Masahiro
pg. 2077
LS-C-TH1.2 - A HILBERTIAN PROJECTION APPROACH WITH DICTIONARY DIVIDING STRATEGY: ACCELERATING NONLINEAR ESTIMATION ALGORITHM WITH MULTISCALE GAUSSIANS
Yuno, Yuuki
pg. 9
OD-B-WE1.2 - ADAPTIVE FEEDBACK CANCELLATION BASED ON PREDICTION ERROR METHOD USING INTERAURAL LEVEL DIFFERENCES IN HEARING DEVICE
Z
Zangl, Hubert
pg. 264
OD-B-WE2.6 - HIGH-ACCURACY RECONSTRUCTION OF PERIODIC SIGNALS BASED ON COMPRESSIVE SENSING
Zavialov, Igor
OD-B-FR3.4 - STABILITY OF A FINANCIAL SYSTEM VIA FINDING SYSTEMICALLY IMPORTANT FINANCIAL INSTITUTIONS
Zeng, Hui
pg. 1716
OD-B-TH2.2 - JOINT ESTIMATION OF IMAGE ROTATION ANGLE AND SCALING FACTOR
Zeng, Qifeng
pg. 1438
OD-B-FR1.3 - HMM-BASED LIP READING WITH STINGY RESIDUAL 3D CONVOLUTION
Zhai, Guangtao
pg. 344
LS-B-TH2.4 - MODEL SELECTION-INSPIRED COEFFICIENTS OPTIMIZATION FOR POLYNOMIAL-KERNEL GRAPH LEARNING
Zhang, Bowen
pg. 859
OD-A-TH3.11 - LOW-RESOURCE MANDARIN PROSODIC STRUCTURE PREDICTION USING SELF-TRAINING
Zhang, Fan
pg. 366
LS-B-TH3.2 - ESTIMATING BEVERAGE PREFERENCE BASED ON SUBJECTIVE EMOTIONAL REACTIONS AND EEG ACTIVITY
Zhang, Haobo
pg. 1043
OD-A-FR3.5 - MULTILINGUAL APPROACH TO JOINT SPEECH AND ACCENT RECOGNITION WITH DNN-HMM FRAMEWORK
Zhang, Jicheng
pg. 1043
OD-A-FR3.5 - MULTILINGUAL APPROACH TO JOINT SPEECH AND ACCENT RECOGNITION WITH DNN-HMM FRAMEWORK
Zhang, Lihui
pg. 2060
OD-B-TH3.15 - ENTAILMENT METHOD BASED ON TEMPLATE SELECTION FOR CHINESE TEXT FEW-SHOT LEARNING
Zhang, Lu
pg. 553
OD-A-WE2.6 - INCORPORATING MULTI-TARGET IN MULTI-STAGE SPEECH ENHANCEMENT MODEL FOR BETTER GENERALIZATION
Zhang, Qin
pg. 523
OD-A-WE2.1 - CYCLEGAN-BASED NON-PARALLEL SPEECH ENHANCEMENT WITH AN ADAPTIVE ATTENTION-IN-ATTENTION MECHANISM
Zhang, Rong
pg. 1590
LS-D-TH3.1 - ROBUSTNESS AGAINST ADVERSARY MODELS ON MNIST BY DEEP-Q REINFORCEMENT LEARNING BASED PARALLEL-GANS
Zhang, Shaochuan
pg. 546
OD-A-WE2.5 - MANDARIN ELECTRO-LARYNGEAL SPEECH ENHANCEMENT BASED ON STATISTICAL VOICE CONVERSION AND MANUAL TONE CONTROL
Zhang, Shuai
pg. 454
OD-A-WE1.5 - ONE IN A HUNDRED: SELECTING THE BEST PREDICTED SEQUENCE FROM NUMEROUS CANDIDATES FOR SPEECH RECOGNITION
Zhang, Wancheng
pg. 541
OD-A-WE2.4 - DNN-BASED LINEAR PREDICTION RESIDUAL ENHANCEMENT FOR SPEECH DEREVERBERATION
Zhang, Wei-Qiang
pg. 1127
LS-D-WE1.4 - MIXING OR EXTRACTING? FURTHER EXPLORING NECESSITY OF MUSIC SEPARATION FOR SINGER IDENTIFICATION
pg. 750
OD-A-TH2.10 - A MULTILINGUAL FRAMEWORK BASED ON PRETRAINING MODEL FOR SPEECH EMOTION RECOGNITION
Zhang, Weibin
pg. 854
OD-A-TH3.10 - TOWARDS UNSEEN SPEAKERS ZERO-SHOT VOICE CONVERSION WITH GENERATIVE ADVERSARIAL NETWORKS
Zhang, Wen
pg. 1221
LS-B-FR1.4 - CONSTRAINED MAXIMUM DIRECTIVITY BEAMFORMERS BASED ON UNIFORM LINEAR ACOUSTIC VECTOR SENSOR ARRAYS
Zhang, Xiao-Lei
pg. 1111
LS-D-WE1.1 - ATTENTION-BASED MULTI-CHANNEL SPEAKER VERIFICATION WITH AD-HOC MICROPHONE ARRAYS
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
pg. 443
OD-A-WE1.3 - CONFORMER-BASED END-TO-END SPEECH RECOGNITION WITH ROTARY POSITION EMBEDDING
pg. 630
OD-A-TH1.4 - MINIMUM-VOLUME REGULARIZED ILRMA FOR BLIND AUDIO SOURCE SEPARATION
pg. 448
OD-A-WE1.4 - EFFICIENT CONFORMER-BASED SPEECH RECOGNITION WITH LINEAR ATTENTION
pg. 635
OD-A-TH1.5 - A COMPARISON OF HANDCRAFTED, PARAMETERIZED, AND LEARNABLE FEATURES FOR SPEECH SEPARATION
Zhang, Xiao-Ping
pg. 344
LS-B-TH2.4 - MODEL SELECTION-INSPIRED COEFFICIENTS OPTIMIZATION FOR POLYNOMIAL-KERNEL GRAPH LEARNING
Zhang, Xiaohui
pg. 750
OD-A-TH2.10 - A MULTILINGUAL FRAMEWORK BASED ON PRETRAINING MODEL FOR SPEECH EMOTION RECOGNITION
Zhang, Yan
pg. 1506
LS-A-WE1.2 - DISTRIBUTED ARITHMETIC CODING FOR SOURCES WITH HIDDEN MARKOV CORRELATION
pg. 541
OD-A-WE2.4 - DNN-BASED LINEAR PREDICTION RESIDUAL ENHANCEMENT FOR SPEECH DEREVERBERATION
Zhang, Yuxin
pg. 1127
LS-D-WE1.4 - MIXING OR EXTRACTING? FURTHER EXPLORING NECESSITY OF MUSIC SEPARATION FOR SINGER IDENTIFICATION
Zhang, Zehua
pg. 553
OD-A-WE2.6 - INCORPORATING MULTI-TARGET IN MULTI-STAGE SPEECH ENHANCEMENT MODEL FOR BETTER GENERALIZATION
Zhang, Zhaohang
pg. 750
OD-A-TH2.10 - A MULTILINGUAL FRAMEWORK BASED ON PRETRAINING MODEL FOR SPEECH EMOTION RECOGNITION
Zhang, Zhen
pg. 1621
LS-C-FR1.1 - TAMPERING DETECTION FOR SPEECH SIGNALS USING SYNCHRONIZATION CODE AND LSF-BASED WATERMARKS
Zhao, Bo
pg. 100
LS-D-TH1.1 - IMPROVED FRUIT FLY OPTIMIZATION ALGORITHM BASED ON SIMULATED ANNEALING IN NEURAL NETWORK
Zhao, Huaibo
pg. 477
OD-A-WE1.9 - AN INVESTIGATION OF ENHANCING CTC MODEL FOR TRIGGERED ATTENTION-BASED STREAMING ASR
Zhao, Jiahong
pg. 974
OD-A-FR2.7 - COPRIME MICROPHONE ARRAYS FOR ESTIMATING SPEECH DIRECTION OF ARRIVAL USING DEEP LEARNING
Zhao, Qibin
pg. 1323
LS-C-WE1.4 - MULTI-FEATURE FUSION FOR EPILEPTIC FOCUS LOCALIZATION BASED ON TENSOR REPRESENTATION
Zhao, Xuyang
pg. 1546
LS-C-WE2.2 - INFANT POSTURE ASSESSMENT BASED ON ROTATIONAL KEYPOINT DETECTION
pg. 1323
LS-C-WE1.4 - MULTI-FEATURE FUSION FOR EPILEPTIC FOCUS LOCALIZATION BASED ON TENSOR REPRESENTATION
Zhao, Yanxi
pg. 305
OD-B-WE2.13 - POSITIONAL-SPECTRAL-TEMPORAL ATTENTION IN 3D CONVOLUTIONAL NEURAL NETWORKS FOR EEG EMOTION RECOGNITION
Zhao, Zeqing
pg. 864
OD-A-TH3.12 - SPTTS: PARALLEL SPEECH SYNTHESIS WITHOUT EXTRA ALIGNER MODEL
Zheng, Chengshi
pg. 523
OD-A-WE2.1 - CYCLEGAN-BASED NON-PARALLEL SPEECH ENHANCEMENT WITH AN ADAPTIVE ATTENTION-IN-ATTENTION MECHANISM
pg. 530
OD-A-WE2.2 - A ROBUST MAXIMUM LIKELIHOOD DISTORTIONLESS RESPONSE BEAMFORMER BASED ON A COMPLEX GENERALIZED GAUSSIAN DISTRIBUTION
Zheng, Thomas Fang
pg. 780
OD-A-TH2.15 - HOW SPEECH IS RECOGNIZED TO BE EMOTIONAL - A STUDY BASED ON INFORMATION DECOMPOSITION
Zheng, Wei-Zhong
pg. 829
OD-A-TH3.6 - SPEECH RECONSTRUCTION FROM THE LARYNX VIBRATION FEATURE CAPTURED BY LASER-DOPPLER VIBROMETER SENSOR
Zhi, Yiming
pg. 1097
OD-A-FR3.15 - OLR 2021 CHALLENGE: DATASETS, RULES AND BASELINES
Zhong, Lifei
pg. 1410
OD-B-TH1.10 - NEW END-TO-END NETWORK FOR STEREO HIGH DYNAMIC RANGE IMAGING
Zhou, Jiantao
pg. 1410
OD-B-TH1.10 - NEW END-TO-END NETWORK FOR STEREO HIGH DYNAMIC RANGE IMAGING
Zhou, Lin
pg. 1305
LS-C-WE1.1 - MICROPHONE ARRAY SPEECH SEPARATION ALGORITHM BASED ON DNN
Zhou, Xianjing
pg. 1541
LS-C-WE2.1 - DEEP LEARNING ANALYSIS MODELS FOR SPEECH AND EMOTIONAL RECOGNITION
Zhou, Xinyuan
pg. 939
OD-A-FR2.1 - CNN-BASED DISCRIMINATIVE TRAINING FOR DOMAIN COMPENSATION IN ACOUSTIC EVENT DETECTION WITH FRAME-WISE CLASSIFIER
Zhou, Yan
pg. 1375
OD-B-TH1.3 - UNDERWATER IMAGE DEHAZING BASED ON DISPARITY ESTIMATION AND COLOR CONSTRAINT
Zhu, Tianliang
pg. 1541
LS-C-WE2.1 - DEEP LEARNING ANALYSIS MODELS FOR SPEECH AND EMOTIONAL RECOGNITION
Zhu, Wenbo
pg. 1116
LS-D-WE1.2 - LIBRI-ADHOC40: A DATASET COLLECTED FROM SYNCHRONIZED AD-HOC MICROPHONE ARRAYS
pg. 635
OD-A-TH1.5 - A COMPARISON OF HANDCRAFTED, PARAMETERIZED, AND LEARNABLE FEATURES FOR SPEECH SEPARATION
Zhu, Yanping
pg. 1333
LS-C-WE1.6 - ARRHYTHMIA CLASSIFICATION ALGORITHM BASED ON SPARSE AUTOENCODER
Zhu, Yun
pg. 106
LS-D-TH1.2 - AN IMPLEMENTATION METHOD OF HEVC DATAFLOW GRAPH BASED ON RECONFIGURABLE PROCESSER
Zhuang, Weiji
pg. 564
OD-A-WE2.8 - MULTI-CHANNEL SPEECH ENHANCEMENT WITH 2-D CONVOLUTIONAL TIME-FREQUENCY DOMAIN FEATURES AND A PRE-TRAINED ACOUSTIC MODEL
Zhuang, Xuyi
pg. 553
OD-A-WE2.6 - INCORPORATING MULTI-TARGET IN MULTI-STAGE SPEECH ENHANCEMENT MODEL FOR BETTER GENERALIZATION
Zou, Chengyi
pg. 1422
OD-B-TH1.12 - SPATIAL INFORMATION REFINEMENT FOR CHROMA INTRA PREDICTION IN VIDEO CODING
Main Menu