List of Accepted Papers
Critical Information
In order to guarantee inclusion in the conference program, at least one author of each paper must make a full registration by 30 September 2017 23:59 (GMT +0800), the pre-registration deadline.
Please also submit your IEEE copyright form and final paper by following the instructions stipulated in /apsipa2017/submit_final_camera_ready/ by 30 September 2017, 23:59 (GMT +0800).
List of Accepted Papers
| Paper ID | Paper Title |
| 1 | Data Embedding in Scalable Coded Video |
| 2 | Multiple Source Localization by using Energy Weighted Single Source Zone Detection |
| 3 | Parking of Nonholonomic Mobile Robots in the Discrete Time Domain |
| 5 | Sentiment Analysis in Indonesian and French by SentiSAIL |
| 7 | Understanding CNN via Deep Features Analysis |
| 9 | IMAGE FUSION ALGORITHM BASED ON GRADIENT SIMILARITY FILTER |
| 12 | A Deep Learning Approach to Drone Monitoring |
| 13 | A Combined Variable Step Size Strategy for Two Microphones Acoustic Feedback Cancellation using Proportionate Algorithms |
| 14 | Delta-Modulated Cross-Correlation method for Delay Estimation on Source Localization |
| 15 | Maximum A Posteriori Adjustment of Adaptive Transversal Filters in Active Noise Control |
| 16 | Techniques for Overheating Detection and Sensor Allocation in Real Processors |
| 18 | Stop Line Detection and Distance Measurement for Road Intersection based on Deep Learning Neural Network |
| 19 | Spatial Multi-Channel Linear Prediction for Dereverberation of Ad-Hoc Microphones |
| 20 | Signal Power Estimation Based on Convex Optimization for Speech Enhancement |
| 22 | Detection of Sculpted Faces on Building Facades |
| 23 | Age/Gender Classification with Whole-Component Convolutional Neural Networks (WC-CNN) |
| 26 | Speech Enhancement based on Binaural Cues Derived from DNN |
| 27 | Pose-Invariant Kinematic Features for Action Recognition |
| 28 | Reduced Contact Lifting of Latent Fingerprint |
| 30 | Collagen Image Compression Using the JPEG-based Predictive Lossless Coding Scheme |
| 31 | A Fast Affine Projection Algorithm Based on a Modified Toeplitz Matrix |
| 33 | Automatic Object Searching by a Mobile Robot with Single RGB-D Camera |
| 40 | Blind Speaker Counting in Highly Reverberant Environments by Clustering Coherence Features |
| 41 | An Investigation of How to Design Control Parameters for Statistical Voice Timbre Control |
| 44 | Codebook-driven speech enhancement using DNN and harmonic emphasis |
| 46 | A Frequency-Domain Adaptive Feedback Cancellation Algorithm Based on Convex Combination |
| 47 | Enhanced F0 generation for GPR-based speech synthesis considering syllable-based prosodic features |
| 48 | On the Convergence of INCA Algorithm |
| 49 | A Multiple-Lane Vehicle Tracking Method for Forward Collision Warning System Applications |
| 50 | Understanding Multiple-Input Multiple-Output Active Noise Control from a Perspective of Sampling and Reconstruction |
| 51 | Understanding Multi-layer Perceptrons on Spatial Image Steganalysis Features |
| 52 | Salient Object Detection Using Array Images |
| 54 | Micro-expression Recognition using Apex Frame with Phase Information |
| 56 | Tibetan-Mandarin Bilingual Speech Recognition Based on End-to-End Framework |
| 59 | Learning the Number of Nodes in DNNs with Activation Mask |
| 60 | Unity-Bounded Functions for Designing Stable Variable Digital Filters |
| 63 | Combining Evidences from Detection Sources for Query-by-Example Spoken Term Detection |
| 64 | Low-Resource Spoken Keyword Search Strategies in Georgian Inspired by Distinctive Feature Theory |
| 65 | Sub-Nyquist Non-Uniform Sampling for Low-Cost Sound Monitoring |
| 66 | Investigation on the Roles of Human and Robot in Collaborative Storytelling |
| 67 | Topic Classification Based on Distributed Document Representation and Latent Topic Information |
| 68 | Compressed High Dimensional Features for Speaker Spoofing Detection |
| 69 | Binary Vector Reconstruction via Discreteness-Aware Approximate Message Passing |
| 70 | Spread Spectrum Compressed Sensing Magnetic Resonance Imaging via Fractional Fourier Transform |
| 71 | A PARALLEL COMPUTATION ALGORITHM FOR SUPER-RESOLUTION METHODS USING CONVOLUTIONAL NEURAL NETWORKS |
| 72 | On the Selection and Design of Filter Banks in Normalised Subband Adaptive Filters (NSAF) |
| 73 | Localization of Harmonic Source Using a Single Moving Sensor of Known Trajectory |
| 74 | Grid-free Compressive Beamforming Using a Single Moving Sensor of Known Trajectory |
| 75 | A Novel Filtering-based F0 Estimation Algorithm with an Application to Voice Conversion |
| 76 | Functional Verification and Performance Testing for OpenAirInterface (OAI) eNodeB |
| 78 | Optimal Kernel in Kernel Regression Problems with Autocorrelation Prior |
| 81 | Portable Vision Screenings System |
| 83 | Detection of various image operations based on CNN |
| 84 | Exploiting Imbalanced Textual and Acoustic Data for Training Prosodically-enhanced RNNLMs |
| 85 | Impulsive Noise Suppression Using Interpolated Zero Phase Signal |
| 86 | Multi-Query Image Retrieval using CNN and SIFT Features |
| 87 | Synchronized Amplitude-and-Frequency Modulation for a Parametric Loudspeaker |
| 88 | Min-Max IIR Filter Design for Feedback Quantizers |
| 89 | Personality Trait Perception from Speech Signals Using Multiresolution Analysis and Convolutional Neural Networks |
| 90 | Mood Disorder Identification Using Deep Bottleneck Features of Elicited Speech |
| 91 | Cube-based Encryption Connected Prior to Motion JPEG Standard |
| 92 | Accurate subset selection for pose estimation from uncertain points and lines |
| 93 | Towards A Human-Robot Teaming System for Exploration of Environment |
| 94 | Distinction between Healthy Individuals and Patients with Confident Abnormal Respiration |
| 95 | Perceptual roles of temporal and segmentation cues in single-channel noise reduction processing |
| 96 | Transformation of Prosody in Voice Conversion |
| 97 | A New Data-driven Band-weighting Function for Predicting the Intelligibility of Noise-suppressed Speech |
| 98 | Importance of Non-Uniform Prosody Modification for Speech Recognition in Emotion Conditions |
| 101 | Fast electromagnetic tracking algorithms with coplanar transmitting coil array |
| 103 | Comparison Analysis of ICA versus MCA-KSVD Blind Source Separation on Task-related fMRI Data |
| 105 | Detection of Meaningful Line Segment Configurations |
| 106 | Defence Mechanisms Evaluation against RA Flood Attacks for Linux-Victim Node |
| 107 | Nonuniform sampling theorems for random signals in the offset linear canonical transform domain |
| 108 | Data Augmentation and Feature Extraction using Variational Autoencoder for Acoustic Modeling |
| 109 | An Integrated Framework for Multimodal Human-Robot Interaction |
| 110 | Perceptual Evaluation of Singing Quality |
| 111 | Compressing Population DNA Sequences using Multiple Reference Sequences |
| 113 | Between Homomorphic Signal Processing and Deep Neural Networks: Constructing Deep Algorithms for Polyphonic Music Transcription |
| 114 | Feedforward Sequential Memory Networks based Encoder-Decoder Model for Machine Translation |
| 115 | Signal Detection with Belief Propagation in Faster-than-Nyquist Signaling |
| 118 | End-to-end Speech Recognition for Languages with Ideographic Characters |
| 119 | Analysis of Efficient Multimodal Features for Estimating User’s Willingness to Talk: Comparison of Human-Machine and Human-Human Dialog |
| 120 | An Investigation of Application-aware Mobile Edge Computing in 5G Networks |
| 121 | Convolution theorem and Wigner-Ville distribution |
| 122 | Time-Domain Subsampling and Reconstruction for Microphone Array |
| 123 | Speech Emotion Recognition via Ensembling Neural Networks |
| 124 | Binaural Beamforming with Spatial Cues Preservation for Hearing Aids in Real-life Complex Acoustic Environments |
| 125 | Enhanced Array Manifold Matrices for L-Shaped Microphone Array-based 2-D DOA Estimation |
| 127 | An Application of Noise-Robust Speech Translation Using Asynchronous Smart Devices |
| 128 | Multimodal Speech Recognition Using Mouth Images from Depth Camera |
| 129 | Multi-Resolution for Disparity Estimation with Convolutional Neural Networks |
| 130 | Future Trend of Deep Learning Frameworks – From the perspective of Big Data analytics and HPC – |
| 131 | Gain Readjustment Function to Reduce Measurement Errors in Long-Term HRV Monitoring with Wearable Telemeter |
| 132 | Distributed Video Coding Based on Compressive Sensing and Intra-Predictive Coding |
| 133 | Deep Learning-Based Speaking Rate-Dependent Hierarchical Prosodic Model for Mandarin TTS |
| 134 | Disparity Map Refinement Method Using Coarse-to-Fine Image Segmentation |
| 135 | Resource Allocation and Minimum Rate for Precoded Non-orthogonal Multiple Access |
| 136 | A Dual Alignment Scheme for Improved Speech-to-Singing Voice Conversion |
| 137 | Experimental study on source-filter interaction using physical model of the vocal folds |
| 138 | Accurate estimation of fo and aperiodicity based on periodicity detector residuals and deviations of phase derivatives |
| 141 | Statistical-mechanical analysis of the FXLMS algorithm for multiple-channel active noise control |
| 142 | Free Linguistic and Speech Resources for Tibetan |
| 143 | Utilizing Neural Network and Critical Band Processing for Speech Enhancement |
| 144 | Acceleration for Query-by-Example Using Posteriorgram of Deep Neural Network |
| 145 | Secure Data Management System with Traceability against Internal Leakage |
| 146 | Residual Drum Sound Estimation for RPCA Singing Voice Extraction |
| 148 | Exemplar-Based Image Inpainting Based on Pixel Inhomogeneity Factor |
| 149 | A New Active Contour Model Based on Complexity of Textures for Segmentation of Natural Image |
| 150 | Weakly Labeled Acoustic Event Detection using Local Detector and Global Classifier |
| 151 | Automatic Identification of Pathological Voice Quality Based on the GRBAS Categorization |
| 152 | Sparse-based Disturbance Cancellation Approaches for Passive Radar |
| 153 | When industrial robots become social: on the design and evaluation of a multimodal interface for welding robots |
| 154 | An Efficient Method for Adapting Step-size Parameters of Primal-dual Hybrid Gradient Method in Application to Total Variation Regularization |
| 155 | A Segmental DNN/i-vector Approach for Digit-Prompted Speaker Verification |
| 156 | Adaptive Feedback Control using Improved Variable Step-Size Affine Projection Algorithm for Hearing Aids |
| 157 | A perception system for robot arms to convey objects to in-car passengers |
| 159 | Handling small motion without differential approximation |
| 160 | A unified network for multi-speaker speech recognition with multi-channel recordings |
| 161 | Sound sensing using smartphones as a crowdsourcing approach |
| 163 | Lung Sound Classification based on Hilbert-Huang Transform Features and multilayer perceptron network |
| 164 | Automatic Detection of Circulating Tumor Cells based on Microscopic Images |
| 165 | Robust Children and Adults Speech Identification and Confidence Measure Based on DNN Posteriorgram |
| 166 | Understanding the Effect of Cannabis Abuse on the ANS and Cardiac Physiology of the Indian Women Paddy-field Workers Using RR Interval and ECG Signal Analyses |
| 168 | Two-Dimensional Winner-Takes-All Hashing in Template Protection based on Fingerprint and Voice Feature Level Fusion |
| 170 | QR-Decomposed Generalized Belief Propagation with Smart Message Reduction for Low-Complexity MIMO Signal Detection |
| 173 | Searchable Encryption of Image based on Secret Sharing Scheme |
| 174 | Modulation spectrum-based speech parameter trajectory smoothing for DNN-based speech synthesis using FFT spectra |
| 177 | A Multilingual Language Processing Tool for Uyghur, Kazak and Kirghiz |
| 178 | Raw Waveform-based Speech Enhancement by Fully Convolutional Networks |
| 182 | Classification of Spectral Compressive Hyperspectral Images Using Morphological Profiles |
| 183 | Subpixel Mapping of Hyperspectral Images with Hybrid Endmember Library and Optimized Abundances |
| 184 | Hyperspectral and Multispectral Image Fusion Using Local Spatial-Spectral Dictionary Pair |
| 185 | Fast Locally Linear Embedding Algorithm for Exemplar-based Voice Conversion |
| 187 | Improving speech intelligibility for bilateral cochlear implant users using Weiner filters and its impact on listening effort |
| 189 | Music Chord Recognition From Audio Data Using Bidirectional Encoder-decoder LSTMs |
| 190 | A Fast Non-Convex Regularizer for Low Rank Matrix Completion |
| 193 | Emotion Recognition by Combining Prosody and Sentiment Analysis for Expressing Reactive Emotion by Humanoid Robot |
| 194 | Stereo Matching Using Relative Total Variation and Entropy |
| 195 | Identifying and Filling Occlusion Holes on Planar Surfaces for 3-D Scene Editing |
| 196 | Selecting Type of Response for Chat-like Spoken Dialogue Systems Based on Acoustic Features of User Utterances |
| 197 | A Computer Aided Diagnosis System for Indocyanine Green Angiography Using Multi-Scale Convolutional Neural Networks |
| 198 | Motion Planning of a 6-Dofs Robot Arm for Bandaging Nursing Task |
| 199 | Local Patch Descriptor Using Deep Convolutional Generative Adversarial Network for Loop Closure Detection in SLAM |
| 200 | REVERSIBLE DATA HIDING FOR COMPRESSION-FRIENDLY IMAGE ENCRYPTION METHOD |
| 201 | A new detector for JPEG decompressed bitmap identification |
| 202 | An Image Compression Algorithm Based on the Karhunen Loeve Transform |
| 204 | An Investigation to Transplant Emotional Expressions in DNN-based TTS Synthesis |
| 205 | Wide Angle Virtual View Synthesis Using Two-by-Two Kinect V2 |
| 206 | Speech Emotion Recognition Based on Multi-Task Learning Using a Convolutional Neural Network |
| 207 | An Investigation of Spectral Feature Partitioning for Replay Attacks Detection |
| 208 | Patterns of Vowels in Uyghur Tri-syllabic Words |
| 209 | A Maximum Likelihood Approach to Deep Neural Network Based Speech Dereverberation |
| 210 | Toward effective noise reduction for sub-Nyquist high-frame-rate MRI techniques with deep learning |
| 211 | I2R-NUS Submission to Oriental Language Recognition AP16-OL7 Challenge |
| 212 | Array shape calibration using near field pilot sources with unknown distance |
| 213 | Representing Raw Linguistic Information in Chinese Text-to-Speech System |
| 214 | An Empirical Study on Performance Optimization at District Cooling Plant of Universiti Teknologi PETRONAS |
| 215 | Diffusion LMS Using Consensus Propagation |
| 216 | A robust PET Image reconstruction using constrained non-negative matrix factorization |
| 218 | F0 Estimation Using Empirical Mode Decomposition and Complex Cepstrum Analysis in Reverberant Environments |
| 219 | Performance Evaluation of Acoustic Scene Classification Using DNN-GMM and Frame-Concatenated Acoustic Features |
| 221 | Learning a Robust DOA Estimation Model with Acoustic Vector Sensor Cues |
| 222 | Efficient Edge-Oriented Based Image Interpolation Algorithm for Non-Integer Scaling Factor |
| 223 | Speech Emotion Recognition using Convolutional Long Short-Term Memory Neural Network and Support Vector Machines |
| 224 | Generalized Atom and Dictionary Design and Compressive Sensing for Vocal Signal Expansion |
| 226 | Density-based Multi-Manifold ISOMAP for Data Classification |
| 227 | SCFT: Sector-based Cancelable Fingeprint Template |
| 228 | Initial Depth Estimation using EPIs and Structure Tensor |
| 229 | Electrically-evoked frequency following responses (EFFRs) and electrically-evoked auditory brainstem responses (EABRs) in guinea pigs |
| 230 | Language Resource Construction for Mongolian |
| 231 | Development of a multi-modal personal authentication interface |
| 232 | A Comparison Study of Information Contributions of Phonemic Contrasts in Mandarin |
| 233 | Voice conversion based on deep neural networks for time-variant linear transformations |
| 234 | A Rail Detection Algorithm Based on Pair Particles Filtering |
| 236 | Swallowing function evaluation using deep-learning-based acoustic signal processing |
| 237 | Four-Dimensional Image Compression with Region of Interest Based on Non-separable Double Lifting Integer Wavelet Transform |
| 239 | Exploiting end of sentences and speaker alternations in language modeling for multiparty conversations |
| 241 | Word Level Prosody Prediction Using Large Audiobook Dataset |
| 243 | PGT: Proposal-Guided Object Tracking |
| 244 | A Study of High Level Tone in Standard Chinese Produced by Prelingual Deaf Adults |
| 245 | ROBUST VOICE ACTIVITY DETECTION BASED ON LSTM RECURRENT NEURAL NETWORKS AND MODULATION SPECTRUM |
| 246 | Speech Emotion Recognition Using MPCRNN based on Gammatone auditory filterbank |
| 247 | Online Sound Structure Analysis Based on Generative Model of Acoustic Feature Sequences |
| 248 | New Approach for Image Segmentation with Shape Priors Based on the Potts Model |
| 250 | An improved orthogonal matching pursuit based on randomly enhanced adaptive subspace pursuit |
| 251 | Enhanced Neural Machine Translation by Learning from Draft |
| 253 | Tracking objects using 3D object proposals |
| 254 | Locomotion Control of a Serpentine Crawling Robot Inspired by Central Pattern Generators |
| 255 | Energy Distribution Analysis and Nonlinear Dynamical Analysis of Phonation in Patients with Parkinson’s Disease |
| 256 | A Free Kazakh Speech Database and a Speech Recognition Baseline |
| 257 | Memory-augmented Chinese-Uyghur Neural Machine Translation |
| 258 | DNN-based Feature Transformation for Speech Recognition Using Throat Microphone |
| 259 | Joint Unsupervised Adaptation of N-gram and RNN Language Models via LDA-based Hybrid Mixture Modeling |
| 260 | A Summary of Blind Guiding Methods Converting Images to Sounds |
| 261 | Millimeter Wave Radar Image Denoising with Complex Nonseparable Oversampled Lapped Transform |
| 262 | Robust i-vector extraction tightly coupled with voice activity detection using deep neural networks |
| 263 | Sound Source Localization Using Binaural Different for Hose-Shaped Rescue Robot |
| 266 | Application of Mean-shift Clustering for Removing Flux Trapping Noise from Geomagnetic Field Signals Measured using HTS-SQUID Magnetometers |
| 267 | Doubly Adaptive Kernel Filtering |
| 268 | Active Enumeration of Local Minima for IIR Filter Design Using PSO |
| 269 | Correlation Between Different DNA Period-3 Signals: An Analytical Study for Exons Prediction |
| 270 | Dynamic semantic boundary detection for speech translation |
| 272 | An Accelerated Adjustment Method for Microphone Array Directivity |
| 273 | Plastic multi-resolution auditory model based neural network for speech enhancement |
| 276 | The Fractional Fourier Transform on Graphs |
| 277 | Epileptic Focus Localization Based on Bivariate Empirical Mode Decomposetion and Entropy |
| 278 | Multi-Scale Salient Object Detection with Pyramid Spatial Pooling |
| 279 | Frequency-Invariant Differential Microphone Array Design in the STFT Domain |
| 280 | Natural Scene Statistics Based Publication Classification Algorithm Using Convolutional Neural Network |
| 281 | Identifying Computer-Generated Text Using Statistical Analysis |
| 282 | Design of Adaptively Scaled Belief in Large MIMO Detection for Higher-Order Modulation |
| 283 | AP17-OLR Challenge: Data, Plan, and Baseline |
| 284 | An End-to-End Neural Network Approach to Story Segmentation |
| 285 | Multiband Hierarchical Ad Hoc Network with Wireless Environment Recognition |
| 286 | A Markerless Visual-motor Tracking System for Behavior Monitoring in DCD Assessment |
| 287 | Speech Separation By Cost-Sensitive Deep Learning |
| 288 | Improving N-gram Language Modeling for Code-switching Speech Recognition |
| 289 | Single-shot High Dynamic Range Imaging via Deep Convolutional Neural Network |
| 290 | An Effective Segmentation Algorithm of Apple Watercore Disease Region Using Fully Convolutional Neural Networks |
| 292 | Investigation of Effectiveness on Recurrent Neural Network for Daily Activity Recognition using Multi-modal Signals |
| 294 | Modeling and Measuring a Moog Voltage-Controlled Filter |
| 295 | MSE-Optimized CP-Based CFO Estimation in OFDM Systems over Multipath Channels |
| 296 | Deep Acoustic-to-Articulatory Inversion Mapping with Latent Trajectory Modeling |
| 298 | Speech Watermarking Scheme Based on Singular-Spectrum Analysis for Tampering Detection and Identification |
| 300 | Integrating Online i-vector into GMM-UBM for Text-dependent Speaker Verification |
| 301 | On the Performance Impact of Virtual Link Types to 5G Networking |
| 303 | A New Pool Control Method for Boolean Compressed Sensing Based Adaptive Group Testing |
| 305 | A study on Quantitative Computation for Prosodic Strength of Mandarin Speech |
| 306 | Integrated Algorithm for Block-Permutation-Based Encryption with Reversible Data Hiding |
| 307 | Topic Embedding of Sentences for Story Segmentation |
| 308 | Multiscale Directional Transforms based on Cosine-Sine Modulated Filter Banks for Sparse Directional Image Representation |
| 309 | SIMD Acceleration for HEVC Encoding on DSP |
| 310 | Gradient-based Contrast Enhancement and Color Correction for Underwater Images |
| 311 | Speaker Recognition with Cough, Laugh and “Wei” |
| 312 | Successive MMSE Group Decoding and Max-Min Power Control for Uplink Multicell NOMA Systems under Pilot Contamination |
| 313 | Sequential Decomposition of 2D Apparent Motion Field Based on Low-Rank and Sparse Approximation |
| 317 | A Drag-and-Drop Type Human Computer Interaction Technique Based on Electrooculogram |
| 318 | Deep Speaker Verification: Do We Need End to End? |
| 319 | Visual Attention Guided Eye Movements for 360 Degree Images |
| 321 | Classifying Road Surface Conditions Using Vibration Signals |
| 323 | Panchromatic and Multi-spectral Image Fusion Method Based on Two-step Sparse Representation and Wavelet Transform |
| 326 | Infrared and Visible Image Fusion Based on Innovation Feature Simultaneous Decomposition |
| 328 | Block-DCT Based Alterable-Coding Restorable Fragile Watermarking Scheme with Superior Localization |
| 329 | Image Origin Identification for Online Social Networks (OSNs) |
| 332 | Abnormal sound detection by two microphones using virtual microphone technique |
| 333 | Highly-Distributed Sensor Processing using IoT for Critical Infrastructure Monitoring |
| 334 | Cross-lingual Speaker Verification with Deep Feature Learning |
| 335 | Music Thumbnailing via Neural Attention Modeling of Music Emotion |
| 339 | Electrolaryngeal Speech Modification towards Singing Aid System for Laryngectomees |
| 340 | Objective neurophysiological assessment for sound quality perception by hearing-impaired listeners |
| 341 | Deep Learning Noise Reduction Approach to Improve Speech Intelligibility for Cochlear Implant Recipients in the Presence of Competing Speech Noise |
| 342 | Automatic Vehicle Classification Using Center Strengthened Convolutional Neural Network |
| 343 | ACOUSTIC SCENE CLASSIFICATION USING SELF-DETERMINATION CONVOLUTIONAL NEURAL NETWORK |
| 345 | Mandarin Electrolaryngeal Voice Conversion with Combination of Gaussian Mixture Model and Non-negative Matrix Factorization |
| 346 | Construction of Semi-Markov Ergodic Maps with Selectable Spectral Characteristics via the Solution of the Inverse Eigenvalue Problem |
| 347 | Optimized Human Detection on the Embedded Computer Vision System |
| 348 | Investigating the use of Scattering Coefficients for Replay Attack Detection |
| 349 | Fast High-Quality Three-Dimensional Reconstruction from Compressive Observation of Phased Array Weather Radar |
| 350 | Trellis Coded Generalized Spatial Modulation with Spatial Multiplexing |
| 351 | Teaching and Learning Next Generation Mobile Communication Networks through Open Source OpenAirInterface Testbeds |
| 352 | An Acoustic Monitoring System and Its Field Trials |
| 353 | Non-native speech conversion with consistency-aware recursive network and generative adversarial network |
| 354 | Overlapping Acoustic Event Classification Based on Joint Training with Source Separation |
| 357 | A Study on Landmark Detection Based on CTC and Its Application to Pronunciation Error Detection |
| 358 | LSTM-Based Iterative Mask Estimation and Post-Processing for Multi-Channel Speech Enhancement |
| 359 | Convolutional Neural Network with Multi-Task Learning Scheme for Acoustic Scene Classification |
| 360 | Prediction Techniques for Wavelet Based 1-D Signal Compression |
| 361 | Random Aliasing Modulation with Decision-Directed Demodulation |
| 362 | FasterMDNet: Learning Model Adaptation by RNN in Tracking-by-Detection based Visual Tracking |
| 363 | Towards Event Based MCTS for Autonomous Cars |
| 365 | Interchannel Phase Difference Clustering Based on Phase Replication for Multiple Sound Sources Localization |
| 366 | Using Optimal Ratio Mask as Training Target for Supervised Speech Separation |
| 367 | Voichap: a standalone voice change application on iOS platform |
| 368 | Exploring Confusing Scene Classes for the Places Dataset: Insights and Solutions |
| 369 | Active Noise Control for Muffler |
| 371 | Study on method for protecting speech privacy by actively controlling speech transmission index in simulated room |
| 372 | End-to-End Speech Emotion Recognition Using Multi-Scale Convolution Networks |
| 373 | Sliced Voxel Representations with LSTM and CNN for 3D Shape Recognition |
| 374 | Wavelet Scattering Transform for Variability Reduction in Cortical Potentials Evoked by Pitch Matched Electroacoustic Stimulation in Unilateral Cochlear Implant Patients |
| 375 | Low-Complexity Zero-Forcing Detector for large-scale MIMO-OFDM systems |
| 377 | Compressed Sensing Reconstruction of MR Phase-varied Images using Multi-scale Complex Sparsifying Transform |
| 378 | Generalization of Thai Tone Contour in HMM-Based Speech Synthesis |
| 379 | Robust Image Identification without any Visible Information for Double-Compressed JPEG Images |
| 380 | Pseudo Multi-Exposure Fusion Using a Single Image |
| 381 | A Fairness aware and Resource Reuse Algorithm for LTE Layered Video Multicast Service |
| 382 | A Deep Learning Architecture for Classifying Medical Images of Anatomy Object |
| 383 | Image Manipulation on Social Media for Encryption-then-Compression Systems |
| 384 | Image Super-Resolution Based on Error Compensation with Convolutional Neural Network |
| 385 | Ensemble of Binary Tree Structured Deep Convolutional Network for Image Classification |
| 386 | Graph Reduction Method Using Localization Operator and Its Application to Pyramid Transform |
| 388 | On the Construction of more Human-like Chatbots: Affect and Emotion Analysis of Movie Dialogue Data |
| 389 | A Study of Monitoring System for Radio Leak with Massive Radio Sensors |
| 391 | Enhancing Wedgelet-based Depth Modeling in 3D-HEVC |
| 392 | Joint Bilateral based Image Denoising using Multi-sized 2D Hard Threshold |
| 393 | Multi-Channel Neural Network for Steganalysis |
| 394 | CNN-based Bottleneck Feature for Noise Robust Query-by-Example Spoken Term Detection |
| 395 | Multi-layer Background Sprite Model for 2D-to-3D Video Conversion |
| 396 | Joint Estimation of Signal and Mutual Coupling Parameters Based on Spatially Spread Polarization Sensitive Array |
| 397 | Super resolution based side information creation algorithm for distributed scalable video coding |
| 398 | Image ordinal estimation: classification and regression benefit each other |
| 399 | Real time digitized Neural-spike storage scheme in multiple channels for biomedical applications |
| 401 | Psychoacoustic subband active noise control algorithm |
| 402 | Robust Template Matching Using Scale-Adaptive Deep Convolutional Features |
| 406 | The longitudinal Development of Focus Duration for Korean Chinese Learners |
| 408 | Multi-focus Image Fusion Using Gaussian Filter and Dynamic Programming |
| 411 | REDESIGNING DATA HIDING: INTERPOLATION-BASED JOINT SCRAMBLING-EMBEDDING METHOD |
| 412 | Investigating Siamese LSTM Networks for Text Categorization |
| 415 | A Study on Enhanced Educational Platform with Adaptive Sensing Devices using IoT Features |
| 416 | Real Time Image Processing Based Obstacle Avoidance and Navigation System for Autonomous Wheelchair Application |
| 417 | Light Field Scene Flow With Occlusion Regularization |
| 419 | The Acoustic Characteristics of Tone 3 in Standard Chinese Produced by Prelingually Deaf Adults |
| 421 | Digital Computation of Fractional Fourier and Linear Canonical Transforms and Sparse Image Representation |
| 424 | Facial Action Recognition using Very Deep Networks for Highly Imbalanced Class Distribution |
| 425 | A Study of Automatic Annotation of PETs with Articulatory Features |
| 429 | Fuzzy Qualitative Approach for Micro-Expression Recognition |
| 430 | Zynq-based Full HD Around View Monitor System for Intelligent Vehicle |
| 432 | Background Subtraction via Truncated Nuclear Norm Minimization |
| 434 | Dictionary design and disparity interpolation on distributed compressed sensing for light field image |
| 435 | A New Intra Prediction Method Based on Consistent Luminance Changes |
| 436 | Digital Hologram Data Representation Method |
| 437 | A Real Time Micro-expression Detection System with LBP-TOP on a Many-core Processor |
| 438 | LiDAR/Camera Sensor Fusion Technology for Pedestrian Detection |
| 440 | Detection and Classification of Malicious Patterns Using Benford’s Law |
| 443 | Emotional Statistical Parametric Speech Synthesis Using LSTM-RNNs |
| 444 | Accelerating Deep Learning by Binarized Hardware |
| 445 | Automated Classroom Monitoring With Connected Visioning System |
| 446 | Acoustic Modeling with Shared Phoneme Set for Multilingual Speech Recognition without Code-Switching |
| 448 | Hybrid EEG-NIRS Brain–Computer Interface Under Eyes-Closed Condition |
| 450 | A New Bilateral Filter for Post-removing the Noise of Synthesis View in 3D Video |
| 452 | Single Image Superresolution by Multiple Geometrical Regressors |
| 453 | Effectiveness of Headrest ANC System with Virtual Sensing Technique for Factory Noise |
| 454 | Simultaneous Biosignal Measurement System for Multiple Users – Development and Validation |
| 455 | Deep Recurrent Neural Network for Video Super-Resolution |
| 456 | Efficient Video Coding Using Rigid Object Tracking |
| 457 | A fast and energy efficient FPGA-based system for real-time object tracking |
| 458 | Efficient CTU-based Intra Frame Coding for HEVC Based on Deep Learning |
| 459 | Multi-accelerator Architecture for GoogLeNet |
| 460 | Augmented Visualization: Observing as Desired |
| 461 | Effect of the Audio Amplifier’s Distortion on Feedforward Active Noise Control |
| 462 | QoE-estimation models for video streaming services |
| 463 | Rewritable Data Insertion in Encrypted JPEG using Coefficient Prediction Method |
| 464 | Development of Under-Resourced Bahasa Indonesia Speech Corpus |
| 465 | Recent Advances in Biometric Security: A Case Study of Liveness Detection in Face Recognition |
| 467 | Multimodal Decomposition for Enhanced Subtle Emotion Recognition |
| 469 | Enhanced Block Truncation Coding Image using Digital Multitone Screen |
| 470 | Automatic Meeting Transcription System for the Japanese Parliament (Diet) |
| 471 | Common Visual Pattern Discovery and Search |
| 472 | Trends in efficient representation of 3D point clouds |
| 473 | Online Unsupervised Kernel Learning Algorithm |
















