Industrial Activities


- APSIPA Industrial Governance Board

- APSIPA Industrial Distinguished Leader Program
- Bios of Board Chair and 3 Committee Chairs
- Bios of Committee Members


APSIPA Industrial Governance Board

Chair - VP of Industrial Relations and Development
Shan Liu, MediaTek, USA (Chair)

Past-Chair - VP of Industrial Relations and Development

Haohong Wang, TCL Research America, USA

Industrial Publication Committee
Guan-Ming Su, Dolby Labs, USA (Chair)
Ning Xu, Snapchat Research, USA
Ming Xia, LinkedIn, USA
Yang Lei, HP, USA
Harshad Kadu, Dolby Labs, USA
I-Ming Pao, Adobe, USA
Yuebing Jiang, Apple, USA

Industrial Forum Committee
Ioannis Katsavounidis, Netflix, USA (Chair)
Tong Zhang, HP Labs, USA (Past-Chair)
Anne Aaron, Netflix, USA
Khaled El-Maleh, Qualcomm, USA
José Roberto Alvarez, Futurewei Technologies, USA
Jin-Gyeong Kim, LG, Korea
Anil Kokaram, YouTube/Google, USA

Industrial Membership Committee
Shinji Watanabe, MERL, USA (Chair)
Shan Liu, MediaTek, USA (Past-Chair)
Ying Li, IBM, USA
Masami Akamine, Toshiba, Japan
Takafumi Koshinaka, NEC, Japan
Jim Zhao, Huawei, USA
Yan Ye, InterDigital, USA
Hiroshi Sawada, NTT Communication Science Labs, Japan
Michiel Bachiani, Google, USA

Top


APSIPA Industrial Distinguished Leader Program

The APSIPA Industrial Distinguished Leader is a distinction reserved for selected APSIPA members (or potential members) whose extraordinary accomplishments in any of the fields related to APSIPA scope are deemed fitting of this prestigious recognition. The Distinguished Leader nomination and election process conducts twice a year, for more details please refer to the guideline and nomination form.

Year 2015 APSIPA Industrial Distinguished Leaders
Chung-Min Chen, VP, Telcordia, USA
Lawrance Loh, President, MediaTek USA Inc., USA
Nikhil Balram, President & CEO, Ricoh Innovations, USA
Yasunori Mochizuki, VP, NEC, Japan
Yi Hao, CEO, GWC, China

Year 2014 APSIPA Industrial Distinguished Leaders
Byung K. Yi, EVP & CTO, InterDigtial, USA
C. C. Lee, SVP, Sony, USA
Li Deng, Partner Research Manager, Microsoft, USA (IEEE Fellow)
Kevin Jou, VP & CTO, MediaTek, Taiwan
Sherman (Xuemin) Chen, VP, Broadcom, USA (IEEE Fellow)
Steve Simske, HP Fellow and Chief Technologist, HP, USA
Shawmin Lei, VP, MediaTek, Taiwan (IEEE Fellow)
Tomohiko Taniguchi, Principal, Fujitsu, Japan (IEEE Fellow)

Top


Bios of Board Chair and 3 Committee Chairs:

Shan Liu
is Director of Technology Development at MediaTek USA Inc. Prior to joining MediaTek, she held various technical and management positions at Mitsubishi Electric Research Laboratories (MERL), Sony Electronics and Sony Computer Entertainment America, Samsung Electronics America, IBM T.J. Watson, Rockwell Science Center, InterVideo Inc. and AT&T Labs, Research. Dr. Liu is the inventor of more than 50 granted patents and patent applications, and author of more than 30 peer-reviewed journal and conference papers. She is also an active participant and contributor to ISO/IEC and ITU-T standards, since 2006. By late 2015, she has published more than 200 standard contributions. Many of her inventions have been adopted by international standards such as ITU-T H.265 | ISO/IEC HEVC and built into communication and multimedia products. Dr. Liu has chaired and co-chaired a number of ad-hoc and technical groups through standard development and served as editor of HEVC Screen Content Coding standard. She was the recipient of MediaTek R&D Gold (highest honor) Award in 2013. Dr. Shan Liu obtained her B.Eng. degree in Electronics Engineering from Tsinghua University, Beijing, China and the M.S. and Ph.D. degrees in Electrical Engineering from University of Southern California, Los Angeles, USA.

Haohong Wang
is the General Manager of TCL Research America at San Jose, California. Prior to joining TCL, he held technical and management positions at AT&T, Catapult, Qualcomm, Marvell and Cisco. He is the inventor of 50+ patents and pending applications, and co-author of 5 books and 50+ articles in journals and conferences. He is the Editor-in-Chief of the Journal of Communications, and has been a member of the Steering Committee of the IEEE Transactions on Multimedia. He co-chairs the IEEE Technical Committee on Human Perception and Multimedia Computing, and has chaired the IEEE Multimedia Communications Technical Committee. He chairs the Steering Committee of ICNC conference, and has served as the General Chair of IEEE ICME 2011 and IEEE VCIP 2014, and as the TPC Chair of IEEE GLOBECOM 2010. He is the recipient of the IEEE MMTC Distinguished Service Award (2013), Manager of the Year Award (2012), Distinguished Inventor Award (2013), and Technology Innovation Award (2014) by TCL Corporation. He received his Ph.D. from Northwestern University, Evanston, USA.

Guan-Ming Su is with Dolby Labs, Sunnyvale, CA. Prior to this he has been with the R&D Department, Qualcomm, Inc., San Diego, CA; ESS Technology, Fremont, CA; and Marvell Semiconductor, Inc., Santa Clara, CA. He is the inventor of 50+ U.S. patents and pending applications. He is the co-author of 3D Visual Communications (John Wiley & Sons, 2013). He served as an associate editor of Journal of Communications; and Director of review board and R-Letter in IEEE Multimedia Communications Technical Committee. He also serves as the Technical Program Track Co-Chair in ICCCN 2011, Theme Chair in ICME 2013, TPC Co-Chair in ICNC 2013, TPC Chair in ICNC 2014, Demo Chair in SMC 2014, General Chair in ICNC 2015, Area Co-Chair for Multimedia Applications in ISM 2015, and Demo Co-Chair in ISM 2016. He is a Senior member of IEEE. He obtained his Ph.D. degree from University of Maryland, College Park.

Ioannis Katsavounidis received his B.S. degree in Electrical Engineering from the Aristotle University of Thessaloniki in Greece and continued his graduate studies at the University of Southern California in Los Angeles, CA, USA, where he received the M.S., Engineer's and Ph.D. degrees, all in Electrical Engineering. Between 1996 and 2000, he worked in Italy as engineer for the underground Physics experiment MACRO, an international collaboration working on magnetic monopole, neutrino and rare-particle detection. In 2000, he joined InterVideo, the makers of WinDVD, eventually becoming Director of new technologies. In 2007, he co-founded Cidana, a mobile-mutimedia software company in Shanghai, China, where he served as Chief Technology Officer. In 2008, he joined as Associate Professor the Department of Electrical and Computer Engineering at the University of Thessaly in Volos, Greece, teaching courses in Image Processing, Audio and Video Compression and Information Theory. Since 2015, he has been a member of the Video Algorithms group, a part of the Encoding Technologies team at Netflix. His research interests are in the areas of video compression, image processing, computer vision and information theory. Dr. Katsavounidis has over 100 publications, including 35 US and EPO patents.

Shinji Watanabe
is a Senior Principal Research Scientist at Mitsubishi Electric Research Laboratories (MERL), Cambridge, MA. He received his Ph.D. from Waseda University, Tokyo, Japan, in 2006. From 2001 to 2011, he was a research scientist at NTT Communication Science Laboratories, Kyoto, Japan. From January to March in 2009, he was a visiting scholar in Georgia institute of technology, Atlanta, GA. His research interests include Bayesian machine learning and speech and spoken language processing. He has been published more than 100 papers in journals and conferences, and received several awards including the Best paper award from IEICE in 2003. He has served an Associate Editor of the IEEE Transactions on Audio Speech and Language Processing, and he serves several committee members including the IEEE Signal Processing Society Speech and Language Technical Committee and the APSIPA Speech, Language, and Audio Technical Committee.

Top


Bios of Committee Members (in alphabetical order)

Anne Aaron manages the Video Algorithms team at Netflix, responsible for software development and research related to video analysis, processing and encoding in the Netflix cloud-based ingest and encoding pipeline. The team is tasked to generate the best quality video encodes for more than 62 million Netflix customers. Prior to Netflix, Anne had technical lead roles at Cisco, working on the software deployed with millions of Flip Video cameras, Dyyno, an early stage start-up which developed a real-time peer-to-peer video distribution system, and Modulus Video, a broadcast video encoder company.

During her Ph.D. studies at Stanford University, she was a member of the Image, Video and Multimedia Systems Laboratory, led by Prof. Bernd Girod. Her research was one of the pioneering work in the sub-field of Distributed Video Coding and the resulting publications have been widely cited by researchers in academia and industry. Anne is originally from Manila, Philippines. She holds B.S. degrees in Physics and Computer Engineering from Ateneo de Manila University and M.S. and Ph.D. degrees in Electrical Engineering from Stanford University.

Masami Akamine is a Senior Fellow at Toshiba Research and Consulting, working on research and development of speech processing algorithms and their applications, as well as being responsible for coordinating research programs among research groups in Kawasaki Japan, Cambridge UK and Beijing China. He received his Ph.D. degree in electrical engineering from Tohoku University in 1985. Since 1985 he has been with Toshiba Corporate Research and Development Center. He has worked on speech coding, speech synthesis, automatic speech recognition, spoken dialogue and their applications. He has published more than 60 papers in journals and conferences, and has more than 100 granted patents. He was awarded as an outstanding researcher by the Minister of Education, Science and Technology Japan in 2001. He has received Technology Development Award from the Acoustic Society of Japan in 2002, Society Best Paper Award from IEICE Japan in 2003, Prime Minister's Prize from the Japan Institute of Invention and Innovation in 2008, and Achievement Award from IEICE Japan in 2012. He was also honoured to receive Purple Ribbon Medal from the Emperor Japan in 2013. He is a senior member of IEEE and had served as a member of the Speech and Language Technical Committee for two years since 2012.

José Roberto Alvarez is Director of Strategy and Planning at Futurewei Technologies in Santa Clara, California, where he is responsible for strategic development of media technologies for Huawei R&D worldwide. He started his career at Philips Laboratories and has been involved in architecting, designing and implementing media technology products for a variety of industries including broadcast, consumer, post-production and computer graphics for companies including Philips, Broadcom, Maxim, Xilinx, and three successful start-ups in Silicon Valley. He has actively participated in major industry inflection points in the last 20 years, most notably in the development of H.265, H.264, MPEG-4, MPEG-2, HDTV, desktop video graphics, and Extensible Processing FPGA platforms. Mr. Alvarez earned Bachelor's and Master's degrees in Electrical Engineering with distinction from The City University of New York. His work has been granted 45 US and 6 EU patents.

Michiel Bachiani has worked in various areas of speech recognition research for more than 20 years with an emphasis on acoustic modeling. He currently manages the acoustic modeling team at Google responsible for developing the technology backing all Google speech applications. At Google, he previously leads the efforts around voicemail transcription and YouTube automatic captioning.

Before joining Google, Michiel Bacchiani worked as a member of technical staff at IBM Research. Before that he worked at AT&T Research Labs and ATR International in Kyoto Japan. At all these assignments he focused on various aspects of speech recognition algorithm research. Michiel Bacchiani received the "ingenieur" (ir.) degree from the Technical University of Eindhoven, The Netherlands and the Ph.D. degree from Boston University. He has authored numerous scientific publications. He is elected to be the chair of the IEEE Speech and Language Processing Technical Committee. He is a board member and subject editor of Speech Communication. He has served on various conference and workshop technical committees and served as area chair for major international conferences (ICASSP, Interspeech).

Khaled El-Maleh is a Senior Director of Technology in the IP Department of Qualcomm leading the Multimedia Technology team and related Multimedia and User Experience IP Strategy areas. Dr. El-Maleh's areas of expertise and interests include: design, implementation and quality evaluation of mobile multimedia systems, data mining/analytics, human-computer interfaces, computer vision applications, innovation and industry-university technology transfer. He is a technologist and strategist with focus on entrepreneurship & Innovation.

Khaled joined Qualcomm in 2000 as a Senior Engineer working on multimedia technology in Qualcomm Chip Business (QCT). Prior to joining IP Department in 2007, Khaled contributed to the early design of Qualcomm voice and video products. Khaled received Double Majors Bachelor degrees in Electrical and Computer Engineering and in Applied Math from King Fahd University of Petroleum and Minerals of Saudi Arabia, and M. Eng. and Ph.D. in Electrical and Computer Engineering from McGill University, Canada. He is an accomplished inventor with more than 200 US and international patents. He was awarded Qualcomm Career Thought Leadership Award in 2009, and the IP Department 2013 Distinguished Contributor Award.

Dr. Yuebing Jiang is with Apple Inc. He received the B.S. degree in microelectronics from Xi'an Jiaotong University, Xi'an, China in 2008, and Ph.D. degree in Computer Engineering from the University of New Mexico, Albuquerque, New Mexico in 2014. Prior to joining Apple, he worked for Real Communications Inc. where he was in charge of video engine modeling and verification. His research interests include FPGA architecture for image and video applications, dynamic partial reconfiguration in video compression, and video/image compression algorithms and standards. He published papers in related areas and has been reviewers for conferences and journals such as ICIP and Transaction for Image Processing.

Harshad Kadu is with Dolby Laboratories Inc. working on image and video processing applications. Prior to Dolby, he worked at Conversant Media Inc. as a Computer Vision and Machine Learning Research Scientist. He has received his PhD degree in Electrical Engineering at University of Southern California. His research interests include image processing, video compression, 3D graphics, computer vision and machine learning

Jin-Gyeong Kim is a Research Fellow at LG Electronics Inc. His current position is a team leader of IP development team in System IC R&D Center. He is responsible for development of various IPs including Analog Mixed Signal IP and Digital IP for DTV and Mobile SoC. He joined Goldstar, former LG Electronics in 1992. He was involved in the Digital HDTV project and his role was MPEG-1 video decoder development in FPGA prototype system. He also developed MPEG-2 video decoder chip for ATSC standard. (During this period, he played a various role in HDTV system architecture design and setting up the develop environment and methodology for FPGA prototype and ASIC development.) From 2003 he was a project leader for SoC development of Korea Digital Mobile Broadcasting system in which H.264 was adopted and the SoC was successfully adopted in mobile phone. He led several SoC projects including world mobile broadcasting systems (T-DMB, DVB-H, ISDB-T 1seg) for mobile phone and automotive, a DTV SoC for digital to analog converter box, and PVR(Personal Video Record) SoC. From 2008 he led DTV SoC projects for Smart TV and the SoCs were adopted in Google TV and LG Smart TV products. Since 2010 he has been the leader of digital IP development team for multimedia IP, CPU/GPU, and picture quality IPs. (4K HEVC decoder, Super Resolution, and 4K/8K Frame Rate Converter IP enabled LG 4K TV early in the market.) In 2015 his team included Analog/Mixed signal IP. From 2013 to 2015 he also developed vision processing IP for ADAS. He received a BS in electrical engineering from Yonsei University and MS in electrical engineering from KAIST in 1990 and 1992, respectively. He received his Ph.D. degree majoring signal processing from the University of Southern California in 2003.

Anil Kokaram is a Tech Lead in the Transcoding Group at YouTube/Google. He leads a team responsible for video quality and develops video processing algorithms for quality improvement in various pipelines. He is also a Professor at Trinity College Dublin, Ireland and continues to supervise students at www.sigmedia.tv in the EE Dept there. His expertise is in the broad areas of DSP for Video Processing. Bayesian Inference and motion estimation. He has published over 100 refereed papers in these areas. In 2007 he was awarded a Science and Engineering Academy Award for his work in video processing for post-production applications. He was founder of a company (GreenParrotPictures) producing video enhancement software that was acquired by Google in 2011. He was formerly Associate Editor of the IEEE Transactions on CCts and Systems for Video Technology and IEEE Trans on Image Processing.

Takafumi Koshinaka is a Senior Principal Researcher at the Data Science Research Laboratories, NEC Corporation, Japan. He is currently leading the research and development for speech and audio processing solutions. His research interests include statistical pattern recognition and machine learning for a wide range of media information processing. He received his B.E. and M.E. in aeronautical engineering from Kyoto University, in, respectively, 1991 and 1993 as well as his Ph.D. in computer science from Tokyo Institute of Technology in 2013. He received the Young Researcher's Award from the Institute of Electronics, Information, and Communication Engineers (IEICE) in 2000. He is a member of IEEE, the Acoustical Society of Japan (ASJ), and IEICE.

Ying Li received the B.S. and M.S. degrees in Computer Science and Engineering from Wuhan University, China and the Ph.D. degree in Electrical Engineering from the University of Southern California (USC), Los Angeles, in 1993, 1996, and 2003, respectively. Since March 2003, Dr. Li has been with IBM T. J. Watson Research Center as a Research Staff Member. Dr. Li's research interests include digital image processing, multimedia content analysis and management, pattern recognition, computer vision, business analytics, statistical modeling and computational creativity. Dr. Li has authored or co-authored around 70 peer-reviewed conference and journal papers including ACM MM, ICMR, ICME, ICPR, SCC, CSVT, IEEE Multimedia and PRL, as well as 7 book and book chapters on various multimedia and computer vision related topics. She currently holds 50+ US patents. Dr. Li has been serving on the Technical Program Committee of dozens of IEEE and ACM conferences such as ICME, ICIP, ICASSP, SCC and ACM Multimedia, as well as on the Program Committee of various IEEE conferences and workshops. Dr. Li is on the editorial board for Journal of Visual Communication and Image Representation (JVCI) and International Journal of Multimedia Information Retrieval (JMIR). She was the leading guest editor for the JVCI Special Issue on Multimedia Database Management, 2004, and Special Issue on Emerging Technique for Multimedia Content Sharing, Search and Understanding, 2008. Dr. Li was (or is) the Publicity Chair of MMSP 2007, Finance Chair of ICME 2009, VCIP 2012, ICME 2014 and 2016, Area Chair of ICPR 2012, ISCAS 2014-2015, BigMM 2015 and ICME 2015, Industry Program Chair of ISM 2009 and 2011, Panel Co-Chair of ISM 2013 and ISM 2015, and co-chairs of various special sessions at ICME and SPIE. Dr. Li was on NSF panel of Video Analysis Track in 2005. She is an IEEE Senior Member.

Yang Lei is a Research Scientist in HP Labs, Palo Alto, California since 2014. She received her Ph.D. in Electrical and Computer Engineering from Purdue University, West Lafayette, under supervision of Prof. Jan P. Allebach. Her research interests are in the areas of image recognition and understanding, computer vision, 3D scanning systems and point cloud processing. She has published multiple articles in IEEE conferences and Journal of Imaging Science and Technology. She also served on the Technical Program Committee of IEEE BigMM 2015 and is the Publication Chair of IEEE ISM 2016.

I-Ming Pao is a Scientist at Adobe and previously held engineering and management positions at Real Communications Inc, SONY, and Freescale. He received his Ph.D. from University of Washington, Seattle, USA.

Hiroshi Sawada is an Executive Manager at the NTT Communication Science Laboratories, Kyoto, Japan. His research interests include statistical signal processing, audio source separation, array signal processing, machine learning, latent variable model, graph-based data structure, and computer architecture. He received the Best Paper Award of the IEEE Circuit and System Society in 2000, the SPIE ICA Unsupervised Learning Pioneer Award in 2013, and the Best Paper Award of the IEEE Signal Processing Society in 2014.

Ming Xia is with LinkedIn. Prior to Linkedin, he was a researcher at Ericsson Research Silicon Valley in San Jose, California, and an expert researcher at the National Institute of Information and Communications Technology (NiCT), Tokyo, Japan. He is the inventor of 20+ patents and pending applications, and co-author of 60+ articles in journals and conferences. He is on the editorial board of Journal of Telecom Systems, and is a guest editor of Journal of Computers & Electrical Engineering. He co-chaired the IEEE ANTS'13 technical committee, the IEEE Globecom'13 workshop on SDN and Optics, and the ICNC'14 Optical and Grid Networking symposium. He is winner of the best paper award at the IEEE ANTS'10. Ming Xia obtained his Ph.D. degree in computer science from the University of California, Davis, USA.

Dr. Ning Xu is currently with Snapchat Research. Prior to joining Snapchat, he has been working with Dolby Laboratories, Inc. and Samsung Research America. Dr. Ning Xu obtained his Ph.D. degree in Electrical Engineering from the University of Illinois at Urbana-Champaign and his Master's and Bachelor's degree from the University of Science and Technology of China. Dr. Ning Xu co-authored 70+ journal and conference papers, patents and patent applications. His research interest includes computer vision, multimedia signal processing, machine learning and other related areas. He is a senior member of IEEE.

Yan Ye is a Director of Video Standards and Platforms at InterDigital Communications Inc, where she manages the video coding and delivery project. Her work at InterDigital focuses on video standardization, as well as innovation and prototyping of video coding, processing, and delivery technologies. Prior to joining InterDigital, she was with the Image Technology Research Department at Dolby Labs, the Video R&D and Standards team at Qualcomm, and CineForm (now GoPro). Dr. Ye has been involved in the development of various video coding standards, including HEVC and its scalable and screen content coding extensions, High Dynamic Range and Wide Color Gamut video coding, the Key Technology Areas (KTA) work of ITU-T Video Coding Expert Groups (VCEG), the scalable extensions of H.264/AVC, ATSC 3.0, and 3GPP SA4. Dr. Ye holds 47 granted patents, and is a co-inventor of more than 100 patent applications in video coding, processing, and delivery. Dr. Ye received her Ph.D. from the Electrical and Computer Engineering Department at University of California San Diego in 2002.

Tong Zhang joined HP Labs in 2000, after receiving Ph.D. degree on Electrical Engineering from University of Southern California. She is now a principal scientist, leading projects related to multimedia analytics. Her expertise includes image, video and audio signal processing, computer vision and pattern recognition. She has 30 granted US patents and numerous publications in this field. She was co-chair of the SPIE Conference on Internet Multimedia Management Systems from 2002 to 2005, and an associate editor of IEEE Transactions on Circuits and Systems for Video Technology from 2008 to 2014. She is vice chair of the ACM SIGMM Bay Area Chapter, and co-organizer of the Bay Area Multimedia Forum.

Jim Zhao is the Senior Director and head of SPO Business Unit at Huawei USA R&D Center, where he lead several innovative projects including Telco Cloud transformation and a disruptive cross-domain network orchestrator. Prior to Huawei, he worked at Cisco for 12 years, where he led the Cisco IP/TV, Web Cache, and Wan Optimization product development. Prior to Cisco, he held various technical positions at Netscape, and Netcom Communications where he worked primarily on electronic messaging system. Jim holds a Master of Science in Computer Science from University of Southern California, and a Bachelor of Science in Computer Science from Jilin University.

Top