Publications and Patents
See Google Scholar for complete listing of papers and patents.
Publications
- PhaseCoder: Microphone Geometry-Agnostic Spatial Audio Understanding for Multimodal LLMs Artem Dementyev, Wazeer Zulfikar, Sinan Hersek, Pascal Getreuer, Anurag Kumar, Vivek Kumar, International Conference on Machine Learning (ICML), 2026 [pdf]
- SpeechCompass: Enhancing Mobile Captioning with Diarization and Directional Guidance via Multi-Microphone Localization Artem Dementyev, Wazeer Zulfikar, Sinan Hersek, Vivek Kumar, arXiv preprint, 2025 [pdf]
- SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs Lijun Yu, Yong Cheng, Zhiruo Wang, Vivek Kumar, Wolfgang Macherey, Yanping Huang, David A. Ross, Irfan Essa, Yonatan Bisk, Ming-Hsuan Yang, Kevin P. Murphy, Alexander G. Hauptmann, Lu Jiang, Advances in Neural Information Processing Systems (NeurIPS), 2023 [pdf]
- Voice conversion with conditional SampleRNN Interspeech 2018, 19th Annual Conference of the International Speech Communication Association, September 2018 2018 [pdf]
- Transform-domain decorrelation in Dolby Digital Plus,2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2014 [pdf]
- Pseudo-Reliable Code Development, Embedded Design, March 2000 [pdf]
Patents
- Systems and methods for adapting human speaker embeddings in speech synthesis US Patent 11,929,058, March 12, 2024
- Speech style transfer US Patent 11,538,455, December 27, 2022
- Audio capture for aerial devices US Patent 10,979,613, April 13, 2021
- Low bit rate parametric encoding and transport of haptic-tactile signals US Patent application WO2017024001A1, Feburary 9, 2017
- Adaptive quantization US Patent application WO2017024001A1, August 3, 2017
- Time-varying filters for generating decorrelation signals, US Patent application 2014126684, August 21, 2014
- Signal decorrelation in an audio processing system, US Patent application US201443877 , November 16, 2014
- Bit error concealment for audio coding system, US Patent US8301440B2, October 30, 2012
- Real time monitoring & control for audio device, US Patent US7778829B2, August 17, 2010
- Sampling rate mismatch solution, US Patent US7778373B2 August 17, 2010
- Bit error management methods for wireless audio communication channel, US Patent US8578247B2, November 5, 2013
- Method and apparatus for sharing a bluetooth module with two computing devices US Patent 7,263,331, August 28, 2007
