Contact | Education | Research and publications | Thesis | Professional | Teaching | Other skills | PDFPDF 

Research and publications

Publications

  1. Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
  2. Efficient Cascaded Streaming ASR System via Frame Rate Reduction
  3. Partial Rewriting for Multi-Stage ASR
  4. Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
  5. Flickering reduction with partial hypothesis reranking for streaming ASR
  6. Neural-FST Class Language Model for End-to-End Speech Recognition
  7. Improved pronunciation prediction accuracy using morphology
  8. Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
  9. Anti-aliasing regularization in stacking layers
  10. Algorithmic Exploration of American English Dialects
  11. A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency
  12. Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
  13. On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
  14. Better Morphology Prediction for Better Speech Systems
  15. Phoebe: Pronunciation-aware Contextualization for End-to-end Speech Recognition
  16. Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
  17. Dictionary Augmented Sequence-to-Sequence Neural Network for Grapheme to Phoneme prediction
  18. Sequence-to-Sequence Neural Network Model with 2D Attention for Learning Japanese Pitch Accents
  19. Pronunciation learning with RNN-transducers
  20. Learning Personalized Pronunciations for Contact Name Recognition
  21. NN-grams: Unifying neural network and n-gram language models for speech recognition
  22. On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition
  23. Optimizing OPC data sampling based on orthogonal vector space
  24. Exploring the Nature of Trader Intuition
  25. Model-based scanner tuning in a manufacturing environment
  26. Investigating signal integration with canonical correlation analysis of fMRI brain activation data
  27. Human imagination in financial markets with insiders
  28. A Mind for the Market: an fMRI Study of Attribution of Mental States to Financial Markets
  29. SCR Recording During fMRI Acquisition

Patents

  1. Two-pass End To End Speech Recognition
  2. Phoneme-based contextualization for cross-lingual speech recognition in end-to-end models
  3. Contextual biasing for speech recognition using grapheme and phoneme data
  4. Compressed recurrent neural network models
  5. Date and/or time resolution
  6. Learning personalized entity pronunciations
  7. Information matrix creation and calibration test pattern selection based on computational lithography model parameter
  8. Calibration pattern selection based on noise sensitivity
  9. Harmonic resist model for use in a lithographic apparatus and a device manufacturing method

Note: Additional US patent applications under review (closed to the general public).