Research and publications

Research and publications

Publications

Efficient Cascaded Streaming ASR System via Frame Rate Reduction
- Xingyu Cai, David Qiu, Shaojin Ding, Dongseong Hwang, Weiran Wang, Antoine Bruguier, Rohit Prabhavalkar, Tara Sainath, Yanzhang He
- ASRU 2023
- Download
- Link
Partial Rewriting for Multi-Stage ASR
- Antoine Bruguier, David Qiu, Yanzhang He
- Arxiv 2023
- Download
- Link
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models
- Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw
- ICASSP 2023
- Download
- Link
Flickering reduction with partial hypothesis reranking for streaming ASR
- Antoine Bruguier, David Qiu, Trevor Strohman, Yanzhang He
- SLT 2022
- Download
- Supplemental material
- Link
Neural-FST Class Language Model for End-to-End Speech Recognition
- Antoine Bruguier*, Duc Le*, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer (* equal participation)
- ICASSP 2022
- Download
- Link
Improved pronunciation prediction accuracy using morphology
- Dravyansh Sharma, Saumya Sahai, Neha Chaudhari, Antoine Bruguier
- Sigmorphon 2021
- Download
- Link
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer
- Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael Seltzer, Duc Le
- ICASSP 2021
- Download
- Link
Anti-aliasing regularization in stacking layers
- Antoine Bruguier, Ananya Misra, Arun Narayanan, Rohit Prabhavalkar
- Interspeech 2020
- Download
- Link
Algorithmic Exploration of American English Dialects
- Alëna Aksënova, Antoine Bruguier, Amanda Ritchart-Scott, Uri Mendlovic
- ICASSP 2020
- Download
- Link
A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency
- Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alex Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirko Visontai, Yonghui Wu, Yu Zhang, Ding Zhao
- ICASSP 2020
- Download
- Link
Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
- Ke Hu*, Antoine Bruguier*, Tara N. Sainath, Rohit Prabhavalkar, Golan Pundak (* equal participation)
- Interspeech 2019
- Download
- Link
On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition
- Kazuki Irie, Rohit Prabhavalkar, Anjuli Kannan, Antoine Bruguier, David Rybach, Patrick Nguyen
- Interspeech 2019
- Download
- Link
Better Morphology Prediction for Better Speech Systems
- Dravyansh Sharma, Melissa Wilson, Antoine Bruguier
- Interspeech 2019
- Download
- Link
Phoebe: Pronunciation-aware Contextualization for End-to-end Speech Recognition
- Antoine Bruguier, Rohit Prabhavalkar, Golan Pundak, Tara N. Sainath
- ICASSP 2019
- Download
- Link
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
- Jonathan Shen, et al.
- arXiv 2019
- Download
- Link
- GitHub
Dictionary Augmented Sequence-to-Sequence Neural Network for Grapheme to Phoneme prediction
- Antoine Bruguier, Anton Bakhtin, Dravyansh Sharma
- Interspeech 2018
- Download
- Link
Sequence-to-Sequence Neural Network Model with 2D Attention for Learning Japanese Pitch Accents
- Antoine Bruguier, Heiga Zen, Arkady Arkhangorodsky
- Interspeech 2018
- Download
- Link
Pronunciation learning with RNN-transducers
- Antoine Bruguier, Danushen Gnanapragasam, Leif Johnson, Kanishka Rao, Francoise Beaufays
- Interspeech 2017
- Download
- Link
Learning Personalized Pronunciations for Contact Name Recognition
- Antoine Bruguier, Fuchun Peng, Francoise Beaufays
- Interspeech 2016
- Download
- Link
NN-grams: Unifying neural network and n-gram language models for speech recognition
- Babak Damavandi, Shankar Kumar, Noam Shazeer, Antoine Bruguier
- Interspeech 2016
- Download
- Link
On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition
- Rohit Prabhavalkar, Ouais Alsharif, Antoine Bruguier, Ian McGraw
- ICASSP 2016
- Download
- Link
Optimizing OPC data sampling based on orthogonal vector space
- Global Foundries authors: Yuyang Sun, Yee Mei Foong, Yingfang Wang, Jacky Cheng, Dongqing Zhang, Shaowen Gao, Nanshu Chen, Byoung Il Choi
- Brion authors: Antoine Bruguier, Mu Feng, Jianhong Qiu, Stefan Hunsche, Liang Liu, Wenjin Shao
- SPIE Advanced Lithography 2011
- Download
- Link
Exploring the Nature of Trader Intuition
- Antoine Bruguier, Steven Quartz, Peter Bossaerts
- Journal of Finance, 65 (2010), 1703-23
- Download
- Link
Model-based scanner tuning in a manufacturing environment
- C. Y. Shih, R. C. Peng, T. C. Chien, Y. W. Guo, J. Y. Lee, C. L. Chang, P. C. Huang, H. H. Liu, H. J. Lee, John Lin, K. W. Chang, C. P. Yeh, W. J. Shao, H. Cao, A. Bruguier, X. Xie, C. H. Chang, R. Aldana, Y. Cao, R. Goossens, S. Hsieh
- SPIE Advanced Lithography 2009
- Download
- Link
Investigating signal integration with canonical correlation analysis of fMRI brain activation data
- Antoine Bruguier*, Kerstin Preuschoff*, Steven Quartz, Peter Bossaerts (* equal participation)
- NeuroImage, Volume 41, Issue 1, 15 May 2008, Pages 35-44
- Download
- Download appendix
Human imagination in financial markets with insiders
- Peter Bossaerts, Antoine Bruguier, Steven Quartz
- Neuroscience Research, Volume 58, Supplement 1, 2007, Page S5
- Link
A Mind for the Market: an fMRI Study of Attribution of Mental States to Financial Markets
- Antoine Bruguier, Steven Quartz, Peter Bossaerts
- Poster presented at the HSD PI Meeting of the National Science Foundation in Washington, D.C. on September 15, 2006
- Download
SCR Recording During fMRI Acquisition
- Antoine Bruguier*, R. McKell Carter*, Christof Koch, Steven Quartz (* equal participation)
- Technical report, 2005
- Download

Patents

Two-pass End To End Speech Recognition
- Tara Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li
- US Patent number 12,073,824 (granted in 2024)
- Download
Phoneme-based contextualization for cross-lingual speech recognition in end-to-end models
- Ke Hu, Antoine Bruguier, Tara Sainath, Rohit Prabhavalkar, Golak Pundak
- US Patent number 11,270,687 (granted in 2022)
- Download
Contextual biasing for speech recognition using grapheme and phoneme data
- Rohit Prabhavalkar, Golan Pundak, Tara Sainath, Antoine Bruguier
- US Patent number 11,217,231 (granted in 2022)
- Download
Compressed recurrent neural network models
- Ouais Alsharif, Rohit Prabhavalkar, Ian McGraw, Antoine Bruguier
- US Patent number 10,878,319 (granted in 2020)
- Download
Date and/or time resolution
- Bryan Horling, Ashutosh Shukla, Antoine Bruguier
- US Patent number 10,277,543 (granted in 2019)
- Download
Learning personalized entity pronunciations
- Antoine Bruguier, Fuchun Peng, Francoise Beaufays
- US Patent number 10,152,965 (granted in 2018)
- Download
Information matrix creation and calibration test pattern selection based on computational lithography model parameter
- Antoine Bruguier, Yu Cao, Jun Ye, Wenjin Shao
- US Patent number 9,588,439 (granted in 2017)
- Download
Calibration pattern selection based on noise sensitivity
- Antoine Bruguier, Wenjin Shao, Song Lan
- US Patent number 8,887,105 (granted in 2014)
- Download
Harmonic resist model for use in a lithographic apparatus and a device manufacturing method
- Antoine Bruguier, Yu Cao, Luoqi Chen, Wenjin Shao
- US Patent number 8,447,095 (granted in 2013)
- Download

Note: Additional US patent applications under review (closed to the general public).