


default search action
Vishrav Chaudhary
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c39]Giovanni Monea, Maxime Peyrard, Martin Josifoski, Vishrav Chaudhary, Jason Eisner, Emre Kiciman, Hamid Palangi, Barun Patra, Robert West:
A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia. ACL (1) 2024: 6828-6844 - [c38]Ayush Jain, Pushkal Katara, Nikolaos Gkanatsios, Adam W. Harley, Gabriel Sarch, Kriti Aggarwal, Vishrav Chaudhary, Katerina Fragkiadaki:
ODIN: A Single Model for 2D and 3D Segmentation. CVPR 2024: 3564-3574 - [i44]Ayush Jain, Pushkal Katara, Nikolaos Gkanatsios, Adam W. Harley, Gabriel Sarch, Kriti Aggarwal, Vishrav Chaudhary, Katerina Fragkiadaki:
ODIN: A Single Model for 2D and 3D Perception. CoRR abs/2401.02416 (2024) - [i43]Marah I Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Harkirat S. Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Parul Chopra, Allie Del Giorno, Gustavo de Rosa, Matthew Dixon, Ronen Eldan, Dan Iter, Amit Garg, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Jamie Huynh, Mojan Javaheripi, Xin Jin, Piero Kauffmann, Nikos Karampatziakis, Dongwoo Kim, Mahoud Khademi, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Chen Liang, Weishung Liu, Eric Lin, Zeqi Lin, Piyush Madan, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Xia Song, Masahiro Tanaka, Xin Wang, Rachel Ward, Guanhua Wang, Philipp Witte, Michael Wyatt, Can Xu, Jiahang Xu, Sonali Yadav, Fan Yang, Ziyi Yang, Donghan Yu, Chengruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou:
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone. CoRR abs/2404.14219 (2024) - [i42]Millicent Ochieng, Varun Gumma, Sunayana Sitaram, Jindong Wang, Vishrav Chaudhary, Keshet Ronen, Kalika Bali, Jacki O'Neill:
Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios. CoRR abs/2406.00343 (2024) - [i41]Sanchit Ahuja, Kumar Tanmay, Hardik Hansrajbhai Chauhan, Barun Patra, Kriti Aggarwal, Luciano Del Corro, Arindam Mitra, Tejas Indulal Dhamecha, Ahmed Awadallah, Monojit Choudhary, Vishrav Chaudhary, Sunayana Sitaram:
sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting. CoRR abs/2407.09879 (2024) - [i40]Kian Ahrabian, Xihui Lin, Barun Patra, Vishrav Chaudhary, Alon Benhaim, Jay Pujara, Xia Song:
The Hitchhiker's Guide to Human Alignment with *PO. CoRR abs/2407.15229 (2024) - [i39]Xihui Lin, Yunan Zhang, Suyu Ge, Barun Patra, Vishrav Chaudhary, Xia Song:
Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads. CoRR abs/2407.17678 (2024) - [i38]Liyuan Liu, Young Jin Kim
, Shuohang Wang, Chen Liang, Yelong Shen, Hao Cheng, Xiaodong Liu, Masahiro Tanaka, Xiaoxia Wu, Wenxiang Hu, Vishrav Chaudhary, Zeqi Lin, Chengruidong Zhang, Jilong Xue, Hany Awadalla, Jianfeng Gao, Weizhu Chen:
GRIN: GRadient-INformed MoE. CoRR abs/2409.12136 (2024) - [i37]Johan Bjorck, Alon Benhaim, Vishrav Chaudhary, Furu Wei, Xia Song:
Scaling Optimal LR Across Token Horizon. CoRR abs/2409.19913 (2024) - [i36]Yifei He, Alon Benhaim, Barun Patra, Praneetha Vaddamanu, Sanchit Ahuja, Parul Chopra, Vishrav Chaudhary, Han Zhao, Xia Song:
Scaling Laws for Multilingual Language Models. CoRR abs/2410.12883 (2024) - [i35]Batuhan K. Karaman, Ishmam Zabir, Alon Benhaim, Vishrav Chaudhary, Mert R. Sabuncu, Xia Song:
POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization. CoRR abs/2410.12999 (2024) - 2023
- [j5]Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-Navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel J. Orr, Lucia Zheng, Mert Yüksekgönül, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri S. Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda:
Holistic Evaluation of Language Models. Trans. Mach. Learn. Res. 2023 (2023) - [c37]Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang, Alon Benhaim, Vishrav Chaudhary, Xia Song, Furu Wei:
A Length-Extrapolatable Transformer. ACL (1) 2023: 14590-14604 - [c36]Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei, Vishrav Chaudhary, Xia Song:
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning. ACL (1) 2023: 15354-15373 - [c35]Sunayana Sitaram, Monojit Choudhury, Barun Patra, Vishrav Chaudhary, Kabir Ahuja, Kalika Bali:
Everything you need to know about Multilingual LLMs: Towards fair, performant and reliable models for languages of the world. ACL (tutorial) 2023: 21-26 - [c34]Martin Josifoski, Maxime Peyrard, Frano Rajic, Jiheng Wei, Debjit Paul, Valentin Hartmann, Barun Patra, Vishrav Chaudhary, Emre Kiciman, Boi Faltings:
Language Model Decoding as Likelihood-Utility Alignment. EACL (Findings) 2023: 1425-1440 - [c33]Aniket Vashishtha, S. Sai Prasad, Payal Bajaj, Vishrav Chaudhary, Kate Cook, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
Performance and Risk Trade-offs for Multi-word Text Prediction at Scale. EACL (Findings) 2023: 2181-2197 - [c32]Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Khan Mohammed, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary:
DUBLIN: Visual Document Understanding By Language-Image Network. EMNLP (Industry Track) 2023: 693-706 - [c31]Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei:
Magneto: A Foundation Transformer. ICML 2023: 36077-36092 - [c30]Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Nils Johan Bertil Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei:
Language Is Not All You Need: Aligning Perception with Language Models. NeurIPS 2023 - [i34]Jessica Huynh, Cathy Jiao, Prakhar Gupta, Shikib Mehri, Payal Bajaj, Vishrav Chaudhary, Maxine Eskénazi:
Understanding the Effectiveness of Very Large Language Models on Dialog Evaluation. CoRR abs/2301.12004 (2023) - [i33]Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei:
Language Is Not All You Need: Aligning Perception with Language Models. CoRR abs/2302.14045 (2023) - [i32]Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Khan Mohammed, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary:
DUBLIN - Document Understanding By Language-Image Network. CoRR abs/2305.14218 (2023) - [i31]Yuntian Deng, Kiran Prasad, Roland Fernandez, Paul Smolensky, Vishrav Chaudhary, Stuart M. Shieber:
Implicit Chain of Thought Reasoning via Knowledge Distillation. CoRR abs/2311.01460 (2023) - [i30]Giovanni Monea, Maxime Peyrard, Martin Josifoski, Vishrav Chaudhary, Jason Eisner, Emre Kiciman, Hamid Palangi, Barun Patra, Robert West:
A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia. CoRR abs/2312.02073 (2023) - 2022
- [j4]Katharina Kann, Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, John E. Ortega, Annette Rios, Angela Fan, Ximena Gutierrez-Vasques, Luis Chiruzzo, Gustavo Alberto Giménez Lugo, Ricardo Ramos, Iván Vladimir Meza Ruíz
, Elisabeth Mager, Vishrav Chaudhary, Graham Neubig, Alexis Palmer
, Rolando Coto-Solano
, Ngoc Thang Vu:
AmericasNLI: Machine translation and natural language inference systems for Indigenous languages of the Americas. Frontiers Artif. Intell. 5 (2022) - [j3]Naman Goyal, Cynthia Gao, Vishrav Chaudhary, Peng-Jen Chen, Guillaume Wenzek, Da Ju, Sanjana Krishnan, Marc'Aurelio Ranzato, Francisco Guzmán, Angela Fan:
The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation. Trans. Assoc. Comput. Linguistics 10: 522-538 (2022) - [c29]Oana Ignat, Jean Maillard, Vishrav Chaudhary, Francisco Guzmán:
OCR Improves Machine Translation for Low-Resource Languages. ACL (Findings) 2022: 1164-1174 - [c28]Simeng Sun, Angela Fan, James Cross, Vishrav Chaudhary, Chau Tran, Philipp Koehn, Francisco Guzmán:
Alternative Input Signals Ease Transfer in Multilingual Machine Translation. ACL (1) 2022: 5291-5305 - [c27]Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John E. Ortega, Ricardo Ramos, Annette Rios, Iván Vladimir Meza Ruíz
, Gustavo Giménez Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando Coto-Solano
, Ngoc Thang Vu, Katharina Kann:
AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages. ACL (1) 2022: 6279-6299 - [c26]Shiyue Zhang, Vishrav Chaudhary, Naman Goyal, James Cross, Guillaume Wenzek, Mohit Bansal, Francisco Guzmán:
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training? AMTA 2022: 97-116 - [c25]Tasnim Mohiuddin, Philipp Koehn, Vishrav Chaudhary, James Cross, Shruti Bhosale, Shafiq R. Joty:
Data Selection Curriculum for Neural Machine Translation. EMNLP (Findings) 2022: 1569-1582 - [c24]Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona T. Diab, Veselin Stoyanov, Xian Li:
Few-shot Learning with Multilingual Generative Language Models. EMNLP 2022: 9019-9052 - [c23]Marina Fomicheva, Shuo Sun, Erick R. Fonseca, Chrysoula Zerva, Frédéric Blain, Vishrav Chaudhary, Francisco Guzmán, Nina Lopatina, Lucia Specia, André F. T. Martins:
MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset. LREC 2022: 4963-4974 - [i29]Oana Ignat, Jean Maillard, Vishrav Chaudhary, Francisco Guzmán:
OCR Improves Machine Translation for Low-Resource Languages. CoRR abs/2202.13274 (2022) - [i28]Tasnim Mohiuddin, Philipp Koehn, Vishrav Chaudhary, James Cross, Shruti Bhosale, Shafiq R. Joty:
Data Selection Curriculum for Neural Machine Translation. CoRR abs/2203.13867 (2022) - [i27]Shiyue Zhang, Vishrav Chaudhary, Naman Goyal, James Cross, Guillaume Wenzek, Mohit Bansal, Francisco Guzmán:
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training? CoRR abs/2204.14268 (2022) - [i26]Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei:
Foundation Transformers. CoRR abs/2210.06423 (2022) - [i25]Martin Josifoski, Maxime Peyrard, Frano Rajic, Jiheng Wei, Debjit Paul, Valentin Hartmann, Barun Patra, Vishrav Chaudhary, Emre Kiciman, Boi Faltings, Robert West:
Language Model Decoding as Likelihood-Utility Alignment. CoRR abs/2210.07228 (2022) - [i24]Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei, Vishrav Chaudhary, Xia Song:
Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning. CoRR abs/2210.14867 (2022) - [i23]Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-Navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel J. Orr, Lucia Zheng, Mert Yüksekgönül
, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri S. Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli
, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang
, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda:
Holistic Evaluation of Language Models. CoRR abs/2211.09110 (2022) - [i22]Shuming Ma, Hongyu Wang, Shaohan Huang, Wenhui Wang, Zewen Chi, Li Dong, Alon Benhaim, Barun Patra, Vishrav Chaudhary, Xia Song, Furu Wei:
TorchScale: Transformers at Scale. CoRR abs/2211.13184 (2022) - [i21]Yutao Sun, Li Dong, Barun Patra, Shuming Ma, Shaohan Huang, Alon Benhaim, Vishrav Chaudhary, Xia Song, Furu Wei:
A Length-Extrapolatable Transformer. CoRR abs/2212.10554 (2022) - 2021
- [j2]Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Michael Auli, Armand Joulin:
Beyond English-Centric Multilingual Machine Translation. J. Mach. Learn. Res. 22: 107:1-107:48 (2021) - [c22]Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn, Mona T. Diab:
Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data. ACL/IJCNLP (1) 2021: 802-812 - [c21]Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan:
Multilingual Translation from Denoising Pre-Training. ACL/IJCNLP (Findings) 2021: 3450-3466 - [c20]Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Francisco Guzmán, Lucia Specia:
Quality Estimation without Human-labeled Data. EACL 2021: 619-625 - [c19]Holger Schwenk, Vishrav Chaudhary, Shuo Sun, Hongyu Gong, Francisco Guzmán:
WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia. EACL 2021: 1351-1361 - [c18]Shuo Sun, Ahmed El-Kishky, Vishrav Chaudhary, James Cross, Lucia Specia, Francisco Guzmán:
Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications. EMNLP (1) 2021: 5865-5875 - [c17]Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Veselin Stoyanov, Alexis Conneau:
Self-training Improves Pre-training for Natural Language Understanding. NAACL-HLT 2021: 5408-5418 - [c16]Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska
, Ondrej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-jussà, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom Kocmi, Philipp Koehn, Nicholas Lourie, Christof Monz, Makoto Morishita, Masaaki Nagata, Ajay Nagesh, Toshiaki Nakazawa, Matteo Negri, Santanu Pal, Allahsera Auguste Tapo, Marco Turchi, Valentin Vydrin, Marcos Zampieri:
Findings of the 2021 Conference on Machine Translation (WMT21). WMT@EMNLP 2021: 1-88 - [c15]Guillaume Wenzek, Vishrav Chaudhary, Angela Fan, Sahir Gomez, Naman Goyal, Somya Jain, Douwe Kiela, Tristan Thrush, Francisco Guzmán:
Findings of the WMT 2021 Shared Task on Large-Scale Multilingual Machine Translation. WMT@EMNLP 2021: 89-99 - [c14]Lucia Specia, Frédéric Blain, Marina Fomicheva, Chrysoula Zerva, Zhenhao Li, Vishrav Chaudhary, André F. T. Martins:
Findings of the WMT 2021 Shared Task on Quality Estimation. WMT@EMNLP 2021: 684-725 - [i20]Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Francisco Guzmán, Lucia Specia:
Quality Estimation without Human-labeled Data. CoRR abs/2102.04020 (2021) - [i19]Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John E. Ortega, Ricardo Ramos, Annette Rios, Ivan Vladimir, Gustavo Alberto Giménez Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, Ngoc Thang Vu, Katharina Kann:
AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages. CoRR abs/2104.08726 (2021) - [i18]Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn, Mona T. Diab:
Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data. CoRR abs/2105.15071 (2021) - [i17]Naman Goyal, Cynthia Gao, Vishrav Chaudhary, Peng-Jen Chen, Guillaume Wenzek, Da Ju, Sanjana Krishnan, Marc'Aurelio Ranzato, Francisco Guzmán, Angela Fan:
The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation. CoRR abs/2106.03193 (2021) - [i16]Hongyu Gong, Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán:
LAWDR: Language-Agnostic Weighted Document Representations from Pre-trained Models. CoRR abs/2106.03379 (2021) - [i15]Shuo Sun, Ahmed El-Kishky, Vishrav Chaudhary, James Cross, Francisco Guzmán, Lucia Specia:
Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications. CoRR abs/2109.08627 (2021) - [i14]Simeng Sun, Angela Fan, James Cross, Vishrav Chaudhary, Chau Tran, Philipp Koehn, Francisco Guzmán:
Alternative Input Signals Ease Transfer in Multilingual Machine Translation. CoRR abs/2110.07804 (2021) - [i13]Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona T. Diab, Veselin Stoyanov, Xian Li:
Few-shot Learning with Multilingual Language Models. CoRR abs/2112.10668 (2021) - 2020
- [j1]Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Frédéric Blain, Francisco Guzmán, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia:
Unsupervised Quality Estimation for Neural Machine Translation. Trans. Assoc. Comput. Linguistics 8: 539-555 (2020) - [c13]Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov:
Unsupervised Cross-lingual Representation Learning at Scale. ACL 2020: 8440-8451 - [c12]Denise Díaz, James Cross, Vishrav Chaudhary, Ahmed El-Kishky, Philipp Koehn:
A Survey of Qualitative Error Analysis for Neural Machine Translation Systems. AMTA (2) 2020: 48-77 - [c11]Ahmed El-Kishky, Vishrav Chaudhary, Francisco Guzmán, Philipp Koehn:
CCAligned: A Massive Collection of Cross-Lingual Web-Document Pairs. EMNLP (1) 2020: 5960-5969 - [c10]Shuo Sun, Marina Fomicheva, Frédéric Blain, Vishrav Chaudhary, Ahmed El-Kishky, Adithya Renduchintala, Francisco Guzmán, Lucia Specia:
An Exploratory Study on Multilingual Quality Estimation. AACL/IJCNLP 2020: 366-377 - [c9]Guillaume Wenzek, Marie-Anne Lachaux, Alexis Conneau, Vishrav Chaudhary, Francisco Guzmán, Armand Joulin, Edouard Grave:
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data. LREC 2020: 4003-4012 - [c8]Lucia Specia, Zhenhao Li, Juan Miguel Pino, Vishrav Chaudhary, Francisco Guzmán, Graham Neubig, Nadir Durrani, Yonatan Belinkov, Philipp Koehn, Hassan Sajjad, Paul Michel, Xian Li:
Findings of the WMT 2020 Shared Task on Machine Translation Robustness. WMT@EMNLP 2020: 76-91 - [c7]Philipp Koehn, Vishrav Chaudhary, Ahmed El-Kishky, Naman Goyal, Peng-Jen Chen, Francisco Guzmán:
Findings of the WMT 2020 Shared Task on Parallel Corpus Filtering and Alignment. WMT@EMNLP 2020: 726-742 - [c6]Lucia Specia, Frédéric Blain, Marina Fomicheva, Erick Rocha Fonseca, Vishrav Chaudhary, Francisco Guzmán, André F. T. Martins:
Findings of the WMT 2020 Shared Task on Quality Estimation. WMT@EMNLP 2020: 743-764 - [c5]Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Frédéric Blain, Vishrav Chaudhary, Mark Fishel, Francisco Guzmán, Lucia Specia:
BERGAMOT-LATTE Submissions for the WMT20 Quality Estimation Shared Task. WMT@EMNLP 2020: 1010-1017 - [i12]Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Frédéric Blain
, Francisco Guzmán, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia:
Unsupervised Quality Estimation for Neural Machine Translation. CoRR abs/2005.10608 (2020) - [i11]Yuqing Tang, Chau Tran, Xian Li, Peng-Jen Chen, Naman Goyal, Vishrav Chaudhary, Jiatao Gu, Angela Fan:
Multilingual Translation with Extensible Multilingual Pretraining and Finetuning. CoRR abs/2008.00401 (2020) - [i10]Jingfei Du, Edouard Grave, Beliz Gunel, Vishrav Chaudhary, Onur Celebi, Michael Auli, Ves Stoyanov, Alexis Conneau:
Self-training Improves Pre-training for Natural Language Understanding. CoRR abs/2010.02194 (2020) - [i9]Marina Fomicheva, Shuo Sun, Erick R. Fonseca, Frédéric Blain
, Vishrav Chaudhary, Francisco Guzmán, Nina Lopatina, Lucia Specia, André F. T. Martins:
MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset. CoRR abs/2010.04480 (2020) - [i8]Angela Fan, Shruti Bhosale, Holger Schwenk, Zhiyi Ma, Ahmed El-Kishky, Siddharth Goyal, Mandeep Baines, Onur Celebi, Guillaume Wenzek, Vishrav Chaudhary, Naman Goyal, Tom Birch, Vitaliy Liptchinsky, Sergey Edunov, Edouard Grave, Michael Auli, Armand Joulin:
Beyond English-Centric Multilingual Machine Translation. CoRR abs/2010.11125 (2020)
2010 – 2019
- 2019
- [c4]Peng-Jen Chen, Jiajun Shen
, Matt Le, Vishrav Chaudhary, Ahmed El-Kishky, Guillaume Wenzek, Myle Ott, Marc'Aurelio Ranzato:
Facebook AI's WAT19 Myanmar-English Translation Task Submission. WAT@EMNLP-IJCNLP 2019: 112-122 - [c3]Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Miguel Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato:
The FLORES Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English. EMNLP/IJCNLP (1) 2019: 6097-6110 - [c2]Philipp Koehn, Francisco Guzmán, Vishrav Chaudhary, Juan Miguel Pino:
Findings of the WMT 2019 Shared Task on Parallel Corpus Filtering for Low-Resource Conditions. WMT (3) 2019: 54-72 - [c1]Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán, Holger Schwenk, Philipp Koehn:
Low-Resource Corpus Filtering Using Multilingual Sentence Embeddings. WMT (3) 2019: 261-266 - [i7]Francisco Guzmán, Peng-Jen Chen, Myle Ott, Juan Miguel Pino, Guillaume Lample, Philipp Koehn, Vishrav Chaudhary, Marc'Aurelio Ranzato:
Two New Evaluation Datasets for Low-Resource Machine Translation: Nepali-English and Sinhala-English. CoRR abs/1902.01382 (2019) - [i6]Vishrav Chaudhary, Yuqing Tang, Francisco Guzmán, Holger Schwenk, Philipp Koehn:
Low-Resource Corpus Filtering using Multilingual Sentence Embeddings. CoRR abs/1906.08885 (2019) - [i5]Holger Schwenk, Vishrav Chaudhary, Shuo Sun, Hongyu Gong, Francisco Guzmán:
WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia. CoRR abs/1907.05791 (2019) - [i4]Peng-Jen Chen, Jiajun Shen, Matt Le, Vishrav Chaudhary, Ahmed El-Kishky, Guillaume Wenzek, Myle Ott, Marc'Aurelio Ranzato:
Facebook AI's WAT19 Myanmar-English Translation Task Submission. CoRR abs/1910.06848 (2019) - [i3]Guillaume Wenzek, Marie-Anne Lachaux, Alexis Conneau, Vishrav Chaudhary, Francisco Guzmán, Armand Joulin, Edouard Grave:
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data. CoRR abs/1911.00359 (2019) - [i2]Alexis Conneau, Kartikay Khandelwal, Naman Goyal, Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov:
Unsupervised Cross-lingual Representation Learning at Scale. CoRR abs/1911.02116 (2019) - [i1]Ahmed El-Kishky, Vishrav Chaudhary, Francisco Guzmán, Philipp Koehn:
A Massive Collection of Cross-Lingual Web-Document Pairs. CoRR abs/1911.06154 (2019)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-03-04 22:23 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint