I'm an Open and Collaborative Natural Language Processing researcher MIT & IBM, currently working on making large language model research more efficient, collaborative and achievable by anyone. I work a lot on evaluation (check out Unitxt), co-created model merging, ZipNN for compressing models (no not quantization, compression :-)). My work focuses on democratizing AI through open science initiatives like BabyLM challenge which I co-organize to promote sample-efficient language model training. I am passionate about collaborative and accessible research. My recent projects include ComPEFT for compressing fine-tuned models, ShareLM for sharing human-model conversations with the community, and tinyBenchmarks for efficient model evaluation. I've also worked extensively on model merging techniques like TIES-Merging and ColD Fusion to enable model recycling.
I believe some technologies are more beneficial to the world than others and that science can be fun.
My research emphasizes making AI systems more accessible to broader communities to use, build, tweak and understand.
Shivalika Singh, Angelika Romanou, Clémentine Fourrier, D. Adelani, Jian Gang Ngui, Daniel Vila-Suero, Peerat Limkonchotiwat, Kelly Marchisio, Wei Qi Leong, Yosephine Susanto, Raymond Ng, Shayne Longpre, Wei-Yin Ko, Madeline Smith, Antoine Bosselut, Alice Oh, Andre F. T. Martins, Leshem Choshen, Daphne Ippolito, Enzo Ferrante, Marzieh Fadaee, B. Ermiş, Sara Hooker
arXiv.org 2024
Andreas Waldis, Yotam Perlitz, Leshem Choshen, Yufang Hou, Iryna Gurevych
Transactions of the Association for Computational Linguistics 2024
Shachar Don-Yehiya, Ben Burtenshaw, Ramon Fernandez Astudillo, Cailean Osborne, Mimansa Jaiswal, Tzu-Sheng Kuo, Wenting Zhao, Idan Shenfeld, Andi Peng, Mikhail Yurochkin, Atoosa Kasirzadeh, Yangsibo Huang, Tatsunori Hashimoto, Yacine Jernite, Daniel Vila-Suero, Omri Abend, Jennifer Ding, Sara Hooker, Hannah Rose Kirk, Leshem Choshen
arXiv.org 2024
Oscar Sainz, Iker Garc'ia-Ferrero, Alon Jacovi, Jon Ander Campos, Yanai Elazar, Eneko Agirre, Yoav Goldberg, Wei-Lin Chen, Jenny Chim, Leshem Choshen, Luca D'Amico-Wong, Melissa Dell, Run-Ze Fan, Shahriar Golchin, Yucheng Li, Pengfei Liu, Bhavish Pahwa, Ameya Prabhu, Suryansh Sharma, Emily Silcock, Kateryna Solonko, David Stap, M. Surdeanu, Yu-Min Tseng, Vishaal Udandarao, Zengzhi Wang, Ruijie Xu, Jinglin Yang
CONDA 2024
Anna A. Ivanova, Aalok Sathe, Benjamin Lipkin, Unnathi Kumar, S. Radkani, T. H. Clark, Carina Kauf, Jennifer Hu, R. T. Pramod, Gabriel Grand, Vivian C. Paulun, Maria Ryskina, Ekin Akyürek, E. Wilcox, Nafisa Rashid, Leshem Choshen, Roger Levy, Evelina Fedorenko, Josh Tenenbaum, Jacob Andreas
arXiv.org 2024
Elron Bandel, Yotam Perlitz, Elad Venezian, Roni Friedman-Melamed, Ofir Arviv, Matan Orbach, Shachar Don-Yehiya, D. Sheinwald, Ariel Gera, Leshem Choshen, Michal Shmueli-Scheuer, Yoav Katz
North American Chapter of the Association for Computational Linguistics 2024
Eyal Shnarch, Alon Halfon, Ariel Gera, Marina Danilevsky, Yannis Katsis, Leshem Choshen, M. Cooper, Dina Epelboim, Zheng Zhang, Dakuo Wang, Lucy Yip, L. Ein-Dor, Lena Dankin, Ilya Shnayderman, R. Aharonov, Yunyao Li, Naftali Liberman, Philip Levin Slesarev, Gwilym Newton, Shila Ofek-Koifman, N. Slonim, Yoav Katz
Conference on Empirical Methods in Natural Language Processing 2022
N. Slonim, Yonatan Bilu, Carlos Alzate, Roy Bar-Haim, Ben Bogin, Francesca Bonin, Leshem Choshen, Edo Cohen-Karlik, Lena Dankin, Lilach Edelstein, L. Ein-Dor, Roni Friedman-Melamed, A. Gavron, Ariel Gera, Martin Gleize, Shai Gretz, Dan Gutfreund, Alon Halfon, Daniel Hershcovich, R. Hoory, Yufang Hou, S. Hummel, Michal Jacovi, Charles Jochim, Yoav Kantor, Yoav Katz, D. Konopnicki, Zvi Kons, Lili Kotlerman, Dalia Krieger, Dan Lahav, Tamar Lavee, Ran Levy, Naftali Liberman, Y. Mass, Amir Menczel, Shachar Mirkin, Guy Moshkowich, Shila Ofek-Koifman, Matan Orbach, Ella Rabinovich, Ruty Rinott, Slava Shechtman, D. Sheinwald, Eyal Shnarch, Ilya Shnayderman, A. Soffer, Artem Spector, B. Sznajder, Assaf Toledo, Orith Toledo-Ronen, Elad Venezian, R. Aharonov
Nature 2021
Leshem Choshen, Omri Abend
arXiv.org 2021
Guy Hacohen, Leshem Choshen, D. Weinshall
arXiv.org 2019
Alex Warstadt, Aaron Mueller, Leshem Choshen, E. Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjabe, Adina Williams, Tal Linzen, Ryan Cotterell
Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning 2023
Leshem Choshen, Lior Fox, Zohar Aizenbud, Omri Abend
Asaf Yehudai, Boaz Carmeli, Y. Mass, Ofir Arviv, Nathaniel Mills, Eyal Shnarch, Leshem Choshen
International Conference on Learning Representations 2024
Leshem Choshen, Ariel Gera, Yotam Perlitz, Michal Shmueli-Scheuer, Gabriel Stanovsky
International Conference on Language Resources and Evaluation 2024