Xi Ye
叢曦

xi.ye [at] princeton.edu   xiye17   scholar   xiye_nlp

I am a Postdoctral Researcher at Princeton Language and Intelligence.

My research is primarily in the field of Natural Language Processing, with an emphasis on improving the explainability of large language models and enhancing their reasoning capabilities. I also work on semantic parsing and program synthesis.

I obtained my PhD from the Department of Computer Science at UT Austin, advised by Greg Durrett. Prior to joining UT Austin, I obtained my Bachelor's degree from the School of Software, Tsinghua University, where I worked with Prof. Shixia Liu on Visual Analysis and Machine Learning.

πŸ“£ I will join the University of Alberta as an Assistant Professor at starting in July 2025, and I am planning to recruit PhD/MS students in the upcoming cycle. (To prospective students: please include "Fall 25 Prospective Student" in email subject line.)

Publications

(preprint) To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Zayne Sprague, Fangcong Yin, Juan Diego Rodriguez, Dongwei Jiang, Manya Wadhwa, Prasann Singhal, Xinyu Zhao, Xi Ye, Kyle Mahowald, and Greg Durrett. Arxiv 2024.

LoFiT: Localized Fine-tuning on LLM Representations code

Fangcong Yin, Xi Ye, and Greg Durrett. Proceedings of NeurIPS 2024.

(preprint) CodeUpdateArena: Benchmarking Knowledge Editing on API Updates code

Zeyu Leo Liu, Shrey Pandit, Xi Ye, Eunsol Choi, and Greg Durrett. ArXiv 2024.

(preprint) Unveiling the Impact of Coding Data Instruction Fine-Tuning on Large Language Models Reasoning

Xinlu Zhang, Zhiyu Chen, Xi Ye, Xianjun Yang, Lichang Chen, William Yang Wang, and Linda Ruth Petzold. ArXiv 2024.

AmbigDocs: Reasoning across Documents on Different Entities under the Same Name website

Yoonsang Lee, Xi Ye, and Eunsol Choi. Proceedings of COLM 2024.

Effective Large Language Model Adaptation for Improved Grounding and Citation Generation

Xi Ye, Ruoxi Sun, Sercan Γ–. Arik, and Tomas Pfister. Proceedings of NAACL 2024.

Crafting In-context Examples according to LMs' Parametric Knowledge

Yoonsang Lee*, Pranav Atreya*, Xi Ye, and Eunsol Choi. Findings of NAACL 2024.

MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning website

Zayne Sprague, Xi Ye, Kaj Bostrom, Swarat Chaudhuri, and Greg Durrett. Proceedings of ICLR 2024 (spotlight).

SatLM: Satisfiability-Aided Language Models Using Declarative Prompting code

Xi Ye, Qiaochu Chen, Isil Dillig, and Greg Durrett. Proceedings of NeurIPS 2023.

Explanation Selection Using Unlabeled Data for Chain-of-Thought Prompting code

Xi Ye, and Greg Durrett. Proceedings of EMNLP 2023.

Complementary Explanations for Effective In-Context Learning code

Xi Ye, Srinivasan Iyer, Asli Celikyilmaz, Ves Stoyanov, Greg Durrett, and Ramakanth Pasunuru. Findings of ACL, 2023.

EEL: Efficiently Encoding Lattices for Reranking code

Prasann Singhal, Jiacheng Xu, Xi Ye, and Greg Durrett. Proceedings of ACL, 2023.

Assessing Out-of-Domain Language Model Performance from Few Examples

Prasann Singhal*, Jarad Forristal*, Xi Ye, and Greg Durrett. Proceedings of EACL 2023.

The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning code

Xi Ye and Greg Durrett. Proceedings of NeurIPS 2022.

Can Explanations Be Useful for Calibrating Black Box Models? code

Xi Ye and Greg Durrett. Proceedings of ACL 2022.

RnG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question Answering code

Xi Ye, Semih Yavuz, Kazuma Hashimoto, Yingbo Zhou, and Caiming Xiong. Proceedings of ACL 2022.

Diagnosing Ensemble Few-Shot Classifiers demo

Weikai Yang, Xi Ye, Xingxing Zhang, Lanxi Xiao, Jiazhi Xia, Zhongyuan Wang, Jun Zhu, Hanspeter Pfister, and Shixia Liu. Transactions of TVCG 2022.

Connecting Attributions and QA Model Behavior on Realistic Counterfactuals code

Xi Ye, Rohan Nair, and Greg Durrett. Proceedings of EMNLP 2021.

Optimal Neural Program Synthesis from Multimodal Specifications code

Xi Ye, Qiaochu Chen, Isil Dillig, and Greg Durrett. Findings of EMNLP 2021.

Benchmarking Multimodal Regex Synthesis with Complex Structures code data

Xi Ye, Qiaochu Chen, Isil Dillig, and Greg Durrett. Proceedings of ACL 2020.

Sketch-Driven Regular Expression Generation from Natural Language and Examples code

Xi Ye, Qiaochu Chen, Xinyu Wang, Isil Dillig, and Greg Durrett. Transactions of ACL 2020.

Multi-Modal Synthesis of Regular Expressions code

Qiaochu Chen, Xinyu Wang, Xi Ye , Greg Durrett, and Isil Dillig. Proceedings of PLDI 2020

Interactive Correction of Mislabeled Training Data video

Shouxing Xiang*, Xi Ye*, Jiazhi Xia, Jing Wu, Yang Chen, and Shixia Liu. Proceedings of VAST 2019

Talks

[11/2023] On Steering Textual Reasoning with Explanations @ CHAI group, Uchicago.

[10/2023] On Steering Textual Reasoning with Explanations @ Yale NLP Lab, Yale.

[03/2023] On Effective Use of Explanations in Prompting @ Student reading group, UMD. slides

[03/2023] On Effective Use of Explanations in Prompting @ Meetup of MLNLP Community.

[03/2023] On Effective Use of Explanations in Prompting @ DM2Lab, UND.

[11/2022] On Unreliability of Explanations in Prompting @ AI4LIFE group, Harvard.

[06/2022] On Calibration using Explanations @ NeuLab group, CMU.

Service

Area Chair: ACL (24), EMNLP (24)

Reviewer: ACL (23), ICML (23,24), NeurIPS (22, 23), ICLR (24), EMNLP (22), NAACL (22), ARR (22, 21), CONLL (21, 20), TL4NLP Workshop (22), SUKI Workshop (22), NLP4Prog Workshop (21).

Teaching

TA for CS378: Natural Language Processing (undergraduate). Fall 2022

TA for CS388: Natural Language Processing (graduate). Spring 2021

TA for CS429: Computer Architecture and Organization (undergraduate). Fall 2018, Spring 2019