2025

DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt

Yitong Zhang, Jia Li#, Liyi Cai, Ge Li (# corresponding author)

arXiv, 2025

DAVSP: Safety Alignment for Large Vision-Language Models via Deep Aligned Visual Safety Prompt

Yitong Zhang, Jia Li#, Liyi Cai, Ge Li (# corresponding author)

arXiv, 2025

Computational Thinking Reasoning in Large Language Models

Kechi Zhang, Ge Li, Jia Li (Female), Huangzhao Zhang, Jingjing Xu, Hao Zhu, Lecheng Wang, Jia Li, Yihong Dong, et al.

arXiv, 2025

Computational Thinking Reasoning in Large Language Models

Kechi Zhang, Ge Li, Jia Li (Female), Huangzhao Zhang, Jingjing Xu, Hao Zhu, Lecheng Wang, Jia Li, Yihong Dong, et al.

arXiv, 2025

SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning

Huanyu Liu, Jia Li, Hao Zhu, Yihong Dong, Kechi Zhang, Ge Li

arXiv, 2025

SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning

Huanyu Liu, Jia Li, Hao Zhu, Yihong Dong, Kechi Zhang, Ge Li

arXiv, 2025

LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding

Jia Li (Female), Xuyuan Guo, Lei Li, Kechi Zhang, Ge Li, Jia Li, et al.

The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), 2025

LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding

Jia Li (Female), Xuyuan Guo, Lei Li, Kechi Zhang, Ge Li, Jia Li, et al.

The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), 2025

Focused-DPO: Enhancing Code Generation Through Focused Preference Optimization on Error-Prone Points

Kechi Zhang, Ge Li, Jia Li, Yihong Dong, Jia Li (Female), Zhi Jin

The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025 Findings), 2025

Focused-DPO: Enhancing Code Generation Through Focused Preference Optimization on Error-Prone Points

Kechi Zhang, Ge Li, Jia Li, Yihong Dong, Jia Li (Female), Zhi Jin

The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025 Findings), 2025

Line-level Semantic Structure Learning for Code Vulnerability Detection

Ziliang Wang, Ge Li, Jia Li, Yihong Dong, Yingfei Xiong, Zhi Jin

The 16th International Conference on Internetware (Internetware 2025), 2025 Oral

Line-level Semantic Structure Learning for Code Vulnerability Detection

Ziliang Wang, Ge Li, Jia Li, Yihong Dong, Yingfei Xiong, Zhi Jin

The 16th International Conference on Internetware (Internetware 2025), 2025 Oral

CodeRAG: Supportive Code Retrieval on Bigraph for Real-World Code Generation

Jia Li (Female), Xianjie Shi, Kechi Zhang, Lei Li, Ge Li, Zhengwei Tao, Jia Li, Fang Liu, et al.

arXiv, 2025

CodeRAG: Supportive Code Retrieval on Bigraph for Real-World Code Generation

Jia Li (Female), Xianjie Shi, Kechi Zhang, Lei Li, Ge Li, Zhengwei Tao, Jia Li, Fang Liu, et al.

arXiv, 2025

FAN: Fourier Analysis Networks

Yihong Dong, Ge Li, Yongding Tao, Xue Jiang, Kechi Zhang, Jia Li, et al.

arXiv, 2025

FAN: Fourier Analysis Networks

Yihong Dong, Ge Li, Yongding Tao, Xue Jiang, Kechi Zhang, Jia Li, et al.

arXiv, 2025

aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion

Jia Li*, Hao Zhu*, Huanyu Liu*, Xianjie Shi, He Zong, Yihong Dong, Kechi Zhang, et al. (* equal contribution)

arXiv, 2025

aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion

Jia Li*, Hao Zhu*, Huanyu Liu*, Xianjie Shi, He Zong, Yihong Dong, Kechi Zhang, et al. (* equal contribution)

arXiv, 2025

Escalating LLM-based Code Translation Benchmarking into the Class-level Era

Pengyu Xue, Linhao Wu, Zhen Yang, Chengyi Wang, Xiang Li, Yuxiang Zhang, Jia Li, Some Other Name

The 34th International Symposium on Software Testing and Analysis (ISSTA 2025), 2025 Oral

Escalating LLM-based Code Translation Benchmarking into the Class-level Era

Pengyu Xue, Linhao Wu, Zhen Yang, Chengyi Wang, Xiang Li, Yuxiang Zhang, Jia Li, Some Other Name

The 34th International Symposium on Software Testing and Analysis (ISSTA 2025), 2025 Oral

FANformer: Improving Large Language Models Through Effective Periodicity Modeling

Yihong Dong, Ge Li, Xue Jiang, Yongding Tao, Kechi Zhang, Hao Zhu, Huanyu Liu, Jiazheng Ding, Jia Li, et al.

arXiv, 2025

FANformer: Improving Large Language Models Through Effective Periodicity Modeling

Yihong Dong, Ge Li, Xue Jiang, Yongding Tao, Kechi Zhang, Hao Zhu, Huanyu Liu, Jiazheng Ding, Jia Li, et al.

arXiv, 2025

Theoretical Proof that Generated Text in the Corpus Leads to the Collapse of Auto-regressive Language Models

Lecheng Wang, Xianjie Shi, Ge Li, Jia Li, Xuanming Zhang, Yihong Dong, et al.

arXiv, 2025

Theoretical Proof that Generated Text in the Corpus Leads to the Collapse of Auto-regressive Language Models

Lecheng Wang, Xianjie Shi, Ge Li, Jia Li, Xuanming Zhang, Yihong Dong, et al.

arXiv, 2025

Large Language Model-Aware In-Context Learning for Code Generation

Jia Li (Female), Ge Li, Chongyang Tao, Jia Li, Huangzhao Zhang, Fang Liu, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), 2025

Large Language Model-Aware In-Context Learning for Code Generation

Jia Li (Female), Ge Li, Chongyang Tao, Jia Li, Huangzhao Zhang, Fang Liu, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), 2025

2024

aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing

Siyuan Jiang*, Jia Li*, He Zong, Huanyu Liu, Hao Zhu, Shukai Hu, Erlu Li, Jiazheng Ding, Yu Han, Wei Ning, Ge Li (* equal contribution)

The 47th International Conference on Software Engineering (ICSE 2025 SEIP Track), 2025 Oral

aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing

Siyuan Jiang*, Jia Li*, He Zong, Huanyu Liu, Hao Zhu, Shukai Hu, Erlu Li, Jiazheng Ding, Yu Han, Wei Ning, Ge Li (* equal contribution)

The 47th International Conference on Software Engineering (ICSE 2025 SEIP Track), 2025 Oral

SCodeSearcher: Soft Contrastive Learning for Code Search

Jia Li (Female), Zheng Fang, Xianjie Shi, Zhi Jin, Fang Liu, Jia Li, Yunfei Zhao, Ge Li

Empirical Software Engineering (EMSE), Volume 30, Issue 3, 28 March 2025

SCodeSearcher: Soft Contrastive Learning for Code Search

Jia Li (Female), Zheng Fang, Xianjie Shi, Zhi Jin, Fang Liu, Jia Li, Yunfei Zhao, Ge Li

Empirical Software Engineering (EMSE), Volume 30, Issue 3, 28 March 2025

Generating Equivalent Representations of Code By A Self-Reflection Approach

Jia Li, Ge Li, Lecheng Wang, Hao Zhu, Zhi Jin

arXiv, 2024

Generating Equivalent Representations of Code By A Self-Reflection Approach

Jia Li, Ge Li, Lecheng Wang, Hao Zhu, Zhi Jin

arXiv, 2024

EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations

Jia Li, Ge Li, Xuanming Zhang, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li

The 38th Conference on Neural Information Processing Systems (NeurIPS 2024 D&B Track), 2024

EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations

Jia Li, Ge Li, Xuanming Zhang, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li

The 38th Conference on Neural Information Processing Systems (NeurIPS 2024 D&B Track), 2024

Deep Learning for Code Generation: A Survey

Huangzhao Zhang, Kechi Zhang, Zhuo Li, Jia Li (Female), Jia Li, Yongmin Li, et al.

Science China Information Sciences (SCIS), Volume 67, Number 191101, 20 August 2024

Deep Learning for Code Generation: A Survey

Huangzhao Zhang, Kechi Zhang, Zhuo Li, Jia Li (Female), Jia Li, Yongmin Li, et al.

Science China Information Sciences (SCIS), Volume 67, Number 191101, 20 August 2024

Structured Chain-of-Thought Prompting for Code Generation

Jia Li, Ge Li, Yongmin Li, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 34, Issue 2, Pages 1-23, 21 January 2025

Structured Chain-of-Thought Prompting for Code Generation

Jia Li, Ge Li, Yongmin Li, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 34, Issue 2, Pages 1-23, 21 January 2025

AceCoder: An Effective Prompting Technique Specialized in Code Generation

Jia Li, Ge Li, Yongmin Li, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 8, Pages 1-26, 21 November 2024

AceCoder: An Effective Prompting Technique Specialized in Code Generation

Jia Li, Ge Li, Yongmin Li, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 8, Pages 1-26, 21 November 2024

DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, et al.

The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024 Findings), Pages 3603-3614, August 2024

DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, et al.

The 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024 Findings), Pages 3603-3614, August 2024

Exploring and Unleashing the Power of Large Language Models in Automated Code Translation

Zhen Yang, Fang Liu, Zhongxing Yu, Jacky Wai Keung, Jia Li, Shuo Liu, et al.

The ACM International Conference on the Foundations of Software Engineering (FSE 2024), Volume 1, Issue FSE, Pages 1585-1608, 12 July 2024 Oral

Exploring and Unleashing the Power of Large Language Models in Automated Code Translation

Zhen Yang, Fang Liu, Zhongxing Yu, Jacky Wai Keung, Jia Li, Shuo Liu, et al.

The ACM International Conference on the Foundations of Software Engineering (FSE 2024), Volume 1, Issue FSE, Pages 1585-1608, 12 July 2024 Oral

2023

Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models

Yuqi Zhu, Jia Li, Ge Li, Yunfei Zhao, Jia Li (Female), Zhi Jin, Hong Mei

The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024), Pages 437-445, 20 February 2024

Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models

Yuqi Zhu, Jia Li, Ge Li, Yunfei Zhao, Jia Li (Female), Zhi Jin, Hong Mei

The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024), Pages 437-445, 20 February 2024

ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation

Zejun Wang, Jia Li, Ge Li, Zhi Jin

arXiv, 2023

ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation

Zejun Wang, Jia Li, Ge Li, Zhi Jin

arXiv, 2023

Poison Attack and Poison Detection on Deep Source Code Processing Models

Jia Li, Zhuo Li, Huangzhao Zhang, Ge Li, Zhi Jin, Xing Hu, Xin Xia

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 3, Pages 1-31, 14 March 2024

Poison Attack and Poison Detection on Deep Source Code Processing Models

Jia Li, Zhuo Li, Huangzhao Zhang, Ge Li, Zhi Jin, Xing Hu, Xin Xia

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 3, Pages 1-31, 14 March 2024

ToolCoder: Teach Code Generation Models to Use API Search Tools

Kechi Zhang, Huangzhao Zhang, Ge Li, Jia Li, Zhuo Li, Zhi Jin

arXiv, 2023

ToolCoder: Teach Code Generation Models to Use API Search Tools

Kechi Zhang, Huangzhao Zhang, Ge Li, Jia Li, Zhuo Li, Zhi Jin

arXiv, 2023

ZC3: Zero-Shot Cross-Language Code Clone Detection

Jia Li (Female), Chongyang Tao, Zhi Jin, Fang Liu, Jia Li, Ge Li

The 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023), Pages 875-887, 26 September 2024

ZC3: Zero-Shot Cross-Language Code Clone Detection

Jia Li (Female), Chongyang Tao, Zhi Jin, Fang Liu, Jia Li, Ge Li

The 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023), Pages 875-887, 26 September 2024

CodeEditor: Learning to Edit Source Code with Pre-trained Models

Jia Li, Ge Li, Zhuo Li, Zhi Jin, Xing Hu, Kechi Zhang, Zhiyi Fu

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 32, Issue 6, Pages 1-22, 30 September 2023

CodeEditor: Learning to Edit Source Code with Pre-trained Models

Jia Li, Ge Li, Zhuo Li, Zhi Jin, Xing Hu, Kechi Zhang, Zhiyi Fu

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 32, Issue 6, Pages 1-22, 30 September 2023

MCodeSearcher: Multi-View Contrastive Learning for Code Search

Jia Li (Female), Fang Liu, Jia Li, Yunfei Zhao, Ge Li, Zhi Jin

The 14th International Conference on Internetware (Internetware 2023), Pages 270-280, 05 October 2023 Oral

MCodeSearcher: Multi-View Contrastive Learning for Code Search

Jia Li (Female), Fang Liu, Jia Li, Yunfei Zhao, Ge Li, Zhi Jin

The 14th International Conference on Internetware (Internetware 2023), Pages 270-280, 05 October 2023 Oral

Self-Edit: Fault-Aware Code Editor for Code Generation

Kechi Zhang, Zhuo Li, Jia Li, Ge Li, Zhi Jin

The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Pages 769-787, July 2024

Self-Edit: Fault-Aware Code Editor for Code Generation

Kechi Zhang, Zhuo Li, Jia Li, Ge Li, Zhi Jin

The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Pages 769-787, July 2024

2022

SkCoder: A Sketch-based Approach for Automatic Code Generation

Jia Li, Yongmin Li, Ge Li, Zhi Jin, Yiyang Hao, Xing Hu

The 45th International Conference on Software Engineering (ICSE 2023), Pages 2124-2135, 26 July 2023 Oral

SkCoder: A Sketch-based Approach for Automatic Code Generation

Jia Li, Yongmin Li, Ge Li, Zhi Jin, Yiyang Hao, Xing Hu

The 45th International Conference on Software Engineering (ICSE 2023), Pages 2124-2135, 26 July 2023 Oral

Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively

Haojie Zhang, Ge Li, Jia Li, Zhongjin Zhang, Yuqi Zhu, Zhi Jin

The 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Pages 21442-21454, 28 November 2022

Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively

Haojie Zhang, Ge Li, Jia Li, Zhongjin Zhang, Yuqi Zhu, Zhi Jin

The 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Pages 21442-21454, 28 November 2022

2021

EditSum: A Retrieve-and-Edit Framework for Source Code Summarization

Jia Li, Yongmin Li, Ge Li, Xing Hu, Xin Xia, Zhi Jin

The 36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021), Pages 155-166, 24 June 2022 Oral

EditSum: A Retrieve-and-Edit Framework for Source Code Summarization

Jia Li, Yongmin Li, Ge Li, Xing Hu, Xin Xia, Zhi Jin

The 36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021), Pages 155-166, 24 June 2022 Oral