2025

Line-level Semantic Structure Learning for Code Vulnerability Detection

Ziliang Wang, Ge Li, Jia Li, Yihong Dong, Yingfei Xiong, Zhi Jin

The 16th International Conference on Internetware (Internetware 2025), 2025 CCF-C, Oral

Line-level Semantic Structure Learning for Code Vulnerability Detection

Ziliang Wang, Ge Li, Jia Li, Yihong Dong, Yingfei Xiong, Zhi Jin

The 16th International Conference on Internetware (Internetware 2025), 2025 CCF-C, Oral

CodeRAG: Supportive Code Retrieval on Bigraph for Real-World Code Generation

Jia Li (Female), Xianjie Shi, Kechi Zhang, Lei Li, Ge Li, Zhengwei Tao, Jia Li, Fang Liu, et al.

arXiv, 2025

CodeRAG: Supportive Code Retrieval on Bigraph for Real-World Code Generation

Jia Li (Female), Xianjie Shi, Kechi Zhang, Lei Li, Ge Li, Zhengwei Tao, Jia Li, Fang Liu, et al.

arXiv, 2025

FAN: Fourier Analysis Networks

Yihong Dong, Ge Li, Yongding Tao, Xue Jiang, Kechi Zhang, Jia Li, et al.

arXiv, 2025

FAN: Fourier Analysis Networks

Yihong Dong, Ge Li, Yongding Tao, Xue Jiang, Kechi Zhang, Jia Li, et al.

arXiv, 2025

aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion

Jia Li*, Hao Zhu*, Huanyu Liu*, Xianjie Shi, He Zong, Yihong Dong, Kechi Zhang, et al. (* equal contribution)

arXiv, 2025

aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion

Jia Li*, Hao Zhu*, Huanyu Liu*, Xianjie Shi, He Zong, Yihong Dong, Kechi Zhang, et al. (* equal contribution)

arXiv, 2025

LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding

Jia Li (Female), Xuyuan Guo, Lei Li, Kechi Zhang, Ge Li, Jia Li, et al.

arXiv, 2025

LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding

Jia Li (Female), Xuyuan Guo, Lei Li, Kechi Zhang, Ge Li, Jia Li, et al.

arXiv, 2025

Escalating LLM-based Code Translation Benchmarking into the Class-level Era

Pengyu Xue, Linhao Wu, Zhen Yang, Chengyi Wang, Xiang Li, Yuxiang Zhang, Jia Li, Some Other Name

The 34th International Symposium on Software Testing and Analysis (ISSTA 2025), 2025 CCF-A, Oral

Escalating LLM-based Code Translation Benchmarking into the Class-level Era

Pengyu Xue, Linhao Wu, Zhen Yang, Chengyi Wang, Xiang Li, Yuxiang Zhang, Jia Li, Some Other Name

The 34th International Symposium on Software Testing and Analysis (ISSTA 2025), 2025 CCF-A, Oral

FANformer: Improving Large Language Models Through Effective Periodicity Modeling

Yihong Dong, Ge Li, Xue Jiang, Yongding Tao, Kechi Zhang, Hao Zhu, Huanyu Liu, Jiazheng Ding, Jia Li, et al.

arXiv, 2025

FANformer: Improving Large Language Models Through Effective Periodicity Modeling

Yihong Dong, Ge Li, Xue Jiang, Yongding Tao, Kechi Zhang, Hao Zhu, Huanyu Liu, Jiazheng Ding, Jia Li, et al.

arXiv, 2025

Focused-DPO: Enhancing Code Generation Through Focused Preference Optimization on Error-Prone Points

Kechi Zhang, Ge Li, Jia Li, Yihong Dong, Jia Li (Female), Zhi Jin

arXiv, 2025

Focused-DPO: Enhancing Code Generation Through Focused Preference Optimization on Error-Prone Points

Kechi Zhang, Ge Li, Jia Li, Yihong Dong, Jia Li (Female), Zhi Jin

arXiv, 2025

Theoretical Proof that Generated Text in the Corpus Leads to the Collapse of Auto-regressive Language Models

Lecheng Wang, Xianjie Shi, Ge Li, Jia Li, Xuanming Zhang, Yihong Dong, et al.

arXiv, 2025

Theoretical Proof that Generated Text in the Corpus Leads to the Collapse of Auto-regressive Language Models

Lecheng Wang, Xianjie Shi, Ge Li, Jia Li, Xuanming Zhang, Yihong Dong, et al.

arXiv, 2025

Large Language Model-Aware In-Context Learning for Code Generation

Jia Li (Female), Ge Li, Chongyang Tao, Jia Li, Huangzhao Zhang, Fang Liu, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), 2025 CCF-A

Large Language Model-Aware In-Context Learning for Code Generation

Jia Li (Female), Ge Li, Chongyang Tao, Jia Li, Huangzhao Zhang, Fang Liu, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), 2025 CCF-A

2024

aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing

Siyuan Jiang*, Jia Li*, He Zong, Huanyu Liu, Hao Zhu, Shukai Hu, Erlu Li, Jiazheng Ding, Yu Han, Wei Ning, Ge Li (* equal contribution)

The 47th International Conference on Software Engineering (ICSE 2025), 2025 CCF-A, Oral

aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing

Siyuan Jiang*, Jia Li*, He Zong, Huanyu Liu, Hao Zhu, Shukai Hu, Erlu Li, Jiazheng Ding, Yu Han, Wei Ning, Ge Li (* equal contribution)

The 47th International Conference on Software Engineering (ICSE 2025), 2025 CCF-A, Oral

SCodeSearcher: Soft Contrastive Learning for Code Search

Jia Li (Female), Zheng Fang, Xianjie Shi, Zhi Jin, Fang Liu, Jia Li, Yunfei Zhao, Ge Li

Empirical Software Engineering (EMSE), Volume 30, Issue 3, 28 March 2025 CCF-B

SCodeSearcher: Soft Contrastive Learning for Code Search

Jia Li (Female), Zheng Fang, Xianjie Shi, Zhi Jin, Fang Liu, Jia Li, Yunfei Zhao, Ge Li

Empirical Software Engineering (EMSE), Volume 30, Issue 3, 28 March 2025 CCF-B

Generating Equivalent Representations of Code By A Self-Reflection Approach

Jia Li, Ge Li, Lecheng Wang, Hao Zhu, Zhi Jin

arXiv, 2024

Generating Equivalent Representations of Code By A Self-Reflection Approach

Jia Li, Ge Li, Lecheng Wang, Hao Zhu, Zhi Jin

arXiv, 2024

EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations

Jia Li, Ge Li, Xuanming Zhang, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li

The 38th Conference on Neural Information Processing Systems (NeurIPS 2024), 2024 CCF-A, Poster

EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations

Jia Li, Ge Li, Xuanming Zhang, Yihong Dong, Zhi Jin, Binhua Li, Fei Huang, Yongbin Li

The 38th Conference on Neural Information Processing Systems (NeurIPS 2024), 2024 CCF-A, Poster

Deep Learning for Code Generation: A Survey

Huangzhao Zhang, Kechi Zhang, Zhuo Li, Jia Li (Female), Jia Li, Yongmin Li, et al.

Science China Information Sciences (SCIS), Volume 67, Number 191101, 20 August 2024 CCF-A

Deep Learning for Code Generation: A Survey

Huangzhao Zhang, Kechi Zhang, Zhuo Li, Jia Li (Female), Jia Li, Yongmin Li, et al.

Science China Information Sciences (SCIS), Volume 67, Number 191101, 20 August 2024 CCF-A

Structured Chain-of-Thought Prompting for Code Generation

Jia Li, Ge Li, Yongmin Li, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 34, Issue 2, Pages 1-23, 21 January 2025 CCF-A

Structured Chain-of-Thought Prompting for Code Generation

Jia Li, Ge Li, Yongmin Li, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 34, Issue 2, Pages 1-23, 21 January 2025 CCF-A

AceCoder: An Effective Prompting Technique Specialized in Code Generation

Jia Li, Ge Li, Yongmin Li, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 8, Pages 1-26, 21 November 2024 CCF-A

AceCoder: An Effective Prompting Technique Specialized in Code Generation

Jia Li, Ge Li, Yongmin Li, Zhi Jin

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 8, Pages 1-26, 21 November 2024 CCF-A

DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, et al.

Proceedings of the 62st Annual Meeting of the Association for Computational Linguistics (ACL 2024), Pages 3603-3614, August 2024 CCF-A, Poster

DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

Jia Li, Ge Li, Yunfei Zhao, Yongmin Li, Huanyu Liu, Hao Zhu, et al.

Proceedings of the 62st Annual Meeting of the Association for Computational Linguistics (ACL 2024), Pages 3603-3614, August 2024 CCF-A, Poster

Exploring and Unleashing the Power of Large Language Models in Automated Code Translation

Zhen Yang, Fang Liu, Zhongxing Yu, Jacky Wai Keung, Jia Li, Shuo Liu, et al.

The ACM International Conference on the Foundations of Software Engineering (FSE 2024), Volume 1, Issue FSE, Pages 1585-1608, 12 July 2024 CCF-A, Oral

Exploring and Unleashing the Power of Large Language Models in Automated Code Translation

Zhen Yang, Fang Liu, Zhongxing Yu, Jacky Wai Keung, Jia Li, Shuo Liu, et al.

The ACM International Conference on the Foundations of Software Engineering (FSE 2024), Volume 1, Issue FSE, Pages 1585-1608, 12 July 2024 CCF-A, Oral

2023

Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models

Yuqi Zhu, Jia Li, Ge Li, Yunfei Zhao, Jia Li (Female), Zhi Jin, Hong Mei

The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024), Pages 437-445, 20 February 2024 CCF-A, Poster

Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models

Yuqi Zhu, Jia Li, Ge Li, Yunfei Zhao, Jia Li (Female), Zhi Jin, Hong Mei

The 38th Annual AAAI Conference on Artificial Intelligence (AAAI 2024), Pages 437-445, 20 February 2024 CCF-A, Poster

ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation

Zejun Wang, Jia Li, Ge Li, Zhi Jin

arXiv, 2023

ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation

Zejun Wang, Jia Li, Ge Li, Zhi Jin

arXiv, 2023

Poison Attack and Poison Detection on Deep Source Code Processing Models

Jia Li, Zhuo Li, Huangzhao Zhang, Ge Li, Zhi Jin, Xing Hu, Xin Xia

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 3, Pages 1-31, 14 March 2024 CCF-A

Poison Attack and Poison Detection on Deep Source Code Processing Models

Jia Li, Zhuo Li, Huangzhao Zhang, Ge Li, Zhi Jin, Xing Hu, Xin Xia

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 33, Issue 3, Pages 1-31, 14 March 2024 CCF-A

ToolCoder: Teach Code Generation Models to Use API Search Tools

Kechi Zhang, Huangzhao Zhang, Ge Li, Jia Li, Zhuo Li, Zhi Jin

arXiv, 2023

ToolCoder: Teach Code Generation Models to Use API Search Tools

Kechi Zhang, Huangzhao Zhang, Ge Li, Jia Li, Zhuo Li, Zhi Jin

arXiv, 2023

ZC3: Zero-Shot Cross-Language Code Clone Detection

Jia Li (Female), Chongyang Tao, Zhi Jin, Fang Liu, Jia Li, Ge Li

The 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023), Pages 875-887, 26 September 2024 CCF-A, Oral

ZC3: Zero-Shot Cross-Language Code Clone Detection

Jia Li (Female), Chongyang Tao, Zhi Jin, Fang Liu, Jia Li, Ge Li

The 38th IEEE/ACM International Conference on Automated Software Engineering (ASE 2023), Pages 875-887, 26 September 2024 CCF-A, Oral

CodeEditor: Learning to Edit Source Code with Pre-trained Models

Jia Li, Ge Li, Zhuo Li, Zhi Jin, Xing Hu, Kechi Zhang, Zhiyi Fu

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 32, Issue 6, Pages 1-22, 30 September 2023 CCF-A

CodeEditor: Learning to Edit Source Code with Pre-trained Models

Jia Li, Ge Li, Zhuo Li, Zhi Jin, Xing Hu, Kechi Zhang, Zhiyi Fu

ACM Transactions on Software Engineering and Methodology (TOSEM), Volume 32, Issue 6, Pages 1-22, 30 September 2023 CCF-A

MCodeSearcher: Multi-View Contrastive Learning for Code Search

Jia Li (Female), Fang Liu, Jia Li, Yunfei Zhao, Ge Li, Zhi Jin

The 14th International Conference on Internetware (Internetware 2023), Pages 270-280, 05 October 2023 CCF-C, Oral

MCodeSearcher: Multi-View Contrastive Learning for Code Search

Jia Li (Female), Fang Liu, Jia Li, Yunfei Zhao, Ge Li, Zhi Jin

The 14th International Conference on Internetware (Internetware 2023), Pages 270-280, 05 October 2023 CCF-C, Oral

Self-Edit: Fault-Aware Code Editor for Code Generation

Kechi Zhang, Zhuo Li, Jia Li, Ge Li, Zhi Jin

The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Pages 769-787, July 2024 CCF-A, Poster

Self-Edit: Fault-Aware Code Editor for Code Generation

Kechi Zhang, Zhuo Li, Jia Li, Ge Li, Zhi Jin

The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Pages 769-787, July 2024 CCF-A, Poster

2022

SkCoder: A Sketch-based Approach for Automatic Code Generation

Jia Li, Yongmin Li, Ge Li, Zhi Jin, Yiyang Hao, Xing Hu

The 45th International Conference on Software Engineering (ICSE 2023), Pages 2124-2135, 26 July 2023 CCF-A, Oral

SkCoder: A Sketch-based Approach for Automatic Code Generation

Jia Li, Yongmin Li, Ge Li, Zhi Jin, Yiyang Hao, Xing Hu

The 45th International Conference on Software Engineering (ICSE 2023), Pages 2124-2135, 26 July 2023 CCF-A, Oral

Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively

Haojie Zhang, Ge Li, Jia Li, Zhongjin Zhang, Yuqi Zhu, Zhi Jin

The 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Pages 21442-21454, 28 November 2022 CCF-A, Poster

Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively

Haojie Zhang, Ge Li, Jia Li, Zhongjin Zhang, Yuqi Zhu, Zhi Jin

The 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Pages 21442-21454, 28 November 2022 CCF-A, Poster

2021

EditSum: A Retrieve-and-Edit Framework for Source Code Summarization

Jia Li, Yongmin Li, Ge Li, Xing Hu, Xin Xia, Zhi Jin

The 36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021), Pages 155-166, 24 June 2022 CCF-A, Oral

EditSum: A Retrieve-and-Edit Framework for Source Code Summarization

Jia Li, Yongmin Li, Ge Li, Xing Hu, Xin Xia, Zhi Jin

The 36th IEEE/ACM International Conference on Automated Software Engineering (ASE 2021), Pages 155-166, 24 June 2022 CCF-A, Oral