English name: Jia Li (♂)
                    Chinese name: 李佳

                    PhD Student
                    School of Computer Science
                    Peking University (PKU)
                    No.5 Yiheyuan Road, Haidian District, Beijing, China
                    Email: lijia AT stu DOT pku DOT edu DOT cn

                       



I am a doctoral student at the School of Computer Science, Peking University (PKU). My supervisor is Prof. Zhi Jin and Prof. Ge Li. I expect to graduate in July 2025.

I will join the College of AI at Tsinghua University as an assistant professor in September 2025!

My research direction is intelligent software development, including Code Generation and Large Language Models (LLMs) for Code.

Welcome to pay attention to our released LLM - aiXcoder-7B: It surpasses existing LLMs of similar scales in code generation and code completion tasks.


News

  • December 16, 2024. Our paper "aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Completion" has been accepted by ICSE 2025.
  • October 18, 2024. We release the technical report of aiXcoder-7B, a lightweight and effective large language model for code completion. [Paper]
  • September 26, 2024. Our paper "EvoCodeBench: An Evolving Code Generation Benchmark with Domain-Specific Evaluations" has been accepted by NeurIPS 2024.
  • May 16, 2024. Our paper "DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories" has been accepted by ACL 2024.
  • April 15, 2024. Our paper "Exploring and Unleashing the Power of Large Language Models in Automated Code Translation" has been accepted by FSE 2024.
  • March 30, 2024. We propose an evolving code generation benchmark named EvoCodeBench for evaluating code generation models in repository-level code generation. [Paper], [Data]
  • January 30, 2024, Our paper "Deep Learning for Code Generation: A Survey" has been accepted by SCIS.
  • December 9, 2023. Our paper "Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Language Models" has been accepted by AAAI 2024.
  • October 9, 2023. Our paper "Poison Attack and Poison Detection on Deep Source Code Processing Models" has been accepted by TOSEM.
  • July 28, 2023. Our paper "ZC3 Zero-Shot Cross-Language Code Clone Detection" has been accepted by ASE 2023 Technical Research Track.
  • June 28, 2023. Our paper "CodeEditor: Learning to Edit Source Code with Pre-trained Models" has been accepted by ASE 2023 Journal-First Track.
  • May 29, 2023. Our paper "MCodeSearcher Multi-View Contrastive Learning for Code Search" has been accepted by Internetware 2023.
  • May 2, 2023. Our paper "Self-Edit: Fault-Aware Code Editor for Code Generation" has been accepted by ACL 2023.
  • April 7, 2023. Our paper "CodeEditor: Learning to Edit Source Code with Pre-trained Models" has been accepted by TOSEM.
  • December 9, 2022. Our paper "SkCoder: A Sketch-based Approach for Automatic Code Generation" has been accepted by ICSE 2023.
  • September 15, 2022. Our paper "Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively" has been accepted by NeurIPS 2022.
  • July 8, 2021. Our paper "EditSum: A Retrieve-and-Edit Framework for Source Code Summarization" has been accepted by ASE 2021.