About me

News

  • 2024.1: We propose a new instruction tuning dataset (INTERS) for unlocking the power of LLMs on search tasks. See more details.
  • 2023.11: We analyze the risk of data leakage in LLM pre-training and write a new paper to alert this problem. See more details.
  • 2023.8: We write a new survey about applying large language models for information retrieval. See more details.
  • 2023.8: We publish a new version of YuLan-Chat. It achieves better performance than the official LLaMA-2 and LLaMA-2-Chat on MMLU, C-Eval, and AGI-Gaokao benchmarks! GitHub Repo stars

Publications

2024

2023

2022

2021

2020

2019

2018

Experiences

  • 2021.12 - 2022.12, Research Intern, Poisson Lab, Huawei . Supervised by Xinyu Zhang
  • 2018.8 - 2019.6, Research Intern, XiaoIce, Microsoft Asia . Supervised by Ruihua Song
  • 2016.9 - 2019.6, Research Assistant, Beijing Key Lab of Big Data Management and Analysis Methods. Supervised by Zhicheng Dou and Ji-Rong Wen
  • 2016.6 - 2016.9, Software Engineer, Infosys Technology Limited . Supervised by Anjaneyulu Pasala

Academic Services

  • PC Member: ACL, SIGIR, WWW, NeurIPS, SIGKDD, AAAI, EMNLP, CIKM, WSDM, COLING, COLM
  • Journal Reviewer: TOIS, JASIST, KAIS, TALLIP, Computing Surveys, ACL Rolling Review