publications

2025

  1. Minder: Faulty Machine Detection for Large-scale Distributed Model Training
    Yangtao Deng ,  Xiang Shi ,  Zhuo Jiang , and 12 more authors
    Apr 2025

2023

  1. fKPISelect: Fault-Injection Based Automated KPI Selection for Practical Multivariate Anomaly Detection
    Xingjian Zhang ,  Yinqin Zhao ,  Chang Liu , and 9 more authors
    In 2023 IEEE 34th International Symposium on Software Reliability Engineering (ISSRE) , Oct 2023
    ISSN: 2332-6549

2022

  1. Modeling Composition of Cloud Services with Complex Dependencies for Availability Assessment
    Xingjian Zhang ,  and  Long Wang
    In 2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks - Supplemental Volume (DSN-S) , Jun 2022
  2. OAG-BERT: Towards a Unified Backbone Language Model for Academic Knowledge Services
    Xiao Liu ,  Da Yin ,  Jingnan Zheng , and 5 more authors
    In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining , Aug 2022
  3. Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers
    Weng Lam Tam ,  Xiao Liu ,  Kaixuan Ji , and 6 more authors
    Jul 2022
    arXiv:2207.07087 [cs]