Publications
[Google scholar] | [DBLP] | [View by topic] |
Recent Preprints
- CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries. Shudong Liu, Yiqiao Jin, Cheng Li, Derek F. Wong, Qingsong Wen, Lichao Sun, Haipeng Chen, Xing Xie, Jindong Wang. [arxiv]
- MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders. Cheng Li, May Fung, Qingyun Wang, Chi Han, Manling Li, Jindong Wang, Heng Ji. [arxiv]
- StringLLM: Understanding the String Processing Capability of Large Language Models. Xilong Wang, Hao Fu, Jindong Wang, Neil Zhenqiang Gong. [arxiv]
- On the Diversity of Synthetic Data and its Impact on Training Large Language Models. Hao Chen, Abdul Waheed, Xiang Li, Yidong Wang, Jindong Wang, Bhiksha Raj, Marah I. Abdin. [arxiv]
- SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer. Hao Chen, Ze Wang, Xiang Li, Ximeng Sun, Fangyi Chen, Jiang Liu, Jindong Wang, Bhiksha Raj, Zicheng Liu, Emad Barsoum. [arxiv]
- Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist. Zihao Zhou, Shudong Liu, Maizhen Ning, Wei Liu, Jindong Wang, Derek F. Wong, Xiaowei Huang, Qiufeng Wang, Kaizhu Huang. [arxiv]
- Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations? Yue Huang, Zhengqing Yuan, Yujun Zhou, Kehan Guo, Xiangqi Wang, Haomin Zhuang, Weixiang Sun, Lichao Sun, Jindong Wang, Yanfang Ye, Xiangliang Zhang. [arxiv]
- CycleResearcher: Improving Automated Research via Automated Review. Yixuan Weng, Minjun Zhu, Guangsheng Bao, Hongbo Zhang, Jindong Wang, Yue Zhang, Linyi Yang. [arxiv]
- Scito2M: A 2 Million, 30-Year Cross-disciplinary Dataset for Temporal Scientometric Analysis. Yiqiao Jin, Yijia Xiao, Yiyang Wang, Jindong Wang. [arxiv]
- Can I understand what I create? Self-Knowledge Evaluation of Large Language Models. Zhiquan Tan, Lai Wei, Jindong Wang, Xing Xie, Weiran Huang. [arxiv]
- Learning with noisy foundation models. Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj. [arxiv]
More preprints
- Meta Semantic Template for Evaluation of Large Language Models. Yachuan Liu, Liang Chen, Jindong Wang, Qiaozhu Mei, Xing Xie. [arxiv]
- Frustratingly Easy Model Generalization by Dummy Risk Minimization. Juncheng Wang, Jindong Wang, Xixu Hu, Shujun Wang, Xing Xie. [arxiv]
- Equivariant Disentangled Transformation for Domain Generalization under Combination Shift. Yivan Zhang, Jindong Wang, Xing Xie, and Masashi Sugiyama. [arxiv]
- Learning Invariant Representations across Domains and Tasks. Jindong Wang, Wenjie Feng, Chang Liu, Chaohui Yu, Mingxuan Du, Renjun Xu, Tao Qin, and Tie-Yan Liu. [arxiv]
- Learning to match distributions for domain adaptation. Chaohui Yu, Jindong Wang, Chang Liu, Tao Qin, Renjun Xu, Wenjie Feng, Yiqiang Chen, and Tie-Yan Liu. [arxiv]
Books
Papers
2024
- RESTful-Llama: Connecting User Queries to RESTful APIsProceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP): Industry Track 2024 | [ PDF ]
- On catastrophic inheritance of large foundation modelsData-centric Machine Learning Research (DMLR) 2024 | [ arXiv ]
- SpecFormer: Guarding Vision Transformer Robustness via Maximum Singular Value PenalizationECCV 2024 | [ arXiv ]
- Selective mixup helps with distribution shifts, but not (only) because of mixupInternational Conference on Machine Learning (ICML) 2024 | [ arXiv ]
- Open-Vocabulary Calibration for Vision-Language ModelsInternational Conference on Machine Learning (ICML) 2024 | [ arXiv ]
- Position Paper: What Can Large Language Models Tell Us about Time Series AnalysisInternational Conference on Machine Learning (ICML) 2024 | [ arXiv ]
- MM-Soc: Benchmarking Multimodal Large Language Models in Social Media PlatformsThe 62nd Annual Meeting of the Association for Computational Linguistics (ACL) Findings 2024 | [ arXiv ]
- LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term PromptingThe 62nd Annual Meeting of the Association for Computational Linguistics (ACL) Findings 2024 | [ arXiv ]
- Supervised Knowledge Makes Large Language Models Better In-context LearnersInternational Conference on Learning Representation (ICLR) 2024 | [ arXiv ]
- UP-Net: An Uncertainty-Driven Prototypical Network for Few-Shot Fault DiagnosisIEEE Transactions on Neural Networks and Learning Systems (TNNLS) 2024 | [ ]
- Towards Optimization and Model Selection for Domain Generalization: A Mixup-guided SolutionSIAM Conference on Data Mining (SDM) 2024 | [ arXiv ]
- Conv-adapter: Exploring parameter efficient transfer learning for convnetsCVPR 2024 Workshop Prompting in Vision 2024 | [ arXiv ]
2023
- Out-of-Distribution Generalization in Text Classification: Past, Present, and FutureThe 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2023 | [ arXiv ]
- Boosting cross-domain speech recognition with self-supervisionIEEE Transactions on Audio, Speech and Language Processing (TASLP) 2023 | [ arXiv ]
2022
- Hierarchical knowledge amalgamation with dual discriminative feature alignmentInformation Sciences 2022 | [ Website ]
- Local and global alignments for generalizable sensor-based human activity recognitionIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022 | [ HTML ]
2021
- MixSpeech: Data Augmentation for Low-resource Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021 | [ arXiv ]
2020
2019
- DrowsyDet: A Mobile Application for Real-time Driver Drowsiness DetectionUbiquitous Intelligence Computing 2019 | [ ]
2018
2017
2016