Publications
[Google scholar] | [DBLP] | [View by topic] |
Recent Preprints
- Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems. Jiaxi Yang, Mengqi Zhang, Yiqiao Jin, Hao Chen, Qingsong Wen, Lu Lin, Yi He, Weijie Xu, James Evans, Jindong Wang. [arxiv]
- Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation. Chika Maduabuchi, Hao Chen, Yujin Han, Jindong Wang. [arxiv] [code]
- Personalized Safety in LLMs: A Benchmark and A Planning-Based Agent Approach. Yuchen Wu, Edward Sun, Kaijie Zhu, Jianxun Lian, Jose Hernandez-Orallo, Aylin Caliskan, Jindong Wang. [arxiv] [website]
- Understanding and Mitigating the Bias Inheritance in LLM-based Data Augmentation on Downstream Tasks. Miaomiao Li, Hao Chen, Yang Wang, Tingyuan Zhu, Weijia Zhang, Kaijie Zhu, Kam-Fai Wong, Jindong Wang. [arxiv]
- CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries. Shudong Liu, Yiqiao Jin, Cheng Li, Derek F. Wong, Qingsong Wen, Lichao Sun, Haipeng Chen, Xing Xie, Jindong Wang. [arxiv]
- MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders. Cheng Li, May Fung, Qingyun Wang, Chi Han, Manling Li, Jindong Wang, Heng Ji. [arxiv]
- On the Diversity of Synthetic Data and its Impact on Training Large Language Models. Hao Chen, Abdul Waheed, Xiang Li, Yidong Wang, Jindong Wang, Bhiksha Raj, Marah I. Abdin. [arxiv]
- Can I understand what I create? Self-Knowledge Evaluation of Large Language Models. Zhiquan Tan, Lai Wei, Jindong Wang, Weiran Huang. [arxiv]
Previous preprints
- Meta Semantic Template for Evaluation of Large Language Models. Yachuan Liu, Liang Chen, Jindong Wang, Qiaozhu Mei, Xing Xie. [arxiv]
- Frustratingly Easy Model Generalization by Dummy Risk Minimization. Juncheng Wang, Jindong Wang, Xixu Hu, Shujun Wang, Xing Xie. [arxiv]
- Equivariant Disentangled Transformation for Domain Generalization under Combination Shift. Yivan Zhang, Jindong Wang, Xing Xie, and Masashi Sugiyama. [arxiv]
- Learning Invariant Representations across Domains and Tasks. Jindong Wang, Wenjie Feng, Chang Liu, Chaohui Yu, Mingxuan Du, Renjun Xu, Tao Qin, and Tie-Yan Liu. [arxiv]
- Learning to match distributions for domain adaptation. Chaohui Yu, Jindong Wang, Chang Liu, Tao Qin, Renjun Xu, Wenjie Feng, Yiqiang Chen, and Tie-Yan Liu. [arxiv]
Books
Papers
2025
- Exposing and Patching the Flaws of Large Language Models in Social Character SimulationConference on Language Modeling (COLM) 2025 | [ arXiv ]
- Value compass leaderboard: A platform for fundamental and validated evaluation of llms values2025 | [ arXiv ]
- MELON: Indirect Prompt Injection Defense via Masked Re-execution and Tool ComparisonInternational Conference on Machine Learning (ICML) 2025 | [ arXiv ]
- Topology-aware Neural Flux Prediction Guided by PhysicsInternational Conference on Machine Learning (ICML) 2025 | [ ]
- SoftVQ-VAE: Efficient 1-Dimensional Continuous TokenizerProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2025 | [ arXiv ]
- CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 CountriesCVPR 2025 workshop on VLMs4ALL 2025 | [ arXiv ]
2024
- RESTful-Llama: Connecting User Queries to RESTful APIsProceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP): Industry Track 2024 | [ PDF ]
- On catastrophic inheritance of large foundation modelsData-centric Machine Learning Research (DMLR) 2024 | [ arXiv ]
- SpecFormer: Guarding Vision Transformer Robustness via Maximum Singular Value PenalizationECCV 2024 | [ arXiv ]
- Selective mixup helps with distribution shifts, but not (only) because of mixupInternational Conference on Machine Learning (ICML) 2024 | [ arXiv ]
- Open-Vocabulary Calibration for Vision-Language ModelsInternational Conference on Machine Learning (ICML) 2024 | [ arXiv ]
- Position Paper: What Can Large Language Models Tell Us about Time Series AnalysisInternational Conference on Machine Learning (ICML) 2024 | [ arXiv ]
- MM-Soc: Benchmarking Multimodal Large Language Models in Social Media PlatformsThe 62nd Annual Meeting of the Association for Computational Linguistics (ACL) Findings 2024 | [ arXiv ]
- LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term PromptingThe 62nd Annual Meeting of the Association for Computational Linguistics (ACL) Findings 2024 | [ arXiv ]
- Supervised Knowledge Makes Large Language Models Better In-context LearnersInternational Conference on Learning Representation (ICLR) 2024 | [ arXiv ]
- UP-Net: An Uncertainty-Driven Prototypical Network for Few-Shot Fault DiagnosisIEEE Transactions on Neural Networks and Learning Systems (TNNLS) 2024 | [ ]
- Towards Optimization and Model Selection for Domain Generalization: A Mixup-guided SolutionSIAM Conference on Data Mining (SDM) 2024 | [ arXiv ]
- Conv-adapter: Exploring parameter efficient transfer learning for convnetsCVPR 2024 Workshop Prompting in Vision 2024 | [ arXiv ]
2023
- Out-of-Distribution Generalization in Text Classification: Past, Present, and FutureThe 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2023 | [ arXiv ]
- Boosting cross-domain speech recognition with self-supervisionIEEE Transactions on Audio, Speech and Language Processing (TASLP) 2023 | [ arXiv ]
2022
- Hierarchical knowledge amalgamation with dual discriminative feature alignmentInformation Sciences 2022 | [ Website ]
- Local and global alignments for generalizable sensor-based human activity recognitionIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022 | [ HTML ]
2021
- MixSpeech: Data Augmentation for Low-resource Automatic Speech RecognitionIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021 | [ arXiv ]
2020
2019
- DrowsyDet: A Mobile Application for Real-time Driver Drowsiness DetectionUbiquitous Intelligence Computing 2019 | [ ]
2018
2017
2016