Shanxiu He
Hello! I am Shanxiu He, a Computer Science Phd Candidate at UCSB with Professor Tao Yang. My research focuses on the effectiveness and efficiency tradeoffs for information retrieval and builds retrieval models with strong zero-shot abilities.
Research Experience
Sep. 2022 - Present, UCSB Information Retrieval Lab
Jun. 2024 - Sep. 2024, Applied Scientist Intern at Amazon
Dec. 2019 - March. 2022, NLP Researcher at UCLA-NLP
Jun. 2021 - Oct. 2021, NLP Research Intern at USC ISI
Publications
Low-Cost Document Retrieval with Dense Pseudo-Query Encoding
Shanxiu He, Wentai Xie, Yifan Qiao, Parker Carlson, Tao Yang.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval 2025.
Dynamic Superblock Pruning for Fast Learned Sparse Retrieval 🔗
Parker Carlson, Wentai Xie, Shanxiu He, Tao Yang.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval 2025.
Token Pruning Optimization for Efficient Dense Retrieval with Multi-Vector Representations 🔗
Shanxiu He, Mutasem Al-Darabsah, Suraj Nair, Jonathan May, Tarun Agarwal, Tao Yang and Choon Hui Teo.
The 47th European Conference on Information Retrieval (ECIR) 2025.
LSTM-Based Selective Dense Text Retrieval Guided by Sparse Lexical Retrieval 🔗
Yingrui Yang, Parker Carlson, Yifan Qiao, Wentai Xie, Shanxiu He, Tao Yang.
The 47th European Conference on Information Retrieval (ECIR) 2025.
Threshold-driven Pruning with Segmented Maximum Term Weights for Approximate Cluster-based Sparse Retrieval 🔗
Yifan Qiao, Shanxiu He, Parker Carlson, Yingrui Yang, Tao Yang.
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), main conference 2024.
Cluster-based Partial Dense Retrieval Fused with Sparse Text Retrieval 🔗
Yingrui Yang, Parker Carlson, Yifan Qiao, Shanxiu He, Tao Yang.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval 2024.
Weighted KL-Divergence for Document Ranking Model Refinement 🔗
Yingrui Yang, Yifan Qiao, Shanxiu He, Tao Yang.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval 2024.
Balanced Knowledge Distillation with Contrastive Learning for Document Re-ranking 🔗
Yingrui Yang, Shanxiu He, Yifan Qiao, Wentai Xie, Tao Yang.
Proceedings of the 2023 ACM SIGIR International Conference on Theory of Information Retrieval (ICTIR).
Representation Sparsification with Hybrid Thresholding for Fast SPLADE-based Document Retrieval 🔗
Yifan Qiao, Yingrui Yang, Shanxiu He, Tao Yang.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval 2023.