Qilin is a large-scale multimodal dataset designed for advancing research in search, recommendation, and Retrieval-Augmented Generation (RAG) systems. This repository contains the official implementation of the dataset paper, baseline models, and evaluation tools.
Qilin provides comprehensive data for three main scenarios:
- Training set: 44,024 samples
- Testing set: 6,192 samples
- Features:
- Rich query metadata
- User interaction logs
- Ground clicked labels
- Training set: 83,437 samples
- Testing set: 11,115 samples
- Features:
- Detailed user interaction history
- Candidate note pools
- Contextual features
- Ground clicked labels
- Multiple content modalities (text, images, video thumbnails)
- Rich user interaction data
- Comprehensive evaluation metrics
- Support for RAG system development
baselines/
: Implementation of state-of-the-art baseline modelsdatasets/
: Dataset files and processing scriptstoy_data/
: Small sample dataset for quick explorationqilin/
: Complete dataset (after downloading)
pip install -r baselines/requirements.txt
- Download the Qilin dataset from Hugging Face
- Extract and place the dataset in the
datasets/qilin
directory - Download the required models:
- Place the downloaded models in the
model
directory
If you use Qilin in your research, please cite our paper:
@misc{chen2025qilinmultimodalinformationretrieval,
title={Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions},
author={Jia Chen and Qian Dong and Haitao Li and Xiaohui He and Yan Gao and Shaosheng Cao and Yi Wu and Ping Yang and Chen Xu and Yao Hu and Qingyao Ai and Yiqun Liu},
year={2025},
eprint={2503.00501},
archivePrefix={arXiv},
primaryClass={cs.IR},
url={https://arxiv.org/abs/2503.00501},
}
This project is licensed under the MIT License - see the LICENSE file for details.