Abstract: In recent years, remote sensing cross-modal text-image retrieval (RSCTIR) has attracted considerable attention owing to its convenience and information mining capabilities. However, two ...
Abstract: Multi-modal data presents a promising opportunity for improving multimedia recommendation models, but it also introduces task-irrelevant noise that can reduce model robustness. In this paper ...
If you just want to use MIR as the pre-training indicator of your own model, no additional environment is required. python mir.py --model_path PATH/TO/MODEL --base_llm PATH/TO/LLM --text_data_path ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results