If you just want to use MIR as the pre-training indicator of your own model, no additional environment is required. python mir.py --model_path PATH/TO/MODEL --base_llm PATH/TO/LLM --text_data_path ...
Abstract: To fulfill the demands of emerging multi-modal services, the cross-modal semantic communication paradigm comes into being. It fully utilizes potential semantic correlations among modalities ...
Abstract: Multi-modal salient object detection (MSOD) aims to boost saliency detection performance by integrating visible sources with depth or thermal infrared ones. Existing methods generally design ...