Abstract: Visual reinforcement learning (VRL) aims to learn optimal policies directly from pixel data, which holds significant potential for applications in control systems characterized by data ...