Abstract: To address the optimization scheduling problem of photovoltaic storage charging stations, the traditional Proximal Policy Optimization (PPO) algorithm, which uses a fixed penalty coefficient ...