Temperature control for cyber-physical thermal systems over wireless networks: A model-assisted deep reinforcement learning approach