Multi-objective scheduling in wireless networks with deep reinforcement learning