DEEP REINFORCEMENT LEARNING BASED MULTI OBJECTIVE X2026