In a quantum device composed of multiple qubits, a reinforcement learning agent learns to preserve quantum information by selecting gate sequences and performing measurements. Based on measurement ...