Reinforcement learning-based spectrum management for cognitive radio networks: A literature review and case study