A reinforcement learning approach to age of information in multi-user networks with HARQ

Ceran, E. T.; Gunduz, D.; Gyorgy, A.

doi:10.1109/JSAC.2021.3065057

Scheduling the transmission of time-sensitive information from a source node to multiple users over error-prone communication channels is studied with the goal of minimizing the long-term average age of information (AoI) at the users. A long-term average resource constraint is imposed on the source, which limits the average number of transmissions. The source can transmit only to a single user at each time slot, and after each transmission, it receives an instantaneous ACK/NACK feedback from the intended receiver, and decides when and to which user to transmit the next update. Assuming the channel statistics are known, the optimal scheduling policy is studied for both the standard automatic repeat request (ARQ) and hybrid ARQ (HARQ) protocols. Then, a reinforcement learning (RL) approach is introduced to find a near-optimal policy, which does not assume any a priori information on the random processes governing the channel states. Different RL methods including average-cost SARSA with linear function approximation (LFA), upper confidence reinforcement learning (UCRL2), and deep Q-network (DQN) are applied and compared through numerical simulations.

A reinforcement learning approach to age of information in multi-user networks with HARQ / Ceran, E.T., Gunduz, D., Gyorgy, A.. - In: IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS. - ISSN 0733-8716. - 39:5(2021), pp. 1412-1426. [10.1109/JSAC.2021.3065057]

A reinforcement learning approach to age of information in multi-user networks with HARQ

Ceran E. T.;Gunduz D.;Gyorgy A.

2021

Abstract

Scheduling the transmission of time-sensitive information from a source node to multiple users over error-prone communication channels is studied with the goal of minimizing the long-term average age of information (AoI) at the users. A long-term average resource constraint is imposed on the source, which limits the average number of transmissions. The source can transmit only to a single user at each time slot, and after each transmission, it receives an instantaneous ACK/NACK feedback from the intended receiver, and decides when and to which user to transmit the next update. Assuming the channel statistics are known, the optimal scheduling policy is studied for both the standard automatic repeat request (ARQ) and hybrid ARQ (HARQ) protocols. Then, a reinforcement learning (RL) approach is introduced to find a near-optimal policy, which does not assume any a priori information on the random processes governing the channel states. Different RL methods including average-cost SARSA with linear function approximation (LFA), upper confidence reinforcement learning (UCRL2), and deep Q-network (DQN) are applied and compared through numerical simulations.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2021
			
	Rivista
	
				IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS
			
	N° del Volume
	
				39
			
	Fascicolo
	
				5
			
	Pagina iniziale
	
				1412
			
	Pagina finale
	
				1426
			
	Codice DOI
	
				https://dx.doi.org/10.1109/JSAC.2021.3065057
			
	Codice WoS
	
				WOS:000641962200017
			
	Codice Scopus
	
				2-s2.0-85102694022
			
	Citazione
	
				A reinforcement learning approach to age of information in multi-user networks with HARQ / Ceran, E.T., Gunduz, D., Gyorgy, A.. - In: IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS. - ISSN 0733-8716. - 39:5(2021), pp. 1412-1426. [10.1109/JSAC.2021.3065057]
			
	Tutti gli autori
	
						Ceran, E. T.; Gunduz, D.; Gyorgy, A.
					
	Tipologia
	
				Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
A_Reinforcement_Learning_Approach_to_Age_of_Information_in_Multi-User_Networks_With_HARQ.pdf Accesso riservato Tipologia: VOR - Versione pubblicata dall'editore Dimensione 3.03 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	3.03 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris