Reinforcement Learning for Proactive Caching of Contents with Different Demand Probabilities

Somuyiwa, S.; Gunduz, D.; Gyorgy, A.

doi:10.1109/ISWCS.2018.8491205

A mobile user randomly accessing a dynamic content library over a wireless channel is considered. At each time instant, a random number of contents are added to the library and each content remains relevant to the user for a random period of time. Contents are classified into finitely many classes such that whenever the user accesses the system, he requests each content randomly with a class-specific demand probability. Contents are downloaded to the user equipment (UE) through a wireless link whose quality also varies randomly with time. The UE has a cache memory of finite capacity, which can be used to proactively store contents before they are requested by the user. Any time contents are downloaded, the system incurs a cost (energy, bandwidth, etc.) that depends on the channel state at the time of download, and scales linearly with the number of contents downloaded. Our goal is to minimize the expected long-term average cost. The problem is modeled as a Markov decision process, and the optimal policy is shown to exhibit a threshold structure; however, since finding the optimal policy is computationally infeasible, parametric approximations to the optimal policy are considered, whose parameters are optimized using the policy gradient method. Numerical simulations show that the performance gain of the resulting scheme over traditional reactive content delivery is significant, and increases with the cache capacity. Comparisons with two performance lower bounds, one computed based on infinite cache capacity and another based on non-casual knowledge of the user access times and content requests, demonstrate that our scheme can perform close to the theoretical optimum.

Reinforcement Learning for Proactive Caching of Contents with Different Demand Probabilities / Somuyiwa, S.; Gunduz, D.; Gyorgy, A.. - 2018-:(2018), pp. 1-6. (Intervento presentato al convegno 15th International Symposium on Wireless Communication Systems, ISWCS 2018 tenutosi a prt nel 2018) [10.1109/ISWCS.2018.8491205].

Reinforcement Learning for Proactive Caching of Contents with Different Demand Probabilities

S. Somuyiwa;D. Gunduz;A. Gyorgy

2018

Abstract

A mobile user randomly accessing a dynamic content library over a wireless channel is considered. At each time instant, a random number of contents are added to the library and each content remains relevant to the user for a random period of time. Contents are classified into finitely many classes such that whenever the user accesses the system, he requests each content randomly with a class-specific demand probability. Contents are downloaded to the user equipment (UE) through a wireless link whose quality also varies randomly with time. The UE has a cache memory of finite capacity, which can be used to proactively store contents before they are requested by the user. Any time contents are downloaded, the system incurs a cost (energy, bandwidth, etc.) that depends on the channel state at the time of download, and scales linearly with the number of contents downloaded. Our goal is to minimize the expected long-term average cost. The problem is modeled as a Markov decision process, and the optimal policy is shown to exhibit a threshold structure; however, since finding the optimal policy is computationally infeasible, parametric approximations to the optimal policy are considered, whose parameters are optimized using the policy gradient method. Numerical simulations show that the performance gain of the resulting scheme over traditional reactive content delivery is significant, and increases with the cache capacity. Comparisons with two performance lower bounds, one computed based on infinite cache capacity and another based on non-casual knowledge of the user access times and content requests, demonstrate that our scheme can perform close to the theoretical optimum.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2018
			
	Titolo del Convegno
	
				15th International Symposium on Wireless Communication Systems, ISWCS 2018
			
	Luogo del Convegno
	
				prt
			
	Data del Convegno
	
				2018
			
	Codice DOI
	
				https://dx.doi.org/10.1109/ISWCS.2018.8491205
			
	Codice Scopus
	
				2-s2.0-85056719187
			
	Serie
	
				INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS
			
	N° del Volume
	
				2018-
			
	Pagina iniziale
	
				1
			
	Pagina finale
	
				6
			
	Tutti gli autori
	
						Somuyiwa, S.; Gunduz, D.; Gyorgy, A.
					
	Citazione
	
				Reinforcement Learning for Proactive Caching of Contents with Different Demand Probabilities / Somuyiwa, S.; Gunduz, D.; Gyorgy, A.. - 2018-:(2018), pp. 1-6. (Intervento presentato al  convegno 15th International Symposium on Wireless Communication Systems, ISWCS 2018 tenutosi a prt nel 2018) [10.1109/ISWCS.2018.8491205].
			
	Tipologia
	
				Relazione in Atti di Convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I metadati presenti in IRIS UNIMORE sono rilasciati con licenza Creative Commons CC0 1.0 Universal, mentre i file delle pubblicazioni sono rilasciati con licenza Attribuzione 4.0 Internazionale (CC BY 4.0), salvo diversa indicazione.
In caso di violazione di copyright, contattare Supporto Iris