Reinforcement Learning algorithms for IoT communications over uncoordinated access channels

This thesis focuses on massive remote monitoring applications, where thousands of devices send time stamped updates over a wireless channel to a common receiver. An uncoordinated communication protocol based on slotted ALOHA is adopted, with the overall objective to keep an up-to-date perception at the receiver, in terms of data freshness - Age of Information (AoI). Under the paradigm of Reinforcement Learning, this thesis proposes and evaluates a Q-learning Multi-Agent algorithm, the AoI-Q-ALOHA method, where the users develop ad-hoc strategies for the minimisation of their local mean AoI. Without any communication with the others, except for the central receiver, the agents will learn on a binary success/collision feedback. Team behaviour is encouraged through a tailored individual reward designation, without any assumption on the network population. We compare the performance in terms of the overall AoI, Throughput and Fairness index, to the standard slotted-ALOHA protocol, and to the threshold-ALOHA policy, a benchmark protocol that resorts to a central optimization of the access parameters. Interesting insights on the distributed setup are derived, as well an exhaustive survey on the properties and robustness of the algorithm.

Reinforcement Learning algorithms for IoT communications over uncoordinated access channels

CAVALAGLI, CHIARA

2023/2024

Abstract

This thesis focuses on massive remote monitoring applications, where thousands of devices send time stamped updates over a wireless channel to a common receiver. An uncoordinated communication protocol based on slotted ALOHA is adopted, with the overall objective to keep an up-to-date perception at the receiver, in terms of data freshness - Age of Information (AoI). Under the paradigm of Reinforcement Learning, this thesis proposes and evaluates a Q-learning Multi-Agent algorithm, the AoI-Q-ALOHA method, where the users develop ad-hoc strategies for the minimisation of their local mean AoI. Without any communication with the others, except for the central receiver, the agents will learn on a binary success/collision feedback. Team behaviour is encouraged through a tailored individual reward designation, without any assumption on the network population. We compare the performance in terms of the overall AoI, Throughput and Fairness index, to the standard slotted-ALOHA protocol, and to the threshold-ALOHA policy, a benchmark protocol that resorts to a central optimization of the access parameters. Interesting insights on the distributed setup are derived, as well an exhaustive survey on the properties and robustness of the algorithm.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				Dipartimento di Matematica "Tullio Levi-Civita" - DM
			
	Corso di studio
	
				DATA SCIENCE Laurea Magistrale (D.M. 270/2004)
			
	Anno Accademico
	
				2023
			
	Titolo inglese
	
				Reinforcement Learning algorithms for IoT communications over uncoordinated access channels
			
	Abstract in italiano
	
				This thesis focuses on massive remote monitoring applications, where thousands of devices send time stamped updates over a wireless channel to a common receiver. An uncoordinated communication protocol based on slotted ALOHA is adopted, with the overall objective to keep an up-to-date perception at the receiver, in terms of data freshness - Age of Information (AoI). Under the paradigm of Reinforcement Learning, this thesis proposes and evaluates a Q-learning Multi-Agent algorithm, the AoI-Q-ALOHA method, where the users develop ad-hoc strategies for the minimisation of their local mean AoI. Without any communication with the others, except for the central receiver, the agents will learn on a binary success/collision feedback. Team behaviour is encouraged through a tailored individual reward designation, without any assumption on the network population. We compare the performance in terms of the overall AoI, Throughput and Fairness index, to the standard slotted-ALOHA protocol, and to the threshold-ALOHA policy, a benchmark protocol that resorts to a central optimization of the access parameters. Interesting insights on the distributed setup are derived, as well an exhaustive survey on the properties and robustness of the algorithm.
			
	Parola chiave
	
				Reinforcement
IoT
Algorithms
Learning
Channels
			
	Relatore
	
				BADIA, LEONARDO
			
	Appare nelle tipologie:
	
				Lauree magistrali

File in questo prodotto:

File	Dimensione	Formato
Cavalagli_Chiara.pdf.pdf Accesso riservato Dimensione 792.45 kB Formato Adobe PDF	792.45 kB	Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/68799