This thesis focuses on massive remote monitoring applications, where thousands of devices send time stamped updates over a wireless channel to a common receiver. An uncoordinated communication protocol based on slotted ALOHA is adopted, with the overall objective to keep an up-to-date perception at the receiver, in terms of data freshness - Age of Information (AoI). Under the paradigm of Reinforcement Learning, this thesis proposes and evaluates a Q-learning Multi-Agent algorithm, the AoI-Q-ALOHA method, where the users develop ad-hoc strategies for the minimisation of their local mean AoI. Without any communication with the others, except for the central receiver, the agents will learn on a binary success/collision feedback. Team behaviour is encouraged through a tailored individual reward designation, without any assumption on the network population. We compare the performance in terms of the overall AoI, Throughput and Fairness index, to the standard slotted-ALOHA protocol, and to the threshold-ALOHA policy, a benchmark protocol that resorts to a central optimization of the access parameters. Interesting insights on the distributed setup are derived, as well an exhaustive survey on the properties and robustness of the algorithm.

This thesis focuses on massive remote monitoring applications, where thousands of devices send time stamped updates over a wireless channel to a common receiver. An uncoordinated communication protocol based on slotted ALOHA is adopted, with the overall objective to keep an up-to-date perception at the receiver, in terms of data freshness - Age of Information (AoI). Under the paradigm of Reinforcement Learning, this thesis proposes and evaluates a Q-learning Multi-Agent algorithm, the AoI-Q-ALOHA method, where the users develop ad-hoc strategies for the minimisation of their local mean AoI. Without any communication with the others, except for the central receiver, the agents will learn on a binary success/collision feedback. Team behaviour is encouraged through a tailored individual reward designation, without any assumption on the network population. We compare the performance in terms of the overall AoI, Throughput and Fairness index, to the standard slotted-ALOHA protocol, and to the threshold-ALOHA policy, a benchmark protocol that resorts to a central optimization of the access parameters. Interesting insights on the distributed setup are derived, as well an exhaustive survey on the properties and robustness of the algorithm.

Reinforcement Learning algorithms for IoT communications over uncoordinated access channels

CAVALAGLI, CHIARA
2023/2024

Abstract

This thesis focuses on massive remote monitoring applications, where thousands of devices send time stamped updates over a wireless channel to a common receiver. An uncoordinated communication protocol based on slotted ALOHA is adopted, with the overall objective to keep an up-to-date perception at the receiver, in terms of data freshness - Age of Information (AoI). Under the paradigm of Reinforcement Learning, this thesis proposes and evaluates a Q-learning Multi-Agent algorithm, the AoI-Q-ALOHA method, where the users develop ad-hoc strategies for the minimisation of their local mean AoI. Without any communication with the others, except for the central receiver, the agents will learn on a binary success/collision feedback. Team behaviour is encouraged through a tailored individual reward designation, without any assumption on the network population. We compare the performance in terms of the overall AoI, Throughput and Fairness index, to the standard slotted-ALOHA protocol, and to the threshold-ALOHA policy, a benchmark protocol that resorts to a central optimization of the access parameters. Interesting insights on the distributed setup are derived, as well an exhaustive survey on the properties and robustness of the algorithm.
2023
Reinforcement Learning algorithms for IoT communications over uncoordinated access channels
This thesis focuses on massive remote monitoring applications, where thousands of devices send time stamped updates over a wireless channel to a common receiver. An uncoordinated communication protocol based on slotted ALOHA is adopted, with the overall objective to keep an up-to-date perception at the receiver, in terms of data freshness - Age of Information (AoI). Under the paradigm of Reinforcement Learning, this thesis proposes and evaluates a Q-learning Multi-Agent algorithm, the AoI-Q-ALOHA method, where the users develop ad-hoc strategies for the minimisation of their local mean AoI. Without any communication with the others, except for the central receiver, the agents will learn on a binary success/collision feedback. Team behaviour is encouraged through a tailored individual reward designation, without any assumption on the network population. We compare the performance in terms of the overall AoI, Throughput and Fairness index, to the standard slotted-ALOHA protocol, and to the threshold-ALOHA policy, a benchmark protocol that resorts to a central optimization of the access parameters. Interesting insights on the distributed setup are derived, as well an exhaustive survey on the properties and robustness of the algorithm.
Reinforcement
IoT
Algorithms
Learning
Channels
File in questo prodotto:
File Dimensione Formato  
Cavalagli_Chiara.pdf.pdf

accesso riservato

Dimensione 792.45 kB
Formato Adobe PDF
792.45 kB Adobe PDF

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/68799