Wordle Duel: Can Deep Reinforcement Learning Architectures Outperform a Heuristic Entropy Maximizer?

This thesis provides a comparative analysis of Information Theory and Deep Reinforcement Learning Approaches to solving a popular word puzzle game: Wordle. The game challenges players to deduce a five-letter hidden word within six attempts, providing feedback for each guess indicating letter correctness and positioning. Its constrained action space, limited to valid five-letter words, combined with its stochastic nature and sequential decision-making, makes it an ideal test bed for evaluating and contrasting entropy-driven strategy, focused on uncertainty reduction, with adaptive Deep Reinforcement Learning's dynamic learning under uncertainty. The primary objective is to rigorously compare these methodologies, particularly examining their performance under the critical constraint of a restricted guess space, implemented by allowing the algorithms to guess only words consistent with the feedback pattern observed after each attempt. By forcing algorithms to guess only words consistent with the feedback, this approach mimics the intuitive strategy of "narrowing down" the possible hidden word, which is often used by casual players. Simultaneously, this restriction reduces computational complexity and allows for a dynamically limited action space where valid guesses are strictly confined to the set of words still possible given the clues received, making the comparison between all the tested approaches easier. Additionally, the target word is uniformly sampled from a predefined lexicon, eliminating real-world biases to isolate algorithmic performance under fair conditions. By enforcing these constraints, the analysis evaluates how Reinforcement Learning’s capacity for long-term reward optimization compares to the entropy method’s stepwise uncertainty reduction, with the goal of minimizing puzzle-solving attempts.

Wordle Duel: Can Deep Reinforcement Learning Architectures Outperform a Heuristic Entropy Maximizer?

KOGUT, MICHAL KRZYSZTOF

2024/2025

Abstract

This thesis provides a comparative analysis of Information Theory and Deep Reinforcement Learning Approaches to solving a popular word puzzle game: Wordle. The game challenges players to deduce a five-letter hidden word within six attempts, providing feedback for each guess indicating letter correctness and positioning. Its constrained action space, limited to valid five-letter words, combined with its stochastic nature and sequential decision-making, makes it an ideal test bed for evaluating and contrasting entropy-driven strategy, focused on uncertainty reduction, with adaptive Deep Reinforcement Learning's dynamic learning under uncertainty. The primary objective is to rigorously compare these methodologies, particularly examining their performance under the critical constraint of a restricted guess space, implemented by allowing the algorithms to guess only words consistent with the feedback pattern observed after each attempt. By forcing algorithms to guess only words consistent with the feedback, this approach mimics the intuitive strategy of "narrowing down" the possible hidden word, which is often used by casual players. Simultaneously, this restriction reduces computational complexity and allows for a dynamically limited action space where valid guesses are strictly confined to the set of words still possible given the clues received, making the comparison between all the tested approaches easier. Additionally, the target word is uniformly sampled from a predefined lexicon, eliminating real-world biases to isolate algorithmic performance under fair conditions. By enforcing these constraints, the analysis evaluates how Reinforcement Learning’s capacity for long-term reward optimization compares to the entropy method’s stepwise uncertainty reduction, with the goal of minimizing puzzle-solving attempts.

Scheda

Scheda DC

	Facoltà/Dipartimento
	
				Dipartimento di Ingegneria dell'Informazione - DEI
			
	Corso di studio
	
				INGEGNERIA DELL'INFORMAZIONE Laurea di Primo Livello (D.M. 270/2004)
			
	Anno Accademico
	
				2024
			
	Titolo inglese
	
				Wordle Duel: Can Deep Reinforcement Learning Architectures Outperform a Heuristic Entropy Maximizer?
			
	Abstract in italiano
	
				This thesis provides a comparative analysis of Information Theory and Deep Reinforcement Learning Approaches to solving a popular word puzzle game: Wordle. The game challenges players to deduce a five-letter hidden word within six attempts, providing feedback for each guess indicating letter correctness and positioning.  Its constrained action space, limited to valid five-letter words, combined with its stochastic nature and sequential decision-making, makes it an ideal test bed for evaluating and contrasting entropy-driven strategy, focused on uncertainty reduction, with adaptive Deep Reinforcement Learning's dynamic learning under uncertainty. The primary objective is to rigorously compare these methodologies, particularly examining their performance under the critical constraint of a restricted guess space, implemented by allowing the algorithms to guess only words consistent with the feedback pattern observed after each attempt. By forcing algorithms to guess only words consistent with the feedback, this approach mimics the intuitive strategy of "narrowing down" the possible hidden word, which is often used by casual players. Simultaneously, this restriction reduces computational complexity and allows for a dynamically limited action space where valid guesses are strictly confined to the set of words still possible given the clues received, making the comparison between all the tested approaches easier. Additionally, the target word is uniformly sampled from a predefined lexicon, eliminating real-world biases to isolate algorithmic performance under fair conditions. By enforcing these constraints, the analysis evaluates how Reinforcement Learning’s capacity for long-term reward optimization compares to the entropy method’s stepwise uncertainty reduction, with the goal of minimizing puzzle-solving attempts.
			
	Parola chiave
	
				Wordle
Deep Learning
Reinforcement L.
Entropy Maximization
Information Theory
			
	Relatore
	
				SUSTO, GIAN ANTONIO
			
	Appare nelle tipologie:
	
				Lauree triennali

File in questo prodotto:

File	Dimensione	Formato
Kogut_Michal.pdf accesso aperto Dimensione 2.04 MB Formato Adobe PDF Visualizza/Apri	2.04 MB	Adobe PDF	Visualizza/Apri

The text of this website © Università degli studi di Padova. Full Text are published under a non-exclusive license. Metadata are under a CC0 License

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12608/89789