The aim of this project is to solve problems involving sequential decision-making under uncertainty by building agents that learn by reinforcement. Problems of this nature involve situations in which an agent (be it a human, an organization or a computer program) needs to make a series of decisions over time, and each decision can influence the future options available and the possible outcomes.   APPLICATION EXAMPLES:   MANDATORY INFRASTRUCTURE RESOURCES:   COMPLETION AND DELIVERY OF THE PROJECT: All the prototypes generated during the project are delivered at the end of the 10th week.