Abstract
|
|
---|---|
The most representative allocation strategies to deal with the multi-armed bandit problem are analyzed in a context with delayed rewards by means of a numerical study based on a discrete event simulation. The scenario that we address is a digital marketing content recommendation system, called campaign management, used by marketers to create specific digital content that can be issued or configured for viewing by certain population segments according to a series of business variables, user profile or behavior. Both batch mode and online update architectures are considered for feedback from the different contents displayed to users. The results show that possibilistic reward (PR) methods outperform other allocation strategies in this scenario with delayed rewards. | |
International
|
Si |
Congress
|
11th International Conference of the ERCIM (European Research Consortium for Informatics and Mathematics) Working Group on Computational and Methodological Statistics (CMStatistics 2018) |
|
960 |
Place
|
Pisa, Italia |
Reviewers
|
Si |
ISBN/ISSN
|
978-9963-2227-5-9 |
|
|
Start Date
|
14/12/2018 |
End Date
|
16/12/2018 |
From page
|
10 |
To page
|
10 |
|
Programme and Abstracts |