Approximate information maximization for bandit games - Research - Institut Pasteur

Un petit guide pour l'utilisation de la recherche avancée :

Tip 1. Utilisez "" afin de chercher une expression exacte.
Exemple : "division cellulaire"
Tip 2. Utilisez + afin de rendre obligatoire la présence d'un mot.
Exemple : +cellule +stem
Tip 3. Utilisez + et - afin de forcer une inclusion ou exclusion d'un mot.
Exemple : +cellule -stem

e.g. searching for members in projects tagged cancer

Rechercher

Compteur

IN

OUT

Contenu 1

Content Type

member
team
department
center
program_project
nrc
whocc
project
software
tool
patent

Keywords

Positions

Personnel Administratif
Chargé(e) de Recherche Expert
Directeur(trice) de Recherche
Assistant(e) de Recherche Clinique
Infirmier(e) de Recherche Clinique
Chercheur(euse) Clinicien(ne)
Manager de département
Etudiant(e) en alternance
Professeur(e)
Professeur Honoraire
Aide technique
Etudiant(e) en Master
MD-PhD
Personnel médical
Chercheur(euse) Contractuel(le)
Personnel infirmier
Chercheur(euse) Permanent(e)
Pharmacien(ne)
Etudiant(e) en thèse
Médecin
Post-doctorant(e)
Prize
Chef(fe) de Projet
Chargé(e) de Recherche
Ingénieur(e) de Recherche
Chercheur(euse) Retraité(e)
Technicien(ne)
Etudiant(e)
Vétérinaire
Visiteur(euse) Scientifique

Appointments

Directeur(trice) Adjoint(e) de Centre
Directeur(trice) Adjoint(e) de Départment
Directeur(trice) Adjoint(e) de Centre National de Référence
Directeur(trice) Adjoint(e) de Plateforme
Directeur(trice) de Centre
Directeur(trice) de Départment
Directeur(trice) d'Institut
Directeur(trice) de Centre National de Référence
Chef(fe) de Groupe
Responsable de Plateforme
Responsable opérationnel et administratif
Responsable de Structure
Président(e) d'honneur de Département
Coordinateur(trice) du Labex

Contenu 2

Content Type

member
team
department
center
program_project
nrc
whocc
project
software
tool
patent

Keywords

Positions

Personnel Administratif
Chargé(e) de Recherche Expert
Directeur(trice) de Recherche
Assistant(e) de Recherche Clinique
Infirmier(e) de Recherche Clinique
Chercheur(euse) Clinicien(ne)
Manager de département
Etudiant(e) en alternance
Professeur(e)
Professeur Honoraire
Aide technique
Etudiant(e) en Master
MD-PhD
Personnel médical
Chercheur(euse) Contractuel(le)
Personnel infirmier
Chercheur(euse) Permanent(e)
Pharmacien(ne)
Etudiant(e) en thèse
Médecin
Post-doctorant(e)
Prize
Chef(fe) de Projet
Chargé(e) de Recherche
Ingénieur(e) de Recherche
Chercheur(euse) Retraité(e)
Technicien(ne)
Etudiant(e)
Vétérinaire
Visiteur(euse) Scientifique

Appointments

Directeur(trice) Adjoint(e) de Centre
Directeur(trice) Adjoint(e) de Départment
Directeur(trice) Adjoint(e) de Centre National de Référence
Directeur(trice) Adjoint(e) de Plateforme
Directeur(trice) de Centre
Directeur(trice) de Départment
Directeur(trice) d'Institut
Directeur(trice) de Centre National de Référence
Chef(fe) de Groupe
Responsable de Plateforme
Responsable opérationnel et administratif
Responsable de Structure
Président(e) d'honneur de Département
Coordinateur(trice) du Labex

Recherche

EN
FR

Revenir

Haut de page

Validation of a SARS-CoV-2 Surrogate Neutralization Test Detecting Neutralizing Antibodies against the Major Variants of Concern.

Mucosal application of the broadly neutralizing antibody 10-1074 protects macaques from cell-associated SHIV vaginal exposure.

Domaines Scientifiques

Maladies

Organismes

Applications

Technique

Publié sur - 06 oct. 2023

Alex Barbier-Chebbah, Christian L. Vestergaard, Jean-Baptiste Masson, Etienne Boursier

Lien vers HAL – hal-04264220

2023

Entropy maximization and free energy minimization are general physical principles for modeling the dynamics of various physical systems. Notable examples include modeling decision-making within the brain using the free-energy principle, optimizing the accuracy-complexity trade-off when accessing hidden variables with the information bottleneck principle (Tishby et al., 2000), and navigation in random environments using information maximization (Vergassola et al., 2007). Built on this principle, we propose a new class of bandit algorithms that maximize an approximation to the information of a key variable within the system. To this end, we develop an approximated analytical physics-based representation of an entropy to forecast the information gain of each action and greedily choose the one with the largest information gain. This method yields strong performances in classical bandit settings. Motivated by its empirical success, we prove its asymptotic optimality for the two-armed bandit problem with Gaussian rewards. Owing to its ability to encompass the system’s properties in a global physical functional, this approach can be efficiently adapted to more complex bandit settings, calling for further investigation of information maximization approaches for multi-armed bandit problems.

Publié le: 06 oct. 2023