Over-optimism in bioinformatics: an illustration. - Research - Institut Pasteur

Un petit guide pour l'utilisation de la recherche avancée :

Tip 1. Utilisez "" afin de chercher une expression exacte.
Exemple : "division cellulaire"
Tip 2. Utilisez + afin de rendre obligatoire la présence d'un mot.
Exemple : +cellule +stem
Tip 3. Utilisez + et - afin de forcer une inclusion ou exclusion d'un mot.
Exemple : +cellule -stem

e.g. searching for members in projects tagged cancer

Rechercher

Compteur

IN

OUT

Contenu 1

Content Type

member
team
department
center
program_project
nrc
whocc
project
software
tool
patent

Keywords

Positions

Personnel Administratif
Chargé(e) de Recherche Expert
Directeur(trice) de Recherche
Assistant(e) de Recherche Clinique
Infirmier(e) de Recherche Clinique
Chercheur(euse) Clinicien(ne)
Manager de département
Etudiant(e) en alternance
Professeur(e)
Professeur Honoraire
Aide technique
Etudiant(e) en Master
MD-PhD
Personnel médical
Chercheur(euse) Contractuel(le)
Personnel infirmier
Chercheur(euse) Permanent(e)
Pharmacien(ne)
Etudiant(e) en thèse
Médecin
Post-doctorant(e)
Prize
Chef(fe) de Projet
Chargé(e) de Recherche
Ingénieur(e) de Recherche
Chercheur(euse) Retraité(e)
Technicien(ne)
Etudiant(e)
Vétérinaire
Visiteur(euse) Scientifique

Appointments

Directeur(trice) Adjoint(e) de Centre
Directeur(trice) Adjoint(e) de Départment
Directeur(trice) Adjoint(e) de Centre National de Référence
Directeur(trice) Adjoint(e) de Plateforme
Directeur(trice) de Centre
Directeur(trice) de Départment
Directeur(trice) d'Institut
Directeur(trice) de Centre National de Référence
Chef(fe) de Groupe
Responsable de Plateforme
Responsable opérationnel et administratif
Responsable de Structure
Président(e) d'honneur de Département
Coordinateur(trice) du Labex

Contenu 2

Content Type

member
team
department
center
program_project
nrc
whocc
project
software
tool
patent

Keywords

Positions

Personnel Administratif
Chargé(e) de Recherche Expert
Directeur(trice) de Recherche
Assistant(e) de Recherche Clinique
Infirmier(e) de Recherche Clinique
Chercheur(euse) Clinicien(ne)
Manager de département
Etudiant(e) en alternance
Professeur(e)
Professeur Honoraire
Aide technique
Etudiant(e) en Master
MD-PhD
Personnel médical
Chercheur(euse) Contractuel(le)
Personnel infirmier
Chercheur(euse) Permanent(e)
Pharmacien(ne)
Etudiant(e) en thèse
Médecin
Post-doctorant(e)
Prize
Chef(fe) de Projet
Chargé(e) de Recherche
Ingénieur(e) de Recherche
Chercheur(euse) Retraité(e)
Technicien(ne)
Etudiant(e)
Vétérinaire
Visiteur(euse) Scientifique

Appointments

Directeur(trice) Adjoint(e) de Centre
Directeur(trice) Adjoint(e) de Départment
Directeur(trice) Adjoint(e) de Centre National de Référence
Directeur(trice) Adjoint(e) de Plateforme
Directeur(trice) de Centre
Directeur(trice) de Départment
Directeur(trice) d'Institut
Directeur(trice) de Centre National de Référence
Chef(fe) de Groupe
Responsable de Plateforme
Responsable opérationnel et administratif
Responsable de Structure
Président(e) d'honneur de Département
Coordinateur(trice) du Labex

Recherche

EN
FR

Revenir

Haut de page

A proteomic analysis reveals differential regulation of the σ(S)-dependent yciGFE(katN) locus by YncC and H-NS in Salmonella and Escherichia coli K-12

LapB, a novel Listeria monocytogenes LPXTG surface adhesin, required for entry into eukaryotic cells and virulence

Domaines Scientifiques

Maladies

Organismes

Applications

Technique

Publié sur Bioinformatics (Oxford, England) - 15 août 2010

Jelizarow M, Guillemot V, Tenenhaus A, Strimmer K, Boulesteix AL,

Lien vers Pubmed [PMID] – 20581402

Lien DOI – 10.1093/bioinformatics/btq323

Bioinformatics 2010 Aug; 26(16): 1990-8

In statistical bioinformatics research, different optimization mechanisms potentially lead to ‘over-optimism’ in published papers. So far, however, a systematic critical study concerning the various sources underlying this over-optimism is lacking.We present an empirical study on over-optimism using high-dimensional classification as example. Specifically, we consider a ‘promising’ new classification algorithm, namely linear discriminant analysis incorporating prior knowledge on gene functional groups through an appropriate shrinkage of the within-group covariance matrix. While this approach yields poor results in terms of error rate, we quantitatively demonstrate that it can artificially seem superior to existing approaches if we ‘fish for significance’. The investigated sources of over-optimism include the optimization of datasets, of settings, of competing methods and, most importantly, of the method’s characteristics. We conclude that, if the improvement of a quantitative criterion such as the error rate is the main contribution of a paper, the superiority of new algorithms should always be demonstrated on independent validation data.The R codes and relevant data can be downloaded from http://www.ibe.med.uni-muenchen.de/organisation/mitarbeiter/020_professuren/boulesteix/overoptimism/, such that the study is completely reproducible.

Publié le: 15 août 2010 • Modifié le: 01 déc. 2022