Education
-
2019
Bioinformatic PhD- CRBM (CNRS), Montpellier - France
Bioinformatic methods development for the study of proteins non-globular regions and their conservation over evolution.
Thesis funded over 3 years by state funding and extended for 1 year thanks to the association “la ligue contre le cancer”. Thesis memoir is not yet publicly available. -
2015
Software engineering diploma- ENSIBS (Université de Bretagne Sud), Vannes - France
Focus on software engineering, software architecture, software quality assessment. -
2014
Study aboard semester- Software engineering, UQAC (Université du Québec à Chicoutimi), Chicoutimi - Quebec, Canada
Focus on software architecture and data mining. -
2012
Diplome universitaire de technologie d’analyste programmeur. - IUT de l’Université de Bretagne Sud, Vannes
Focus on programming, algorithm and software engineering.
Generic technical skills
- Main programming languages :
– Ruby for everyday scripting (thanks to its high expressivity) and in a lesser extent for standalone application development with JRuby (that allow to pseudo-compile ruby code in java byte code)
– Java for standalone application development, the high verbosity and strong structuration of the language make it a perfect tool for long-lasting applications
– SQL for relational databases query (MySQL) - Other programming languages :
– Bash, good notions : short script writing and modification mainly for jobs management on calcul cluster
– Python, good notions : short script writing and modification
– Perl, notions, modifications de scripts existants
– R/Julia, good notions, short scripts writing for statistical treatment and plotting - Tools for software development:
– UML/OCL, for software modelling
– Git, for source management, versioning and collaborative work - Operating system – GNU/Linux on calcul cluster, server or desktop (Debian and Debian based distributions mainly) – Windows … sometimes, to help colleagues
- Other tools
– Calcul on cluster (with both Slurm and SGE scheduler)
– Machine learning with weka (both through the graphical interface and as a Java/JRuby API)
– LaTex (and classical Microsoft Office suite) for redaction
Bioinformatic related technical skills
-
Protein sequences annotation For previous studies I wrote a large protein sequence annotation pipeline, I got a strong experience in the annotation of various kinds of protein regions using state-of-the-art-predictors : disordered regions, amyloidogenic regions, domains (Pfam Cath), transmembrane regions, signal peptide, short eukaryotic linear motifs
-
Relational database design for OMICS data For previous studies I organised a vast amount of data (~1.000.000 proteins fully annotated) as a relational database to make it “readable” for other treatments (web interface, statistical analysis, machine learning …)
-
Protein mutations mapping on structure prediction of mutation (truncation and point mutation) effects on an already determined protein structure
Teaching
- Teaching of the graphical interface theory at Montpellier IUT (> 30 H per group, ~ 25 students per group, 2 groups) : how to collect, reshape and organise users need, how to produce a user interface that fit the needs and how to produce an evolutive interface. And more theoretically what is a “good quality” interface.
Publications
-
Villain E, Nikekhin AA, Kajava AV.
Porins and Amyloids are Coded by Similar Sequence Motifs.
Proteomics. 2019 Mar;19(6):1800075. -
Roche DB, Villain E, Kajava AV.
Usage of a dataset of NMR resolved protein structures to test aggregation versus solubility prediction algorithms.
Protein Science. 2017 Sep;26(9):1864-9. -
Arsic N., Gadea G., Slatter T. , Villain E. , Fournet A., Kazantseva M., Allemand F., Sibille N., Seveno M., Urbach S., Bourdon J-C., Bernado P., Kajava A.V., Braithwaite. A., Roux P.
∆133p53β isoform pro-invasive activity is regulated through an aggregation-dependent mechanism.
Nature Communication, under revision. -
Villain E., Fort P. and Kajava A.V.
Aspartate-phobia of thermophiles as a reaction to deleterious chemical transformations.
Submitted.
Conferences
-
2019
Microbiology department external seminar, Paris Sud XI Orsay University :
Relationship between protein’s disordered regions and organism thermal adaptation
Villain E. -
2016
Journées ouvertes de biologie, informatique et mathématiques, avec la présentation :
Large-scale analysis of amyloidogenic proteins in a dataset of 93 evolutionary diverse proteomes
Villain E., Richard F., Fort P. and Kajava A.V.
Posters
-
2020
Pipeline for large-scale analysis of amyloidogenic regions in proteomes.
Falgarone T., Villain E., Leclercq J., Richard F. and Kajava A.V. -
2019
Winter workshop du COST action “Non-globular protein network” :
Amyloidogenic regions selection over evolution, a proteome wide study.
Villain E., Richard F., Fort P. and Kajava A.V.
Miscellaneous
- Writing of 3 theses extra year funding applications (2 successful)
- 3 Interns supervised : 2 master II (6-month internship) and a master I (3-month internship)