BioModels Database logo

BioModels Database


Internship opportunities

Curation internship: Literature curation of mechanistic models of disease pathways

BioModels is a central repository of mathematical models of biological/biomedical processes. It is hosted at EMBL-EBI and is one of the resources of the molecular systems cluster. The models distributed through BioModels are extensively tested and encoded in standard formats (for example, SBML (Systems Biology Markup Language)) and are free to use. In addition, the models and their components are cross-referenced with external data resources and ontologies, which facilitates search and retrieval, and maximises the benefits of the growing number of already existing models.

Curating models of biological processes is an effective training in computational systems biology, where the curators gain an integrative knowledge on biological systems, modelling and bioinformatics. We have a number of internship opportunities available within the team, to carry out targeted curation of mechanistic models of certain disease pathways. The topics likely to be focused, but not limited to, are cancer, diabetes [1], arthritis (inflammation & immunology), plant systems, and neurodegeneration [2].

The internships might be ideal for a master degree thesis or for candidates who are in between their masters and PhD. An initial knowledge of biology, mathematics and computing is desirable. The successful candidate will be a master student in bioinformatics or systems biology. Training will be given to understand basics of modelling, and on modelling and simulation tools. The position is ideally for 6 months, but can be extended up to one year. A fixed monthly allowance is provided to help towards living costs.

For further enquiries or to make an application (attach your CV and a cover letter), please contact: Rahuman Sheriff (sheriff AT

  1. Ajmera I., Swat M., Laibe C., Le Novère N., Chelliah V. The impact of mathematical modeling on the understanding of diabetes and related complications. CPT: Pharmacometrics & Systems Pharmacology. Jul 10;2:e54. 2013.
  2. Lloret-Villas A., Varusai T.M., Juty N., Laibe C., Le Novère N., Hermjakob H., Chelliah V. The impact of mathematical modeling in understanding the mechanisms underlying neurodegeneration: evolving dimensions and future directions. CPT: Pharmacometrics & Systems Pharmacology. (in press)

Software development internship: Cluster Analysis of BioModels using Biomedical Ontologies

Ontologies provide formal means of defining concepts and their interrelationships as a knowledge graph routinely consisting of thousands or tens of thousands of terms arranged hierarchically. Amid the exponential increase of data available in life sciences, ontologies have become the cornerstone of data integration and search.

BioModels at the European Bioinformatics Institute is a repository of mathematical models describing biological processes. It hosts over 8400 submissions derived from the literature and over 142000 models that have been automatically generated from pathway resources. All depositions are linked, using semantic annotations, to terms from established resources like the NCBI Taxonomy database or biomedical ontologies such as Gene Ontology, ChEBI or Systems Biology Ontology.

The hierarchical nature of these classifications can be leveraged to help users progressively find the content of interest (e.g., but a common usability challenge with such approaches is the need to prune intermediate categories. Previous attempts to do so ( have relied on manual input and are not immediately reusable for other ontologies.

In this context, this project strives to (1) develop automated methods for clustering models based on their semantic annotations and an arbitrarily-chosen set of ontologies; (2) create novel visualisation components which use the results of the cluster analysis to provide user-friendly ways of browsing BioModels content.

Required skills and experience

  • solid understanding of Computer Science underpinnings, in particular, algorithms and data structures commonly-used in graph theory;
  • familiarity with machine learning approaches and toolkits;
  • good command of one or more mainstream programming languages (Java, C++, Python);
  • willingness to develop new abilities as the project requires;
  • self-motivated and capable of working both independently and as part of a team;
  • proficient communication, interpersonal and English language skills;

Desirable experience:

  • hands-on experience of working with Semantic Web technologies;
  • familiarity with modern JavaScript language features and libraries

This project is expected to last around six months and would be ideally suited for a dissertation project or placement. For further enquiries or to make an application (attach your CV and a cover letter), please contact: Mihai Glont (mglont AT