Edit

Microbiome Informatics

Suite of services for the assembly, archiving, and analysis of microbiome-derived sequence data

The Finn team develops the software and analysis pipelines that underpin the MGnify metagenomics database, and are responsible for the HMMER webservers. The team also includes the HGNC and VGNC, which are responsible for the assignment of gene symbols and names in human and selected vertebrates.

About our data services

Our services use complex mathematical models tailored for life-science research. For example, the HMMER algorithm offers fast detection of distantly related proteins and is available through our website infrastructure. We aim to simplify access to curated, complex data, and to maximise biological knowledge by extending annotation based on sequence similarity.

Understanding environmental samples

Our MGnify service enables researchers to submit sequence data and associated descriptive metadata about environmental samples to public nucleotide archives. Once deposited, our team helps ensure the data is functionally analysed (using an InterPro-based pipeline), taxonomically analysed and visualised via a web interface. 

This graphic shows the workflow for our metagenomics analysis.

Training

We participate in EMBL-EBI’s Training Programme, offering courses in metagenomics and other approaches to sequence analysis.

Collaboration

We welcome new collaborations in all areas, and are particularly interested in working with people who have developed new tools for analysis, or who are working with metagenomics datasets generated with new sequencing technology.

Data resources

Ensembl

Ensembl

The Ensembl project, founded in 1999 to support the results of the Human Genome Project, supports over 80 vertebrate species and provides resources such as reference gene sets, whole genome alignments, gene homology annotation, gene sequence alignments, variant annotation and regulatory regions. Man…

Ensembl Genomes

Ensembl Genomes

Ensembl Genomes is an integrating portal providing access to genome-scale data from across the taxonomic space. Using the same infrastructure developed in the context of the vertebrate-focused Ensembl project, it offers consistent interactive and programmatic user interfaces to data from important i…

Edit