Virtual course

Next generation sequencing bioinformatics

A guide to the technology, analysis workflows, tools, and resources for next generation sequencing data analysis.

This virtual course will provide insights into how biological knowledge can be derived from genomics experiments and explain different approaches in analysing such data. The main focus will be on assembly, re-sequencing, and variant calling during the analysis of higher-eukaryotes, with a particular emphasis on human genetic research. Throughout the week, more advanced topics will introduce the creation of pipelines, automation, and the scaling-up of analysis experiments.

Practical sessions will enable participants to process training datasets and apply appropriate statistical methods in their analyses. There will be no opportunity to work with personal research data during the course.

Virtual course

Participants will learn via a mix of pre-recorded lectures, live presentations, and trainer Q&A sessions. Practical experience will be developed through group activities and trainer-led computational exercises. Live sessions will be delivered using Zoom with additional support and communication via Slack

Pre-recorded material will be made available to registered participants prior to the start of the course and in the week before the course there will be a brief induction session. Computational practicals will run on EMBL-EBI's virtual training infrastructure, meaning participants will not require access to a powerful computer or install complex software on their own machines.

Participants will need to be available between the hours of 09:30-17:30 GMT each day of the course. Trainers will be available to assist, answer questions and further explain the analysis during these times.

Who is this course for?

The course is aimed at PhD students and post-doctoral researchers who are starting to use high-throughput sequencing technologies and bioinformatics methods in their research. The content is most applicable for those working with eukaryotic genomes, especially in the area of human genetics and rare-disease research.

Participants will require a basic knowledge of the Unix command line and the Ubuntu 18 operating system. We recommend these free tutorials:

Please note: participants without basic knowledge of these resources will have difficulty in completing the practical sessions.

What will I learn?

Learning outcomes

After this course you should be able to:

  • State the advantages and limitations of high-throughput assays
  • Apply appropriate short read aligners to unassembled reads
  • Perform variant calling analysis and annotation
  • Scale-up and automate simple genomics pipelines
  • Access genomic datasets from online public resources

Course content

During this course you will learn about:

  • Quality control methods for cleaning raw read data
  • Alignment of reads to a reference genome
  • File format conversion and processing
  • Tools for variant calling
  • Methodologies for variant annotation
  • Approaches for scaling up and reproducing data
  • Data resources for genomics data

Trainers

Tom Hancocks
EMBL-EBI
Chiara Batini
University of Leicester
Charles Solomon
University of Leicester
Kayesha Coley
University of Leicester
Noemi Piga
University of Leicester
Sean Laidlaw
Wellcome Sanger Institute
Raheleh Rabhari
Wellcome Sanger Institute
Malvika Sharan
The Alan Turing Institute
Emily Perry
EMBL-EBI
Marcela Uliano-Silva
Wellcome Sanger Institute
Sam Holt
EMBL-EBI
Baron Koylass
EMBL-EBI
Alan Tracey
Wellcome Sanger Institute
This course has ended

15 - 19 February 2021
£200
Contact
Marina Pujol

Organisers
  • Tom Hancocks
    EMBL-EBI
  • Chiara Batini
    University of Leicester

Share this event with: