COG-UK captures SARS-CoV-2 samples from COVID-19 infections from across the UK. These weekly summaries include information on sequencing coverage (the number of sequences available in COG-UK out of the total number of people with a COVID-19 infection) across Wales, Scotland, England and Northern Ireland.
Public Data and Analysis
COG-UK sits at the interface of public health action and academic research. We are committed to open science, and sharing all data that we can as rapidly as possible. This includes sharing data for use by Public Health authorities internationally, to support COVID-19 response, and sharing data in such a way that the academic community can access and use the data and analysis according to FAIR data principles.
Please note that the data that is being shared is the work of hundreds of people and should be treated as unpublished data. If you wish to publish research using the data, please get in contact with COG-UK first to ensure that those who have generated the data can be involved in its analysis.
From this website are aiming to share the results of genomic analysis that feed into weekly updates to the UK Government and Public Health Agencies to help guide its healthcare strategies in responding to, and minimising, the spread of COVID-19 across the UK.
The Centre for Genomic Pathogen Surveillance maintain a Microreact website which permits continuous evaluation of the lineages circulating in the UK (currently updated weekly).
The MRC-University of Glasgow Centre for Virus Research have developed a web application for the analysis of SARS-CoV-2 genome sequences: CoV-GLUE
Latest Sequence Data
Sequences are released rapidly and should be considered draft and subject to change.
All links refer to the latest data build, typically updated daily. The very latest datasets and analysis are available via MRC CLIMB:
- All sequences
- Trimmed and masked alignment
- Unmasked alignment
- Tree (Newick format) of GISAID and COG-UK samples
We regularly deposit datasets into public databases (GISAID for consensus sequences, ENA for raw data).
GISAID - Global Initiative on Sharing All Influenza Data
The Global Initiative on Sharing All Influenza Data (GISAID) database has emerged as the standard database for sharing of SARS-CoV-2 data internationally.
The COG-UK consortium regularly deposits sequences into this database, which can be viewed in international context via Nextstrain.