Summary statistics

Data are available for both the Affymetrix 500K and Infinium 15K chips. All files are available on the Sanger Institute FTP site.

Affymetrix 500K

The result files for the analyses described in the consortium paper for (i) each case collection against the pooled control groups 58C and UKBS, (ii) combining other case collections as controls and (iii) combining phenotypically relevant case collections (RA/T1D, autoimmune - CD/RA/T1D, cardiovascular - HT/CAD/T2D) are available below. Data are split by chromosome. Chromosome X is split into 'chromosome 23' (the part not pseudoautosomal with Y) and 'chromosome 24' (PAR).

Autosomes and pseudoautosomal region on chromosome X

Download the complete data set or subsets with just the data relevant to each disease (Bipolar Disorder, Coronary Artery Disease, Crohn's Disease, Hypertension, Rheumatoid Arthritis, Type 1 Diabetes, Type 2 Diabetes).

The files have columns:

Chromosome X (non-pseudoautosomal)

Download the complete data set. The files are named snptest_{case}_{control}_23.txt with the abbreviations:

These files include the information above but with some additional columns:

The columns cases_ and controls_ without the _male suffix refer to females. Note that some of the columns listed above for autosomes are not relevant here; in which cases a "." will be found in place of data.

Imputed genotypes

Summary statistics as above for HapMap SNPs not present on the Affymetrix 500K Genechip and for which the genotypes of the WTCCC samples have been obtained by imputation as described in the paper.

Download the complete data set or subsets, one per collection (1958 British Birth Cohort, UK Blood Service, Bipolar Disorder, Coronary Artery Disease, Crohn's Disease, Hypertension, Rheumatoid Arthritis, Type 1 Diabetes, Type 2 Diabetes).

The files have columns:

Illumina Infinium 15K (non-synonymous)

There are four files. Download them all in a ZIP archive.

The file genotype_counts.txt contains genotype counts for each cohort used in the analysis. Counts for the homozygous genotype containing the minor allele are listed first, followed by counts for the heterozygote and other homozygote. Poorly genotyping SNPs that were excluded from analyses are set to missing:

The file original_analyses.txt documents the analyses performed on each case group versus the 1958 birth cohort control group:

The file pooled_analyses.txt documents the analyses for each case group versus a combined control group consisting of each of the other disease groups and the 1958 Birth Cohort:

The file top_hits_pooled_analyses.txt contains results for SNPs that have p values less than 0.001 in the combined analyses: