Various filters were applied to the raw genotyping results to derive a unique set of high-reliability Calls. This file summarises, on a batch-by-batch basis, the results of the checking operations used to produce the final distributed values. In addition, some loci were tested using multiple probesets and it was necesary to identify which (if any) probeset gave the best result on any particular batch.
The columns in the file are as follows:
- Batch ID;
- Affymetrix SNP ID;
- Affymetric probset ID;
- Test: locus sufficiently well-defined in annotation dictionary;
- Test: locus present on the array used to genotype current batch;
- Test: probeset regarded as performing well by Affymetrix;
- Test: probeset judged to be the best available (when more than one used for a SNP), or only one;
- Test: WTCHG quality-control indicates final Calls are reliable.
To appear in the final Calls results, a locus/probeset combination must pass all 5 tests and would have 5 'u' values listed alongside it. These rows (around 85% of combinations) have been removed from the file to save space.
This resource is not suitable for displaying within a web-browser.
It can be downloaded or viewed using the link: gen_probe_discard.zip
If you have wget available (typically on linux systems), then you can also obtain a copy using the command
wget -nd biobank.ctsu.ox.ac.uk/ukb/ukb/docs/gen_probe_discard.zip