| |
The International Union of Pure and Applied Chemistry
(IUPAC) has defined a standard
representation of DNA bases by single characters that specify either a
single base (e.g. G for guanine, A for adenine) or a set of bases
(e.g. R for either G or A). UCSC uses these single character codes to
represent multiple observed alleles of single-base polymorphisms.
| Symbol |
Bases |
Origin of designation |
| G |
G |
Guanine |
| A |
A |
Adenine |
| T |
T |
Thymine |
| C |
C |
Cytosine |
| R |
G or A |
puRine |
| Y |
T or C |
pYrimidine |
| M |
A or C |
aMino |
| K |
G or T |
Keto |
| S |
G or C |
Strong interaction (3 H bonds) |
| W |
A or T |
Weak interaction (2 H bonds) |
| H |
A or C or T |
not-G, H follows G in the alphabet |
| B |
G or T or C |
not-A, B follows A |
| V |
G or C or A |
not-T (not-U), V follows U |
| D |
G or A or T |
not-C, D follows C |
| N |
G or A or T or C |
aNy |
| |