FASTA file format
Import
Marvin import the FASTA files if the comment line of it starts with a '>' character. If the sequence type is not specified, Marvin will try to guess it from the comment header and from the sequence.
Codename: fasta
DNA, RNA
We import the following codes.
|
Nucleic Acid Code |
Meaning |
Mnemonic |
|
A |
A |
Adenine |
|
C |
C |
Cytosine |
|
G |
G |
Guanine |
|
U |
U |
Uracil (only in RNA) |
|
T |
U |
Timin (only in DNA) |
|
R |
A or G |
puRine |
|
Y |
C, T or U |
pYrimidines |
|
K |
G, T or U |
bases which are Ketones |
|
M |
A or C |
bases with aMino groups |
|
S |
C or G |
Strong interaction |
|
W |
A, T or U |
Weak interaction |
|
B |
not A (i.e. C, G, T or U) |
B comes after A |
|
D |
not C (i.e. A, G, T or U) |
D comes after C |
|
H |
not G (i.e., A, C, T or U) |
H comes after G |
|
V |
neither T nor U (i.e. A, C or G) |
V comes after U |
|
N |
A C G T U |
aNy |
|
X |
masked |
|
|
- |
gap of indeterminate length |
|
Ambigous nucleosides are represented with a labeled pseudoatom.
Codename: fasta:dna, fasta:rna
Peptide
|
Code |
Aminoacid |
|
A |
Alanine |
|
B |
Aspartic acid or Asparagine |
|
C |
Cysteine |
|
D |
Aspartic acid |
|
E |
Glutamic acid |
|
F |
Phenylalanine |
|
G |
Glycine |
|
H |
Histidine |
|
I |
Isoleucine |
|
K |
Lysine |
|
L |
Leucine |
|
M |
Methionine |
|
N |
Asparagine |
|
O |
Pyrrolysine |
|
P |
Proline |
|
Q |
Glutamine |
|
R |
Arginine |
|
S |
Serine |
|
T |
Threonine |
|
U |
Selenocysteine |
|
V |
Valine |
|
W |
Tryptophan |
|
Y |
Tyrosine |
|
Z |
Glutamic acid or Glutamine |
|
X |
any |
|
* |
translation stop |
|
- |
gap of indeterminate length |
Codename: fasta:peptide