front 1 1. | back 1 In any segment of DNA, typically only one frame |
front 2 2. | back 2 Genes do not often overlap by more than a few |
front 3 3. | back 3 The gene density in phage genomes is very
high, so |
front 4 4. | back 4 Most protein-coding genes will have coding potential predicted
by |
front 5 5. | back 5 Many phage genes are unique, and will not have any |
front 6 6. | back 6 Some protein-coding genes may not be predicted by |
front 7 7. | back 7 If there are two genes transcribed in opposite directions
whose |
front 8 8. | back 8 Protein-coding genes are generally at |
front 9 9. | back 9
Switches in gene orientation (from forward
to |
front 10 10. | back 10 Each protein-coding gene ends with a stop codon
(TAG, |
front 11 11. | back 11 Each protein-coding gene starts with an initiation |
front 12 12. | back 12 An important task is choosing between different |
front 13 12a. | back 13 The relationship to the closest upstream gene is
important. |
front 14 12b. | back 14 The position of the start site is often conserved among
homologues of genes. |
front 15 12c. | back 15 The preferred start site usually has a favorable RBS score within
all |
front 16 12d. | back 16 Manual inspection can be helpful to distinguish between |
front 17 12e. | back 17 Your final start-site selection will likely represent a compromise of
these sub-principles. A corollary to the choosing start guidelines:
Sometimes the best start leads to the choice between 2 tandem start
codons (i.e. one is right after the other). From a small amount of
mass spec data and some basic biology principles, always choose the
second start codon. For example, the Met-Met “ATGATG” or Met-Leu
“TGATGTTGA” start codons |
front 18 13. | back 18 tRNA genes are not called precisely in the program |
front 19 14. | back 19 Protein assignments require rigorous review of the
ever- |
front 20 15. | back 20 Iteration is key. Annotation is like writing a paper;
after |