We present a combinatorial algorithm for recognizing protein-coding regions in DNA sequences of higher eukaryotes. The developed algorithm provides highly specific recognition of protein-coding segments and is intended to be used in construction of oligonucleotide probes and PCR primers for analysis of cDNA libraries or total cell RNA. As distinct from other methods, this one is based on simple statistical indices (codon usage and positional nucleotide frequencies at splicing sites) and hence is applicable to obscure genomes, when large training samples are unavailable. The algorithm structure allows the researcher to readily adapt it for various experimental requirements.
|Number of pages||6|
|Publication status||Published - Jan 1997|
- Computational molecular biology
- Computer analysis
- DNA sequences
- Protein-coding regions