Statistical analysis and prediction of the exonic structure of human genes

Research output: Contribution to journalArticlepeer-review

21 Citations (Scopus)


Nonhomologous fully sequenced human protein-coding genes were studied. Three sets of exon-exon junctions were formed defined by the intron (shadow) position relative to the reading frame. For the analysis of intron shadow signals in exons, information content and discrimination energy approaches were used with the correction allowing one to ignore the influence of a protein-coding message. The corrected formulas allow one to define the consensuses for the three types of intron shadow signals as aAG/guwn, cAG/GUnn, and cAG/gunU, and provide better recognition than the original formulas. The analysis of the codon usage in the signal positions leads to the conclusion that the prevalence of some amino acids in corresponding protein sites is caused by the signal requirements and not vice versa. The distribution of potential intron shadow signals in exons contradicts the hypothesis of intron insertion into suitable preexisting sites. There exists a correlation between the intron types and/or the exon length modulo 3.

Original languageEnglish
Pages (from-to)239-252
Number of pages14
JournalJournal of Molecular Evolution
Issue number3
Publication statusPublished - Sep 1992
Externally publishedYes


  • Computer analysis
  • Exon-exon junctions
  • Exon-intron structure
  • Exons
  • Human
  • Introns
  • Splicing


Dive into the research topics of 'Statistical analysis and prediction of the exonic structure of human genes'. Together they form a unique fingerprint.

Cite this