International E-publication: Publish Projects, Dissertation, Theses, Books, Souvenir, Conference Proceeding with ISBN.  International E-Bulletin: Information/News regarding: Academics and Research

ORF Investigator: A New ORF finding tool combining Pairwise Global Gene Alignment

Author Affiliations

  • 1Department of Bioinformatics, UCST, Dehradun, INDIA
  • 2 Department of Biotechnology, DDU University, Gorakhpur, INDIA

Res. J. Recent Sci., Volume 1, Issue (11), Pages 32-35, November,2 (2012)


Bioinformatics tools have become an integral part of the molecular data generated during the DNA fingerprinting of fungal pathogens. Finding and annoting the coding and non coding regions and final product in the form of its amino acid sequences is prerequisite for understanding the evolutionary processes in different pathogenic, fungi, as well as the species used for bioremediation, the medicinal and for biofertilizers applications. In the present study an attempt has been made to develop a tool “ORF Investigator” which not only gives information about the coding and non coding sequences but also can perform pairwise global alignment of different gene/DNA regions sequences. The tool efficiently finds out the ORFs for corresponding amino acid sequences and converts them into their one letter amino acid code declaring their respecting positions in the sequence stretch. The pairwise global alignment between the sequences makes it convenient to detect the different mutations including single nucleotide polymorphism. Needleman and Wunsch algorithms are used for the gene alignment and the coding has been done in PERL language making it suitable for windows user.


  1. Frishman D., Mironov A., Mewes H.W. and Gelfand M., Combining diverse evidence for gene recognition in completely sequenced bacterial genomes, Nucleic Acids Research, 26(12), 2941-2947 (1998)
  2. Skovgaard M., Jensen L.J., Brunak S., Ussery D. and Krogh A., On the total number of genes and their length distribution in completemicrobial genomes, Trends in Genetics, 17(8)425-428 (2001)
  3. Kawarabayasi Y., Complete genome sequence of an aerobic hyperthermophilic crenarchaeon, Aeropyrum pernix K1. DNA Res, , 83-101 (1999)
  4. Fickett J., Recognition of protein coding regions in DNA sequences, Nucleic Acids Research, 17, 5303-5318 (1982)
  5. Gribskov M., Devereux J. and Burgess R., The codon preference plot: Graphic analysis of protein coding sequences and prediction of gene expression, Nucleic Acids Research, 12, 539-549 (1984)
  6. Staden R., Measurements of the effects that coding for a protein has on a DNA sequence and their use in finding genes, Nucleic Acids Research, 12, 551-567 (1984)
  7. Borodovsky M. and McIninch J., GENMARK: Parallel gene recognition for both DNA strands, Computers and Chemistry, 17(2), 123-133 (1993)
  8. Krogh A., Mian I.S. and Haussler D., A hidden Markov model that finds genes in E. coli DNA, Nucleic Acids Research, 22, 4768-4778 (1994)
  9. Salzberg S.L., Delcher A.L., Kasif S. and White O., Microbial gene identification using interpolated Markov models, Nucleic AcidsResearch, 26(2), 544-548 (1998)
  10. Lukashin A.V. and Borodovsky M., GeneMark.hmm: new solutions for gene finding, Nucleic Acids Research, 26(4), 1107-1115 (1998)
  11. Besemer J., Lomsadze A. and Borodovsky M., GeneMarkS: a selftraining method for prediction of gene starts in microbial genomes implications for finding sequence motifs in regulatory regions, Nucleic Acids Research, 29(12), 2607-2618 (2001)
  12. Besemer J. and Borodovsky M., Heuristic approach to deriving models for gene finding, Nucleic Acids Research, 27(19), 3911-3920 (1999)
  13. ORF investigator Program can be downloaded from website.