Upcoming Events

July 19-23, 2013: 21st Annual International Conference on Intelligent Systems for Molecular Biology (ISMB).

April 7-10, 2013: 17th Annual International Conference on Research in Computational Molecular Biology (RECOMB).

April 15-19, 2013: IEEE Symposium on Computational Intelligence in Bioinformatics & Computational Biology.

March 4-6, 2013: 5th international conference on Bioinformatics & Computational Biology (BICoB).

More events....


Oct 18, 2012: A new Postdoctoral Fellow, Dr. Sitanshu Sekhar Sahu has joined our lab.

Aug 13-17, 2012: A comprehensive 1-week Bioinformatics Workshop was organized on campus; co-organized by OSU's iCREST center. Visit facebook page for details.

Apr 23, 2012: Co-hosted Dr. James Tiedje (Director, NSF Center for Microbial Ecology, Michigan State University) as an invited iCREST speaker; see flyer for details.

Apr 13, 2012: World renowned Computational Biologist, Dr. Eugene Koonin (NCBI) visited our lab, and delivered an invited lecture on campus as part of iCREST speaker series; see flyer for details. Video on YouTube.

Mar 16, 2012: We welcome Dr. Chris Town (Group leader, Plant Genomics, JCVI) as an invited iCREST speaker; see flyer for details.

Feb 14, 2012: KBL receives new grant from OCAST to develop bioinformatics systems for plant-microbe interaction networks; immediate Postdoc opening available.

Oct 21, 2011: We welcome Dr. Patrick X. Zhao (Head, Bioinformatics Lab, Noble Foundation) as an invited iCREST speaker; see flyer for details.

Sep 17, 2011: Tyler Weirick joins our lab (under iCREST) as a Graduate Research Assistant.

Aug 17, 2011: Robyn Kelley, a new master's student joins our lab as a Graduate Research Assistant.

July 21, 2011: KBL receives OSU funding to establish an iCREST center for Bioinformatics and Computational Biology.

June 08, 2011: KBL welcomes its first student, Kalpana Varala to work as a summer scholar in lab.


Research Projects

Agricultural Biosecurity Metagenomics Host-microbe Interactions Genome Annotation

Being an interface between the modern biology and the information technology, the discipline of Bioinformatics & Computational Biology involves discovery, development and implementation of computational algorithms and software tools that facilitate an understanding of the biological processes with the goal to serve primarily agriculture and healthcare sectors with several spin-offs. Our lab is applying statistical pattern recognition, artificial intelligence and machine learning technologies to develop novel computational tools and algorithms, primarily in the following areas of research interest:


Criminal investigations of bioterrorism attacks, tracking of disease outbreaks, and medical intelligence operations require bioinformatics applications that are mission oriented and task specific. The KBL is involved in developing novel computational tools & algorithms for pathogen detection and discrimination, identification of species-specific signatures, and using artificial intelligence to predict biosecurity threats. For example, discriminating pathogen genotypes in a fundamentally different way from distance-based and BLAST algorithms and instead, using the Neural Networks, Support Vector Machine or Decision Tree classifiers to build patterns from genome regions (e.g. DNA barcodes) that are under selective pressure; and ultimately incorporating them into a database(s) / visualization tool(s) for community use.

In addition, we are also engaged in modeling of high-consequence pathogens based on epidemiological factors, e.g. develop tools for disease prediction / forecasting based on significant weather parameters and/or other correlated features.


Organizing metagenomic sequence data (Huge!):-

  • Clustering: a large dataset clustered into distinct subsets based on some specific measures (e.g. clustering of proteins/genes to get families, binning into taxon).

  • Binning: a clustering method that uses DNA composition (e.g. GC content), codon usage, assembly, phylogenetic analysis, database search etc. to cluster reads into specific genomes or groups of genomes.

  • Gene prediction: genes coding for proteins, RNA and other regulatory elements.

  • Gene annotation: in metagenomics, a substantial percentage of sequences cannot be easily classified; new algorithms development is needed.


The interplay between plants and their microbial co-habitants is regulated by extensive chemical signaling. Most of what we know about these complex community interactions has been derived through study of organisms in pure culture, but it is well known that the vast majority of microbes have not been cultivated. While high-throughput DNA sequence analysis will be an important tool for these studies, the immense richness and diversity of such communities present a strong mandate for the use of functional metagenomics strategies that involve a broad variety of screening methodologies to discover and study the currently unknown key biological processes. Thus, developing computational methods that identify which interactions enable successful host-microbe associations have great implications, especially in metagenomics.


Annotation of sequenced genomes is an important process of attaching biological information to the sequences. Apart from using the conventional similarity-based approaches for the annotation of genomes, we are using Artificial Intelligence techniques (Neural Networks, Support Vector Machine, Decision Tree etc.) to develop various machine learned algorithms for faster and accurate structural / functional annotations. The hybrid approach (combining homology and machine learning techniques) has shown relatively better accuracy over the individual approaches and we are actively developing such hybrid methodologies for the genome annotation process. One of such usages has been successfully demonstrated in Subcellular Localization prediction where a range of prediction tools have been developed.