DroSpeGe About Arthropods BLAST BioMart Maps Data News

EvidentialGene : Evidence Directed Gene Prediction for Eukaryotes

      Name                    Last modified       Description

[DIR] Parent Directory 04-Sep-2012 15:52 [DIR] about/ 30-Apr-2013 01:22 [DIR] arthropods/ 11-Apr-2013 22:01 [DIR] cacao/ 22-Jun-2012 13:35 [DIR] daphnia/ 06-Sep-2011 15:01 [DIR] evigene/ 11-Apr-2013 11:37 [DIR] killifish/ 30-Jan-2013 19:52 [DIR] nasonia/ 03-Apr-2013 13:28 [DIR] pea_aphid2/ 29-Jan-2012 19:30 [DIR] plants/ 15-Apr-2013 16:32 [DIR] vertebrates/ 11-Apr-2013 21:50

 
2012 Poster : Perfect Genes Constructed with Gigabases of RNA

EvidentialGene mRNA Transcript Assembly Software 2013 April

EvidentialGene Animal & Plant Genes Quality Comparison
evigenequal1

EvidentialGene :
Evidence Directed Gene Prediction for Eukaryotes

  Don Gilbert, gilbertd at indiana edu, 2010 .. 2013
  http://arthropods.eugenes.org/EvidentialGene/

Annotation summary

Informant evidence for gene models is transcript and protein
data.  Transcripts from long (e.g. dbEST) and short reads (e.g.
Illumina), are mapped to the genome assembly with GMAP (long) 
or GSNAP (short).

EST assemblies are constructed using PASA, from all transcript
data.  RNA-seq assemblies are first constructed from aligned short
reads with Cufflinks, and combined with EST reads for a full 
assembly.

Protein genes from related sequenced genomes are BLASTX aligned to
repeat/transposon-soft-masked genome, then refined with Exonerate to 
protein gene models.

Gene models are predicted with evidence-directed AUGUSTUS
predictor. Augustus is trained for gene parameters
using full cDNA genes derirved from the PASA EST assemblies.

Several prediction sets from different evidence sets and
parameters are combined, selecting highest evidence scored model
per locus, and avoiding artifactual gene splits and joins.  
This consensus gene set has the best match to EST and protein 
evidence.

Predicted genes are UTR-extended and/or improved by PASA. Genes
are annotated with Uniprot descriptions, and classified by
evidence scores including transposable elements.

Software references:
  AUGUSTUS: http://augustus.gobics.de/
  GMAP-GSNAP: http://research-pub.gene.com/gmap/
  PASA: http://pasa.sourceforge.net/
  Cufflinks: http://cufflinks.cbcb.umd.edu/
  exonerate: http://www.ebi.ac.uk/~guy/exonerate/
  NCBI BLAST:  blast.ncbi.nlm.nih.gov/


Developed at the Genome Informatics Lab of Indiana University Biology Department