NCGAS makes robust transcriptome analysis easier with a readily usable workflow following de novo assembly best practices

Thumbnail Image
Can’t use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.



Journal Title

Journal ISSN

Volume Title


Plant and Animal Genomics 2018


The National Center for Genome Analysis Support (NCGAS) assists research groups with de novo transcriptome assembly. Best practices for such analyses include sample pooling, running multiple assembler algorithms with multiple parameters, combining the assemblies, and filtering the redundancy/erroneously assembled transcripts. These combined de novo transcriptome assemblies can put a technical burden on genomic researchers who may not be fully computationally trained on efficient use of HPC clusters. NCGAS has created a workflow template to move client data through 19 parallelized assemblies using four software packages (Trinity, SOAP-denovo, transABySS, and VelvetOases) and multiple khmers. The transcripts are then combined and filtered using EviGenes to output putative transcripts and alternative forms in a replicable manner. The process is semi-automated but flexible enough to allow researchers to adjust parameters if they desire. While designed for IU machines and XSEDE’s Bridges, allocations on these machines are available to any genomics researchers in US and the job scripts can be easily adjusted for other job handlers/clusters. This workflow provides a low bar for entry into robust transcriptome assembly that follows best practices, while also providing a replicable means of filtering large numbers of transcripts into a draft version of a transcriptome. Scripts can be found at



NCGAS, Transcriptomics


Sanders, S., C. Ganote, B. Papudeshi, K. Mockaitis, and T. Doak. (2018). NCGAS makes robust transcriptome analysis easier with a readily usable workflow following de novo assembly best practices. Plant and Animal Genomics 2018, San Diego, CA. Retrieved from


Link(s) to data and video for this item



Except where otherwise noted, the contents of this presentation are copyright of the Trustees of Indiana University. This license includes the following terms: You are free to share -to copy, distribute and transmit the work and to remix -to adapt the work under the following conditions: attribution -you must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). For any reuse or distribution, you must make clear to others the license terms of this work.