National Center for Genome Analysis Support (NCGAS): Genomics and other Science in the NSF-Funded Jetstream Cloud

dc.contributor.authorDoak, Thomas
dc.contributor.authorSanders, Sheri
dc.contributor.authorGanote, Carrie
dc.contributor.authorPapudeshi, Bhavya
dc.contributor.authorFischer, Jeremy
dc.contributor.authorHancock, David Y.
dc.date.accessioned2020-03-20T20:32:53Z
dc.date.available2020-03-20T20:32:53Z
dc.date.issued2020-01-13
dc.description.abstractThe National Center for Genome Analysis Support (NCGAS) is an NSF-funded (NSF-1445604) center that helps all NSF-funded researchers doing genomics research. Genomics includes transcriptomics, metagenomics, genome annotation, etc. Our support includes providing access to large memory computing, maintaining curated sets of genomics applications, providing one-on-one consultation, and creating educational opportunities. A resource that we have come to rely on for providing these services is the NSF-funded Jetstream Cloud—maintained by Indiana University (led by the Indiana University Pervasive Technology Institute (PTI) and the University of Texas at Austin's Texas Advanced Computing Center (TACC). Additionally, we leverage Globus data transfer tools. Globus at the University of Chicago is responsible for integrating Jetstream with the NSF-funded Extreme Science and Engineering Discovery Environment (XSEDE), and for integrating Globus data movement and management tools, as well as Globus-based secure user authentication. With a focus on ease of use and broad accessibility, Jetstream is designed for those who have not previously used high performance computing and software resources—for researchers who need more than desktop-strength computing but less than full-scale High Performance Computing (HPC). Jetstream features a web-based user interface based on the popular Atmosphere cloud computing environment—developed by CyVerse—extended to support science and engineering research generally. The system is particularly geared toward 21st-century workforce development at small colleges and universities – especially historically black colleges and universities, minority serving institutions, tribal colleges, and higher education institutions in EPSCoR States. Jetstream provides a library of virtual machines designed to do discipline-specific scientific analysis, but researchers can also develop their own VMs, with their own software sets, or sets specialized to a particular task. These VMs can be both saved and shared with collaborators. Currently there are 19 genomics VMs, including RStudio instances with bioconductor, ready-made genome browsers with JBrowse/Tripal, and metagenomic tools like QIIME2 and Anvi’o. biology and molecular biology researchers are the largest users of Jetstream. NCGAS has found VMs extremely useful in education and workshops: we develop class-specific VMs, with all the applications needed, then clone, so that each student has their own VM to work on (making courses easy to scale). In addition to on-demand VMs, persistent science gateways can be established using template VMs NCGAS has built. These can be used to provide services to collaborators or to the world. Users can easily create Galaxy servers on Jetstream: each server comes preconfigured with hundreds of tools and commonly used reference datasets—once running, researchers can use it or customize it. Many NCGAS users establish genome browsers—specific to their organism—that are shared with small sets of collaborating researchers—but can be shared to the world. Jetstream is accessed via an allocation process at XSEDE—a startup allocation is typically approved within a day.en
dc.description.sponsorshipThis research is based upon work supported by the National Science Foundation under grant No. ABI-1759906 to Indiana University.en
dc.identifier.citationDoak TG, Sanders SA, Ganote C, Papudeshi B, Fischer J, Hancock DY. (2020). National Center for Genome Analysis Support (NCGAS): Genomics and other Science in the NSF-Funded Jetstream Cloud. Plant and Animal Genome 2020, San Diego, California. Available at http://hdl.handle.net/2022/25301.en
dc.identifier.urihttps://hdl.handle.net/2022/25301
dc.language.isoenen
dc.rightsExcept where otherwise noted, the contents of this presentation are copyright of the Trustees of Indiana University. This license includes the following terms: You are free to share -to copy, distribute and transmit the work and to remix -to adapt the work under the following conditions: attribution -you must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work). For any reuse or distribution, you must make clear to others the license terms of this work.en
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en
dc.subjectJetstreamen
dc.subjectNCGASen
dc.subjectCloud Computingen
dc.titleNational Center for Genome Analysis Support (NCGAS): Genomics and other Science in the NSF-Funded Jetstream Clouden
dc.typePresentationen

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
Jetstream-Outreach-PAG 2020-Oct2019-short.pdf
Size:
218.53 MB
Format:
Adobe Portable Document Format
Description:
PDF
No Thumbnail Available
Name:
Jetstream-Outreach-PAG 2020-Oct2019-short.pptx
Size:
232.66 MB
Format:
Microsoft Powerpoint XML
Description:
PPTX

Collections

Can’t use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.