Anton Kovalsky, Author at Terra https://terra.bio/author/akovalsk/ Science at Scale Wed, 27 Dec 2023 04:55:52 +0000 en-US hourly 1 https://wordpress.org/?v=6.9.1 https://terra.bio/wp-content/uploads/2023/12/Terra-Color-logo-300-150x150.pngAnton Kovalsky, Author at Terrahttps://terra.bio/author/akovalsk/ 32 32 Terra 101 workshop: Learn how to use Terra quickly and efficientlyhttps://terra.bio/terra-101-workshop-learn-how-to-use-terra-quickly-and-efficiently/ https://terra.bio/terra-101-workshop-learn-how-to-use-terra-quickly-and-efficiently/#respond Thu, 05 Jan 2023 19:27:58 +0000 https://terrabioappdev.wpenginepowered.com/terra-101-workshop-learn-how-to-use-terra-quickly-and-efficiently/Register for a free hands-on virtual workshop to get started with Terra and level up your skills.

The post Terra 101 workshop: Learn how to use Terra quickly and efficiently appeared first on Terra.

]]>
What better way to kick off a new year than to learn some new skills that deliver immediate benefits?

We’re offering a hands-on virtual workshop for those who want to learn how to use Terra quickly and efficiently. It’s open to everyone and is completely free (we’ll take care of the compute costs for the exercises) so make sure to register today

 

Specifically, you’ll learn about:

  • Terra architecture as it relates to cloud-based data sets, tools, and computing resources
  • How data is organized in Terra
  • How to run a WDL workflow to automate analysis
  • How to launch a Cloud Environment from your Terra workspace to run interactive analysis tools like Jupyter Notebooks, RStudio, and Galaxy

 

Save the date(s)!

The workshop will consist of two two-hour sessions scheduled on consecutive days:

January 31st, 2023 at 10:00am – 12:00pm ET
February 1st, 2023 at 10:00am – 12:00pm ET

Be sure to register today so you don’t miss this opportunity.

We look forward to seeing you! If you have any questions, please don’t hesitate to contact us at dsp-workshops@broadinstitute.org.

 

 

The post Terra 101 workshop: Learn how to use Terra quickly and efficiently appeared first on Terra.

]]>
https://terra.bio/terra-101-workshop-learn-how-to-use-terra-quickly-and-efficiently/feed/ 0
COVID-19 Genomic Surveillance in the Boston Area – Powered by Terrahttps://terra.bio/covid-19-genomic-surveillance-in-the-boston-area-powered-by-terra/ https://terra.bio/covid-19-genomic-surveillance-in-the-boston-area-powered-by-terra/#respond Thu, 04 Jun 2020 16:16:14 +0000 https://terrabioappdev.wpenginepowered.com/covid-19-genomic-surveillance-in-the-boston-area-powered-by-terra/Anton Kovalsky As the global battle against COVID-19 continues, researchers in the viral genomics group at the Broad Institute have been hard at work using viral sequencing and genomic epidemiology to understand the spread of SARS-CoV-2 close to home, yielding new insights into the arrival and spread of the epidemic in the greater Boston area. [...]

The post COVID-19 Genomic Surveillance in the Boston Area – Powered by Terra appeared first on Terra.

]]>
As the global battle against COVID-19 continues, researchers in the viral genomics group at the Broad Institute have been hard at work using viral sequencing and genomic epidemiology to understand the spread of SARS-CoV-2 close to home, yielding new insights into the arrival and spread of the epidemic in the greater Boston area.

The Broad’s Data Sciences Platform has been supporting this work and hopes to enable anyone around the world to do this as well, by harnessing the power of Terra as a community data access and analysis platform for pathogen genomic surveillance. Using the Broad team’s SARS-CoV-2 genomic data as a pilot, this initiative will provide the data management, analysis tools, and compute needed to help make genomic data for public health epidemiology more accessible, interoperable, and user friendly.

In their first data release, the Broad research team focused on the early phase of the local epidemic in the Boston area, from early March to early April, revealing how the virus was introduced from multiple sources, both domestically and abroad. They also investigated two “superspreading” events, one an international conference, and the other within a congregate living facility. A working draft version of the manuscript describing the details of this data and analysis can be found on Virological.org. The team has been working in close collaboration with their partners at Massachusetts General Hospital, and the Massachusetts Department of Public Health, sharing their findings in real-time.

The team is sharing their data and analytical tools openly in a newly featured COVID-19 Broad Viral NGS workspace in Terra, so other research and public health labs can explore and use these resources for their own work. The workspace is designed to enable researchers to go all the way from raw reads to phylogenetic trees using their own data, and to make batch data submission to GenBank easier.

The current dataset contains 331 assembled and high-quality SARS-CoV-2 viral genomes from human-depleted, metagenomic Illumina short read data, from samples taken from nasal swabs of confirmed COVID-19 patients tested at the Massachusetts General Hospital. It also contains 3065 publicly available SARS-CoV-2 genomes from the global dataset available on GenBank as of May 14th, 2020 to include in phylogenetic analyses of the primary dataset.

Along with the data, the workspace contains road-tested analysis workflows for viral genomic data that allow researchers to easily:

  • perform reference-based assembly of viral genomes from PCR amplified or metagenomic Illumina reads, called assemble_refbased
  • perform phylogenetic analysis and interactive visualization using the community standard Nextstrain.org toolset, called nextstrain_augur_viral_pipelines
  • simplifying viral genome submissions to NCBI Genbank via direct bulk submission, in a manner consistent with SARS-CoV-2 community standards, called genbank

generic_viralpipelinesdataflow.png

With this workspace, researchers are also able to recreate the phylogenetic trees shown in the Terra workspace dashboard and can run the workflows on their own data in Terra to create new phylogenetic trees and interactive visualizations. Both the Nextstrain visualization and the GenBank submission workflows can operate on genomes generated within Terra or imported from external assembly software.

Sabeti_Blog_2.png

All of these resources have also been distributed via public repositories—the genomic dataset has been released publicly on NCBI under BioProject PRJNA622837 and the featured, portable analysis workflows are published on the Dockstore tool repository service. The workflows have been validated to run on other commercial cloud platforms as well as local compute hardware. However, the viral genomics group has found from years of experience that Terra increases access to these tools to a much wider set of researchers, as it does not require traditional bioinformatic expertise.

Terra, a product of the Broad Institute in collaboration with Verily Life Sciences, leverages cloud resources to enable computation at a scale not possible in a lab with limited compute resources. Terra supports several flagship human genetic research initiatives such as NHGRI’s AnVIL, NHLBI’s BioData Catalyst, and NIH’s All Of Us. Terra is designed to make it easier to share and access large datasets, run analyses at scale, and work collaboratively across groups and organizations without having to worry about the underlying computational infrastructure or keeping track of large datasets. The team hopes providing data and tools on Terra can help to empower public health labs as they engage in viral sequencing work.

This initiative was sponsored in part by the National Institute of Allergy and Infectious Diseases and the Bill and Melinda Gates Foundation and is working in partnership with the Public Health Alliance for Genomic Epidemiology (PHA4GE) and the Virus Pathogen Resource (ViPR).

The post COVID-19 Genomic Surveillance in the Boston Area – Powered by Terra appeared first on Terra.

]]>
https://terra.bio/covid-19-genomic-surveillance-in-the-boston-area-powered-by-terra/feed/ 0
COVID-19 Integrated Analyses using Single Cell Datahttps://terra.bio/covid-19-integrated-analyses-using-single-cell-data/ https://terra.bio/covid-19-integrated-analyses-using-single-cell-data/#respond Tue, 28 Apr 2020 18:32:44 +0000 https://terrabioappdev.wpenginepowered.com/covid-19-integrated-analyses-using-single-cell-data/Recently, we’ve been writing here about the work of the Viral Genomics group at Broad and how they’re using Terra to support their workflows for genome assembly and phylogenomic analysis of novel coronavirus (SARS-CoV-2) genomes recovered from patients. Today, I’d like to share with you a project that focuses on the cells and [...]

The post COVID-19 Integrated Analyses using Single Cell Data appeared first on Terra.

]]>
Recently, we’ve been writing here about the work of the Viral Genomics group at Broad and how they’re using Terra to support their workflows for genome assembly and phylogenomic analysis of novel coronavirus (SARS-CoV-2) genomes recovered from patients.

Today, I’d like to share with you a project that focuses on the cells and organs in the human body that may be infected by the virus. With more information about the specific cell types that can be infected by the virus, researchers will be better able to generate hypotheses for transmission, pathogenesis, clinical associations, and therapeutics.

The Human Cell Atlas Lung Biological Network, including our colleagues at Broad’s Klarman Cell Observatory (KCO), recently shared a manuscript pre-print describing how they used millions of single-cell RNA-seq profiles from 25 tissues and organs to identify the cells likely to be infected with SARS-CoV-2 throughout the body, and the gene expression programs within those cells that could provide clues to disease mechanisms and therapies. The KCO conducted this work as part of a large international collaboration, including the Human Cell Atlas Lung Biological Network, the National Heart, Lung, and Blood Institute’s LungMap Consortium, and many more institutions.

Building upon previous research that identified specific genes that mediate SARS-CoV-2 infection, this work is important because it not only pinpoints specific types of cells that express potential mediators of infection, but also relates the level of expression of the viral entry genes in specific cell types to key factors that were associated with more severe COVID-19: age, sex, and smoking status.

The authors of the study wanted to ensure it was possible for others to build on their work by making all of the necessary data and code publicly available. They’ve published a Terra workspace containing a set of Jupyter notebooks that will allow researchers to reproduce many of the core analyses in their manuscript. The workspace’s Data section also contains metadata from a variety of organ and tissue samples that are necessary for the analysis (age, sex, or smoking status).

This will help researchers build upon the authors’ work by providing the clinical and biomedical research community with an efficient way to apply the consortium’s analysis to new data. Interested researchers can create their own copies of the workspace, upload their own data into their copy, and share their analysis with colleagues.

Additionally, our Single Cell Portal team has created a dedicated page to collect these new studies, with interactive visualizations of this data enabling gene query within cell types across many organs in the human body.

We hope these resources will be useful to those working on a solution to the current crisis. To get in touch with the authors of the study, please reach out to lung@humancellatlas.org

The post COVID-19 Integrated Analyses using Single Cell Data appeared first on Terra.

]]>
https://terra.bio/covid-19-integrated-analyses-using-single-cell-data/feed/ 0