{"id":601,"date":"2022-08-09T15:48:17","date_gmt":"2022-08-09T15:48:17","guid":{"rendered":"https:\/\/terrabioappdev.wpenginepowered.com\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/"},"modified":"2023-12-27T04:55:39","modified_gmt":"2023-12-27T04:55:39","slug":"harness-machine-learning-to-clean-up-your-scrna-seq-data","status":"publish","type":"post","link":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/","title":{"rendered":"Harness machine learning to clean up your scRNA-seq data"},"content":{"rendered":"<p><i><span style=\"font-weight: 400;\">Stephen Fleming is a Machine Learning Scientist in the Methods group of the Data Sciences Platform at the Broad Institute. As part of Mehrtash Babadi\u2019s team, he works to develop analysis tools for single-cell RNA sequencing (scRNA-seq) data, and he analyzes scRNA-seq datasets in Patrick Ellinor\u2019s group as part of the Precision Cardiology Lab. In this guest blog post, Stephen introduces CellBender, a software package for eliminating technical artifacts from high-throughput scRNA-seq and other multi-omics data. The Cellbender manuscript preprint is available <\/span><\/i><a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/791699v2\"><i><span style=\"font-weight: 400;\">on biorxiv<\/span><\/i><\/a><i><span style=\"font-weight: 400;\">.<\/span><\/i><\/p>\n<hr \/>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">Microfluidics-based technologies for probing the contents of individual cells have been revolutionizing how we approach cell biology. In less than a decade, single-cell RNA sequencing has gone from being a cool new concept with a lot of potential, to a mainstream technique that has already enabled important discoveries (e.g. <\/span><a href=\"https:\/\/www.nature.com\/articles\/s41586-018-0393-7\"><span style=\"font-weight: 400;\">Montoro <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2018<\/span><\/a><span style=\"font-weight: 400;\">; <\/span><a href=\"https:\/\/www.nature.com\/articles\/s41586-021-03570-8\"><span style=\"font-weight: 400;\">Delorey <\/span><i><span style=\"font-weight: 400;\">et al<\/span><\/i><span style=\"font-weight: 400;\">., 2021<\/span><\/a><span style=\"font-weight: 400;\">), and is now inspiring spin-offs like single-cell proteomics and spatial transcriptomics.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Yet like any data generation technology that has come before, the process of turning biological material into digital signal retains some inherent messiness. On the wetlab side of things, even the most stringent protocol cannot completely eliminate enzymatic side-processes that produce spurious library fragments, contamination by exogenous or endogenous ambient transcripts, potential impurity of barcode beads, and barcode swapping during amplification and\/or sequencing.\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img fetchpriority=\"high\" decoding=\"async\" class=\"alignnone size-full wp-image-1401\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/phenomenology.png\" alt=\"\" width=\"501\" height=\"192\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">Cell dissociation and nuclei extraction lead to the presence of cell-free RNA in solution. (b) Schematic diagram of the proposed source of ambient RNA background counts. Cell-free \u201cambient\u201d RNAs (black lines) and other cellular debris are present in the cell-containing solution, and these RNAs are packaged up into the same droplet as a cell (red), or into an otherwise empty droplet that contains only a barcoded capture oligo bead (green hexagon).<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">In the generated data, these issues show up as systematic background noise. For example, a highly-expressed marker gene might look as though it is expressed in all cell types at a low level, rather than being specific to a certain cell type. This sort of background noise\u00a0 <\/span><span style=\"font-weight: 400;\">masks the true cell-type-specificity of gene expression \u2014and manifests incorrectly as spurious differential expression signals\u2014 in downstream analyses.<\/span><span style=\"font-weight: 400;\"> Systematic background noise is dataset-specific, and can cause a batch effect that hinders dataset integration and atlasing efforts (see Supplementary Fig. S1 in <\/span><a href=\"https:\/\/www.science.org\/doi\/10.1126\/science.abl4290\"><span style=\"font-weight: 400;\">Eraslan et al., 2022<\/span><\/a><span style=\"font-weight: 400;\">).<\/span><\/p>\n<p>&nbsp;<\/p>\n<h3><span style=\"font-weight: 400;\">Identifying and removing background noise with machine learning<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">To address this pervasive problem, we first investigated the phenomenology of background RNA counts in a variety of scRNA-seq and snRNA-seq experiments, then developed a modeling approach that uses machine learning to identify and subtract those background counts, resulting in a cleaner and more reliable dataset. <\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img decoding=\"async\" class=\"alignnone size-full wp-image-1402\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/pcl_hcm_dcm_atlas.png\" alt=\"\" width=\"607\" height=\"314\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">Removal of background RNA from a published human heart snRNA-seq atlas from <\/span><\/i><a href=\"https:\/\/www.nature.com\/articles\/s41586-022-04817-8\"><span style=\"font-weight: 400;\">Chaffin <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2022<\/span><\/a><i><span style=\"font-weight: 400;\">. a) Dotplot showing several highly-expressed genes in the raw dataset; b) Same dotplot after noise removal with CellBender.<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">We implemented this approach in Python, using the popular <a href=\"https:\/\/pytorch.org\/\">Pytorch<\/a> framework and the probabilistic programming language <a href=\"https:\/\/pyro.ai\/\">Pyro<\/a>. Our software package, called Cellbender, is fully open-source and available <a href=\"https:\/\/github.com\/broadinstitute\/CellBender\">in Github<\/a>.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">When we initially reported our results in a <\/span><a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/791699v1.full\"><span style=\"font-weight: 400;\">2019 preprint<\/span><\/a><span style=\"font-weight: 400;\">, CellBender was the only truly unsupervised option for background noise removal in single-cell datasets. As a bonus, in addition to producing a clean dataset, CellBender computes several other quantities of interest, including the probability that each droplet in the experiment contains a cell (since lots of droplets in these experiments are empty, and it can be hard to tell). We found that our approach outperformed previously existing tools for cell-calling, including CellRanger and EmptyDrops.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Since the 2019 preprint, CellBender has been applied to a range of real world studies by our collaborators as well as independent groups. Some notable works include studies of the human heart in health and disease (<\/span><a href=\"https:\/\/www.ahajournals.org\/doi\/10.1161\/CIRCULATIONAHA.119.045401\"><span style=\"font-weight: 400;\">Tucker <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2020<\/span><\/a><span style=\"font-weight: 400;\">; <\/span><a href=\"https:\/\/www.nature.com\/articles\/s41586-022-04817-8\"><span style=\"font-weight: 400;\">Chaffin <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2022<\/span><\/a><span style=\"font-weight: 400;\">; ), the human intestine (<\/span><a href=\"https:\/\/www.cell.com\/cell-stem-cell\/fulltext\/S1934-5909(20)30546-4\"><span style=\"font-weight: 400;\">Holloway<\/span><i><span style=\"font-weight: 400;\"> et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2021<\/span><\/a><span style=\"font-weight: 400;\">), human and mouse adipocytes (<\/span><a href=\"https:\/\/www.nature.com\/articles\/s41586-020-2856-x\"><span style=\"font-weight: 400;\">Sun <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2020<\/span><\/a><span style=\"font-weight: 400;\">; <\/span><a href=\"https:\/\/www.nature.com\/articles\/s42255-021-00509-1\"><span style=\"font-weight: 400;\">Dong <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2022<\/span><\/a><span style=\"font-weight: 400;\">), human tissues infected by SARS-CoV-2 (<\/span><a href=\"https:\/\/www.nature.com\/articles\/s41421-020-00225-2\"><span style=\"font-weight: 400;\">Xu <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2020<\/span><\/a><span style=\"font-weight: 400;\">; <\/span><a href=\"https:\/\/www.nature.com\/articles\/s41586-021-03570-8\"><span style=\"font-weight: 400;\">Delorey <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2021<\/span><\/a><span style=\"font-weight: 400;\">; <\/span><a href=\"https:\/\/www.cell.com\/cell\/fulltext\/S0092-8674(21)00882-5\"><span style=\"font-weight: 400;\">Ziegler <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2021<\/span><\/a><span style=\"font-weight: 400;\">; <\/span><a href=\"https:\/\/www.nature.com\/articles\/s41586-021-03569-1\"><span style=\"font-weight: 400;\">Melms <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2021<\/span><\/a><span style=\"font-weight: 400;\">), and a large human cross-tissue atlas (<\/span><a href=\"https:\/\/www.science.org\/doi\/10.1126\/science.abl4290\"><span style=\"font-weight: 400;\">Eraslan <\/span><i><span style=\"font-weight: 400;\">et al.<\/span><\/i><span style=\"font-weight: 400;\">, 2022<\/span><\/a><span style=\"font-weight: 400;\">).\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">All this road-testing has led to further improvements and a <\/span><a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/791699v2\"><span style=\"font-weight: 400;\">new updated preprint<\/span><\/a><span style=\"font-weight: 400;\"> that we are submitting for publication. In the new preprint, we additionally demonstrate that our approach outperforms a 2020 method for noise removal called <\/span><a href=\"https:\/\/genomebiology.biomedcentral.com\/articles\/10.1186\/s13059-020-1950-6\"><span style=\"font-weight: 400;\">DecontX<\/span><\/a><span style=\"font-weight: 400;\">, and can be applied to other omics data types, such as the antibody capture features that are measured together with RNA in a CITE-seq experiment.\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img decoding=\"async\" class=\"alignnone size-full wp-image-1403\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/citeseq.png\" alt=\"\" width=\"524\" height=\"378\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">Removal of background counts from a mutimodal CITE-seq dataset, showing gene expression in blue and antibody capture in red. (a) Raw data from a <\/span><\/i><a href=\"https:\/\/www.10xgenomics.com\/resources\/datasets\/5-k-peripheral-blood-mononuclear-cells-pbm-cs-from-a-healthy-donor-with-cell-surface-proteins-next-gem-3-1-standard-3-1-0\"><i><span style=\"font-weight: 400;\">public 10x genomics PBMC dataset<\/span><\/i><\/a><i><span style=\"font-weight: 400;\"> shows that most antibodies are measured in all cell types. (b) After CellBender, antibody counts become much more cell-type specific.<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<h3><span style=\"font-weight: 400;\">Get started with CellBender today<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">The Cellbender background removal tool is designed as a drop-in pre-processing step that can be run as a standalone operation or integrated into an existing workflow.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">For convenience, we provide a <\/span><a href=\"https:\/\/github.com\/broadinstitute\/CellBender\/blob\/master\/wdl\/cellbender_remove_background.wdl\"><span style=\"font-weight: 400;\">WDL workflow<\/span><\/a><span style=\"font-weight: 400;\"> that is preconfigured to run the background removal tool with GPU acceleration (Tesla K80) on a virtual machine in Google Cloud. We also created a <\/span><a href=\"https:\/\/app.terra.bio\/#workspaces\/help-terra\/CellBender\"><span style=\"font-weight: 400;\">public Terra workspace<\/span><\/a><span style=\"font-weight: 400;\"> showcasing the workflow along with some test data, so you can view configuration details, browse the tool&#8217;s outputs, and even try it out yourself without having to install anything on your own computer.\u00a0<\/span><\/p>\n<p><i><span style=\"font-weight: 400;\">If you&#8217;re not familiar with running workflows in Terra, check out the <\/span><\/i><a href=\"https:\/\/support.terra.bio\/hc\/en-us\/articles\/360043454592-Workflows-Quickstart-Part-1-Run-pre-configured-workflow\"><i><span style=\"font-weight: 400;\">Workflows Quickstart<\/span><\/i><\/a><i><span style=\"font-weight: 400;\"> and its accompanying <\/span><\/i><a href=\"https:\/\/youtu.be\/rUBrJNqLyfU\"><i><span style=\"font-weight: 400;\">video<\/span><\/i><\/a><i><span style=\"font-weight: 400;\">.<\/span><\/i><\/p>\n<p><span style=\"font-weight: 400;\">Among its outputs, the CellBender background removal tool produces an HTML report that includes quality metrics, as well as automatically generated commentary that is intended to guide the interpretation of those metrics. You can see an example of this in the public workspace as described below.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-1405\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_data_tab-1024x210.png\" alt=\"\" width=\"800\" height=\"164\" \/><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">To view the report for a CellBender background removal run, navigate to the workspace&#8217;s <\/span><a href=\"https:\/\/app.terra.bio\/#workspaces\/help-terra\/CellBender\/data\"><span style=\"font-weight: 400;\">DATA tab<\/span><\/a><span style=\"font-weight: 400;\">, click on &#8220;sample_set&#8221; in the left-hand TABLES menu, and then scroll the table view horizontally until you see the &#8220;html_report_array&#8221;. Click on one of the filenames in that column and hit the download button; this should open the html file in your browser. Scroll through the report to view metrics, plots and accompanying commentary.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-1404\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender-report-1024x481.png\" alt=\"\" width=\"800\" height=\"376\" \/><\/p>\n<p><em>One of several sections in the Cellbender background removal quality report, consisting of a plot, summary statistics and a statement to guide interpretation.<\/em><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">We encourage you to try this out for yourself and let us know how it goes. If you experience any issues getting started with Terra, please reach out to Terra helpdesk. If you run into any problems with CellBender itself, feel free to open an <\/span><a href=\"https:\/\/github.com\/broadinstitute\/CellBender\/issues\"><span style=\"font-weight: 400;\">issue in the CellBender repository on Github<\/span><\/a><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<h4><\/h4>\n<h4><span style=\"color: #008000;\"><b>Resources<\/b><\/span><\/h4>\n<p><a href=\"https:\/\/github.com\/broadinstitute\/CellBender\"><span style=\"font-weight: 400;\">CellBender repository on Github<\/span><\/a><\/p>\n<p><a href=\"https:\/\/github.com\/broadinstitute\/CellBender\/tree\/master\/wdl\"><span style=\"font-weight: 400;\">WDL workflow<\/span><\/a><\/p>\n<p><a href=\"https:\/\/app.terra.bio\/#workspaces\/help-terra\/CellBender\"><span style=\"font-weight: 400;\">Public Terra workspace<\/span><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Guest author Stephen Fleming introduces CellBender, a software package for eliminating technical artifacts from high-throughput scRNA-seq and other multi-omics data.<\/p>\n","protected":false},"author":42,"featured_media":607,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[13,156,119,66,59,14],"tags":[],"class_list":["post-601","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-guest-author","category-machine-learning","category-most-recent","category-single-cell-transcriptomics","category-single-cell","category-workspaces"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.0 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Harness machine learning to clean up your scRNA-seq data - Terra<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Harness machine learning to clean up your scRNA-seq data - Terra\" \/>\n<meta property=\"og:description\" content=\"Guest author Stephen Fleming introduces CellBender, a software package for eliminating technical artifacts from high-throughput scRNA-seq and other multi-omics data.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/\" \/>\n<meta property=\"og:site_name\" content=\"Terra\" \/>\n<meta property=\"article:published_time\" content=\"2022-08-09T15:48:17+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-12-27T04:55:39+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_OG.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"627\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Stephen Fleming\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Stephen Fleming\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/\"},\"author\":{\"name\":\"Stephen Fleming\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/person\/64ad39ad60c72c2b8b70c94698b2128a\"},\"headline\":\"Harness machine learning to clean up your scRNA-seq data\",\"datePublished\":\"2022-08-09T15:48:17+00:00\",\"dateModified\":\"2023-12-27T04:55:39+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/\"},\"wordCount\":1120,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/terra.bio\/#organization\"},\"image\":{\"@id\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_OG.png\",\"articleSection\":[\"Guest Author\",\"Machine Learning\",\"Most Recent\",\"Single Cell Transcriptomics\",\"Single-Cell\",\"Workspaces\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/\",\"url\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/\",\"name\":\"Harness machine learning to clean up your scRNA-seq data - Terra\",\"isPartOf\":{\"@id\":\"https:\/\/terra.bio\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_OG.png\",\"datePublished\":\"2022-08-09T15:48:17+00:00\",\"dateModified\":\"2023-12-27T04:55:39+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#primaryimage\",\"url\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_OG.png\",\"contentUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_OG.png\",\"width\":1200,\"height\":627,\"caption\":\"cellbender_OG\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/terra.bio\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Harness machine learning to clean up your scRNA-seq data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/terra.bio\/#website\",\"url\":\"https:\/\/terra.bio\/\",\"name\":\"Terra\",\"description\":\"Science at Scale\",\"publisher\":{\"@id\":\"https:\/\/terra.bio\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/terra.bio\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/terra.bio\/#organization\",\"name\":\"Terra\",\"url\":\"https:\/\/terra.bio\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp\",\"contentUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp\",\"width\":287,\"height\":318,\"caption\":\"Terra\"},\"image\":{\"@id\":\"https:\/\/terra.bio\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/person\/64ad39ad60c72c2b8b70c94698b2128a\",\"name\":\"Stephen Fleming\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/be388cc5d4fcc232c97ffbfe41d6a13a3cfb78ed62317c8a46221a4c2a9ec7c6?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/be388cc5d4fcc232c97ffbfe41d6a13a3cfb78ed62317c8a46221a4c2a9ec7c6?s=96&d=mm&r=g\",\"caption\":\"Stephen Fleming\"},\"url\":\"https:\/\/terra.bio\/author\/sfleming\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Harness machine learning to clean up your scRNA-seq data - Terra","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/","og_locale":"en_US","og_type":"article","og_title":"Harness machine learning to clean up your scRNA-seq data - Terra","og_description":"Guest author Stephen Fleming introduces CellBender, a software package for eliminating technical artifacts from high-throughput scRNA-seq and other multi-omics data.","og_url":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/","og_site_name":"Terra","article_published_time":"2022-08-09T15:48:17+00:00","article_modified_time":"2023-12-27T04:55:39+00:00","og_image":[{"width":1200,"height":627,"url":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_OG.png","type":"image\/png"}],"author":"Stephen Fleming","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Stephen Fleming","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#article","isPartOf":{"@id":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/"},"author":{"name":"Stephen Fleming","@id":"https:\/\/terra.bio\/#\/schema\/person\/64ad39ad60c72c2b8b70c94698b2128a"},"headline":"Harness machine learning to clean up your scRNA-seq data","datePublished":"2022-08-09T15:48:17+00:00","dateModified":"2023-12-27T04:55:39+00:00","mainEntityOfPage":{"@id":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/"},"wordCount":1120,"commentCount":0,"publisher":{"@id":"https:\/\/terra.bio\/#organization"},"image":{"@id":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#primaryimage"},"thumbnailUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_OG.png","articleSection":["Guest Author","Machine Learning","Most Recent","Single Cell Transcriptomics","Single-Cell","Workspaces"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/","url":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/","name":"Harness machine learning to clean up your scRNA-seq data - Terra","isPartOf":{"@id":"https:\/\/terra.bio\/#website"},"primaryImageOfPage":{"@id":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#primaryimage"},"image":{"@id":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#primaryimage"},"thumbnailUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_OG.png","datePublished":"2022-08-09T15:48:17+00:00","dateModified":"2023-12-27T04:55:39+00:00","breadcrumb":{"@id":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#primaryimage","url":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_OG.png","contentUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/cellbender_OG.png","width":1200,"height":627,"caption":"cellbender_OG"},{"@type":"BreadcrumbList","@id":"https:\/\/terra.bio\/harness-machine-learning-to-clean-up-your-scrna-seq-data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/terra.bio\/"},{"@type":"ListItem","position":2,"name":"Harness machine learning to clean up your scRNA-seq data"}]},{"@type":"WebSite","@id":"https:\/\/terra.bio\/#website","url":"https:\/\/terra.bio\/","name":"Terra","description":"Science at Scale","publisher":{"@id":"https:\/\/terra.bio\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/terra.bio\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/terra.bio\/#organization","name":"Terra","url":"https:\/\/terra.bio\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/terra.bio\/#\/schema\/logo\/image\/","url":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp","contentUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp","width":287,"height":318,"caption":"Terra"},"image":{"@id":"https:\/\/terra.bio\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/terra.bio\/#\/schema\/person\/64ad39ad60c72c2b8b70c94698b2128a","name":"Stephen Fleming","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/terra.bio\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/be388cc5d4fcc232c97ffbfe41d6a13a3cfb78ed62317c8a46221a4c2a9ec7c6?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/be388cc5d4fcc232c97ffbfe41d6a13a3cfb78ed62317c8a46221a4c2a9ec7c6?s=96&d=mm&r=g","caption":"Stephen Fleming"},"url":"https:\/\/terra.bio\/author\/sfleming\/"}]}},"_links":{"self":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/posts\/601","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/users\/42"}],"replies":[{"embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/comments?post=601"}],"version-history":[{"count":0,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/posts\/601\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/media\/607"}],"wp:attachment":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/media?parent=601"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/categories?post=601"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/tags?post=601"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}