{"id":458,"date":"2021-11-04T13:46:11","date_gmt":"2021-11-04T13:46:11","guid":{"rendered":"https:\/\/terrabioappdev.wpenginepowered.com\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/"},"modified":"2023-12-27T04:55:06","modified_gmt":"2023-12-27T04:55:06","slug":"introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield","status":"publish","type":"post","link":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/","title":{"rendered":"Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield"},"content":{"rendered":"<p><i><span style=\"font-weight: 400;\">Dr. Kiran Garimella is the Associate Director for Genomic Medicine in the Data Sciences Platform at the Broad Institute. In this guest blog post, Kiran gives us an overview of MAS-ISO-seq: a new method for high-throughput long-read transcriptome sequencing, collaboratively developed with Dr. Aziz Al\u2019Khafaji (Hacohen\/Blainey labs); Jonn Smith (DSP), Dr. Mehrtash Babadi (DSP), and the help and support of many others at the Broad Institute.<\/span><\/i><\/p>\n<p><em><span style=\"font-weight: 400;\"><i data-stringify-type=\"italic\">You can see the data processing part of the MAS-ISO-seq method in action in the\u00a0<\/i><i data-stringify-type=\"italic\"><a class=\"c-link\" tabindex=\"-1\" href=\"https:\/\/app.terra.bio\/#workspaces\/broad-firecloud-dsde-methods\/MAS-seq%20-%20Data%20Segmentation%20and%20Alignment\" target=\"_blank\" rel=\"noopener noreferrer\" data-stringify-link=\"https:\/\/app.terra.bio\/#workspaces\/broad-firecloud-dsde-methods\/MAS-seq%20-%20Data%20Segmentation%20and%20Alignment\" data-sk=\"tooltip_parent\" data-remove-tab-index=\"true\">public Terra workspace<\/a><\/i><i data-stringify-type=\"italic\">\u00a0created by the development team, which features a workflow going from raw PacBio output to properly segmented and filtered reads ready for analysis, configured to run on a publicly accessible example dataset<\/i>.<\/span><\/em><\/p>\n<hr \/>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">Short reads (~150-200 bp) are the standard brush for painting a picture of gene expression activity. But long read bristles are much finer, revealing more detail than possible with short reads alone. PacBio instruments are capable of generating long (~15,000-25,000 bp) and accurate (~Q30) reads via circular consensus sequencing (aka \u201cHiFi\u201d). As the average gene transcript isoform lengths are only ~1,500 bp, most full-length isoforms are easily capturable. Long reads enable identification of alternatively\/aberrantly spliced isoforms and gene fusions, comparison across samples and cell types, all without the need for <\/span><i><span style=\"font-weight: 400;\">de novo<\/span><\/i><span style=\"font-weight: 400;\"> assembly or other complex reconstruction methodologies.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The downside, however, is yield: long read output is comparatively low, limiting their scalability and applicability to interesting biological problems.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">On PacBio\u2019s long read sequencing platforms specifically, yield is dictated by the number of active nanoscopic wells on each flowcell. The current flowcell design has around 3 to 6 million active wells, which generates an equal number of reads regardless of their length: you only get one read out per well, whether the transcript is 1,500 bp or 25,000 bp long.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">So here&#8217;s the big question: if the instrument can sequence such long stretches of DNA, and cDNA sequences are much shorter than that typical length, is there some way to overcome the one-read-per-well limitation and get much more data from each flowcell?<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Yes! By applying a new within-read multiplexing approach to PacBio sequencing, we were able to develop a new protocol that increases the yield of full-length RNA isoform sequence data by 15x. The protocol is called\u00a0 <\/span><b>M<\/b><span style=\"font-weight: 400;\">ultiplexed <\/span><b>A<\/b><span style=\"font-weight: 400;\">rray<\/span><b>S<\/b> <b>iso<\/b><span style=\"font-weight: 400;\">form <\/span><b>seq<\/b><span style=\"font-weight: 400;\">uencing, or <\/span><b>MAS-ISO-seq<\/b><span style=\"font-weight: 400;\">, and is suitable for both single-cell and bulk sequencing applications.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">You can find the full details of how it works in <\/span><a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/2021.10.01.462818v1\"><span style=\"font-weight: 400;\">our preprint<\/span><\/a><span style=\"font-weight: 400;\"> in bioRxiv, or keep reading to get the highlights from this blog.\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<h3>Using intramolecular multiplexing to maximize yield<\/h3>\n<p><span style=\"font-weight: 400;\">The central idea of this approach is to combine several cDNA sequences into a single molecule (which we term an \u201carray\u201d), sequence that, then segment the resulting long read into its constituent fragments for analysis.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">To do this, we start with a cDNA library (usually a single cell library that has been prepared with a 10x Genomics kit) and split it up among several PCR reactions. In each PCR reaction, we add a different barcode pair. <\/span><span style=\"font-weight: 400;\">What&#8217;s really neat is that the barcode pairs are designed to be complementary across parallel reactions, so when we process the ends to make them &#8220;sticky&#8221; and pool them all together, the barcoded cDNA fragments self-assemble into the array.\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img fetchpriority=\"high\" decoding=\"async\" class=\"alignnone wp-image-1136\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/kiran-blog_fig_1-929x1024.png\" alt=\"\" width=\"600\" height=\"662\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">Schematic of the MAS-ISO-seq intramolecular cDNA multiplexing workflow<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">The barcodes in the array have a predictable, programmatic order (A, B, C, D, E, \u2026 etc.), and the maximum number of cDNA sequences in the read is constrained by the number of parallel PCR reactions (we usually do 15, which works out to a 22,500 bp read if every transcript is 1,500 bp).<\/span><\/p>\n<p>&nbsp;<\/p>\n<h3>A probabilistic approach to read segmentation<\/h3>\n<p><span style=\"font-weight: 400;\">Once we&#8217;ve sequenced each multiplexed array, we need to split the resulting long read up into the individual sequences corresponding to the original cDNA fragments.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">We initially tried using an iterative BLAST-like search to find the adapters and split the read at those positions, but that approach turned out to be quite brittle. It doesn\u2019t understand context (\u201cdoes it make sense that this substring in the read is a barcode given what\u2019s surrounding it?\u201d) and it\u2019s extremely sensitive to sequencing error (\u201cthis read wasn\u2019t error corrected as well as the last one; can I really believe that the subsequence I\u2019m looking at is one of my barcodes?\u201d). This is a big problem: mis-segmented reads may be misleading, masquerading for instance as false fusion genes or other seemingly interesting \u2014but wrong\u2014 biological findings.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The quality control evaluation of our initial pilot run confirmed that the BLAST-like adapter search was not going to cut it (so to speak) so we pivoted to a different solution that leverages what we know about the structure of our MAS-ISO-seq reads:<\/span><\/p>\n<ol>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Array elements (i.e. a transcript with poly-A tail and optional single cell adapters, etc.) are flanked by our barcodes, referred to here as M<\/span><span style=\"font-weight: 400;\">i<\/span><span style=\"font-weight: 400;\">\u00a0and M<\/span><span style=\"font-weight: 400;\">i+1<\/span><span style=\"font-weight: 400;\">. Across the length of the read, these adapters appear as an ordered sequence (M<\/span><span style=\"font-weight: 400;\">1<\/span><span style=\"font-weight: 400;\">, M<\/span><span style=\"font-weight: 400;\">2<\/span><span style=\"font-weight: 400;\">, M<\/span><span style=\"font-weight: 400;\">3<\/span><span style=\"font-weight: 400;\">, \u2026, M<\/span><span style=\"font-weight: 400;\">n<\/span><span style=\"font-weight: 400;\">) where n is the total number of elements in the array.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Each array element itself contains several sequences known\u00a0<\/span><i><span style=\"font-weight: 400;\">a priori<\/span><\/i><span style=\"font-weight: 400;\">\u00a0(e.g. 10x Genomics single-cell 5\u2019 and 3\u2019 adapters, poly-A tails).<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">The cDNA sequence itself can be considered not known\u00a0<\/span><i><span style=\"font-weight: 400;\">a priori<\/span><\/i><span style=\"font-weight: 400;\">. Cell barcodes, spatial barcodes, UMIs, or other adapters that come from a very large sequence space may also effectively be considered unknown.<\/span><\/li>\n<\/ol>\n<p>&nbsp;<\/p>\n<p><img decoding=\"async\" class=\"alignnone wp-image-1139 size-large\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/kiran-blog_fig_2-1-1024x185.png\" alt=\"\" width=\"800\" height=\"145\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">MAS-ISO-seq read structure barcodes and other adapters arising from the single-cell library preparation additionally highlighted.<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">This codified read structure lays out useful landmarks and constraints, which enabled us to develop a model for annotating then segmenting them with high fidelity, even in the presence of high sequencing error rates.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">In particular, the latter two observations\u00a0 enabled us to design a composite hidden Markov model with two separate submodels: &#8220;global alignment&#8221; and &#8220;random, repeat&#8221;. The \u201cglobal alignment\u201d submodel enables the recognition of known sequences (not just the MAS-ISO-seq barcodes, but all of them), allowing for mismatches and indels along the length of the sequence. The \u201crandom, repeat\u201d submodel enables the recognition of sequences that are not known in advance. These models are connected to one another and repeated as necessary according to a given MAS-ISO-seq array design (for example, a 15-element array). See the preprint if you want full technicolor detail (and some nifty grayscale figures) about the submodels; the point is that this modeling approach gives us much higher confidence in our annotation of where the sequences of interest start and stop within each array read.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">With the annotated reads in hand, we can apply the ordering constraint outlined in the first observation as a quality control check. Based on the patterns recorded during the annotation stage, we can confirm whether each array read&#8217;s MAS-ISO-seq adapters appear in the order specified by the array design. Reads failing this check are potentially mis-segmented, so we filter them out. And that&#8217;s it!<\/span><\/p>\n<p><span style=\"font-weight: 400;\">We implemented this method as an open-source software package called <a href=\"https:\/\/github.com\/broadinstitute\/longbow\">Longbow<\/a>, which is available for download on Github and through Pypi, and we also provide a workflow script along with example data in a Terra workspace (see Resources at the end of this post for details).<\/span><\/p>\n<p>&nbsp;<\/p>\n<h3>A &gt;15x increase in single-cell full-length transcript isoform yield<\/h3>\n<p><span style=\"font-weight: 400;\">To evaluate the performance of our approach on a full-size dataset, we applied Longbow to 15-element MAS-ISO-seq data for a test dataset on T-cells. We started with an initial set of ~5.6M input reads, of which ~1.6M reads were successfully error-corrected to ~Q30 by PacBio\u2019s\u00a0on-board software. Of these error-corrected reads, ~99% were successfully demultiplexed into 22.7M CCS-corrected transcripts, a ~14x increase from the initial 1.6M corrected read set.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img decoding=\"async\" class=\"alignnone wp-image-1134\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/kiran-blog_fig_3-1024x655.png\" alt=\"\" width=\"600\" height=\"384\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">Sankey diagram of final processing status for reads from an exemplar 15-element MAS-ISO-seq library<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">What happened to the other 3.9M reads, you ask? Well, usually those get thrown away. PacBio\u2019s error correction protocol relies on molecules being redundantly sequenced within each well and then transformed into a consensus sequence. Many reads, however, don\u2019t get enough redundant passes to be corrected, and are therefore discarded.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">But uncorrected doesn\u2019t mean unusable; we can rescue this data! Because our HMM is robust to sequencing error, we also process the ~3.9M reads that\u00a0PacBio\u2019s software does not correct. Here, substantially more reads fail Longbow\u2019s filtering (~55%). However, 1.7M reads do segment properly and pass filtration, yielding another ~12.8M transcripts (additional ~8x increase from the initial 1.6M corrected reads). These transcripts are not error-corrected, but still represent valid data that can be used for a variety of purposes (e.g. transcriptome annotation).<\/span><\/p>\n<p>&nbsp;<\/p>\n<h3>More power to identify cell types and differential expression<\/h3>\n<p><span style=\"font-weight: 400;\">What can we do with the massive increase in full-length transcriptome data enabled by MAS-ISO-seq? Let\u2019s look at a single single-cell library on ~6,000 tumor-infiltrating cytotoxic T cells, prepared both for short read sequencing and as a MAS-ISO-seq library. The figure below shows our short read dataset, with cells clearly separated into clusters representing different states of dormancy\/activation\/exhaustion (panel A).\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">When we downsample our MAS-ISO-seq output to different levels (1.6M to 33M reads, panel B), we can see how more long read data influences a similar clustering analysis. As we add more data, cell clustering becomes more stable, increasingly resembling that of the short reads.\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-1137\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/kiran-blog_fig_4-1024x558.png\" alt=\"\" width=\"800\" height=\"436\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">Comparison of short read clustering to in silico downsampling of long read data. (A) UMAP embedding of single-cell gene expression of 6,260 CD8+ T cells from short (B) In silico downsampling analysis of MAS-ISO-seq reads; (top) evolution of UMAP embedding vs. depth (the long-read UMAPs are annotated with the cell identities determined from the short-read data); (middle) adjusted Rand index (ARI) between short-reads reference annotations and downsampled long reads vs. depth; (bottom) number of statistically significant differentially spliced genes vs. depth.<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">Adding more data also allows us to do progressively better at annotating a <\/span><i><span style=\"font-weight: 400;\">custom<\/span><\/i><span style=\"font-weight: 400;\"> transcriptome for these cells \u2013 using StringTie2 to augment the canonical Gencode annotations with high-confidence transcripts found in these cells. Our ability to identify differentially spliced genes among cell clusters increases substantially as a function of overall read count: up to a 34-fold gain (1604 vs 47) when we use the entire dataset!<\/span><\/p>\n<p>&nbsp;<\/p>\n<h3>Future directions and availability of the software<\/h3>\n<p><span style=\"font-weight: 400;\">We&#8217;re very excited by MAS-ISO-seq&#8217;s ability to produce substantially higher throughput full-length isoform sequencing in single cells. And this is just the start! Over the coming months, we plan to add more capabilities to our processing software, Longbow. Support for more array designs (e.g. 10x Genomics 3\u2019 kit, spatial transcriptomics, etc.), support for full-length poly(A) tail capture, more robust training for the HMM, speed and usability enhancements, and more.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Like all software authored in the Data Sciences Platform, Longbow is open-source (BSD-3-Clause License) and should work on any reasonable HPC environment. We also provide a Docker image with all of the relevant dependencies preinstalled, as well as an easy-to-install <a href=\"https:\/\/pypi.org\/project\/maslongbow\/\">PyPi package<\/a>.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">For your convenience, we also provide a reproducible workflow for going from raw PacBio output to properly segmented and filtered reads ready for analysis. We plan to make further improvements to this workflow\u00a0 in the coming months, such as the ability to output gene-level and isoform-level quantification matrices that can be imported directly into Scanpy for analysis. <\/span><span style=\"font-weight: 400;\">The workflow is available from Github and Dockstore, and we make it available in a <a href=\"https:\/\/app.terra.bio\/#workspaces\/broad-firecloud-dsde-methods\/MAS-seq%20-%20Data%20Segmentation%20and%20Alignment\">public Terra workspace<\/a>, preconfigured to run on some example data, to enable anyone to try out the method quickly without any setup work. Please check it out and let us know what you think!<\/span><\/p>\n<p>&nbsp;<\/p>\n<hr \/>\n<p>&nbsp;<\/p>\n<h4><span style=\"color: #008000;\"><b>Resources<\/b><\/span><\/h4>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">The MAS-ISO-seq preprint can be found at <\/span><a href=\"https:\/\/www.biorxiv.org\/content\/10.1101\/2021.10.01.462818v1\"><span style=\"font-weight: 400;\">https:\/\/www.biorxiv.org\/content\/10.1101\/2021.10.01.462818v1<\/span><\/a><span style=\"font-weight: 400;\">. Comments\/reviews are most welcome!<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">The Longbow software can be found at <\/span><a href=\"https:\/\/github.com\/broadinstitute\/longbow\"><span style=\"font-weight: 400;\">https:\/\/github.com\/broadinstitute\/longbow<\/span><\/a><span style=\"font-weight: 400;\"> and on Pypi at \u200b\u200b<\/span><a href=\"https:\/\/pypi.org\/project\/maslongbow\/\"><span style=\"font-weight: 400;\">https:\/\/pypi.org\/project\/maslongbow\/<\/span><\/a><span style=\"font-weight: 400;\">, with documentation at <\/span><a href=\"https:\/\/broadinstitute.github.io\/longbow\/how.html\"><span style=\"font-weight: 400;\">https:\/\/broadinstitute.github.io\/longbow\/<\/span><\/a><span style=\"font-weight: 400;\">. Comments and issue reports are welcome there too.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">An example dataset of Spike-In RNA Variants (Lexogen SIRV-Set 4) and the workflow for initial data processing can be found in <a href=\"https:\/\/app.terra.bio\/#workspaces\/broad-firecloud-dsde-methods\/MAS-seq%20-%20Data%20Segmentation%20and%20Alignment\">this Terra workspace<\/a>.<\/span><\/li>\n<li aria-level=\"1\">If you are new to running workflows in Terra, see the <a href=\"https:\/\/support.terra.bio\/hc\/en-us\/sections\/360013095471-Workflows-QuickStart-Tutorial\">Workflows Quickstart Tutorial<\/a> to learn how you can use the preconfigured workspace to try out the method yourself.<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Dr. Kiran Garimella gives an overview of MAS-ISO-seq, a new method for generating a lot more data per run with long-read sequencing technologies such as PacBio, and shares a workspace that demonstrates the method&#8217;s data processing.<\/p>\n","protected":false},"author":29,"featured_media":463,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[12,18,155,13,19,119,66,59,32,14],"tags":[166,63,67],"class_list":["post-458","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-analysis","category-data","category-developers","category-guest-author","category-most-popular","category-most-recent","category-single-cell-transcriptomics","category-single-cell","category-workflows","category-workspaces","tag-pacbio","tag-rnaseq","tag-transcriptome"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.0 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield - Terra<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield - Terra\" \/>\n<meta property=\"og:description\" content=\"Dr. Kiran Garimella gives an overview of MAS-ISO-seq, a new method for generating a lot more data per run with long-read sequencing technologies such as PacBio, and shares a workspace that demonstrates the method&#039;s data processing.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/\" \/>\n<meta property=\"og:site_name\" content=\"Terra\" \/>\n<meta property=\"article:published_time\" content=\"2021-11-04T13:46:11+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-12-27T04:55:06+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/mas-seq-harvest-gourds_OG.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"627\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Kiran Garimella\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Kiran Garimella\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"10 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/\"},\"author\":{\"name\":\"Kiran Garimella\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/person\/a630ad1f4a2eb7fe5446858fd2ec93ee\"},\"headline\":\"Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield\",\"datePublished\":\"2021-11-04T13:46:11+00:00\",\"dateModified\":\"2023-12-27T04:55:06+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/\"},\"wordCount\":2051,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/terra.bio\/#organization\"},\"image\":{\"@id\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/mas-seq-harvest-gourds_OG.png\",\"keywords\":[\"pacbio\",\"rnaseq\",\"transcriptome\"],\"articleSection\":[\"Analysis\",\"Data\",\"Developers\",\"Guest Author\",\"Most Popular\",\"Most Recent\",\"Single Cell Transcriptomics\",\"Single-Cell\",\"Workflows\",\"Workspaces\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/\",\"url\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/\",\"name\":\"Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield - Terra\",\"isPartOf\":{\"@id\":\"https:\/\/terra.bio\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/mas-seq-harvest-gourds_OG.png\",\"datePublished\":\"2021-11-04T13:46:11+00:00\",\"dateModified\":\"2023-12-27T04:55:06+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#primaryimage\",\"url\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/mas-seq-harvest-gourds_OG.png\",\"contentUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/mas-seq-harvest-gourds_OG.png\",\"width\":1200,\"height\":627,\"caption\":\"mas seq harvest gourds_OG\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/terra.bio\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/terra.bio\/#website\",\"url\":\"https:\/\/terra.bio\/\",\"name\":\"Terra\",\"description\":\"Science at Scale\",\"publisher\":{\"@id\":\"https:\/\/terra.bio\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/terra.bio\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/terra.bio\/#organization\",\"name\":\"Terra\",\"url\":\"https:\/\/terra.bio\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp\",\"contentUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp\",\"width\":287,\"height\":318,\"caption\":\"Terra\"},\"image\":{\"@id\":\"https:\/\/terra.bio\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/person\/a630ad1f4a2eb7fe5446858fd2ec93ee\",\"name\":\"Kiran Garimella\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/0f70f836b6a8658d710b38fc165d7980de1a0f574416256a0beb7c15238f6f9a?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/0f70f836b6a8658d710b38fc165d7980de1a0f574416256a0beb7c15238f6f9a?s=96&d=mm&r=g\",\"caption\":\"Kiran Garimella\"},\"url\":\"https:\/\/terra.bio\/author\/kirang\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield - Terra","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/","og_locale":"en_US","og_type":"article","og_title":"Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield - Terra","og_description":"Dr. Kiran Garimella gives an overview of MAS-ISO-seq, a new method for generating a lot more data per run with long-read sequencing technologies such as PacBio, and shares a workspace that demonstrates the method's data processing.","og_url":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/","og_site_name":"Terra","article_published_time":"2021-11-04T13:46:11+00:00","article_modified_time":"2023-12-27T04:55:06+00:00","og_image":[{"width":1200,"height":627,"url":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/mas-seq-harvest-gourds_OG.png","type":"image\/png"}],"author":"Kiran Garimella","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Kiran Garimella","Est. reading time":"10 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#article","isPartOf":{"@id":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/"},"author":{"name":"Kiran Garimella","@id":"https:\/\/terra.bio\/#\/schema\/person\/a630ad1f4a2eb7fe5446858fd2ec93ee"},"headline":"Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield","datePublished":"2021-11-04T13:46:11+00:00","dateModified":"2023-12-27T04:55:06+00:00","mainEntityOfPage":{"@id":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/"},"wordCount":2051,"commentCount":0,"publisher":{"@id":"https:\/\/terra.bio\/#organization"},"image":{"@id":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#primaryimage"},"thumbnailUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/mas-seq-harvest-gourds_OG.png","keywords":["pacbio","rnaseq","transcriptome"],"articleSection":["Analysis","Data","Developers","Guest Author","Most Popular","Most Recent","Single Cell Transcriptomics","Single-Cell","Workflows","Workspaces"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/","url":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/","name":"Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield - Terra","isPartOf":{"@id":"https:\/\/terra.bio\/#website"},"primaryImageOfPage":{"@id":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#primaryimage"},"image":{"@id":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#primaryimage"},"thumbnailUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/mas-seq-harvest-gourds_OG.png","datePublished":"2021-11-04T13:46:11+00:00","dateModified":"2023-12-27T04:55:06+00:00","breadcrumb":{"@id":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#primaryimage","url":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/mas-seq-harvest-gourds_OG.png","contentUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/mas-seq-harvest-gourds_OG.png","width":1200,"height":627,"caption":"mas seq harvest gourds_OG"},{"@type":"BreadcrumbList","@id":"https:\/\/terra.bio\/introducing-mas-iso-seq-a-new-long-read-sequencing-protocol-for-dramatically-increasing-rna-transcript-isoform-yield\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/terra.bio\/"},{"@type":"ListItem","position":2,"name":"Introducing MAS-ISO-seq: A new long-read sequencing protocol for dramatically increasing RNA transcript isoform yield"}]},{"@type":"WebSite","@id":"https:\/\/terra.bio\/#website","url":"https:\/\/terra.bio\/","name":"Terra","description":"Science at Scale","publisher":{"@id":"https:\/\/terra.bio\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/terra.bio\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/terra.bio\/#organization","name":"Terra","url":"https:\/\/terra.bio\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/terra.bio\/#\/schema\/logo\/image\/","url":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp","contentUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp","width":287,"height":318,"caption":"Terra"},"image":{"@id":"https:\/\/terra.bio\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/terra.bio\/#\/schema\/person\/a630ad1f4a2eb7fe5446858fd2ec93ee","name":"Kiran Garimella","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/terra.bio\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/0f70f836b6a8658d710b38fc165d7980de1a0f574416256a0beb7c15238f6f9a?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/0f70f836b6a8658d710b38fc165d7980de1a0f574416256a0beb7c15238f6f9a?s=96&d=mm&r=g","caption":"Kiran Garimella"},"url":"https:\/\/terra.bio\/author\/kirang\/"}]}},"_links":{"self":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/posts\/458","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/users\/29"}],"replies":[{"embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/comments?post=458"}],"version-history":[{"count":0,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/posts\/458\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/media\/463"}],"wp:attachment":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/media?parent=458"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/categories?post=458"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/tags?post=458"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}