{"id":503,"date":"2022-03-10T20:03:19","date_gmt":"2022-03-10T20:03:19","guid":{"rendered":"https:\/\/terrabioappdev.wpenginepowered.com\/ten-simple-rules-2-document-everything\/"},"modified":"2023-12-27T04:55:17","modified_gmt":"2023-12-27T04:55:17","slug":"ten-simple-rules-2-document-everything","status":"publish","type":"post","link":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/","title":{"rendered":"Ten simple rules \u2014 #2 Document everything"},"content":{"rendered":"<p><i><span style=\"font-weight: 400;\">This blog post is part of a series based on the paper &#8220;Ten simple rules for large-scale data processing&#8221; by Arkarachai Fungtammasan et al. (PLOS Computational Biology, 2022). Each installment reviews one of the rules proposed by the authors and illustrates how it can be applied when working in Terra. In this installment, we cover features that Terra users can take advantage of to communicate key project information to collaborators, keep records of workflow executions, and document analyses done in interactive environments for reproducibility.\u00a0<\/span><\/i><\/p>\n<hr \/>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">Hot on the heels of &#8220;<\/span><a href=\"https:\/\/terra.bio\/ten-simple-rules-1-dont-reinvent-the-wheel\/\"><span style=\"font-weight: 400;\">Don&#8217;t reinvent the wheel<\/span><\/a><span style=\"font-weight: 400;\">&#8220;, we tackle another deceptively simple rule: &#8220;Document everything&#8221;, an exhortation that may seem self-evident but can be quite challenging to apply consistently in practice.<\/span><\/p>\n<blockquote><p><span style=\"font-weight: 400;\">&#8220;If it\u2019s not written down, it didn\u2019t happen.\u201d<\/span><\/p><\/blockquote>\n<p><span style=\"font-weight: 400;\">In <\/span><a href=\"https:\/\/journals.plos.org\/ploscompbiol\/article?id=10.1371\/journal.pcbi.1009757\"><span style=\"font-weight: 400;\">their paper<\/span><\/a><span style=\"font-weight: 400;\">, Arkarachai Fungtammasan and colleagues motivate this rule primarily by calling out the necessity of ensuring effective transfer of knowledge within teams, particularly large collaborative teams that experience staff turnover. They also wisely point out the utility of progressively recording information that will later need to be collated for documentation, which I interpret as documentation intended for an external audience, e.g. for a research publication.\u00a0<\/span><\/p>\n<blockquote><p><span style=\"font-weight: 400;\">&#8220;[&#8230;] As members join and leave the team working on a large-scale data processing project, remembering why each decision was made can become difficult. A simple log with what the decision is, what the rationale for it is, who contributed to making it, and who agreed or approved with it can be enough. [&#8230;] This information can also be helpful to have consolidated when creating documentation explaining the pipelines used [<\/span><a href=\"https:\/\/journals.plos.org\/ploscompbiol\/article?id=10.1371\/journal.pcbi.1009757#pcbi.1009757.ref011\"><span style=\"font-weight: 400;\">11<\/span><\/a><span style=\"font-weight: 400;\">].&#8221;<\/span><\/p><\/blockquote>\n<p><span style=\"font-weight: 400;\">This is a compelling argument, deeply relatable. Even folks who are working mostly solo rather than as part of a large team should recognize the value of setting up their future self for success when the time comes to write the Materials and Methods section of their manuscript. To (badly) paraphrase the great Ru Paul, if you don&#8217;t document the work for yourself today, how the heck are you going to document it for somebody else in six months or more?<\/span><\/p>\n<p><span style=\"font-weight: 400;\">When it comes to solutions, the authors&#8217; recommendation centers on the use of project management tools.<\/span><\/p>\n<blockquote><p><span style=\"font-weight: 400;\">&#8220;There are multiple approaches to logging decisions, but a powerful and common approach is to repurpose project management systems such as GitHub Issues. Doing so effectively ties decisions to code or documentation changes, allows for historical record including discussion, can handle decision dependencies, and can incorporate formal review process automation.&#8221;<\/span><\/p><\/blockquote>\n<p><span style=\"font-weight: 400;\">We can certainly agree it&#8217;s a great idea to use a formal project management system to track work in general, and I for one wish that had been covered in my graduate school education. The specific suggestion of <\/span><a href=\"https:\/\/github.com\/features\/issues\"><span style=\"font-weight: 400;\">Github Issues<\/span><\/a><span style=\"font-weight: 400;\"> will work particularly well for people whose work has a strong code development component, since they&#8217;re likely to be using Github already. I&#8217;ll note that there is also an add-on for Github Issues called <\/span><a href=\"https:\/\/www.zenhub.com\/\"><span style=\"font-weight: 400;\">Zenhub<\/span><\/a><span style=\"font-weight: 400;\"> that provides additional project management functionality, and is free to use with public repositories. And of course, there are plenty of other options with different feature sets for teams that have different needs and preferences.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Yet while this recommendation does a great job of addressing the need to capture information about decision-making and code development processes for posterity, it doesn&#8217;t really touch the question of how to document analysis work at a granular level \u2014 e.g. which pipeline was run on which data, what was the command line, what were the outputs etc \u2014 without having to do an inhuman amount of manual input.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">This is admittedly a difficult question to address in a generic way, because the answer depends so much on the specific platform or environment you&#8217;ll be using to do the work.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Fortunately in this post I have the luxury of focusing on how you can apply the &#8220;Document everything&#8221; rule specifically within the Terra ecosystem. So let&#8217;s review a few key features of Terra that can help you apply this rule in three main areas: keeping records of workflow executions, documenting the twists and turns of your interactive analyses, and communicating the purpose and contents of a project workspace.\u00a0<\/span><\/p>\n<p><i><span style=\"font-weight: 400;\">FYI, the closely related topics of version control and monitoring execution are the object of separate rules, which we&#8217;ll get to in a few weeks.<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<h3><span style=\"font-weight: 400;\">Keep detailed records of workflow executions<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Running workflows at scale can be very challenging due to a range of factors including the amount of data involved, the complexity of the workflows, and the importance of processing all samples in a dataset in the same way. So it&#8217;s absolutely essential that whatever system you use enables you to find out exactly what was done in any given workflow run.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">I&#8217;m happy to say that is something the Terra workflow management system does particularly well. Whenever you launch a workflow (or set of workflows), the system records all relevant metadata automatically, including all command-line parameter values as well as direct links to the workflow code, input files and output files. The system also retains (and links to) copies of all execution logs, which contain information such as the exact command line that was run at each step, what logging information the tool itself produced (stdin and stderr), and additional metadata that we&#8217;ll discuss more when we get to later rules such as #10: Monitor Execution.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">This logging system (and its user-friendly web interface) makes it possible to quickly find all the information and inputs you would need to reproduce a workflow-based analysis with perfect fidelity.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img fetchpriority=\"high\" decoding=\"async\" class=\"alignnone size-large wp-image-1224\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/job-history-with-comment-1024x291.png\" alt=\"\" width=\"800\" height=\"227\" \/><\/p>\n<p><img decoding=\"async\" class=\"alignnone size-large wp-image-1222\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/wf-submission-details-1024x364.png\" alt=\"\" width=\"800\" height=\"284\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">Screenshots of a workspace&#8217;s job history view: list of submissions (top) and detailed view of one submission (bottom) (browse this workspace&#8217;s job history <\/span><\/i><a href=\"https:\/\/app.terra.bio\/#workspaces\/warp-pipelines\/DRAGEN-GATK-Whole-Genome-Germline-Pipeline\/job_history\"><i><span style=\"font-weight: 400;\">here<\/span><\/i><\/a><i><span style=\"font-weight: 400;\">)<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">In addition to the automated logging, you also have the option of adding comments to your workflow submission, either at the time you launch it, or after the fact (including during execution). For example, in the list of submissions shown above, the second row in the table (with &#8220;Aborted&#8221; status) includes a comment that was added after the workflow run was aborted. This can be very useful for keeping track of decisions or troubleshooting notes, especially in projects with a heavy development component, where multiple attempts may need to be made on the path to success.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">It&#8217;s worth noting that this commenting feature was added in response to community requests and has proved hugely popular with researchers who need to manage a lot of workflow submissions. Community feedback works!<\/span><\/p>\n<p>&nbsp;<\/p>\n<h3><span style=\"font-weight: 400;\">Use Jupyter Notebooks to document interactive analyses<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">In my experience, the part of people&#8217;s projects that tends to be the least adequately documented is the phase of iterative data exploration, analysis and visualization that is generally typical of tertiary data analysis, which we lump under the term &#8220;interactive analysis&#8221; in contrast to automated workflows. This phase typically involves applying a variety of commands, sometimes scripted, sometimes not, within an interactive environment such as a terminal shell or an application like RStudio.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">By its very nature this can be a messy, non-linear process, and unfortunately the entire process often ends up summarized in Methods sections as &#8220;We applied methods X and Y using base R and this list of packages which are available in CRAN.&#8221; It goes without saying that this is not sufficient to enable a reader, collaborator or even your future self to reproduce the work.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">One increasingly popular way to address this problem, which is <\/span><a href=\"https:\/\/support.terra.bio\/hc\/en-us\/articles\/360059009571-Notebooks-Quickstart-Guide\"><span style=\"font-weight: 400;\">fully supported in Terra<\/span><\/a><span style=\"font-weight: 400;\">, is to perform most if not all of the work within a Jupyter Notebook. The Notebook environment allows you to progressively document every step and every attempted command, alternating documentation cells and code execution cells, with full logging of command outputs. This provides a much richer documentation record than &#8220;just&#8221; including code comments in a script, for example. And keep in mind that you can run almost any command-line analysis tool from within a notebook; you&#8217;re not limited to running Python and R code.\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><img decoding=\"async\" class=\"alignnone size-large wp-image-1225\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/notebook-gatk-1024x353.png\" alt=\"\" width=\"800\" height=\"276\" \/><\/p>\n<p><i><span style=\"font-weight: 400;\">Screenshot of a Jupyter Notebook in Terra (preview mode) showing invocation and partial log output of a GATK command (see the full tutorial notebook <\/span><\/i><a href=\"https:\/\/app.terra.bio\/#workspaces\/help-gatk\/Genomics-in-the-Cloud-v1\/notebooks\/launch\/Genomics-Notebook-executed.ipynb\"><i><span style=\"font-weight: 400;\">here<\/span><\/i><\/a><i><span style=\"font-weight: 400;\">)<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">That being said, the resulting &#8220;complete record&#8221; can be a little overwhelming, so I personally like to maintain a parallel notebook in which I only include &#8220;the bits that worked&#8221;. This allows me to progressively build (and regularly re-check) the minimal end-to-end path necessary to reproduce the work. The result is a more easily readable documentation record that is pretty much ready to publish.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">This approach may also be a good fit for people who find the experience of developing an analysis in Jupyter Notebook to be too constraining, and who prefer to work in an environment like RStudio (which is also <\/span><a href=\"https:\/\/terra.bio\/try-rstudio-in-terra\/\"><span style=\"font-weight: 400;\">available in Terra<\/span><\/a><span style=\"font-weight: 400;\">). As you progress through your analysis, record chunks of the work in a notebook in parallel, alternating descriptions of your decision-making process and the actual code executions applied to the data. If you&#8217;re used to saving analysis code in R scripts, you can simply invoke the scripts from your notebook, and combine the advantages of both sides \u2014 the flexibility of RStudio as a development environment and the documentation power of Jupyter Notebooks.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Does that mean you&#8217;ll be running a lot of computations twice or more? Why yes, it does indeed, and that&#8217;s a good thing: it&#8217;s a built-in way to verify the reproducibility of your work as you go.\u00a0<\/span><\/p>\n<p>&nbsp;<\/p>\n<h3><span style=\"font-weight: 400;\">Communicate the purpose and contents of your workspace<\/span><\/h3>\n<p><span style=\"font-weight: 400;\">Giving teammates access to your work in Terra is straightforward; simply share your project workspace with them through the <\/span><a href=\"https:\/\/support.terra.bio\/hc\/en-us\/articles\/360024617851\"><span style=\"font-weight: 400;\">workspace sharing<\/span><\/a><span style=\"font-weight: 400;\"> menu. However, there can be a lot of assets in your workspace (data, code, tools) and it&#8217;s not necessarily trivial for someone coming in to understand how it all ties together, especially if they are new to Terra themselves.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">We encourage you to take advantage of the editable <\/span><a href=\"https:\/\/support.terra.bio\/hc\/en-us\/articles\/360024743371-Working-with-workspaces#zp-3-0\"><span style=\"font-weight: 400;\">&#8220;dashboard&#8221;<\/span><\/a><span style=\"font-weight: 400;\"> of your workspace to provide collaborators with an overview of the project that the workspace is meant to tackle, and summarize key information: what are the main assets used in the workspace (e.g. data, tools, code), instructions for running the analyses, plus meta-level information like authorship and any applicable licensing conditions. The Terra User Education team provides <\/span><a href=\"https:\/\/support.terra.bio\/hc\/en-us\/articles\/360043450112\"><span style=\"font-weight: 400;\">best-practices recommendations<\/span><\/a><span style=\"font-weight: 400;\"> for how to structure dashboard documentation, based on their extensive experience developing public workspaces for educational purposes.<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><a href=\"https:\/\/app.terra.bio\/#workspaces\/featured-workspaces-hca\/Intro-to-HCA-data-on-Terra\"><i><span style=\"font-weight: 400;\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-1226\" src=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/hca-dashboard-1024x609.png\" alt=\"\" width=\"800\" height=\"476\" \/><\/span><\/i><\/a><\/p>\n<p><i><span style=\"font-weight: 400;\">Screenshot showing part of a workspace dashboard (see full dashboard <\/span><\/i><a href=\"https:\/\/app.terra.bio\/#workspaces\/featured-workspaces-hca\/Intro-to-HCA-data-on-Terra\"><i><span style=\"font-weight: 400;\">here<\/span><\/i><\/a><i><span style=\"font-weight: 400;\">)<\/span><\/i><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"font-weight: 400;\">One limitation here is that the workspace dashboard documentation functionality does require manual input (the information is not collated automatically) and it&#8217;s not version-controlled. Its main advantage is that it makes it possible to attach summary information as an integral part to the workspace itself, rather than relying on external\/separate documents.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Some enhancements to this functionality have been discussed, like the possibility of <\/span><a href=\"https:\/\/support.terra.bio\/hc\/en-us\/community\/posts\/4607419732123-Add-comment-log-to-the-workspace\"><span style=\"font-weight: 400;\">adding a comment log<\/span><\/a><span style=\"font-weight: 400;\"> so that multiple people collaborating within a workspace could post timestamped notes, flag issues and ask questions within the context of the workspace itself (rather than having to switch to an outside application).\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">As I hinted at earlier, <\/span><a href=\"https:\/\/terra.bio\/submit-feature-requests-and-vote-on-community-ideas\/\"><span style=\"font-weight: 400;\">we welcome feature requests<\/span><\/a><span style=\"font-weight: 400;\">, so feel free to upvote <\/span><a href=\"https:\/\/support.terra.bio\/hc\/en-us\/community\/posts\/4607419732123-Add-comment-log-to-the-workspace\"><span style=\"font-weight: 400;\">this idea<\/span><\/a><span style=\"font-weight: 400;\"> or suggest your own in the <\/span><a href=\"https:\/\/support.terra.bio\/hc\/en-us\/community\/topics\/360000500452-Active-Feature-Requests\"><span style=\"font-weight: 400;\">Feature Requests<\/span><\/a><span style=\"font-weight: 400;\"> section of the Terra community forum!<\/span><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A review of Terra features supporting documentation for transfer of knowledge and reproducibility, inspired by \u201cTen simple rules for large-scale data processing\u201d (Fungtammasan 2022)<\/p>\n","protected":false},"author":4,"featured_media":508,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[31,43,53,45,32],"tags":[],"class_list":["post-503","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ecosystem","category-features","category-getting-started","category-notebooks","category-workflows"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.0 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Ten simple rules \u2014 #2 Document everything - Terra<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Ten simple rules \u2014 #2 Document everything - Terra\" \/>\n<meta property=\"og:description\" content=\"A review of Terra features supporting documentation for transfer of knowledge and reproducibility, inspired by \u201cTen simple rules for large-scale data processing\u201d (Fungtammasan 2022)\" \/>\n<meta property=\"og:url\" content=\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/\" \/>\n<meta property=\"og:site_name\" content=\"Terra\" \/>\n<meta property=\"article:published_time\" content=\"2022-03-10T20:03:19+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-12-27T04:55:17+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/document-everything_OG.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"627\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Geraldine Van der Auwera\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Geraldine Van der Auwera\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"9 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/\"},\"author\":{\"name\":\"Geraldine Van der Auwera\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/person\/ad0522d0b331a5e08fa1733f65086ee2\"},\"headline\":\"Ten simple rules \u2014 #2 Document everything\",\"datePublished\":\"2022-03-10T20:03:19+00:00\",\"dateModified\":\"2023-12-27T04:55:17+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/\"},\"wordCount\":1887,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/terra.bio\/#organization\"},\"image\":{\"@id\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/document-everything_OG.png\",\"articleSection\":[\"Ecosystem\",\"Features\",\"Getting Started\",\"Notebooks\",\"Workflows\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/\",\"url\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/\",\"name\":\"Ten simple rules \u2014 #2 Document everything - Terra\",\"isPartOf\":{\"@id\":\"https:\/\/terra.bio\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/document-everything_OG.png\",\"datePublished\":\"2022-03-10T20:03:19+00:00\",\"dateModified\":\"2023-12-27T04:55:17+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#primaryimage\",\"url\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/document-everything_OG.png\",\"contentUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/document-everything_OG.png\",\"width\":1200,\"height\":627,\"caption\":\"document everything_OG\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/terra.bio\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Ten simple rules \u2014 #2 Document everything\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/terra.bio\/#website\",\"url\":\"https:\/\/terra.bio\/\",\"name\":\"Terra\",\"description\":\"Science at Scale\",\"publisher\":{\"@id\":\"https:\/\/terra.bio\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/terra.bio\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/terra.bio\/#organization\",\"name\":\"Terra\",\"url\":\"https:\/\/terra.bio\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp\",\"contentUrl\":\"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp\",\"width\":287,\"height\":318,\"caption\":\"Terra\"},\"image\":{\"@id\":\"https:\/\/terra.bio\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/person\/ad0522d0b331a5e08fa1733f65086ee2\",\"name\":\"Geraldine Van der Auwera\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/terra.bio\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/d73bdaf6740465b385e0e3b290786d8cb9d9d548eadec23364254ba06c85204b?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/d73bdaf6740465b385e0e3b290786d8cb9d9d548eadec23364254ba06c85204b?s=96&d=mm&r=g\",\"caption\":\"Geraldine Van der Auwera\"},\"sameAs\":[\"https:\/\/app.terra.bio\/\"],\"url\":\"https:\/\/terra.bio\/author\/geraldinevanterra\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Ten simple rules \u2014 #2 Document everything - Terra","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/","og_locale":"en_US","og_type":"article","og_title":"Ten simple rules \u2014 #2 Document everything - Terra","og_description":"A review of Terra features supporting documentation for transfer of knowledge and reproducibility, inspired by \u201cTen simple rules for large-scale data processing\u201d (Fungtammasan 2022)","og_url":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/","og_site_name":"Terra","article_published_time":"2022-03-10T20:03:19+00:00","article_modified_time":"2023-12-27T04:55:17+00:00","og_image":[{"width":1200,"height":627,"url":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/document-everything_OG.png","type":"image\/png"}],"author":"Geraldine Van der Auwera","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Geraldine Van der Auwera","Est. reading time":"9 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#article","isPartOf":{"@id":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/"},"author":{"name":"Geraldine Van der Auwera","@id":"https:\/\/terra.bio\/#\/schema\/person\/ad0522d0b331a5e08fa1733f65086ee2"},"headline":"Ten simple rules \u2014 #2 Document everything","datePublished":"2022-03-10T20:03:19+00:00","dateModified":"2023-12-27T04:55:17+00:00","mainEntityOfPage":{"@id":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/"},"wordCount":1887,"commentCount":0,"publisher":{"@id":"https:\/\/terra.bio\/#organization"},"image":{"@id":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#primaryimage"},"thumbnailUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/document-everything_OG.png","articleSection":["Ecosystem","Features","Getting Started","Notebooks","Workflows"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/","url":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/","name":"Ten simple rules \u2014 #2 Document everything - Terra","isPartOf":{"@id":"https:\/\/terra.bio\/#website"},"primaryImageOfPage":{"@id":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#primaryimage"},"image":{"@id":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#primaryimage"},"thumbnailUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/document-everything_OG.png","datePublished":"2022-03-10T20:03:19+00:00","dateModified":"2023-12-27T04:55:17+00:00","breadcrumb":{"@id":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#primaryimage","url":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/document-everything_OG.png","contentUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/document-everything_OG.png","width":1200,"height":627,"caption":"document everything_OG"},{"@type":"BreadcrumbList","@id":"https:\/\/terra.bio\/ten-simple-rules-2-document-everything\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/terra.bio\/"},{"@type":"ListItem","position":2,"name":"Ten simple rules \u2014 #2 Document everything"}]},{"@type":"WebSite","@id":"https:\/\/terra.bio\/#website","url":"https:\/\/terra.bio\/","name":"Terra","description":"Science at Scale","publisher":{"@id":"https:\/\/terra.bio\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/terra.bio\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/terra.bio\/#organization","name":"Terra","url":"https:\/\/terra.bio\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/terra.bio\/#\/schema\/logo\/image\/","url":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp","contentUrl":"https:\/\/terra.bio\/wp-content\/uploads\/2023\/12\/Terra-Bio-App@2x.webp","width":287,"height":318,"caption":"Terra"},"image":{"@id":"https:\/\/terra.bio\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/terra.bio\/#\/schema\/person\/ad0522d0b331a5e08fa1733f65086ee2","name":"Geraldine Van der Auwera","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/terra.bio\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/d73bdaf6740465b385e0e3b290786d8cb9d9d548eadec23364254ba06c85204b?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/d73bdaf6740465b385e0e3b290786d8cb9d9d548eadec23364254ba06c85204b?s=96&d=mm&r=g","caption":"Geraldine Van der Auwera"},"sameAs":["https:\/\/app.terra.bio\/"],"url":"https:\/\/terra.bio\/author\/geraldinevanterra\/"}]}},"_links":{"self":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/posts\/503","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/comments?post=503"}],"version-history":[{"count":0,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/posts\/503\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/media\/508"}],"wp:attachment":[{"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/media?parent=503"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/categories?post=503"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/terra.bio\/wp-json\/wp\/v2\/tags?post=503"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}