Annotating Metabolic Processes: Instructions for Using Argo

This document is part of Annotating Metabolic Processes.

Prerequisites

This task requires some proficiency in using Argo. Please make sure that you know how to:

  • set up component parameters,
  • run workflows,
  • track the progress of processing workflows and open the Manual Annotation Editor,
  • use the Manual Annotation Editor.

Most of these actions are demonstrated in the step-by-step instructions below; however, you are encouraged to visit the tutorials page before continuing.

Workflows

For the purpose of evaluating the usability of Argo, there are three workflows prepared for the curators:

  1. Manual annotation: This workflow reads PubMed abstracts and opens a Manual Annotation Editor for a curator to tag the entities of interest without any support from automatic processing available in Argo.
    • Input: PubMed abstract identifiers.
    • Output: Annotation files in XMI (interchangeable) format.
  2. Automatic annotation: This workflow reads PubMed abstracts and performs automatic recognition of the entities of interest. This workflow is purely automatic and does not involve any manual intervention from the curators. The objective of this workflow is to automatically “pre-annotate” the input abstracts for later manual inspection.
    • Input: PubMed abstract identifiers.
    • Output: Annotation files in XMI format.
  3. Manual correction: This workflow reads files that already contain annotations (coming from the second, automatic workflow) and opens a Manual Annotation Editor for a curator to correct (remove, add, modify) the automatically recognised annotations.
    • Input: Annotation files in XMI format.
    • Output: Annotation files in XMI format.

Although the second and third workflow could be combined into a single workflow, for the purpose of evaluation they are defined separately.

The three workflows are publicly available in Argo.

Step-by-step Instructions

For convenience, you are advised to open Argo’s main application in a separate window and keep these instructions and the annotation guidelines handy.

Launch the Argo application in a separate window now!

A. Initial setup

The initial setup involves signing in to or registering an account with Argo, as well as creating folders that will be used to output annotations to. You will also make copies of publicly available, read-only workflows so that you can make some adjustments on them.

initial setup step-by-step instructions

B. Manual annotation

This phase involves configuring the manual annotation workflow by setting up input PubMed IDs and folders for output annotations as well as running the workflows and performing the manual annotation.

manual annotation step-by-step instructions

C. Automatic Annotation

This phase involves configuring the input and output annotation folders of the automatic annotation workflow and running it.

automatic annotation step-by-step instructions

D. Manual correction

This phase is similar to the manual annotation phase. The difference lies in the source of input annotations which comes from the output of the previous, automatic annotation workflow.

manual correction step-by-step instructions

Frequently Asked Questions

  • Can I try the workflows with other PubMed IDs for practice?
  • Yes, you may enter any (preferably pertaining to metabolic processes) PubMed IDs as input to the workflows. Alternatively you can select some from the list below:
    • 21223739 22064374 22585829 3037304 7906730
    • 8076368  8445979  8558417  8915012 9972481
  • What should I do if a recognised entity cannot be identified in ChEBI/UniProt/CTD?
  • Nothing. It is expected that some entities will be new to the external dictionaries. They will be automatically pulled from the annotations in post-processing (not part of this curation task) and will constitute a set of new entries for the external dictionaries.
  • How can I view or change annotations at a later visit?
  • As long as the workflow with a Manual Annotation Editor is running, you can view or make changes to any of the documents that are part of this workflow by switching to the Processes view, identifying the right process, and clicking on the Launch Manual Annotation Editor button.
  • If the process has finished, you can still view or change annotations; however, you have to create or edit another workflow. Notice that the “Metabolic processes: Manual correction” workflow is exactly what is needed to accomplish it. Simply edit this workflow, create a copy, rename it to something memorable, and change the input of the XMI Reader to the files/documents you want to view/edit. For the XMI Writer, if you wish to overwrite the input files, select the same folder and tick the “overwrite” checkbox.
  • If you only want to view the files, remove the XMI Writer component from the workflow altogether.
  • Can I see if XMI files contain annotations in the Documents view?
  • XMI files contain annotations even if you have not created any explicitly; they include document metadata. Currently, the Documents view does not visually differentiate between XMI files with “default” annotations and those with added annotations.
  • Can I download the annotations?
  • Yes. Each of the workflows includes the XMI Writer component that writes the original text and annotations to your document space. Simply switch to the Documents view, locate and select the documents you wish to download and click on the Download button in the toolbar.
  • Note: Currently you may download only one document at a time.