Create a Workflow with Composer
The Arvados Workflow Composer is a graphical interface for building Common Workflow Language (CWL) workflows to run on Arvados.
This tutorial will demonstrate:
- Creating a new git repository through Arvados to store the workflow
- Creating CommandLineTools for “sort” and “uniq”
- Creating a Workflow which uses “sort” and “uniq” to remove duplicate lines from a text file
- Submitting the Workflow to run on Arvados
1. Access from workbench
2. Composer starting page
3. Manage git repositories (opens Workbench in new tab)
4. Add a new repository
5. Return to Composer. Use refresh button to discover new repository (may take a few moments to show up).
6. Create a new Command Line Tool
7. Set Docker image, base command, and input port for “sort” tool
The “Docker Repository” is the name:tag of a Docker image uploaded Arvados. (Use arv-keepdocker --pull debian:9
) You can also find prepackaged bioinformatics tools on various sites, such as http://dockstore.org and http://biocontainers.pro/ .
8. Redirect stdout to a file
9. Capture output file
10. Save Command Line Tool
11. Repeat steps 6-10 for “uniq” tool
Create a new tool with a “base command” of “uniq”.
12. Switch back to “Home” tab and create workflow
13. Drag and drop tools into Workflow
14. Drag from input port of “sort” to empty space to create workflow input
15. Drag from output port of “sort” to input port of “uniq”
16. Drag from output port of “uniq” to empty space to create workflow output
17. Save Workflow
18. Click on “Test” tab then click “Run”
19. Choose input file
You may need to upload an input file
20. Run the workflow
21. Monitor progress (may take several minutes)
22. Get workflow output
Previous: Creative Commons