Create a Workflow with Composer
The Arvados Workflow Composer is a graphical interface for building Common Workflow Language (CWL) workflows to run on Arvados.
This tutorial will demonstrate:
- Creating a new git repository through Arvados to store the workflow
- Creating CommandLineTools for “sort” and “uniq”
- Creating a Workflow which uses “sort” and “uniq” to remove duplicate lines from a text file
- Submitting the Workflow to run on Arvados
1. Access from workbench
data:image/s3,"s3://crabby-images/9e7ca/9e7ca58adbc9cb40b16cf1a5944f6dfeb6058b2a" alt=""
2. Composer starting page
data:image/s3,"s3://crabby-images/adc73/adc73f04a3b306c3828314eb068b3aeaf16c94e0" alt=""
3. Manage git repositories (opens Workbench in new tab)
data:image/s3,"s3://crabby-images/4b6d1/4b6d18c45d574e054099539c3efd86dbb99a9337" alt=""
4. Add a new repository
data:image/s3,"s3://crabby-images/9c187/9c18754e9f37493d9c6a26d70998d41a47558e5a" alt=""
data:image/s3,"s3://crabby-images/827c8/827c8580590996d68fc5788b8ee49fb58f4dbfc3" alt=""
5. Return to Composer. Use refresh button to discover new repository (may take a few moments to show up).
data:image/s3,"s3://crabby-images/41a2c/41a2caf06971578b96a5049a790e01014f1f285c" alt=""
6. Create a new Command Line Tool
data:image/s3,"s3://crabby-images/e8edd/e8edd652e5e4f5fdd4a6433d87057987d9738f98" alt=""
data:image/s3,"s3://crabby-images/f854b/f854bb8dc1026834be3a3986f867395c72d961ec" alt=""
7. Set Docker image, base command, and input port for “sort” tool
The “Docker Repository” is the name:tag of a Docker image uploaded Arvados. (Use arv-keepdocker --pull debian:8
) You can also find prepackaged bioinformatics tools on various sites, such as http://dockstore.org and http://biocontainers.pro/ .
data:image/s3,"s3://crabby-images/78a85/78a8543a5caf175e16921a28822e7a0317bf18b2" alt=""
8. Redirect stdout to a file
data:image/s3,"s3://crabby-images/af4ad/af4ade84ebd48805f4ed5a645080c15026cc3eb9" alt=""
9. Capture output file
data:image/s3,"s3://crabby-images/514cd/514cdb28de296be59dcaf2b0ae5b869dd771b021" alt=""
10. Save Command Line Tool
data:image/s3,"s3://crabby-images/f148f/f148f8bd29a872149e3d243a44e5c277977e2b1d" alt=""
11. Repeat steps 6-10 for “uniq” tool
Create a new tool with a “base command” of “uniq”.
12. Switch back to “Home” tab and create workflow
data:image/s3,"s3://crabby-images/84e07/84e074fd4444a59af115fe66a51ad41a9bbaec26" alt=""
data:image/s3,"s3://crabby-images/e1c02/e1c0214311c859063f0ae629c395ca499dd917ce" alt=""
data:image/s3,"s3://crabby-images/b80b8/b80b85a8eaa659cbd4217bda3a394c7d701866ed" alt=""
13. Drag and drop tools into Workflow
data:image/s3,"s3://crabby-images/2a428/2a4280797abd9a6a6290f700a5af477c914c113f" alt=""
14. Drag from input port of “sort” to empty space to create workflow input
data:image/s3,"s3://crabby-images/3a9a8/3a9a84259642bdc049efc563caf583de7488a5e8" alt=""
15. Drag from output port of “sort” to input port of “uniq”
data:image/s3,"s3://crabby-images/150ce/150ce961323205e79dd085b91668970a04728e39" alt=""
16. Drag from output port of “uniq” to empty space to create workflow output
data:image/s3,"s3://crabby-images/0b7b7/0b7b7c5c4bc1650a51c0f0cdb2ff3d03054fadc4" alt=""
17. Save Workflow
data:image/s3,"s3://crabby-images/c88ad/c88ad8a5a421e18a14a02e9ecb94fe61946227b8" alt=""
18. Click on “Test” tab then click “Run”
data:image/s3,"s3://crabby-images/9f421/9f42125f742b6c575811e549e017b8096c93517d" alt=""
19. Choose input file
You may need to upload an input file
data:image/s3,"s3://crabby-images/bdff0/bdff0e2aa45b9ea319d0015ce09e089c6c581211" alt=""
20. Run the workflow
data:image/s3,"s3://crabby-images/2f181/2f181553eb08be6c1fb95fd9749b31e3582c28dc" alt=""
21. Monitor progress (may take several minutes)
data:image/s3,"s3://crabby-images/d8640/d8640a8897663a80ad4dcd46fe54fb3b9d929fdc" alt=""
22. Get workflow output
data:image/s3,"s3://crabby-images/81856/8185695ec27119eeb2862f8ffc4fcae850e858ff" alt=""
Previous: Running a workflow using Workbench
Next: Accessing an Arvados VM with Webshell