Developing CWL Workflows with VSCode
These lessons give step by step instructions for using Visual Studio
Code (abbreviated “vscode”) to develop CWL workflows on Arvados.
- Set up SSH
- Install vscode and necessary extensions, then use vscode to connect to an Arvados shell node for development
- Register a workflow, run it on workbench, and view the log
- Upload input, run a workflow on it, and view the output
- Register a workflow with default inputs
- Run a workflow without registering it
1. SSH Setup
- (Windows only) Install Git for Windows https://git-scm.com/download/win
- Choose “64-bit Git for Windows Setup”. It does not require admin privileges to install.
- Hit “Next” a bunch of times to accept the defaults
- The most important things is that “install git bash” and “install OpenSSH” are enabled (this is the default).
- At the end of the installation, you can launch tick a box to git bash directly.
- Open “Git Bash” (installed in the “Git” folder of the start menu)
- (All operating systems) Starting from bash shell (on MacOS or Linux you will open “Terminal”)
- Shell: Run
ssh-keygen
- Hit enter to save to a default location
- You can choose to protect the key with a password, or just hit enter for no password.
- Shell: Look for a message like
Your public key has been saved
in /c/Users/MyUsername/.ssh/id_rsa.pub
(Windows git bash
example, on MacOS or Linux this will probably start with /Users
or /home
)
- Shell: Run
cat /c/Users/MyUsername/.ssh/id_rsa.pub
- Shell: Use the pointer to highlight and copy the lines starting
with
ssh-rsa …
up to the next blank line. Right click and
select “Copy”
- Open Arvados workbench 2. If necessary, go to the user menu and
select “Go to Workbench 2”
- Workbench: Go to
SSH keys
in the user menu
- Workbench:Click
+Add new ssh key
- Workbench: Paste the key into
Public key
and enter something for name
- Workbench: Go to
Virtual Machines
in the user menu
- Workbench: Highlight and copy the value in in the
Command line
column.
- At the git bash command line
- Shell: paste the
ssh shell…
command line you got from workbench.
- Shell: type “yes” if it asks
Are you sure you want to continue connecting
.
- Note: it can take up to two minutes for the SSH key to be copied to
the shell node. If you get “Permission denied” the first time, wait 60
seconds and try again.
- Shell: You should now be logged into the Arvados shell node.
- Shell: Log out by typing
exit
2. VSCode setup
- Install Visual Studio Code and start it up
- Vscode: On the left sidebar, select
Extensions
- In
Search Extensions in Marketplace
enter “remote development”.
- Choose and install the “Remote Development” extension pack from Microsoft
- Vscode: On the left sidebar, choose
Remote Explorer
- At the top of the Remote Explorer panel choose
SSH targets
- Click
Add New
- Enter the
ssh shell…
command line you used in the previous section, step 1.4.1
- If it asks you
Select SSH configuration file to update
choose the first one in the list.
- Right click the newly added ssh target in the list and select “connect to host in current window`
- If it asks
Select platform of the remote host
select Linux
.
- Vscode: On the left sidebar, go back to
Extensions
- Search for “benten”, then look for
CWL (Rabix/Benten)
and click Install
- On the information page for
CWL (Rabix/Benten)
- If you see a warning
Install the extension on 'SSH: ...' to enable
then click the button Install in SSH: ...
- You should now see a message
Extension is enabled on 'SSH: ...' and disabled locally.
- Vscode: On the left sidebar, choose
Explorer
- Select
Clone Repository
and enter https://github.com/arvados/arvados-vscode-cwl-training, then click Open
- If asked
Would you like to open the cloned repository?
choose Open
- Go to Arvados Workbench
- Workbench: In the user menu, select
Current token
- Workbench: Click on
Copy to Clipboard
.
- Workbench: You should see a notification
Token copied to clipboard
.
- Go to Vscode
- Vscode: Click on the
Terminal
menu
- Vscode: Click
Run Task…
- Vscode: Select
Set Arvados Host
- Vscode: Paste the value of API Host from the Workbench
Get API Token
dialog (found in the User menu) at the prompt
- Vscode: Next, run task
Set Arvados Token
- Vscode: Paste the value of API Token from the Workbench
Get API Token
dialog
- Vscode: These will create files called
API_HOST
and API_TOKEN
3. Register & run a workflow
- Vscode: Click on the
lesson1/main.cwl
file
- Click on the
Terminal
menu
- Click
Run Task…
- Select
Register or update CWL workflow on Arvados Workbench
- This will create a file called
WORKFLOW_UUID
- Workbench: Go to
+NEW
and select New project
- Enter a name for the project like “Lesson 1”
- You should arrive at the panel for the new project
- Workbench: With
Lesson 1
selected
- Click on
+NEW
and select Run a process
- Select
CWL training lesson 1
from the list and click Next
- Enter a name for this run like
First training run
- Enter a message (under
#main/message
) like “Hello world”
- Click
Run process
- This should take you to a panel showing the workflow run status
- Workbench: workflow run status panel
- Wait for the badge in the upper right to say
Completed
- In the lower panel, double click on the
echo
workflow step
- This will take you to the status panel for the
echo
step
- Click on the three vertical dots in the top-right corner next to
Completed
- Choose
Log
- This will take you to the log viewer panel
- Under
Event Type
choose stdout
- You should see your message
- Vscode: Click on the
lesson2/main.cwl
file
- Click on the
Terminal
menu
- Click
Run Task…
- Select
Register or update CWL workflow on Arvados Workbench
- Go to your desktop
- Right click on the desktop, select
New > Text Document
- Name the file
message
- Enter a message like “Hello earth” and save
- Workbench: Go to
+NEW
and select New project
- Enter a name for the project like “Lesson 2”
- You should arrive at the panel for the new project
- Arvados workbench: With
Lesson 2
project selected
- Click on +NEW and select
New collection
- For Collection Name enter “my message”
- Drag and drop
message.txt
into the browser
- Click
Create a collection
- The file should be uploaded and then you will be on the collection page
- Workbench: Select the
Lesson 2
project
- Click on
+NEW
and select Run a process
- Select
CWL training lesson 2
from the list and click Next
- Enter a name for this run like “Second training run”
- Click on
#main/message
- A selection dialog box will appear
- Navigate to the collection you created in step (4.4.4) and choose
message.txt
- Click
Run process
- This should take you to a panel showing the workflow run status
- Workbench: workflow run status panel
- Wait for the process to complete
- Click on the dot menu
- Choose
Outputs
- Right click on
reverse.txt
- Click on
Open in new tab
- The results should be visible in a new browser tab.
The default value for the message
parameter will taken from the lesson3/defaults.yaml
file
- Vscode: Click on the
lesson3/main.cwl
file
- Click on the
Terminal
menu
- Click
Run Task…
- Select
Register or update CWL workflow on Arvados Workbench
- Workbench: Go to
+NEW
and select New project
- Enter a name for the project like “Lesson 3”
- You should arrive at the panel for the new project
- Workbench: With
Lesson 3
selected
- Click on
+NEW
and select Run a process
- Select
CWL training lesson 3
from the list and click Next
- Enter a name for this run like “Third training run”
- The
#main/message
parameter will be pre-filled with your default value. You can choose to change it or use the default.
- Click
Run process
- This should take you to the status page for this workflow
- The greeting will appear in the
Log
of the echo
task, which
can be found the same way as described earlier in section 3.
6. Run a workflow without registering it
The message
parameter will be taken from the file lesson4/main-input.yaml
. This is useful during development.
- Workbench: Go to
+NEW
and select New project
- Enter a name for the project like “Lesson 4”
- You should arrive at the panel for the new project
- Click on
Additional info
in the upper right to expand the info
panel
- Under
Project UUID
click the Copy to clipboard
button
- Vscode: Select the file
lesson4/main.cwl
- Click on the
Terminal
menu
- Click
Run Task…
- Select
Set Arvados project UUID
- Paste the project UUID from workbench at the prompt
- Vscode: Select the file
lesson4/main.cwl
- Click on the
Terminal
menu
- Click
Run Task…
- Select
Run CWL workflow on Arvados
- Vscode: In the bottom panel select the
Terminal
tab
- In the upper right corner of the Terminal tab select
Task - Run CWL Workflow
from the drop-down
- Look for logging text like
submitted container_request zzzzz-xvhdp-0123456789abcde
- Highlight and copy the workflow identifier (this the string containing
-xvhdp-
in the middle)
- The results of this run will appear in the terminal when the run completes.
- Workbench: Paste the workflow identifier into the search box
- This will take you to the status page for this workflow
Notes
If you need to change something about the environment of the user on
the remote host (for example, the user has been added to a new unix
group) you need to restart the vscode server that runs on the remote
host. Do this in vscode:
ctrl+shift+p: Remote-SSH: Kill VS Code Server on Host
This is because the vscode server remains running on the remote host
even after you disconnect, so exiting/restarting vscode on the desktop
has no effect.
Previous: Using storage classes
Next: Starting a Workflow at the Command Line