The Arvados core API server consists of four services: PostgreSQL, Arvados Rails API, Arvados Controller, and Nginx.
Here is a simplified diagram showing the relationship between the core services. Client requests arrive at the public-facing Nginx reverse proxy. The request is forwarded to Arvados controller. The controller is able handle some requests itself, the rest are forwarded to the Arvados Rails API. The Rails API server implements the majority of business logic, communicating with the PostgreSQL database to fetch data and make transactional updates. All services are stateless, except the PostgreSQL database. This guide assumes all of these services will be installed on the same node, but it is possible to install these services across multiple nodes.
# su postgres
postgres$ tr -dc 0-9a-zA-Z </dev/urandom | head -c25; echo
yourgeneratedpassword
Record this. You’ll need it when you set up the Rails server later.postgres$ createuser --encrypted --no-createrole --no-superuser --pwprompt arvados
Enter password for new role: yourgeneratedpassword
Enter it again: yourgeneratedpassword
postgres$ createdb arvados_production -T template0 -E UTF8 -O arvados
postgres$ psql arvados_production -c "CREATE EXTENSION IF NOT EXISTS pg_trgm"
postgres$ exit
Starting from an empty config.yml file, add the following configuration keys.
SystemRootToken: "$system_root_token"
ManagementToken: "$management_token"
Collections:
BlobSigningKey: "$blob_signing_key"
These secret tokens are used to authenticate messages between Arvados components.
SystemRootToken
is used by Arvados system services to authenticate as the system (root) user when communicating with the API server.ManagementToken
is used to authenticate access to system metrics.Collections.BlobSigningKey
is used to control access to Keep blocks.Each token should be a string of at least 50 alphanumeric characters. You can generate a suitable token with the following command:
~$ tr -dc 0-9a-zA-Z </dev/urandom | head -c50 ; echo
PostgreSQL:
Connection:
host: localhost
user: arvados
password: $postgres_password
dbname: arvados_production
Replace the $postgres_password
placeholder with the password you generated during database setup.
Services:
Controller:
ExternalURL: "https://ClusterID.example.com"
InternalURLs:
"http://localhost:8003": {}
RailsAPI:
# Does not have an ExternalURL
InternalURLs:
"http://localhost:8004": {}
Replace ClusterID.example.com
with the hostname that you previously selected for the API server.
The Services
section of the configuration helps Arvados components contact one another (service discovery). Each service has one or more InternalURLs
and an ExternalURL
. The InternalURLs
describe where the service runs, and how the Nginx reverse proxy will connect to it. The ExternalURL
is how external clients contact the service.
Use a text editor to create a new file /etc/nginx/conf.d/arvados-controller.conf
with the following configuration. Options that need attention are marked in red.
proxy_http_version 1.1;
# When Keep clients request a list of Keep services from the API
# server, use the origin IP address to determine if the request came
# from the internal subnet or it is an external client. This sets the
# $external_client variable which in turn is used to set the
# X-External-Client header.
#
# The API server uses this header to choose whether to respond to a
# "available keep services" request with either a list of internal keep
# servers (0) or with the keepproxy (1).
#
# Following the example here, update the 10.20.30.0/24 netmask
# to match your private subnet.
# Update 1.2.3.4 and add lines as necessary with the public IP
# address of all servers that can also access the private network to
# ensure they are not considered 'external'.
geo $external_client {
default 1;
127.0.0.0/24 0;
10.20.30.0/24 0;
1.2.3.4/32 0;
}
# This is the port where nginx expects to contact arvados-controller.
upstream controller {
server localhost:8003 fail_timeout=10s;
}
server {
# This configures the public https port that clients will actually connect to,
# the request is reverse proxied to the upstream 'controller'
listen 443 ssl;
server_name ClusterID.example.com;
ssl_certificate /YOUR/PATH/TO/cert.pem;
ssl_certificate_key /YOUR/PATH/TO/cert.key;
# Refer to the comment about this setting in the passenger (arvados
# api server) section of your Nginx configuration.
client_max_body_size 128m;
location / {
proxy_pass http://controller;
proxy_redirect off;
proxy_connect_timeout 90s;
proxy_read_timeout 300s;
proxy_max_temp_file_size 0;
proxy_request_buffering off;
proxy_buffering off;
proxy_http_version 1.1;
proxy_set_header Host $http_host;
proxy_set_header Upgrade $http_upgrade;
proxy_set_header Connection "upgrade";
proxy_set_header X-External-Client $external_client;
proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
proxy_set_header X-Forwarded-Proto https;
proxy_set_header X-Real-IP $remote_addr;
}
}
# dnf install arvados-api-server arvados-controller
# apt install arvados-api-server arvados-controller
By default, the Rails API server is configured to listen on localhost:8004
, matching the example cluster configuration above. If you need to change this, edit the arvados-railsapi.service
definition to redefine the PASSENGER_ADDRESS
and PASSENGER_PORT
environment variables, like this:
# systemctl edit arvados-railsapi.service
### Editing /etc/systemd/system/arvados-railsapi.service.d/override.conf
### Anything between here and the comment below will become the new contents of the file
[Service]
Environment=PASSENGER_ADDRESS=0.0.0.0
Environment=PASSENGER_PORT=8040
### Lines below this comment will be discarded
[...]
You can similarly define other Passenger settings if desired. The Passenger Standalone reference documents all the available settings.
# systemctl enable --now arvados-railsapi arvados-controller
# systemctl status arvados-railsapi arvados-controller
[...]
If systemctl status
indicates it is not running, use journalctl
to check logs for errors:
# journalctl --since -5min -u arvados-railsapi -u arvados-controller
We recommend using the Cluster diagnostics tool. The first few tests (10, 20, 30) will succeed if you have a working API server and controller. Of course, tests for services that you have not yet installed and configured will fail.
Here are some other checks you can perform manually.
$ curl https://ClusterID.example.com/arvados/v1/config
$ curl https://ClusterID.example.com/discovery/v1/apis/arvados/v1/rest
$ curl -H "Authorization: Bearer $system_root_token" https://ClusterID.example.com/arvados/v1/users/current
If you are getting TLS errors, make sure the ssl_certificate
directive in your nginx configuration has the full certificate chain.
Logs can be found in /var/www/arvados-api/current/log/production.log
and using journalctl -u arvados-controller
. See also the admin page on Logging.
The content of this documentation is licensed under the
Creative
Commons Attribution-Share Alike 3.0 United States licence.
Code samples in this documentation are licensed under the
Apache License, Version 2.0.