Database integration

The Scientific Data Warehouse (SDW) is the Data Platform in Nuvolos

Nuvolos is not just an online computer lab, it is also an online data platform. We offer strong data integration to support modern research and education use cases that rely on large amounts of data.

As a data platform, Nuvolos integrates strongly with both online (Nuvolos based) and offline (non-Nuvolos based) applications. Please review our access documentation to learn more. Nuvolos also offers the ability to define data pipelines, and ingest data from various sources. Please reach out to our support for more information.

In this documentation, we will regularly refer to the data platform service as the Scientific Data Warehouse (SDW). The Scientific Data Warehouse is built on the Snowflake service, you can refer to the Snowflake SQL documentation here.

Connection modes

Nuvolos users automatically get an account on the SDW once they have access to a space with tables enabled. Whenever you're interacting with the SDW, a connector first connects to the Snowflake service using some credentials. To follow industry best practices regarding data security, Nuvolos offers 2 connection modes for SDW accounts:

Service mode

In this mode, connection is done based on a username and an RSA key, and connecting to the SDW is only allowed from within Nuvolos. Nuvolos applications receive the RSA key automatically. Consult the internal application guides on how to work with the SDW.

While it is highly unlikely that such an RSA key gets compromised, it is possible to rotate the RSA key in the Table Access menu. You'll need to restart running Nuvolos applications to receive the rotated key.

User mode

In this mode, connecting to the SDW is possible from both within and outside Nuvolos. Connections are done using

  1. username and an RSA key from within Nuvolos. Nuvolos applications receive the RSA key automatically. Consult the internal application guides on how to work with the SDW.

  2. username and token from outside Nuvolos. After switching to User mode, please visit first https://alphacruncher.eu-central-1.snowflakecomputing.com/console and log in with your username and access token. On this site, you'll be asked to set up multi-factor authentication (MFA) using a smartphone. From then on, you can connect to the SDW using username and access token from any connector, however connection attempts need to be approved on the MFA-linked smartphone. Consult the external application guides on how to work with the SDW.

If you need to regularly interact with the SDW from outside Nuvolos, reach out to [email protected] about possible solutions.

Change the connection mode

Every user can change their SDW connection mode anytime in the Table Access menu. By default, new users are created with Service mode, but you can change back and forth between Service and User mode.

Start working with data

Nuvolos differentiates two types of data:

  1. Tabular data stored in some database management system

  2. Data stored in regular files

This page describes working with tabular data. For working with files, consult our guide to the file system.

This documentation distinguishes tabular and file based data.

Tabular data refers to data stored in the Scientific Data Warehouse (SDW), a SQL compliant cloud-based data warehouse.

File based data means data that is stored on a regular file system. This guide focuses on database-stored datasets.

Dataset spaces

Datasets are special kinds of spaces, with the main goal of containing only tabular data and documentation related to that tabular data and potentially other descriptor files. For obtaining a full list of the datasets available to you in your current organisation, navigate to your dashboard, and pick the datasets menu.

Viewing the list of available datasets

Distribute data you need

Suppose that your project is called 'Demo research project' and you need two tables from the 'Correlation Risk Proxy' dataset.

Using the distribute feature, you can set your research project up with the required data:

Distributing tables to a personal space

Work with data in your workspace

Once the distribution has completed, you will receive an e-mail to your account you registered with. If the distribution was successful, the data should be available in the instance you distributed to.

You will now be able to work with data in your workspace, without having to worry about backing up your data and causing inadvertent changes.

Please follow our detailed guides:

Last updated

Was this helpful?