Overview

Overview

Datasets are the core component of Redivis. Datasets contain various metadata, documentation, and a versioned history of their tabular data.

All datasets have their own page and are uploaded by either a user or an organization. Datasets can be added to projects to merge with other datasets across Redivis.

Some components of a dataset may not be available to you until you are granted access, while others are completely public for you to use.

You can create your own datasets to use in your projects and share with colleagues.

Overview tab

This tab contains all of the cataloging information for this dataset. On the left side is an overview of the statistics about the table generated by Redivis, and contextual information about how it has been managed and organized. In the main middle section, there is more in-depth information written up by the dataset's editor(s).

Field

Notes

Version

All datasets start with their version tag as v1.0 As data is modified over time, a dataset's version will be incremented. Clicking the version will allow you to see this dataset's version history.

Row count

Total number of records in the table.

Variable count

Total number of variables in the table.

Total size

Size (in bytes) of the most recent version of the table.

Entity

Documents the concept that one record in this dataset represents. For example, the dataset's entity might represent a unique patient, or a specific hospitalization, or a prescription.

Temporal range

The range of time the current version of the dataset covers (if applicable). This can either be set by the dataset editor manually, or calculated from the data itself. You can view the Release tab of the dataset's version history to further inspect how the temporal range was created.

Permission setting

For datasets that are uploaded by an organization, this will specify the dataset's permission group, or "public" if the dataset is not permissioned. For user-uploaded datasets this will specify either "public" or "private", depending on its access setting.

Project count

The total number of projects using this dataset.

Collections

Only available to organization-uploaded datasets. Shows all collections that this dataset is a part of.

Introduction

A short, editor-generated summary of the dataset. Limited to 256 characters.

Tags

Editor-assigned tags for this dataset to help users understand it and find similar datasets.

Documentation

Editors can use this space to include long-form information about the dataset. Embedded pictures, links, and headers are all supported. For organization-uploaded datasets, editors can set some parts of the documentation to only be visible to you if you're authorized for a certain access level.

Data tab

Clicking on the Data tab underneath the dataset title will allow you to view the table of the current version of this dataset.

If you have metadata access to the dataset, you will be able to view the list of variables, variable metadata, and summary statistics on individual variables.

If you have data access to the dataset, you will also be able to see a cells preview of the dataset and run custom SQL queries.

Applying for access

Your current access level to the dataset will be displayed in the box at the right of the dataset page. If you don't have data access to the dataset, you will be able to "Apply for access", opening the permissioning interface where you can view and submit any requirements necessary to gain access.

Adding to a project

If you have data access to a dataset, you will be able to add it to any of your projects (or create a new project with the dataset in it). Click on the "Add to project" at the right of the page to add this dataset to a project. Projects are a powerful tool that allow you to merge, query, and analyze datasets from across Redivis, all within a collaborative data pipeline environment.

Add a dataset to one of your projects, or create a new project with the dataset in it.

Downloading a dataset

Clicking on the "Download" button at the right of the page will open the dataset download interface. From here, you will be able to export metadata (requires metadata access) and download the raw data (requires data access). You will also be able to view API endpoints for this dataset to help you interact with it programmatically.

Note that some datasets may have additional export restrictions, preventing you from downloading the dataset even when you have data access. In this case, the download button will be displayed as "Download (limited)".

The data download interface