Datasets are the core component of Redivis. Datasets contain various metadata, documentation, and a versioned history of their tabular data.
Some components of a dataset may not be available to you until you are granted access, while others are completely public for you to use.
You can create your own datasets to use in your projects and share with colleagues.
This tab contains all of the cataloging information for this dataset. On the left side is an overview of the statistics about the table generated by Redivis, and contextual information about how it has been managed and organized. In the main middle section, there is more in-depth information written up by the dataset's editor(s).
All datasets start with their version tag as
Total number of records in the table.
Total number of variables in the table.
Size (in bytes) of the most recent version of the table.
Documents the concept that one record in this dataset represents. For example, the dataset's entity might represent a unique patient, or a specific hospitalization, or a prescription.
The range of time the current version of the dataset covers (if applicable). This can either be set by the dataset editor manually, or calculated from the data itself. You can view the Release tab of the dataset's version history to further inspect how the temporal range was created.
For datasets that are uploaded by an organization, this will specify the dataset's permission group, or "public" if the dataset is not permissioned. For user-uploaded datasets this will specify either "public" or "private", depending on its access setting.
The total number of projects using this dataset.
A short, editor-generated summary of the dataset. Limited to 256 characters.
Editor-assigned tags for this dataset to help users understand it and find similar datasets.
Editors can use this space to include long-form information about the dataset. Embedded pictures, links, and headers are all supported. For organization-uploaded datasets, editors can set some parts of the documentation to only be visible to you if you're authorized for a certain access level.
Clicking on the Data tab underneath the dataset title will allow you to view the table of the current version of this dataset.
If you have metadata access to the dataset, you will be able to view the list of variables, variable metadata, and summary statistics on individual variables.
Your current access level to the dataset will be displayed in the box at the right of the dataset page. If you don't have data access to the dataset, you will be able to "Apply for access", opening the permissioning interface where you can view and submit any requirements necessary to gain access.
If you have data access to a dataset, you will be able to add it to any of your projects (or create a new project with the dataset in it). Click on the "Add to project" at the right of the page to add this dataset to a project. Projects are a powerful tool that allow you to merge, query, and analyze datasets from across Redivis, all within a collaborative data pipeline environment.
Clicking on the "Download" button at the right of the page will open the dataset download interface. From here, you will be able to export metadata (requires metadata access) and download the raw data (requires data access). You will also be able to view API endpoints for this dataset to help you interact with it programmatically.
Note that some datasets may have additional export restrictions, preventing you from downloading the dataset even when you have data access. In this case, the download button will be displayed as "Download (limited)".