Redivis Documentation
API DocumentationRedivis Home
  • Introduction
  • Redivis for open science
    • FAIR data practices
    • Open access
    • Data repository characteristics
    • Data retention policy
    • Citations
  • Guides
    • Getting started
    • Discover & access data
      • Discover datasets
      • Apply to access restricted data
      • Create a study
    • Analyze data in a workflow
      • Reshape data in transforms
      • Work with data in notebooks
      • Running ML workloads
      • Example workflows
        • Analyzing large tabular data
        • Create an image classification model
        • Fine tuning a Large Language Model (LLM)
        • No-code visualization
        • Continuous enrollment
        • Select first/last encounter
    • Export & publish your work
      • Export to other environments
      • Build your own site with Observable
    • Create & manage datasets
      • Create and populate a dataset
      • Upload tabular data as tables
      • Upload unstructured data as files
      • Cleaning tabular data
    • Administer an organization
      • Configure access systems
      • Grant access to data
      • Generate a report
      • Example tasks
        • Emailing subsets of members
    • Video guides
  • Reference
    • Your account
      • Creating an account
      • Managing logins
      • Single Sign-On (SSO)
      • Workspace
      • Studies
      • Compute credits and billing
    • Datasets
      • Documentation
      • Tables
      • Variables
      • Files
      • Creating & editing datasets
      • Uploading data
        • Tabular data
        • Geospatial data
        • Unstructured data
        • Metadata
        • Data sources
        • Programmatic uploads
      • Version control
      • Sampling
      • Exporting data
        • Download
        • Programmatic
        • Google Data Studio
        • Google Cloud Storage
        • Google BigQuery
        • Embedding tables
    • Workflows
      • Workflow concepts
      • Documentation
      • Data sources
      • Tables
      • Transforms
        • Transform concepts
        • Step: Aggregate
        • Step: Create variables
        • Step: Filter
        • Step: Join
        • Step: Limit
        • Step: Stack
        • Step: Order
        • Step: Pivot
        • Step: Rename
        • Step: Retype
        • Step: SQL query
        • Variable selection
        • Value lists
        • Optimization and errors
        • Variable creation methods
          • Common elements
          • Aggregate
          • Case (if/else)
          • Date
          • DateTime
          • Geography
          • JSON
          • Math
          • Navigation
          • Numbering
          • Other
          • Statistical
          • String
          • Time
      • Notebooks
        • Notebook concepts
        • Compute resources
        • Python notebooks
        • R notebooks
        • Stata notebooks
        • SAS notebooks
        • Using the Jupyter interface
      • Access and privacy
    • Data access
      • Access levels
      • Configuring access
      • Requesting access
      • Approving access
      • Usage rules
      • Data access in workflows
    • Organizations
      • Administrator panel
      • Members
      • Studies
      • Workflows
      • Datasets
      • Permission groups
      • Requirements
      • Reports
      • Logs
      • Billing
      • Settings and branding
        • Account
        • Public profile
        • Membership
        • Export environments
        • Advanced: DOI configuration
        • Advanced: Stata & SAS setup
        • Advanced: Data storage locations
        • Advanced: Data egress configuration
    • Institutions
      • Administrator panel
      • Organizations
      • Members
      • Datasets
      • Reports
      • Settings and branding
    • Quotas and limits
    • Glossary
  • Additional Resources
    • Events and press
    • API documentation
    • Redivis Labs
    • Office hours
    • Contact us
    • More information
      • Product updates
      • Roadmap
      • System status
      • Security
      • Feature requests
      • Report a bug
Powered by GitBook
On this page
  • Overview
  • Table characteristics
  • Viewing variables
  • Viewing cells
  • Querying tables

Was this helpful?

Export as PDF
  1. Reference
  2. Datasets

Tables

Last updated 26 days ago

Was this helpful?

Overview

A table is where tabular data is stored on Redivis. A table can be thought of as a grid made up of variables (columns) and rows (records). You'll see tables:

  • On the Tables tab of datasets

  • On in a workflow

Every table has a table viewer where you can explore the data, given you have appropriate . There are three tabs where you can view the data and interact with it in different ways:

  • Variables view (requires )

  • Cells view (requires )

  • Query view (requires )

You can also learn more about the table and take further action

  • View this table's (if applicable)

  • View this table's Bibliography

  • this table (requires )

Table characteristics

Field

Notes

Name

The table's name. If in a dataset, must be unique across all tables for that version of the dataset. If in a workflow, must be unique across all tables currently in the workflow.

Description

Optional. A free-form description of the table's contents. May not exceed 5000 characters.

Bibliography

This table's citation, and any recorded related identifiers.

Variable count

Total number of variables in the table.

Row count

Total number of rows, or records, in the table.

Size

Total size of the table, in bytes.

Entity

Optional. Documents the concept that one record in this table represents. For example, the table's entity might represent a unique patient, or a specific hospitalization, or a prescription.

Temporal range

Optional. The range of time that this table covers. This can either be set manually, or calculated from the min/max of a particular variable.

Sample

If this table is sampled, you will see a marker for whether you are looking at the full dataset or the 1% sample. To interact with sampled tables, add the dataset to a workflow.

Viewing variables

Viewing cells

Clicking any the header or cells in any column will display the variable's summary statistics. You can also right click a cell to filter by a specific value, or sort on that variable.

For any Geography type variables, you can hover on the cell or click to see a preview.

This behavior allows for the performance and scaling characteristics of tabular data on Redivis, and is typical for relational databases and other tabular data stores.

Querying tables

Choose one of the templates or type your own SQL query, and click the Run button to see the results. You can select any column in the query results to view the summary statistics for that column.

If you update the query after execution, it will become "stale" (marked in yellow), which is a reminder that the results no longer match the query you are typing. The previous results will still be shown to help inform the new query you are writing.

Accessibility note for keyboard navigation: When you open the query tab, your cursor will be focused on the query editor. To navigate away using the keyboard, first press Escape, then press Tab to focus with other non-editor elements.

If calculated from a variable, that variable must have date, dateTime, or integer. If the variable is an integer, its values will be assumed to represent a year and must be in the range [0, 9999].

The Variables tab provides a searchable list of the table's variables. In order to view this tab, you will need to the corresponding table.

Clicking any row will display the variable's statistics panel, which shows some common calculated from the variable's data.

The Cells tab allows you to quickly preview and explore the table's data as a grid, regardless of the table's size. In order to view this tab, you will need to the corresponding table.

The order of records within a table is arbitrary and non-deterministic, unless the table is the result of a transform or query that specifies an .

If the order of records is considered to be part of the "data" in your table, you should create an additional "record number" variable that can be sorted on. Alternatively, you can upload your source files as , in which case an exact copy of the original file will be stored.

Workflow provide the most robust and reproducible mechanism for querying tables on Redivis. However, you can execute SQL queries directly on a table for when you want to perform a quick, one-off analysis. In order to perform a query, you will need to the corresponding table.

The Query tab on a table contains a SQL editor with scaffolding to reference that table. The query syntax follows the same rules as the . Additionally query results may not exceed 10GB – if you need to materialize larger results, use a transform for your query.

order clause
unstructured data
table nodes
access
Dataset
Export
transforms
Embedded view of a table with 2.7B records. Click around to interact!
metadata access
data access
data access
data access
metadata access
data access
data access
transform SQL query syntax
Query the table to filter for a particular value, or explore variable correlations via a cross-tab
summary statistics
type