Tables
Last updated
Was this helpful?
Last updated
Was this helpful?
A table is where tabular data is stored on Redivis. A table can be thought of as a grid made up of variables (columns) and rows (records). You'll see tables:
On the Tables tab of datasets
On in a workflow
Every table has a table viewer where you can explore the data, given you have appropriate . There are three tabs where you can view the data and interact with it in different ways:
Variables view (requires )
Cells view (requires )
Query view (requires )
You can also learn more about the table and take further action
View this table's (if applicable)
View this table's Bibliography
this table (requires )
Field
Notes
Name
The table's name. If in a dataset, must be unique across all tables for that version of the dataset. If in a workflow, must be unique across all tables currently in the workflow.
Description
Optional. A free-form description of the table's contents. May not exceed 5000 characters.
Bibliography
This table's citation, and any recorded related identifiers.
Variable count
Total number of variables in the table.
Row count
Total number of rows, or records, in the table.
Size
Total size of the table, in bytes.
Entity
Optional. Documents the concept that one record in this table represents. For example, the table's entity might represent a unique patient, or a specific hospitalization, or a prescription.
Temporal range
Optional. The range of time that this table covers. This can either be set manually, or calculated from the min/max of a particular variable.
Sample
If this table is sampled, you will see a marker for whether you are looking at the full dataset or the 1% sample. To interact with sampled tables, add the dataset to a workflow.
Clicking any the header or cells in any column will display the variable's summary statistics. You can also right click a cell to filter by a specific value, or sort on that variable.
For any Geography type variables, you can hover on the cell or click to see a preview.
Choose one of the templates or type your own SQL query, and click the Run button to see the results. You can select any column in the query results to view the summary statistics for that column.
If you update the query after execution, it will become "stale" (marked in yellow), which is a reminder that the results no longer match the query you are typing. The previous results will still be shown to help inform the new query you are writing.
If calculated from a variable, that variable must have date
, dateTime
, or integer
. If the variable is an integer, its values will be assumed to represent a year and must be in the range [0, 9999]
.
The Variables tab provides a searchable list of the table's variables. In order to view this tab, you will need to the corresponding table.
Clicking any row will display the variable's statistics panel, which shows some common calculated from the variable's data.
The Cells tab allows you to quickly preview and explore the table's data as a grid, regardless of the table's size. In order to view this tab, you will need to the corresponding table.
The order of records within a table is arbitrary and non-deterministic, unless the table is the result of a transform or query that specifies an .
If the order of records is considered to be part of the "data" in your table, you should create an additional "record number" variable that can be sorted on. Alternatively, you can upload your source files as , in which case an exact copy of the original file will be stored.
Workflow provide the most robust and reproducible mechanism for querying tables on Redivis. However, you can execute SQL queries directly on a table for when you want to perform a quick, one-off analysis. In order to perform a query, you will need to the corresponding table.
The Query tab on a table contains a SQL editor with scaffolding to reference that table. The query syntax follows the same rules as the . Additionally query results may not exceed 10GB – if you need to materialize larger results, use a transform for your query.