# Data

Datasets can contain data in one of two formats, available on the corresponding tab of the dataset page.&#x20;

* **Tables** contain tabular data which can be quickly viewed and analyzed on Redivis using transforms and notebooks in a workflow.
* **Files** can be any type of data with a basic inspection tool and available for analysis in notebooks.

## Tables

All datasets that contain data will have at least one table, displayed on the **Tables** tab of the dataset page. Tables are created by data editors when they [upload tabular data](https://docs.redivis.com/reference/datasets/create-and-edit-datasets/import-sources), and users' access to these tables will be governed by the dataset's [access configuration](https://docs.redivis.com/reference/data-access/configuring-access).&#x20;

{% embed url="<https://redivis.com/datasets/7br5-41440fjzk/tables/6fff-7hqb797cb>" %}
Embedded view of a table with 2.7B records. Click around to interact!
{% endembed %}

#### Working with tables

Tables can be explored from the dataset page, including the ability to view [variables and summary statistics](https://docs.redivis.com/reference/tables/variables), [cell contents](https://docs.redivis.com/reference/tables/cells), and run [one-off queries](https://docs.redivis.com/reference/tables/queries) against the table.

These tables are then further utilized from within a [workflow](https://docs.redivis.com/reference/workflows).

Learn more about [tables](https://docs.redivis.com/reference/tables) on Redivis.&#x20;

*For editors: How to import* [*tabular*](https://docs.redivis.com/reference/datasets/create-and-edit-datasets/import-tabular-data) *or* [*geospatial*](https://docs.redivis.com/reference/datasets/create-and-edit-datasets/import-geospatial-data) *data.*

## Non-tabular files

Files are data entities uploaded to [datasets](https://docs.redivis.com/reference/organizations/datasets) on Redivis. Files are used to store non-tabular (a.k.a. unstructured) data of any file type. You can view a dataset's files by clicking on the **Files** tab of any dataset or [dataset node](https://docs.redivis.com/reference/workflows/data-sources) in a workflow.

<figure><img src="https://1672950126-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-LVodLwUXgJUGcm5Cvso%2Fuploads%2FfZj23DrFSqUUV3idfBDn%2FScreenshot%202024-12-09%20at%207.09.15%E2%80%AFPM_out.png?alt=media&#x26;token=c9b44def-9c7b-4ba3-993a-d563a4aa945a" alt=""><figcaption></figcaption></figure>

#### Inspecting files

You can inspect an individual file by clicking on its name to launch the file viewer. Any file can be downloaded or referenced within a notebook, and many file types can be previewed directly within Redivis, including:

* 3D models&#x20;
* Audio files&#x20;
* CIF + PDB files (molecular + protein structures)&#x20;
* Documents + spreadsheets
* FITS files (common in astronomy)&#x20;
* DICOM
* Geospatial (shapefiles, geojson, etc)
* HDF5&#x20;
* HTML
* Images&#x20;
* NIFTI (multi-dimensional raster)
* PDFs&#x20;
* Videos&#x20;
* Text/code&#x20;
* TIFFs&#x20;
* ZIPs&#x20;
* TEI

&#x20;If you have a file type that you think should be supported that isn't, please [let us know](https://redivis.com/contact)!

<figure><img src="https://1672950126-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-LVodLwUXgJUGcm5Cvso%2Fuploads%2FprFGCeWZLshiA0wQdYZC%2FScreenshot%202023-10-11%20at%2011.37.46_out.png?alt=media&#x26;token=7f0324e9-9e84-471c-a002-6340d1658617" alt=""><figcaption><p>View files in Redivis</p></figcaption></figure>

{% hint style="info" %}
You can view [example files](https://redivis.com/datasets/yz1s-d09009dbb/files) on the Redivis Demo organization.
{% endhint %}

#### Folders and index tables

All files are assigned to a "folder" within the dataset. You can click on the folder name in the right grey bar to filter this dataset's files by the folder they are in.

Each folder has a corresponding index table that is present on the **Tables** tab of the dataset. These will match the folder name and have a **File index** label.&#x20;

In these index tables, each row of the table represents a file in that folder.

| Variable name | Description                                                                                                                       |
| ------------- | --------------------------------------------------------------------------------------------------------------------------------- |
| file\_id      | A unique, system generated identifier for the file. Use the file\_id to reference and download specific file(s) in your analysis. |
| file\_name    | The name of the file                                                                                                              |
| size          | Size of the file in bytes                                                                                                         |
| added\_at     | Timestamp for when the file was added to this folder                                                                              |
| md5\_hash     | MD5 checksum of the file contents, encoded as a base64 string                                                                     |

<figure><img src="https://1672950126-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2F-LVodLwUXgJUGcm5Cvso%2Fuploads%2Fg9dF34RrQWmHTkhha0SG%2FScreenshot%202023-10-11%20at%2011.54.55_out.png?alt=media&#x26;token=c9d5c733-e9f0-4a18-ae2e-2cd73e3e9cb7" alt=""><figcaption><p>Yes, Redivis supports cat images</p></figcaption></figure>

#### Working with files

By representing files within an index table, we can query and subset the files [within a workflow](https://docs.redivis.com/reference/workflows), while joining them with other tabular metadata. In this way, file index tables behave like any other tabular metadata on Redivis.

To do deeper analysis of files, we can [load these files in a notebook ](https://docs.redivis.com/reference/workflows/notebooks/notebook-concepts)for further analysis. Consult the documentation for your preferred programming language to learn more:

[Working with unstructured data files in Python](https://docs.redivis.com/workflows/notebooks/python-notebooks#working-with-non-tabular-files)

[Working with unstructured data files in R](https://docs.redivis.com/workflows/notebooks/r-notebooks#working-with-unstructured-data-files)

*For editors:* [*How to import files*](https://docs.redivis.com/reference/datasets/create-and-edit-datasets/import-files)
