Files in CSV, TSV (column delimited) XLS, JSON, and Parquet formats become datasets in Neebo. Technically, because all data in Neebo is virtualized, every data asset in Neebo is a "dataset" - a reference to another data asset. A dataset is Workspace specific - it can only exist inside one Workspace.

However, a central principle in Neebo is that when data is first added, to either the tool or to a Workspace, it is a non-editable data source. A data sources is a specific type of dataset that represents the external data source (like a table in a database) and can be referenced from multiple Workspaces. Data source icons differ according to their connection type (S3, PostgreSQL, etc.).

Only when you create a reference to the data source does it become editable, as a dataset. Dataset icons are the same regardless of their data type. Any Neebo dataset can in turn be used as a data source in another Workspace, in which case a new copy is created that does not reference or reflect the content of any other datasets.

Once a dataset is added to a Workspace it can be viewed and interacted with inside a Workspace's Workbench.

Creating datasets


In the example shown below, "Chromosome X1" and "Blue Eyes" are both data sources that have been added to the Workspace. "Chromosome X1" happens to be a new data connection added to Neebo, whereas "Blue Eyes" is a dataset that already existed in Neebo. When the "Blue Eyes" dataset was added to the Workspace, it became a data source reference there. Regardless of the origin, as data sources they are functionally the same.


The Add Assets and Connect pages describe the various ways to add data to Neebo. See the Workspaces page for details concerning adding an asset to a Workspace.

Deleting and removing datasets

When a dataset is removed from a Workspace it still exists in Neebo and can be be added to Workspaces in the future. Only the owner or a collaborator can remove or delete a dataset. To remove a dataset from a Workspace, select the dataset in the Workbench Flow area and choose "Remove" from its context menu.  

When a dataset is deleted, it no longer exists in Neebo. From the dataset's details page, use the rightmost  button and choose "Delete." If the dataset has no downstream dependencies, you will be prompted to confirm the action. If the dataset is referenced in one or more other Workspaces, it must be removed from those Workspaces in order to be deleted. Note that a specific dataset reference is being removed from a Workspace. A data source cannot be deleted.


The breadcrumb path in the header shows the Workspace name, then (separated with a carat >) the dataset name, so you can identify the specific dataset you are viewing.

The Details page also provides a set of buttons that allow you to add a dataset to one or more other Workspaces, open the dataset in the Workbench for the current Workspace, or to delete the dataset .


Tags are a searchable attribute intended for categorization and finding, that can be applied to datasets and Workspaces. They can be up to 80 characters (spaces are not allowed). Clicking on a Workspace tag starts a search for that tag.



Shows the Workspace to which the selected dataset belongs.