Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: s/Dataspace/Workspace/

Files in CSV, TSV (column delimited) XLS, JSON, and Parquet formats become datasets in Neebo. Technically, because all data in Neebo is virtualized, every data asset in Neebo is a "dataset" - a reference to another data asset. A dataset is Dataspace Workspace specific - it can only exist inside one DataspaceWorkspace.

However, a central principle in Neebo is that when data is first added, to either the tool or to a DataspaceWorkspace, it is a non-editable data source. A data sources is a specific type of dataset that represents the external data source (like a table in a database) and can be referenced from multiple DataspacesWorkspaces. Data source icons differ according to their connection type (S3, PostgreSQL, etc.).

Only when you create a reference to the data source does it become editable, as a dataset. Dataset icons are the same regardless of their data type. Any Neebo dataset can in turn be used as a data source in another DataspaceWorkspace, in which case a new copy is created that does not reference or reflect the content of any other datasets.

Once a dataset is added to a Dataspace Workspace it can be viewed and interacted with inside a Dataspacea Workspace's Workbench.

Creating datasets

...

In the example shown below, "Chromosome X1" and "Blue Eyes" are both data sources that have been added to the DataspaceWorkspace. "Chromosome X1" happens to be a new data connection added to Neebo, whereas "Blue Eyes" is a dataset that already existed in Neebo. When the "Blue Eyes" dataset was added to the DataspaceWorkspace, it became a data source reference there. Regardless of the origin, as data sources they are functionally the same.

...

The Add Assets and Connect pages describe the various ways to add data to Neebo. See the DataspacesWorkspaces page for details concerning adding an asset to a DataspaceWorkspace.

Deleting and removing datasets

When a dataset is removed from a Dataspace Workspace it still exists in Neebo and can be be added to Dataspaces Workspaces in the future. Only the owner or a collaborator can remove or delete a dataset. To remove a dataset from a DataspaceWorkspace, select the dataset in the Workbench Flow area and choose "Remove" from its context menu.  

When a dataset is deleted, it no longer exists in Neebo. From the dataset's details page, use the rightmost  button and choose "Delete." If the dataset has no downstream dependencies, you will be prompted to confirm the action. If the dataset is referenced in one or more other DataspacesWorkspaces, it must be removed from those Dataspaces Workspaces in order to be deleted. Note that a specific dataset reference is being removed from a DataspaceWorkspace. A data source cannot be deleted.

...

The breadcrumb path in the header shows the Dataspace Workspace name, then (separated with a carat >) the dataset name, so you can identify the specific dataset you are viewing.

The Details page also provides a set of buttons that allow you to add a dataset to one or more other DataspacesWorkspaces, open the dataset in the Workbench for the current DataspaceWorkspace, or to delete the dataset .

...

Tags are a searchable attribute intended for categorization and finding, that can be applied to datasets and DataspacesWorkspaces. They can be up to 80 characters (spaces are not allowed). Clicking on a Dataspace Workspace tag starts a search for that tag.

...

Workspace

Shows the Dataspace Workspace to which the selected dataset belongs. 

...