Each workspace can store both structured and unstructured data. It can also store standard files such as text files and data analysis scripts.
We generally categorise these assets in one of two ways:
- “Data” is considered to be data in a tabular form, (e.g. a CSV file) or other structured form (e.g. an SPSS file). It can also be semi-structured data (e.g. a head MRI scan or a genome sequence).
- “Files” are considered to be physical files that you would not normally perform data analysis on (e.g. scripts that you can run, or study protocol documents).
See the summary table in the guidance and best practice for uploading article for more examples and comprehensive guidance on how the workspace is set up to store these different types of data and files.
Where does my data belong?
The workspace allows you to store data as files on the file system, or as tables within the database. How you upload data and the destination you choose within the workspace affects how much data you can upload, and what happens to it before you see it in the workspace.
If you have tabular data, e.g. a CSV file, you need to decide whether you would like it to appear as a file on the workspace file system (the Datafiles folder), or be loaded into a database table (the Data folder).
Where do my files belong?
We provide two pre-defined folders in the workspace for efficiently organising the common types of assets you may want to upload. The Scripts folder is intended for storing data analysis scripts, such as R or SQL scripts. The Documents folder is meant for holding PDF or Office files, such as analysis reports or study protocol documents.
If you have other kinds of files, such as images or sensor data, we suggest that you store these in the Datafiles folder. This folder is specifically configured so that uploading a large collection of files is efficient, due to the fact that there is no pre-processing of these files as they are uploaded through the SFTP service. It also provides a dedicated folder for pointing your data analysis tool of choice to, e.g. a DICOM viewer. Please note that files stored in the Datafiles folder are not currently displayed in the web interface, but are accessible from virtual desktops and the R console.
Users can create folders within the Files section of the workspace. To do so, navigate to the 'Add' menu and select 'New folder'. You should then name the folder and choose whether to create it in the Scripts or Documents section of your workspace. Alternatively you can choose to create the new folder in the Current folder, which is the folder that you have navigated to in the Files tab.
Select ‘Create new folder’ to create the folder.