Skip to main content

Creating Datasets

Creating an Account at OpenCSG Community

To create and upload datasets to StarHub, you will need to create an account at OpenCSG Community. If you do not have an account, click here to create.

Sign up page

If you already have an account at OpenCSG Community, please login and create a new dataset repository using the web interface.

Login page

Creating a Dataset Repository

To create a new dataset repository, login to OpenCSG Community, click on New Dataset in the top right corner.

Click create dataset

In the dataset repository creation page, fill in the following information, and then click the Create Dataset.

  • Specify the owner of the repository: this can be either you or any of the organizations you are affiliated with.
  • Enter your dataset name, alias and description.
  • Specify the license.
  • Creating public datasets is currently not supported. Please contact the administrator contact@opencsg.com for manual review if needed.

Create dataset repo

After creating your dataset repository, you should see a page like this:

Dataset repo file

The README.md file that the system generates for you can be accessed and edited online on the Files tab. Once you are finished, the README.md file will be automatically rendered as dataset card by the system and shown on the Summary tab.

Dataset card can help users better understand your dataset and make your dataset easier to retrieve and discover. We recommend that you create your dataset card according to the dataset card specification. See Dataset Card Specification for more details.