There are two main classes related to datasets handling in Dataiku’s Python APIs:

  • dataiku.core.dataset.Dataset in the dataiku package, which deals primarily with reading and writing data. It has the most flexibility when it comes to reading and writing

  • dataikuapi.dss.dataset.DSSDataset in the dataikuapi package which is mostly used for creating datasets, managing their settings, building flows, creating ML models, and performing a wider range of operations on datasets.

For more details on the two packages, please see Getting started

For starting code samples, please see Python Recipes.

Detailed samples about interacting with datasets can be found in:

Reference documentation for the classes supporting interaction with datasets can be found in Datasets