Contents Menu Expand Light mode Dark mode Auto light/dark mode
Dataiku Developer Guide
Dataiku
  • Discussions
    • Setup & Configuration
    • Using Dataiku DSS
    • Plugins & Extending Dataiku DSS
    • General Discussion
    • Job Board
    • Community Resources
    • Product Ideas
  • Knowledge
    • Getting Started
    • Knowledge Base
    • Reference Documentation
    • Developer Guide
  • Academy
    • Quick Start Programs
    • Learning Paths
    • Certifications
    • Course Catalog
    • Academy Discussions
  • Community Programs
    • Upcoming User Events
    • Find a User Group
    • Past Events
    • Community Conundrums
    • Dataiku Neurons
    • Banana Data Podcast
  • What's New
Share your feedback on the Dataiku documentation with this 5 minute survey. Thanks!
  • Getting started
    • Introduction
    • Development environment
    • Basic workflow
    • MLOps lifecycle
    • The Dataiku Python APIs
    • Quickstart Tutorial
      • Step 1: Prepare the input dataset for ML modeling
      • Step 2: Test different Machine Learning models for heart failures prediction
      • Step 3: Create a Dataiku Saved Model using the best-performing model
  • Concepts and examples
    • Datasets
      • Datasets (reading and writing data)
      • Datasets (other operations)
    • Managed folders
    • The main API client
    • Pyspark recipes
    • Projects
    • Recipes
    • Scenarios
    • Flow creation and management
    • Visual Machine learning
    • Experiment Tracking
    • Statistics worksheets
    • API Designer & Deployer
    • Project Deployer
    • Static insights
    • Jobs
    • Authentication information and impersonation
    • Importing tables as datasets
    • Wikis
    • Discussions
    • Performing SQL, Hive and Impala queries
    • SQL Query
    • Project libraries
    • Meanings
    • Users and groups
    • Connections
    • Scenarios (in a scenario)
    • Code envs
    • Plugins
    • Project folders
    • Dataiku Applications
    • Feature Store
    • Streaming Endpoints
    • Metrics and checks
    • Model Evaluation Stores
    • Administration
    • Utilities
    • API for plugin components
      • API for plugin recipes
      • API for plugin datasets
      • API for plugin formats
      • API for plugin FS providers
    • Clusters
    • Code Studios
    • Fleet Manager
      • The main FMClient class
      • Fleet Manager Instances
      • Fleet Manager Virtual Networks
      • Fleet Manager Instance Templates
      • Fleet Manager Tenant
      • Fleet Manager Future
    • Govern
      • The main GovernClient class
      • Govern Blueprints and Blueprint versions
      • Govern Artifacts
      • Govern Uploaded Files
      • Govern Time series
      • Govern Artifact Search
      • Govern Blueprint Designer
        • Govern Hooks
      • Govern Roles and Permissions
        • Govern Role Assignment Rules
      • Authentication information and impersonation
      • Users and groups
      • Other administration tasks
      • Govern Custom Pages
      • Govern Custom Pages Handler
      • Utilities
    • Workspaces
    • Data Collections
    • Webapps
  • Tutorials
    • Developer tools
      • Setting up the Dataiku API local environment
      • Usage basics for the Dataiku Python API
      • VSCode extension for Dataiku DSS
      • PyCharm plugin for Dataiku DSS
      • Using VSCode for Code Studios
      • Running unit tests on project libraries
    • Data Engineering
      • Leveraging SQL in Python & R
      • Using Snowpark Python in Dataiku: basics
      • Using Databricks Connect Python in Dataiku: basics
      • Data quality assessments (SQL Datasets)
    • Webapps
      • Standard (HTML, CSS, JS)
        • HTML/CSS/JS: your first webapp
        • Adapt a D3.js Template in a Webapp
        • Use Custom Static Files (Javascript, CSS) in a Webapp
        • Upload to Dataiku DSS in a Webapp
        • Download from a Dataiku DSS Webapp
        • How to create a form for data input?
        • Simple scoring application
      • Dash
        • Create an empty dash webapp
        • Dash: your first webapp
        • Create a simple admin project dashboard using Dash.
        • Create a simple multi-page webapp
        • Uploading or downloading files with Managed Folders in Dash
        • How to create a form for data input?
        • Using Dash and Azure OpenAI to build a GPT-powered web app assistant
        • Simple scoring application
      • Streamlit
        • Streamlit: your first webapp
      • Bokeh
        • Bokeh: Your first webapp
      • Use your own Framework
        • Code studio template
        • Quickstart with Angular & Vue Templates
        • Deploy your web application
    • Machine Learning
      • Generative AI - NLP
        • GPT-based zero-shot text classification with the OpenAI API
        • GPT-based few-shot classification with the OpenAI API
        • Comparing zero-shot learning and few-shot learning using Dolly for efficient text classification
        • Using Langchain, Chroma, and GPT for document-based retrieval-augmented generation
      • Experiment Tracking
        • Experiment Tracking with the PythonModel module
        • Experiment Tracking for NLP with Keras/Tensorflow
        • Experiment tracking with Catboost
        • Experiment tracking with LightGBM
        • Experiment tracking with scikit-learn
      • Pre-trained Models
        • Load and re-use a TensorFlow Hub model
        • Load and re-use a PyTorch model
        • Load and re-use a Hugging Face model
        • Load and re-use a SentenceTransformers word embedding model
        • Load and re-use a spaCy named-entity recognition model
        • Load and re-use an NLTK tokenizer
      • Model Export
        • Wrapping an exported model in a CLI tool
    • Plugins development
      • Writing a File Format Plugin to allow ICal import in Dataiku
      • Writing a macro for managing regression tests
      • Writing a custom recipe to remove outliers from a dataset
  • API Reference
    • Python
      • Datasets
      • Managed folders
      • The main API client
      • Interaction with Pyspark
      • Projects
      • Recipes
      • Scenarios
      • Flow creation and management
      • Machine learning
      • Experiment Tracking
      • Statistics worksheets
      • API Designer & Deployer
      • Project Deployer
      • Static insights
      • Jobs
      • Authentication information and impersonation
      • Importing tables as datasets
      • Wiki
      • Discussions
      • Performing SQL, Hive and Impala queries
      • SQL Query
      • Project libraries
      • Meanings
      • Users and groups
      • Connections
      • Scenarios (in a scenario)
      • Code envs
      • Plugins
      • Project folders
      • Macros
      • Dataiku applications
      • Feature Store
      • Streaming endpoints
      • Metrics and checks
      • Model Evaluation Stores
      • Other administration tasks
      • Utilities
      • Index of the dataiku package
      • Index of the dataikuapi package
      • API for plugin components
        • API for plugin recipes
        • API for plugin datasets
        • API for plugin formats
        • API for plugin FS providers
      • Clusters
      • Code Studios
      • Fleet Manager
      • Govern
      • Workspaces
      • Data Collections
      • Webapps
      • Dashboards
You are viewing the developer guide for version 12 of DSS.
Back to top

API for plugin datasets#

Reference documentation#

dataiku.connector.Connector(config[, ...])

The base interface for a Custom Python connector

dataiku.connector.CustomDatasetWriter()

Next
API for plugin formats
Previous
API for plugin recipes
Copyright © 2022, Dataiku
Made with Sphinx and @pradyunsg's Furo
On this page
  • API for plugin datasets
    • Reference documentation