kedro.extras.datasets

Description

kedro.extras.datasets is where you can find all of Kedro’s data connectors. These data connectors are implementations of the AbstractDataSet.

Data Sets

kedro.extras.datasets.biosequence.BioSequenceDataSet(…) BioSequenceDataSet loads and saves data to a sequence file.
kedro.extras.datasets.dask.ParquetDataSet(…) ParquetDataSet loads and saves data to parquet file(s).
kedro.extras.datasets.matplotlib.MatplotlibWriter(…) MatplotlibWriter saves matplotlib objects to image file(s) in an underlying filesystem (e.g.
kedro.extras.datasets.networkx.NetworkXDataSet(…) NetworkXDataSet loads and saves graphs to a JSON file using an underlying filesystem (e.g.: local, S3, GCS).
kedro.extras.datasets.pandas.CSVBlobDataSet(…) CSVBlobDataSet loads and saves CSV files in Microsoft’s Azure Blob Storage.
kedro.extras.datasets.pandas.CSVDataSet(filepath) CSVDataSet loads/saves data from/to a CSV file using an underlying filesystem (e.g.: local, S3, GCS).
kedro.extras.datasets.pandas.ExcelDataSet(…) ExcelDataSet loads/saves data from/to a Excel file using an underlying filesystem (e.g.: local, S3, GCS).
kedro.extras.datasets.pandas.FeatherDataSet(…) FeatherDataSet loads and saves data to a feather file using an underlying filesystem (e.g.: local, S3, GCS).
kedro.extras.datasets.pandas.GBQTableDataSet(…) GBQTableDataSet loads and saves data from/to Google BigQuery.
kedro.extras.datasets.pandas.HDFDataSet(…) HDFDataSet loads/saves data from/to a hdf file using an underlying filesystem (e.g.
kedro.extras.datasets.pandas.JSONBlobDataSet(…) JSONBlobDataSet loads and saves JSON files in Microsoft’s Azure Blob Storage.
kedro.extras.datasets.pandas.JSONDataSet(…) JSONDataSet loads/saves data from/to a JSON file using an underlying filesystem (e.g.: local, S3, GCS).
kedro.extras.datasets.pandas.ParquetDataSet(…) ParquetDataSet loads/saves data from/to a Parquet file using an underlying filesystem (e.g.: local, S3, GCS).
kedro.extras.datasets.pandas.SQLQueryDataSet(…) SQLQueryDataSet loads data from a provided SQL query.
kedro.extras.datasets.pandas.SQLTableDataSet(…) SQLTableDataSet loads data from a SQL table and saves a pandas dataframe to a table.
kedro.extras.datasets.spark.SparkDataSet(…) SparkDataSet loads and saves Spark dataframes.
kedro.extras.datasets.spark.SparkHiveDataSet(…) SparkHiveDataSet loads and saves Spark dataframes stored on Hive.
kedro.extras.datasets.spark.SparkJDBCDataSet(…) SparkJDBCDataSet loads data from a database table accessible via JDBC URL url and connection properties and saves the content of a PySpark DataFrame to an external database table via JDBC.
kedro.extras.datasets.text.TextDataSet(filepath) TextDataSet loads/saves data from/to a text file using an underlying filesystem (e.g.: local, S3, GCS)
kedro.extras.datasets.yaml.YAMLDataSet(filepath) YAMLDataSet loads/saves data from/to a YAML file using an underlying filesystem (e.g.: local, S3, GCS).