Top | ![]() |
![]() |
![]() |
![]() |
gpointer | partitioning | Write / Construct Only |
gboolean | infer-dictionary | Read / Write |
GArrowSchema * | schema | Read / Write |
GADatasetSegmentEncoding | segment-encoding | Read / Write |
GEnum ╰── GADatasetSegmentEncoding GObject ├── GADatasetPartitioning │ ├── GADatasetKeyValuePartitioning │ ╰── GADatasetKeyValuePartitioning │ ├── GADatasetDirectoryPartitioning │ ╰── GADatasetDirectoryPartitioning ╰── GADatasetPartitioningOptions
GADatasetPartitioningOptions is a class for partitioning options.
GADatasetPartitioning is a base class for partitioning classes such as GADatasetDirectoryPartitioning.
GADatasetKeyValuePartitioning is a base class for key-value style partitioning classes such as GADatasetDirectoryPartitioning.
GADatasetDirectoryPartitioning is a class for partitioning that uses directory structure.
GADatasetPartitioningOptions *
gadataset_partitioning_options_new (void
);
Since: 6.0.0
GADatasetPartitioning *
gadataset_partitioning_new (void
);
Since: 6.0.0
gchar *
gadataset_partitioning_get_type_name (GADatasetPartitioning *partitioning
);
Since: 6.0.0
GADatasetDirectoryPartitioning * gadataset_directory_partitioning_new (GArrowSchema *schema
,GList *dictionaries
,GADatasetPartitioningOptions *options
,GError **error
);
schema |
A GArrowSchema that describes all partitioned segments. |
|
dictionaries |
A list of GArrowArray
for dictionary data types in |
[nullable][element-type GArrowArray] |
options |
[nullable] | |
error |
Return location for a GError or |
[nullable] |
Since: 6.0.0
They are corresponding to arrow::dataset::SegmentEncoding
values.
Since: 6.0.0
struct GADatasetPartitioningOptionsClass { GObjectClass parent_class; };
struct GADatasetKeyValuePartitioningClass { GADatasetPartitioningClass parent_class; };
struct GADatasetDirectoryPartitioningClass { GADatasetKeyValuePartitioningClass parent_class; };
typedef struct _GADatasetDirectoryPartitioning GADatasetDirectoryPartitioning;
typedef struct _GADatasetKeyValuePartitioning GADatasetKeyValuePartitioning;
“partitioning”
property “partitioning” gpointer
The raw std::shared<arrow::dataset::Partitioning> *.
Owner: GADatasetPartitioning
Flags: Write / Construct Only
“infer-dictionary”
property “infer-dictionary” gboolean
When inferring a schema for partition fields, yield dictionary encoded types instead of plain. This can be more efficient when materializing virtual columns, and Expressions parsed by the finished Partitioning will include dictionaries of all unique inspected values for each field.
Owner: GADatasetPartitioningOptions
Flags: Read / Write
Default value: FALSE
Since: 6.0.0
“schema”
property “schema” GArrowSchema *
Optionally, an expected schema can be provided, in which case inference will only check discovered fields against the schema and update internal state (such as dictionaries).
Owner: GADatasetPartitioningOptions
Flags: Read / Write
Since: 6.0.0
“segment-encoding”
property“segment-encoding” GADatasetSegmentEncoding
After splitting a path into components, decode the path components before parsing according to this scheme.
Owner: GADatasetPartitioningOptions
Flags: Read / Write
Default value: GADATASET_SEGMENT_ENCODING_URI
Since: 6.0.0