This page covers the MinIO (default) provider — the built-in object store that ships with Datris. To write to AWS S3, see S3 Destination — same code path, same file formats, just per-pipeline credentials instead of the global MinIO ones.
Output Path
Files are written to:{environment}-data. You can override this per pipeline with the destinationBucketOverride field.
File Formats
| Format | Description |
|---|---|
parquet | Apache Parquet columnar format (default) |
orc | Apache ORC columnar format |
fileFormat field.
Write Modes
| Mode | Behavior |
|---|---|
append | Append new files to the output path (default) |
overwrite | Replace all existing files at the output path |
ignore | Do nothing if data already exists at the output path |
errorifexists | Fail the job if data already exists at the output path |
Partitioning
Partition output files by one or more columns using thepartitionBy array. Spark creates a directory structure based on the distinct values of the specified columns.
Delete Before Write
SetdeleteBeforeWrite to true to remove all existing objects under the output prefix before writing new data. This is useful when you need a clean target path but want finer control than the overwrite write mode provides.
Type Casting
Column types from the source schema are mapped to Spark types before writing:| Schema Type | Spark Type |
|---|---|
tinyint | ByteType |
smallint | ShortType |
int | IntegerType |
bigint | LongType |
float | FloatType |
double | DoubleType |
string | StringType |
boolean | BooleanType |
date | DateType |
timestamp | TimestampType |
Configuration Example
Field Reference
| Field | Required | Default | Description |
|---|---|---|---|
prefixKey | yes | Key prefix under the bucket | |
fileFormat | no | parquet | Output file format: parquet or orc |
writeMode | no | append | Spark write mode |
destinationBucketOverride | no | {environment}-data | Override the default bucket name |
partitionBy | no | Array of column names to partition by | |
deleteBeforeWrite | no | false | Delete existing objects before writing |
