Documentation Index
Fetch the complete documentation index at: https://knowledge.cloudquant.com/llms.txt
Use this file to discover all available pages before exploring further.
Amazon S3
S3 datasources allow CloudQuant Data Liberator to read CSV, TSV, and Parquet files directly from Amazon S3 buckets or S3-compatible object storage services (MinIO, Wasabi, Backblaze B2, etc.).Connection Configuration
Required Fields
| Field | Type | Description |
|---|---|---|
connection_type | string | Must be "s3" |
aws_access_key_id | string | AWS access key ID |
aws_secret_access_key | string | AWS secret access key |
bucket | string | S3 bucket name |
endpoint | string | S3 endpoint URL, e.g., "https://s3.amazonaws.com" |
Optional Fields
| Field | Type | Default | Description |
|---|---|---|---|
prefix | string | "" | Key prefix (virtual directory) within the bucket |
request_style | string | "virtual" | S3 request style: "path" or "virtual" |
mount_point | string | Local mount path for FUSE-based access | |
config_name | string | Internal configuration identifier | |
object_key | string | Object key pattern for file selection |
For S3-compatible services (MinIO, Wasabi, etc.), set
request_style to "path" and update the endpoint to point to your service. Virtual-hosted style is the default for AWS S3.Example Connection
CSV/TSV Dataset
Thedata_args are identical to Local File sources. The file_pattern is evaluated relative to the prefix configured on the connection.
Required data_args
| Field | Type | Description |
|---|---|---|
file_pattern | string | Glob pattern relative to the prefix, e.g., "*.csv" |
data_dt_column | string or list | Column(s) containing the datetime value |
data_dt_format | string or list | strptime format or special values ("muts", "uts", "datetime") |
data_key_column | string or list | Symbol/key column(s) |
Optional data_args
| Field | Type | Default | Description |
|---|---|---|---|
sep_override | string | "," | Delimiter character |
encoding | string | "utf-8" | File encoding |
data_dt_timezone | string | "UTC" | Source data timezone |
fname_dt_regex | string | Regex to extract date from filename | |
fname_dt_format | string | strptime format for filename date | |
fname_dt_timezone | string | Timezone of filename date | |
fname_dt_nudge | int | 0 | Microsecond offset for filename date |
fname_dt_approx_seconds | int | Approximate seconds per file | |
arrow_sort | list | ["symbol", "muts"] | Sort order |
arrow_timestamp | bool | true | Generate human-readable timestamp column |
Complete CSV Example
Parquet Dataset
This provides passthrough access to Parquet data without intermediate caching, leveraging Arrow’s native Parquet reader.Complete Parquet Example
S3-Compatible Storage
MinIO Example
Most S3-compatible services require
request_style set to "path". Only AWS S3 defaults to virtual-hosted style.IAM Permissions
The IAM user or role associated with the access key needs at minimum:s3:GetObjecton the bucket objectss3:ListBucketon the bucket

