File-globbing interfaces: Prepares the initial globbing API for matching
file patterns in S3, GCS, and locally. Globbing can be tested using the glob
table function. An upcoming release will integrate globbing into existing file
scan functions (read_parquet
, read_csv
).
glob
table function: Table function return file names that match a glob
using the new file-globbing interfaces.
For example, querying a public GCS bucket using a glob:
SELECT filename
FROM glob('s3://glaredb-public/testdata/csv/glob_numbers/**/{3,5}00.csv');
Authenticated GCS access: Enable accessing Google Cloud Storage buckets using service accounts.
GCS documentation: Add documentation for querying files in GCS buckets using the "GCS File System": https://glaredb.com/docs/reference/filesystems/gcs
approx_count_distinct
: Add a new aggregate function for efficient
estimation of distinct values.
Correlated subquery fix: Ensure joins used for decorrelating subqueries honor proper set semantics. See https://github.com/GlareDB/glaredb/issues/3621
Unauthenticated GCS access: Introduce read-only, unauthenticated access to public GCS buckets.
High‐core‐count performance: Partitioned aggregate hash table is now fully lock-free, reducing contention on 64+-core machines.
Parallel build optimization: Hash table initialization has been moved into the normal execution path, enabling multiple hash tables to be initialized in parallel.
Under-the-hood improvements: Removed Mutex locks from the build phase and defer aggregate table allocation to execution time.
Parquet scan filters: Added early pruning of row-groups via scan filters, significantly reducing IO on large datasets.
Casting refinements: Tweaked cast rules to avoid unnecessary runtime casts and improve filter pushdown accuracy.
Aggregate hash table tweaks: General performance boosts, including specialized integer-sum implementations and DataType refactoring.
LIMIT hint pushdown: Limit rows processed in Sort operators, speeding up queries that combine ORDER BY and LIMIT.
Expanded bitwise/exponent operators:
<<
) and shift right (>>
) scalar functions.&
), OR (|
), XOR (#/XOR
), NOT (~
) functions.^
) for Float64.Optimizer rule: Introduced common sub-expression elimination for query plans.
Versioning: Switched to a new, date-based versioning scheme.
Performance fixes: