Commits · main · core / sbroad

Dec 26, 2024
- ci: add .gitleaksignore · b49ce8c5
  Виталий Шунков authored 3 months ago
  
  b49ce8c5
Nov 18, 2024
- refactor: make DEFAULT_VTABLE_MAX_ROWS and DEFAULT_VDBE_MAX_STEPS public · 9a021c28
  Maksim Kaitmazian authored 4 months ago
  
  9a021c28
Nov 14, 2024

feat: add `has_output` and `arena_type` functions · c144e8f4
EmirVildanov authored 5 months ago

c144e8f4
feat: fix tests buid warnings · f5cbca36
EmirVildanov authored 5 months ago

f5cbca36
feat: fix newest clippy warnings · e4801f51
EmirVildanov authored 5 months ago

e4801f51

feat: optimize DELETE without WHERE filter · 4ee134ce

EmirVildanov authored 5 months ago

Previously all DELETE operations were executed via space api. Primary keys were materialized and passed tuple by tuple to the `delete` function. Under Delete node we always put a Motion with policy Local to materialize tuples to delete.

This commit brings an optimization for the case of DELETE without WHERE condition. Such queries are translated into local SQL and don't require tuples materialization from target relation (from which we delete). We don't add a Motion node under Delete in such cases.

4ee134ce

feat: add sbroad-cartridge Makefile task to run integration tests only by... · 90739853
EmirVildanov authored 5 months ago
```
feat: add sbroad-cartridge Makefile task to run integration tests only by pattern (without group specification)
```
90739853
feat: support COALESCE · aaa0da6b
Maksim Kaitmazian authored 5 months ago and Maksim Kaitmazian committed 5 months ago

aaa0da6b

Nov 13, 2024
- fix: error formatting · ae91231b
  Maksim Kaitmazian authored 5 months ago
  
  Previously, format! macro was missed, so an error was displayed as plain format string (i.e. "failed to compile stmt: {_e:?}").
  ae91231b
- chore: fix lint errors · 042671c5
  Maksim Kaitmazian authored 5 months ago
  
  042671c5
- chore: fix dead code and unused imports warnings · 9386ff69
  Maksim Kaitmazian authored 5 months ago
  
  9386ff69
- BREAKING CHANGE: get rid of tracing · fb1aef8a
  Maksim Kaitmazian authored 5 months ago
  
  fb1aef8a
Nov 05, 2024

feat: mock PARTITION BY syntax · 57cb5b4c

EmirVildanov authored 5 months ago

in case such a clause met we ignore it and return an error of it being unimplemented yet

57cb5b4c

Nov 02, 2024
- feat: update test_app Makefile with rules running specific tests · c884d0b4
  EmirVildanov authored 5 months ago
  
  c884d0b4
- feat: apply lints · 4f5d2025
  EmirVildanov authored 5 months ago
  
  4f5d2025
- feat: add tests for distinct asterisk, fix unit tests · ce940442
  EmirVildanov authored 5 months ago
  
  ce940442
- feat: remove duplicate functions for cloning subtrees and checking their equality · 67bf298b
  EmirVildanov authored 5 months ago
  
  67bf298b
- fix: make expressions with references under GroupBy and parent relational... · 88a56ac2
  EmirVildanov authored 5 months ago
  
  fix: make expressions with references under GroupBy and parent relational operators be compared by Reference `targets` and `position` fields instead of Reference aliases
  88a56ac2
Oct 29, 2024

fix: incorrect equivalence classes · b2833f6a

Arseniy Volynets authored 5 months ago

- `propagate_equality` transformation did not
compute equality classes correctly, its 'merge'
function was completely wrong: it tried to add
intersection of classes to a another class,
instead of doing union
- to merge classes correctly we must do it
when we add a new pair of equal expressions:
otherwise later there will too many classes
that contain common elements, so 'merge'
function was removed and 'insert' now merges
two classes that contain common elements
- Also this logic is now covered by tests

b2833f6a

Oct 28, 2024

fix: incorrect equality cols for Eq · 29b7bd56

Arseniy Volynets authored 5 months ago

- In case we have equality between columns
of the same table, `eq_cols_from_eq` should
return empty equality cols, but it returned
None, which led to wrong motion. Fixed that.
- It was found after a merge tuples transform
fix, in front_sql_join_single_left_5 test
where now there are two equalities instead of one.
Earlier, there always was one equality because
merge tuples produced a single (..) = (..)
term for an and-chain. And this test didn't work,
because this term contained one equality pair.
Now there are two terms: (..) = (..) and (..)=(..)
and one of them does not contain any equality
pairs.

29b7bd56

fix: merge tuple transformation didn't group cols · 7fc3ff62

Arseniy Volynets authored 5 months ago

- merge tuple transformation that merges several
and-ed equalities into equlities of rows didn't
group columns by child they refer to. This led
to rows where we couldn't find sharding keys,
because they were scattered across the different
rows:

```
sk(t1) = (a, b), sk(t2) = (e, f)
... on (t1.a, t2.f) = (t2.e, t1.b)
```

But now correct rows are generated:

```
... on (t1.a, t1.b) = (t2.e, t2.f)
```

7fc3ff62

Oct 23, 2024
- feat: support IS [NOT] expression · f84401ec
  EmirVildanov authored 5 months ago
  
  f84401ec
Oct 18, 2024
- query.ebnf: remove duplicate line · 2d86c413
  Alexander Tolstoy authored 5 months ago
  
  2d86c413
Oct 17, 2024
- feat: add timeout to wait_masters_connect() call · 7c362b44
  Andrey Strochuk authored 6 months ago
  
  7c362b44
Oct 15, 2024

fix: modify WHITESPACE requirements in grammar to support failing queries · 73e6815a
EmirVildanov authored 6 months ago

73e6815a

feat: show buckets estimation in explain · 56933ca1

Arseniy Volynets authored 6 months ago and

Denis Smirnov committed 6 months ago

- Add new line in explain reporting on which buckets query will
  be executed.
- For queries consisting of a single subtree we can say exactly on
  which buckets it will be executed, for queries with more subtrees
  (with motions), we provide an upper bound of total buckets used
  in the query. Upper bound is computed by merging buckets from the
  leaf subtrees.
- In case for DML query with non-local motion we can't provide an
  upper bound, and print 'buckets: unknown'

Examples:
```
explain select a from t
->
projection ("t"."a"::integer -> "a")
    scan "t"
execution options:
    vdbe_max_steps = 45000
    vtable_max_rows = 5000
buckets = [1-3000]

explain select t.a from t join t as t2
on t.a = t2.b
->
projection ("t"."a"::integer -> "a")
  ...
execution options:
    vdbe_max_steps = 45000
    vtable_max_rows = 5000
buckets <= [1-3000]

explain select id from _pico_table
->
projection ("_pico_table"."id"::unsigned -> "id")
    scan "_pico_table"
execution options:
    vdbe_max_steps = 45000
    vtable_max_rows = 5000
buckets = any

explain insert into t values (1, 2)
->
insert "t" on conflict: fail
motion [policy: segment([ref("COLUMN_1")])]
    values
        value row (...)
execution options:
    vdbe_max_steps = 45000
    vtable_max_rows = 5000
buckets = unknown
```

56933ca1

Oct 14, 2024
- feat: support mandatory WHITESPACEs in the grammar · 7166193f
  EmirVildanov authored 6 months ago
  
  7166193f
Oct 07, 2024

fix: bucket calculation for duplicated columns · ed9808c2

Denis Smirnov authored 6 months ago


The queries `select * from t where sk = 1 and sk = 2` discovered
the bucket for the constant 1, rather then an empty set. The reason
was that the tuple merge transformed `sk = 1 and sk = 2` to
`(sk, sk) = (1, 2)`, while the distribution took into account only
the first position (constant 1).

To compute all keys we now take a cartesian product between all
groups of columns of a tuple, where each group consists of columns
corresponding to single column of sharding key.

Suppose tuple is (a, b, a). (a, b) refer to sharding columns, then
we have two groups:
a -> {0, 2}
b -> {1}

And the distribution keys are:
{0, 2} x {1} = {(0, 1), (2, 1)}

Co-authored-by: Arseniy Volynets <a.volynets@picodata.io>

ed9808c2

fix: bucket discovery for local motions · 8507c789

Denis Smirnov authored 6 months ago


Previously, during bucket discovery, we used Buckets::Any for
Motion(Local) nodes. This caused DML queries to be executed on all
nodes instead of targeting specific bucket children. We now apply
Motion(Local) only in the following cases:

- update/delete. When materializing the reading subtree for DML
  operations, Buckets::Any was used, but the reason for this is
  unclear.
- union all between sharded and local tables. To prevent duplicates,
  we materialize the global subtree only on a single storage node.
  Consequently, the subtree with Motion(Local) must have the same
  buckets as its child (the child node will always have Buckets::Any).

Co-authored-by: Arseniy Volynets <a.volynets@picodata.io>

8507c789

Oct 04, 2024

fix: wrong slices calculation · d5aa241e

Arseniy Volynets authored 6 months ago

- Previously we computed slices based on their
level in bfs tree traversal. This was wrong, as
motions that were independent could be in
different slices
- Fix that, now we the slice in which motion will
be is the max number of other motion nodes in path
from this motion to any leaf node

d5aa241e

Oct 03, 2024

refactor: remove ExecuteOptions · 093f271f

Arseniy Volynets authored 6 months ago

- ExecuteOptions is a hashmap that always
stores only 1 entry. Moreover this led to
a bug, when we expected that it stores ALL
execution options, while it stored only
vdbe_max_steps

093f271f

fix: vtable max rows limit not applied · b6574768

Arseniy Volynets authored 6 months ago

- We didn't apply vtable max rows value
when executing local sql. We tried to lookup
it in the execute options hashmap and took
the default value instead, though it was
not stored in the hashmap.

b6574768

Oct 02, 2024
- feat: introduce WAIT APPLIED (GLOBALLY | LOCALLY ) options for DDL operations · 18f4db0d
  Maksim Kaitmazian authored 7 months ago and Maksim Kaitmazian committed 6 months ago
  
  18f4db0d
- fix: use Buckets::Any for Values · 0a5a4902
  Arseniy Volynets authored 6 months ago
  
  - Earlier we returned empty buckets for Values/ValuesRow nodes and then choosed random bucket for execution if they were a subtree root. - Buckets::Any provides the same semantics, use it instead and simplify `empty_query_result` function, which now always returns empty result.
  0a5a4902
- fixup: EBNF for documentation · c4848ec4
  Denis Smirnov authored 6 months ago
  
  c4848ec4
Oct 01, 2024
- feat: introduce IF EXISTS and IF NOT EXISTS options · 79d1d796
  Maksim Kaitmazian authored 7 months ago and Maksim Kaitmazian committed 6 months ago
  
  79d1d796
- feat: indent execution options in explain · 497d6b02
  Arseniy Volynets authored 6 months ago
  
  497d6b02
Sep 30, 2024
- feat: change default auth method to md5 · 4e5d87d8
  Вартан Бабаян authored 7 months ago
  
  4e5d87d8
- fix: gitlab MR pattern · f02f1627
  Denis Smirnov authored 6 months ago
  
  View commits for tag 1.7.3 1.7.3
  
  f02f1627
- fix: used to panic on different values rows length · 2139dc67
  Arseniy Volynets authored 6 months ago and Denis Smirnov committed 6 months ago
  
  - When creating Values node in IR we didn't check that all values rows have the same length. - This led to panic on earlier pipeline stages: syntax plan build - Add a check that all values rows have the same length
  2139dc67