Commits · 08a9d0a89c058d49307ba85bb97a5bf5dd823714 · core / picodata

Dec 27, 2024

fix: bug in conf_test when demoting excess voters · 08a9d0a8

Georgy Moshkin authored 2 months ago

The bug was introduced when I changed the behaviour of conf_change in
regard to instances with target_state Expelled. As a result we would
sometimes arbitrarily demote healthy voters in presence of degenerate
ones.

For example we could have this situation:

instance i1: raft_id=1, target_state=Online, raft_configuration=voter
instance i2: raft_id=2, target_state=Expelled, raft_configuration=voter (!)
instance i3: raft_id=3, target_state=Offline, raft_configuration=voter (!)
instance i4: raft_id=4, target_state=Online, raft_configuration=learner (!)
instance i5: raft_id=5, target_state=Online, raft_configuration=learner (!)

08a9d0a8

test: check that expelling offline replicas works · 7863470b
Georgy Moshkin authored 3 months ago

7863470b
fix: picodata expel wasn't waiting for current_state = Expelled · 817c4097
Georgy Moshkin authored 3 months ago

817c4097

test: remove unreliable check from test_restart_both · 3b80afff

Dmitry Rodionov authored 2 months ago and

Егор Ивков committed 2 months ago

The check arrived late when elections were already completed.
Circumstances are a bit weird. For some reason the check hang for two
seconds. Only after that it received conn refused, and during these two
seconds elections happened and leader was chosen so check can never
succeed after that.

3b80afff

ci: add timeout for mac job · 3c8a241a
Виталий Шунков authored 2 months ago

3c8a241a
query.ebnf: add coalesce rule · 67eb0eee
Artur Sabirov authored 2 months ago and Maksim Kaitmazian committed 2 months ago
```
Follow-up for !1425
```
67eb0eee
query.ebnf: proofread and edit · b410d8d4
Artur Sabirov authored 2 months ago and Maksim Kaitmazian committed 2 months ago

b410d8d4

feat(pgproto)!: enable pgproto by default · 8f7f68c1

Кирилл Безуглый authored 2 months ago

BREAKING CHANGE: as soon as PostgreSQL protocol
is enabled by default, we must be careful with
`--pg-listen` cli flag when creating more than
a single instance, because we will suddenly get
a port confict error (busy port)

8f7f68c1

Dec 26, 2024
- ci: add .gitleaksignore · 72d248a8
  Виталий Шунков authored 2 months ago
  
  72d248a8
Dec 25, 2024

test: setup logging in tests · e2de36ac

Dmitry Rodionov authored 2 months ago

Logs help to debug flaky tests. Because sometimes just by instance logs
it is not that easy to guess what the test was doing at that time.
Especially when there is a repetitive pattern in the test.

For now instance logs and test output are printed in separate sections.
It still allows to match events by time easier compared to looking only
at instance log (it is way bigger compared to test log).

Additionally this patch sets up a `framework` package intended to host
reusable test code without polluting conftest.py

e2de36ac

fix: flaky test_bootstrap_from_snapshot · fc739b7b
Egor Ivkov authored 2 months ago and Егор Ивков committed 2 months ago

fc739b7b
fix: null was used instead of nil in proc sharding lua snippet · 5fe74577
Dmitry Rodionov authored 2 months ago
```
I found this by compiling tarantool with CMAKE_BUILD_TYPE=Debug
```
5fe74577

refactor: assert index is <= applied in compact_log instead of truncating · 0957624d

Georgy Moshkin authored 2 months ago

We used to automatically truncate the index in compact_log if the caller
requested to compact too many entries. This made it so that the
requirement of not compacting any un-applied entries was implicit in our
code base, which is not good as it allows for some bugs to creep in
(like the one we fix a couple commits ago).

Now this is changed and instead of silently adjusting the index of last
compacted entry, we just assert that it's no greater than the applied
index. As a consequence there's a minor improvement in do_raft_log_auto_compaction
function.

0957624d

test: fix CI=1 hack for local runs · 1161519a
Georgy Moshkin authored 3 months ago

1161519a

fix: bug in raft snapshot application · e7c8582d

Georgy Moshkin authored 2 months ago

There was a hard-to-reproduce bug in our snapshot application code.
We always compact the raft log before applying the snapshot, because
the snapshot replaces the entries and some of the logic in raft-rs
seems to rely on this. The problem was, that our compact_log function
would not remove any unapplied entries, which makes sense for compaction
triggered automatically by raft log size, but doesn't make sense for
raft snapshot, as the snapshot contains the state corresponding to the
newer entries. The fix is simple: don't guard from unapplied entry
compaction in case the compaction is for raft snapshot.

We don't add any regression tests for this, because the implementation
would be too difficult and would need us to pollute the code with error
injection logic, which is not a worthy trade off in this case. But also
the logic will still be tested, because this bug was responsible for a
large amount of flaky tests, so we should see a significant reduction in
flakiness from now on in tests concerning raft snapshots.

e7c8582d

test: reduce test_large_snapshot flakiness · 36a59324
Georgy Moshkin authored 2 months ago

36a59324
feat(sql): support VARCHAR without limit · cd1802d9
Erik Khamitov authored 2 months ago

cd1802d9

Dec 24, 2024
- chore: fix adr naming convention so files are properly ordered by date · 62413f03
  Dmitry Rodionov authored 2 months ago and Егор Ивков committed 2 months ago
  
  62413f03
Dec 23, 2024
- ci: annotate test job log using collapsible sections · 9d98ba5b
  Dmitry Rodionov authored 3 months ago
  
  9d98ba5b
- chore: document tips on debugging python tests · 12cb955f
  Dmitry Rodionov authored 2 months ago
  
  12cb955f
Dec 20, 2024
- refactor: rename admin.sock to admin.socket · 4233b7ba
  Вартан Бабаян authored 2 months ago
  
  4233b7ba
- refactor: rename data_dir to instance_dir · ddacea09
  Вартан Бабаян authored 2 months ago
  
  ddacea09
- feat: set activation_deadline to 2 hours · 1d0f3dd1
  Вартан Бабаян authored 2 months ago
  
  1d0f3dd1
- feat(sql): mock DDL create schema, drop schema · 2e512c5f
  Erik Khamitov authored 3 months ago
  
  2e512c5f
Dec 19, 2024
- test: add proper test for pico.abort_ddl · 63fcd5d3
  Вартан Бабаян authored 3 months ago
  
  63fcd5d3
- fix: check max users on user creation · 484a32ed
  Egor Ivkov authored 3 months ago and Егор Ивков committed 2 months ago
  
  484a32ed
- test: role and user limits · abb23911
  Egor Ivkov authored 3 months ago and Егор Ивков committed 2 months ago
  
  abb23911
- refactor(sbroad): avoid redundant error allocation in Value::cast · dded2b2b
  Maksim Kaitmazian authored 2 months ago and Maksim Kaitmazian committed 2 months ago
  
  dded2b2b
- opt(sbroad): fold constants under casts · 818b3337
  Maksim Kaitmazian authored 2 months ago and Maksim Kaitmazian committed 2 months ago
  
  Close #1228
  818b3337
- refactor(sbroad): Value::cast · 8c4a931f
  Maksim Kaitmazian authored 2 months ago and Maksim Kaitmazian committed 2 months ago
  
  8c4a931f
- ci: rename /helm to /docker · bb5d388e
  Виталий Шунков authored 2 months ago and Dmitry Rodionov committed 2 months ago
  
  bb5d388e
- perf: introduce TopologyCache · 5bf9d8a5
  Georgy Moshkin authored 3 months ago
  
  TopologyCache is a collection of deserialized structures with information about cluster topology. This currently includes data from _pico_instance, _pico_replicaset, _pico_tier & _pico_service_route_table. The info is automatically kept up to date with the corresponding system tables. The TopologyCache also caches the immutable info related to the current instance, like instance name, replicaset uuid, etc. From now on we should be reading this data from TopologyCache whenever possible instead of going directly to system tables as we were doing previosly. At the moment only the plugin RPC module has transitioned to using TopologyCache but other modules should be refactored as well. Especially governor.
  5bf9d8a5
- refactor: simplify code in resolve_rpc_target · f4d66cab
  Georgy Moshkin authored 3 months ago
  
  f4d66cab
- chore: util::DebugDiff for pretty struct comparison · abec4647
  Georgy Moshkin authored 3 months ago
  
  abec4647
- fix: explicitly check bucket id for validity · 1307a31f
  Georgy Moshkin authored 3 months ago
  
  Before this fix if requesting RPC by an invalid bucket_id we would send an RPC to every replicaset to check if they have such a bucket_id. This is not needed because we know the allowed range.
  1307a31f
- ci: display cargo version in linting stage · 330f853e
  Georgy Moshkin authored 3 months ago
  
  330f853e
- chore: new rustfmt dropped · 8858dc42
  Georgy Moshkin authored 3 months ago
  
  8858dc42
- refactor: clean up handle_dml_entry · 71146eef
  Georgy Moshkin authored 3 months ago
  
  71146eef
- adr: TRUNCATE sql command · 959411b7
  EmirVildanov authored 4 months ago
  
  this adr is a proposal of implementing TRUNCATE sql command, its context and possible problems
  959411b7
Dec 18, 2024
- chore: add justfile as an alternative to make · 86584dc9
  Кирилл Безуглый authored 4 months ago
  
  86584dc9