This check was too strict:
assert i2.raft_first_index() == i1.raft_first_index() + 2
This is because governor can manage to apply more than 2 entries after log was compacted, so we can't rely on this strict difference of 2 commits. For example in this log 3 entries were applied: #1127 (comment 111700)