Commits · 14-4-stable · gitlab-org / Gitaly

This project is mirrored from https://gitlab.com/gitlab-org/gitaly.git. Pull mirroring updated 6 minutes ago.

Feb 03, 2022
- Merge branch 'sh-optimize-repository-id-migration-14-4' into '14-4-stable' · a76fe89a
  Patrick Steinhardt authored 3 years ago
```
Optimize link repository ID migration (14.4)

See merge request gitlab-org/gitaly!4244
```
  a76fe89a
- Merge branch 'sh-backport-gitaly-lfs-smudge-fix-14-4' into '14-4-stable' · 2634a6d7
  Patrick Steinhardt authored 3 years ago
```
gitaly-lfs-smudge: Fix missing close for HTTP body (14.4)

See merge request gitlab-org/gitaly!4243
```
  2634a6d7
Feb 02, 2022

Optimize link repository ID migration · f7d0bd08

Stan Hu authored 3 years ago

The previous migration was slow at times because the update would cause
PostgreSQL to do a merge join and then filter out rows matching
`repository_id IS NULL`. As more rows migrated gained a `repository_id`,
this would increase the query time significantly for each batch.

The batching was added to deal with limiting the payload size of a
trigger update.

We can make this migration go faster by disabling the triggers in the
transactions, rollback to 2bbec66c, and re-enable the trigger.

Relates to https://gitlab.com/gitlab-org/gitaly/-/issues/3973

Changelog: fixed

f7d0bd08

Merge branch 'jc-fix-cache-test' into 'master' · b1859840
Patrick Steinhardt authored 3 years ago
```
Fail Read if objectReader is closed

Closes #3823

See merge request gitlab-org/gitaly!3944
```
b1859840
Merge branch 'pks-supervisor-timeout-fixes' into 'master' · 46b9f457
Sami Hiltunen authored 3 years ago
```
supervisor: Fix bugs related to timeouts

See merge request gitlab-org/gitaly!4029
```
46b9f457

gitaly-lfs-smudge: Fix missing close for HTTP body · 08973448

Patrick Steinhardt authored 3 years ago

The gitaly-lfs-smudge command is a smudge filter for Git which will
replace contents of LFS pointers with the actual LFS object's contents.
To do so, we need to request the object's contents from Rails via an
HTTP request. The tests exercising this code all of a sudden started
failing due to leaking Goroutines, where the leak happens in the HTTP
handling code. And sure enough: we never close the `http.Response` body,
which may likely be the root cause here.

Fix this by always closing the body. While I have no idea why leaks
started to happen just now, chances are high that this fixes the new
flake.

08973448

Jan 11, 2022
- Merge remote-tracking branch 'dev/14-4-stable' into 14-4-stable · 88d32fa4
  GitLab Release Tools Bot authored 3 years ago
  
  88d32fa4
- Update VERSION files · ad3a4c1a
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  v14.4.5
  
  ad3a4c1a
- Update changelog for 14.4.5 · 266907a8
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  266907a8
Jan 10, 2022
- Merge branch 'pks-security-disallow-replace-refs-v14.4' into '14-4-stable' · 58d8bca7
  GitLab Release Tools Bot authored 3 years ago
```
git: Disallow use of replace refs

See merge request gitlab-org/security/gitaly!43
```
  58d8bca7
- git: Disallow use of replace refs · 01a76456
  Patrick Steinhardt authored 3 years ago
  
  01a76456
Dec 06, 2021
- Merge remote-tracking branch 'dev/14-4-stable' into 14-4-stable · 740c438a
  GitLab Release Tools Bot authored 3 years ago
  
  740c438a
Dec 03, 2021
- Update VERSION files · 1e6cf649
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  v14.4.4
  
  1e6cf649
- Update changelog for 14.4.4 · 0ae83ec7
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  0ae83ec7
Dec 01, 2021
- Update VERSION files · 3637f783
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  v14.4.3
  
  3637f783
- Update changelog for 14.4.3 · 513e2eec
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  513e2eec
Nov 23, 2021
- Merge branch 'ps-backport-sub-cmds-14-4' into '14-4-stable' · 9a2edd07
  Pavlo Strokov authored 3 years ago
```
list-untracked-repositories: Praefect sub-command to show untracked repositories

See merge request gitlab-org/gitaly!4115
```
  9a2edd07
- Merge branch 'pks-revert-materialized-valid-primaries-view-v14.4' into '14-4-stable' · fe9eb6b4
  Patrick Steinhardt authored 3 years ago
```
datastore: Revert use of materialized views (v14.4)

See merge request gitlab-org/gitaly!4117
```
  fe9eb6b4
- Merge branch 'ps-fix-link-repo-14-4' into '14-4-stable' · 3c8de77e
  Toon Claes authored 3 years ago
```
sql-migrate: Update storage_repositories table

See merge request gitlab-org/gitaly!4113
```
  3c8de77e
Nov 22, 2021

datastore: Revert use of materialized views · 38b5c332

Patrick Steinhardt authored 3 years ago

Revert the introduction of materialized views for `valid_primaries`. As
it turns out, the changes cause incompatibilities with Postgres 11,
which is still actively in use. Furthermore, the performance issues we
have seen have not been fully fixed with this change, and we do not yet
fully understand the root cause for this.

Changelog: fixed

38b5c332

Nov 20, 2021

list-untracked-repositories: Praefect sub-command to show untracked repositories · b6fb5c33

Pavlo Strokov authored 3 years ago

The change is a backport of the functionality implemented
to resolve https://gitlab.com/gitlab-org/gitaly/-/issues/3792
issue. A new sub-command for the praefect binary. On run it
connects to all gitaly storages set in the configuration file
and receives from each list of the repositories existing on
the disk. Each repository then checked if it exists in the
praefect database and if it is not the location of the
repository is printed out in JSON format to the stdout of the
process.

Part of: https://gitlab.com/gitlab-org/gitaly/-/issues/3792

Changelog: added

b6fb5c33

sql-migrate: Update storage_repositories table · 0e6a5d2a

Pavlo Strokov authored 3 years ago

The batch update query introduced to mitigate limitation
of the PostgreSQL on amount of payload size that can be
send by the NOTIFY function was missing a condition in
the update statements. Because of that the payload contained
changes not for the N storage-repository entries, but for
(N * num_of_storages) storage-repository entries. So initial
size of 150 becomes 450 if 3 storages used.

The change also includes significantly reduced batch size.
The calculation was done on the test data similar to the
production used data. The approximate payload size for
single row is about 470 bytes. As max payload size is 8k bytes
we are allowed to use no more than 16~17 entries. To be more
realistic we reduce it to 14.

Part of: https://gitlab.com/gitlab-org/gitaly/-/issues/3806

Changelog: fixed

0e6a5d2a

Nov 19, 2021

Merge branch 'pks-praefect-datastore-collector-metrics-endpoint-v14.4' into '14-4-stable' · f60e6616
Patrick Steinhardt authored 3 years ago
```
praefect: Backport separate endpoint for datastore collector (v14.4)

See merge request gitlab-org/gitaly!4094
```
f60e6616

praefect: Do not collect repository store metrics on startup · 3cde9b5e

John Cai authored 3 years ago

Our current code path will trigger the RepositoryStoreCollector to query
the database on startup, even if the prometheus listener is not
listening. This is because we call DescribeByCollect in the Describe
method. The Prometheus client will call Describe on Register, which ends
up triggering the Collect method and hence runs the queries. Instead, we
can just provide the decriptions separately from the Collect method.

Changelog: fixed
(cherry picked from commit 90cb7fb7)

3cde9b5e

praefect: Add ability to have separate database metrics endpoint · ebaade4a

John Cai authored 3 years ago

By default, when metrics are enabled, then each Praefect will expose
information about how many read-only repositories there are, which
requires Praefect to query the database. First, this will result in the
same metrics being exposed by every Praefect given that the database is
shared between all of them. And second, this will cause one query per
Praefect per scraping run. This cost does add up and generate quite some
load on the database, especially so if there is a lot of repositories in
that database, up to a point where it may overload the database
completely.

Fix this issue by splitting metrics which hit the database into a
separate endpoint "/db_metrics". This allows admins to set up a separate
scraper with a different scraping interval for this metric, and
furthermore it gives the ability to only scrape this metric for one of
the Praefect instances so the work isn't unnecessarily duplicated.

Given that this is a breaking change which will get backported, we must
make this behaviour opt-in for now. We thus include a new configuration
key "prometheus_use_database_endpoint" which enables the new behaviour
such that existing installations' metrics won't break on a simple point
release. The intent is to eventually remove this configuration though
and enable it for all setups on a major release.

Changelog: added
(cherry picked from commit 7e74b733)

ebaade4a

prometheus: Avoid duplicated metrics registration · aac5d5e5

Pavlo Strokov authored 3 years ago

Praefect uses prometheus to export metrics from inside.
It relies on the defaults from the prometheus library
to gather set of metrics and register a new metrics.
Because of it the new metrics got registered on the
DefaultRegisterer - a global pre-configured registerer.
Because of that we can't call 'run' function multiple
times (for testing purposes) as it results to the metrics
registration error. To omit that problem the 'run' function
extended with prometheus.Registerer parameter that is used
to register praefect custom metrics. The production code
still uses the same DefaultRegisterer as it was before.
And the test code creates a new instance of the registerer
for each 'run' invocation, so there are no more duplicates.

(cherry picked from commit 81368d46)

aac5d5e5

bootstrap: Abstract bootstrapper for testing · 43031e2d

Pavlo Strokov authored 3 years ago

The old implementation of the bootstrapper initialization
does not allow calling the 'run' function to start a service
because the tableflip library doesn't support multiple
instances to be created for one process.
Starting the Praefect service is required in tests to verify
sub-command execution. The bootstrapper initialization
extracted out of 'run' function. It allows using a new
Noop bootstrapper to run service without tableflip
support.

(cherry picked from commit 18ff3676)

43031e2d

Nov 18, 2021

Merge branch 'smh-optimize-dataloss-query-14-4' into '14-4-stable' · c1cf3752
Toon Claes authored 3 years ago
```
Materialize valid_primaries view (14.4)

See merge request gitlab-org/gitaly!4090
```
c1cf3752

Materialize valid_primaries view in RepositoryStoreCollector · df6b165f

Sami Hiltunen authored 3 years ago

RepositoryStoreCollector gathers metrics on repositories which don't
have a valid primary candidates available. This indicates the repository
is unavailable as the current primary is not valid and ther are no valid
candidates to failover to. The query is currently extremely inefficient
on some versions of Postgres as it ends up computing the full valid_primaries
view for each of the rows it checks. This doesn't seem to occur on all versions
of Postgres, namely 12.6 at least manages to push down the search criteria
inside the view. This commit fixes the situation by materializing the
valid_primaries view prior to querying it. This ensures the full view isn't
computed for all of the rows but rather Postgres just uses the pre-computed
result.

Changelog: performance

df6b165f

Get the latest generation from repositories instead of a view · 57bef779

Sami Hiltunen authored 3 years ago

Dataloss query is currently getting the latest generation of a repository
from a view that takes the max generation from storage_repositories. This
is unnecessary as the repositories table already contains the latest generation
and we can take it from there instead. This commit reads it from the repositories
table instead.

Changelog: performance

57bef779

Nov 17, 2021

Materialize valid_primaries view in dataloss query · 6d569bb6

Sami Hiltunen authored 3 years ago

The dataloss query is extremely slow for bigger datasets. The problem
is that for each row that the data loss query is returning,
Postgres computes the full result of the valid_primaries view only to
filter down to the correct record. This results in an o(n2) complexity
which kills the performance as soon as the dataset size increases. It's
not clear why the join parameters are not pushed down in to the view in
the query.

This commit optimizes the query by materializing the valid_primaries view.
This ensures Postgres computes the full view only once and joins with the
pre-computed result.

Changelog: performance

6d569bb6

Nov 08, 2021
- Update VERSION files · 00071e4a
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  v14.4.2
  
  00071e4a
- Update changelog for 14.4.2 · 25892575
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  25892575
Oct 28, 2021
- Merge remote-tracking branch 'dev/14-4-stable' into 14-4-stable · f5bdb31e
  GitLab Release Tools Bot authored 3 years ago
  
  f5bdb31e
- Update VERSION files · 735a55dc
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  v14.4.1
  
  735a55dc
- Update changelog for 14.4.1 · 13f05aed
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  13f05aed
Oct 21, 2021
- Update VERSION files · 7abdbce5
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  v14.4.0
  
  7abdbce5
- Update changelog for 14.4.0 · daf08efb
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  daf08efb
- Update VERSION files · bccb4c09
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  v14.4.0-rc45
  
  bccb4c09
- Update VERSION files · 6fdc72ba
  GitLab Release Tools Bot authored 3 years ago
```
[ci skip]
```
  v14.4.0-rc44
  
  6fdc72ba