Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

worktree: detect from secondary worktree if main worktree is bare #1829

Open
wants to merge 1 commit into
base: maint
Choose a base branch
from

Conversation

olga-mcbfe
Copy link

@olga-mcbfe olga-mcbfe commented Nov 14, 2024

Changes since v1:

  • no code changes
  • rebased with maint
  • CC added

Existing broken functionality forces our project to use hacks on bare repo that we'd like to avoid. I would really appreciate reviews of this patch to move closer towards fixing the issue. This is my first contribution to git/git, I apologize if I got lost in the instructions, but I tried my best to follow the rules.

CC: Patrick Steinhardt ps@pks.im, Eric Sunshine sunshine@sunshineco.com, Johannes Schindelin Johannes.Schindelin@gmx.de, René Scharfe l.s.r@web.de, Junio C Hamano gitster@pobox.com

Copy link

gitgitgadget bot commented Nov 14, 2024

Welcome to GitGitGadget

Hi @olga-mcbfe, and welcome to GitGitGadget, the GitHub App to send patch series to the Git mailing list from GitHub Pull Requests.

Please make sure that either:

  • Your Pull Request has a good description, if it consists of multiple commits, as it will be used as cover letter.
  • Your Pull Request description is empty, if it consists of a single commit, as the commit message should be descriptive enough by itself.

You can CC potential reviewers by adding a footer to the PR description with the following syntax:

CC: Revi Ewer <revi.ewer@example.com>, Ill Takalook <ill.takalook@example.net>

NOTE: DO NOT copy/paste your CC list from a previous GGG PR's description,
because it will result in a malformed CC list on the mailing list. See
example.

Also, it is a good idea to review the commit messages one last time, as the Git project expects them in a quite specific form:

  • the lines should not exceed 76 columns,
  • the first line should be like a header and typically start with a prefix like "tests:" or "revisions:" to state which subsystem the change is about, and
  • the commit messages' body should be describing the "why?" of the change.
  • Finally, the commit messages should end in a Signed-off-by: line matching the commits' author.

It is in general a good idea to await the automated test ("Checks") in this Pull Request before contributing the patches, e.g. to avoid trivial issues such as unportable code.

Contributing the patches

Before you can contribute the patches, your GitHub username needs to be added to the list of permitted users. Any already-permitted user can do that, by adding a comment to your PR of the form /allow. A good way to find other contributors is to locate recent pull requests where someone has been /allowed:

Both the person who commented /allow and the PR author are able to /allow you.

An alternative is the channel #git-devel on the Libera Chat IRC network:

<newcontributor> I've just created my first PR, could someone please /allow me? https://github.com/gitgitgadget/git/pull/12345
<veteran> newcontributor: it is done
<newcontributor> thanks!

Once on the list of permitted usernames, you can contribute the patches to the Git mailing list by adding a PR comment /submit.

If you want to see what email(s) would be sent for a /submit request, add a PR comment /preview to have the email(s) sent to you. You must have a public GitHub email address for this. Note that any reviewers CC'd via the list in the PR description will not actually be sent emails.

After you submit, GitGitGadget will respond with another comment that contains the link to the cover letter mail in the Git mailing list archive. Please make sure to monitor the discussion in that thread and to address comments and suggestions (while the comments and suggestions will be mirrored into the PR by GitGitGadget, you will still want to reply via mail).

If you do not want to subscribe to the Git mailing list just to be able to respond to a mail, you can download the mbox from the Git mailing list archive (click the (raw) link), then import it into your mail program. If you use GMail, you can do this via:

curl -g --user "<EMailAddress>:<Password>" \
    --url "imaps://imap.gmail.com/INBOX" -T /path/to/raw.txt

To iterate on your change, i.e. send a revised patch or patch series, you will first want to (force-)push to the same branch. You probably also want to modify your Pull Request description (or title). It is a good idea to summarize the revision by adding something like this to the cover letter (read: by editing the first comment on the PR, i.e. the PR description):

Changes since v1:
- Fixed a typo in the commit message (found by ...)
- Added a code comment to ... as suggested by ...
...

To send a new iteration, just add another PR comment with the contents: /submit.

Need help?

New contributors who want advice are encouraged to join git-mentoring@googlegroups.com, where volunteers who regularly contribute to Git are willing to answer newbie questions, give advice, or otherwise provide mentoring to interested contributors. You must join in order to post or view messages, but anyone can join.

You may also be able to find help in real time in the developer IRC channel, #git-devel on Libera Chat. Remember that IRC does not support offline messaging, so if you send someone a private message and log out, they cannot respond to you. The scrollback of #git-devel is archived, though.

@olga-mcbfe olga-mcbfe force-pushed the fix-bare-repo-detection-with-worktree-config branch 2 times, most recently from 47a2eea to 9e7170f Compare November 14, 2024 17:06
@Ikke
Copy link

Ikke commented Nov 15, 2024

/allow

Copy link

gitgitgadget bot commented Nov 15, 2024

User olga-mcbfe is now allowed to use GitGitGadget.

WARNING: olga-mcbfe has no public email address set on GitHub;
GitGitGadget needs an email address to Cc: you on your contribution, so that you receive any feedback on the Git mailing list. Go to https://github.com/settings/profile to make your preferred email public to let GitGitGadget know which email address to use.

@olga-mcbfe
Copy link
Author

/preview

Copy link

gitgitgadget bot commented Nov 15, 2024

Preview email sent as pull.1829.git.1731652846575.gitgitgadget@gmail.com

@olga-mcbfe
Copy link
Author

/submit

Copy link

gitgitgadget bot commented Nov 15, 2024

Submitted as pull.1829.git.1731653548549.gitgitgadget@gmail.com

To fetch this version into FETCH_HEAD:

git fetch https://github.com/gitgitgadget/git/ pr-1829/olga-mcbfe/fix-bare-repo-detection-with-worktree-config-v1

To fetch this version to local tag pr-1829/olga-mcbfe/fix-bare-repo-detection-with-worktree-config-v1:

git fetch --no-tags https://github.com/gitgitgadget/git/ tag pr-1829/olga-mcbfe/fix-bare-repo-detection-with-worktree-config-v1

Setup:
1. Have a bare repo with core.bare = true in config.worktree
2. Create a new worktree

Behavior:
From the secondary worktree the main worktree appears as non-bare.

Expected:
From the secondary worktree the main worktree should appear as bare.

Why current behavior is not good?
If the main worktree is detected as not bare it doesn't allow
checking out the branch of the main worktree. There are possibly
other problems associated with that behavior.

Why is it happening?
While we're inside the secondary worktree we don't initialize the main
worktree's repository with its configuration.

How is it fixed?
Load actual configs of the main worktree. Also, skip the config loading
step if we're already inside the current worktree because in that case we
rely on is_bare_repository() to return the correct result.

Other solutions considered:
Alternatively, instead of incorrectly always using
`the_repository` as the main worktree's repository, we can detect
and load the actual repository of the main worktree and then use
that repository's `is_bare` value extracted from correct configs.
However, this approach is a bit riskier and could also affect
performance. Since we had the assignment `worktree->repo =
the_repository` for a long time already, I decided it's safe to
keep it as it is for now; it can be still fixed separately from
this change.

Real life use case:
1. Have a bare repo
2. Create a worktree from the bare repo
3. In the secondary worktree enable sparse-checkout - this enables
extensions.worktreeConfig and keeps core.bare=true setting in
config.worktree of the bare worktree
4. The secondary worktree or any other non-bare worktree created
won't be able to use branch main (not even once), but it should be
able to.

Signed-off-by: Olga Pilipenco <olga.pilipenco@shopify.com>
@olga-mcbfe olga-mcbfe force-pushed the fix-bare-repo-detection-with-worktree-config branch from 9e7170f to 17f4b24 Compare January 16, 2025 21:14
@olga-mcbfe olga-mcbfe changed the base branch from master to maint January 16, 2025 21:18
@olga-mcbfe
Copy link
Author

/preview

Copy link

gitgitgadget bot commented Jan 16, 2025

Preview email sent as pull.1829.v2.git.1737062807126.gitgitgadget@gmail.com

@olga-mcbfe
Copy link
Author

/submit

Copy link

gitgitgadget bot commented Jan 16, 2025

Submitted as pull.1829.v2.git.1737063335673.gitgitgadget@gmail.com

To fetch this version into FETCH_HEAD:

git fetch https://github.com/gitgitgadget/git/ pr-1829/olga-mcbfe/fix-bare-repo-detection-with-worktree-config-v2

To fetch this version to local tag pr-1829/olga-mcbfe/fix-bare-repo-detection-with-worktree-config-v2:

git fetch --no-tags https://github.com/gitgitgadget/git/ tag pr-1829/olga-mcbfe/fix-bare-repo-detection-with-worktree-config-v2

Copy link

gitgitgadget bot commented Jan 19, 2025

On the Git mailing list, Eric Sunshine wrote (reply to this):

On Thu, Jan 16, 2025 at 4:35 PM Olga Pilipenco via GitGitGadget
<gitgitgadget@gmail.com> wrote:
> Setup:
> 1. Have a bare repo with core.bare = true in config.worktree
> 2. Create a new worktree
>
> Behavior:
> From the secondary worktree the main worktree appears as non-bare.
>
> Expected:
> From the secondary worktree the main worktree should appear as bare.
>
> Why current behavior is not good?
> If the main worktree is detected as not bare it doesn't allow
> checking out the branch of the main worktree. There are possibly
> other problems associated with that behavior.
>
> Why is it happening?
> While we're inside the secondary worktree we don't initialize the main
> worktree's repository with its configuration.

Okay, this is clearly a very real problem and explains this comment
added by f3534c98e4 (worktree: update is_bare heuristics, 2019-04-19):

    NEEDSWORK: If this function is called from a secondary worktree and
    config.worktree is present, is_bare_repository_cfg will reflect the
    contents of config.worktree, not the contents of the main worktree.
    This means that worktree->is_bare may be set to 0 even if the main
    worktree is configured to be bare.

(Aside: I recall reading this comment when Jonathan added it but
wasn't able to dig into it at the time to really understand it, and
never got back around to it. Now, after studying your patch, I
understand what it was about.

> How is it fixed?
> Load actual configs of the main worktree. Also, skip the config loading
> step if we're already inside the current worktree because in that case we
> rely on is_bare_repository() to return the correct result.

I found that I had to dig around a bit to fully understand the problem
expressed by this commit message. Perhaps adding a bit more detail
would help? Here's my attempt at rewriting the above (also in a way
which is more idiomatic to this project):

    When extensions.worktreeConfig is true and the main worktree is
    bare -- that is, its config.worktree file contains core.bare=true
    -- commands run from secondary worktrees incorrectly see the main
    worktree as not bare. As such, those commands incorrectly think
    that the repository's default branch (typically "main" or
    "master") is checked out in the bare repository even though it's
    not. This makes it impossible, for instance, to checkout or delete
    the default branch from a secondary worktree, among other
    shortcomings.

    This problem occurs because, when extensions.worktreeConfig is
    true, commands run in secondary worktrees only consult
    $commondir/config and $commondir/worktrees/<id>/config.worktree,
    thus they never see the main worktree's core.bare=true setting in
    $commondir/config.worktree.

    Fix this problem by consulting the main worktree's config.worktree
    file when checking whether it is bare. (This extra work is
    performed only when running from a secondary worktree.)

> Other solutions considered:
> Alternatively, instead of incorrectly always using
> `the_repository` as the main worktree's repository, we can detect
> and load the actual repository of the main worktree and then use
> that repository's `is_bare` value extracted from correct configs.
> However, this approach is a bit riskier and could also affect
> performance. Since we had the assignment `worktree->repo =
> the_repository` for a long time already, I decided it's safe to
> keep it as it is for now; it can be still fixed separately from
> this change.

I found this paragraph somewhat confusing because it seems to conflate
a repository (i.e. the shared object database) with the `struct
repository` type, and the configuration which happens to get loaded
and stored (as one of *many* members) of the repository structure. I
had to read it several times to understand that this was talking about
instantiating a separate `struct repository` initialized from the main
worktree configuration. I agree that doing so would likely be overkill
and could impact performance negatively. I understand that you added
this paragraph because SubmittingPatches suggests to do so, but I
think it can probably be omitted in this case unless it can be
rewritten to be more clear (but even then I doubt it is necessary to
keep it).

> Real life use case:
> 1. Have a bare repo
> 2. Create a worktree from the bare repo
> 3. In the secondary worktree enable sparse-checkout - this enables
> extensions.worktreeConfig and keeps core.bare=true setting in
> config.worktree of the bare worktree
> 4. The secondary worktree or any other non-bare worktree created
> won't be able to use branch main (not even once), but it should be
> able to.

This is mostly repeating what was said earlier, thus probably isn't
adding any value to the commit message. I'd probably drop it.

> Signed-off-by: Olga Pilipenco <olga.pilipenco@shopify.com>
> ---
>     Changes since v1:
>
>      * no code changes
>      * rebased with maint
>      * CC added

Sorry. I've had your v1 sitting in my ever-increasingly-large backlog
of patches to look at, but have been extra busy the last many months
and never managed to get to it.

>     Existing broken functionality forces our project to use hacks on bare
>     repo that we'd like to avoid. I would really appreciate reviews of this
>     patch to move closer towards fixing the issue. This is my first
>     contribution to git/git, I apologize if I got lost in the instructions,
>     but I tried my best to follow the rules.

Your submission is fine. Unfortunately, the project has a lack of
reviewers but no lack of submitters, so sometimes patches get
overlooked or simply buried.

> diff --git a/t/t3200-branch.sh b/t/t3200-branch.sh
> @@ -410,6 +410,20 @@ test_expect_success 'bare main worktree has HEAD at branch deleted by secondary
> +test_expect_success 'secondary worktree can switch to main if common dir is bare worktree' '

The use of "common dir" is a bit confusing. Also, this patch is fixing
the more general problem that secondary worktrees think that the bare
main worktree has a branch checked out. So, perhaps a better title
would be:

    secondary worktrees recognize core.bare=true in main config.worktree

or something?

> +       test_when_finished "rm -rf bare_repo non_bare_repo secondary_worktree" &&
> +       git init -b main non_bare_repo &&
> +       test_commit -C non_bare_repo x &&
> +
> +       git clone --bare non_bare_repo bare_repo &&
> +       git -C bare_repo config extensions.worktreeConfig true &&
> +       git -C bare_repo config unset core.bare &&
> +       git -C bare_repo config --worktree core.bare true &&
> +
> +       git -C bare_repo worktree add ../secondary_worktree &&
> +       git -C secondary_worktree checkout main
> +'

Very straightforward and exactly what I expected to see once I
understood the problem.

> diff --git a/worktree.c b/worktree.c
> @@ -65,6 +65,28 @@ static int is_current_worktree(struct worktree *wt)
> +static int is_bare_git_dir(const char *git_dir)

Nit: I wonder if a name such as is_main_worktree_bare() would clue
readers in a bit more?

> +{
> +       int bare = 0;
> +       struct config_set cs = { { 0 } };

This is not your fault since this construct is used elsewhere in this
file (from which I presume you copied it), but project consensus is
that using the notation `{{0}}` to work around a complaint from the
Apple compiler (and only the Apple compiler) should be avoided, and
that `{0}` is preferred. So, if you reroll, changing this to `{0}` may
make other reviewers happy (or you can leave it as is to be consistent
with existing precedence in this file; I don't feel strongly about
it).

> +       char *config_file;
> +       char *worktree_config_file;
> +
> +       config_file = xstrfmt("%s/config", git_dir);
> +       worktree_config_file = xstrfmt("%s/config.worktree",  git_dir);
> +
> +       git_configset_init(&cs);
> +       git_configset_add_file(&cs, config_file);
> +       git_configset_add_file(&cs, worktree_config_file);

Genuine question: I haven't thought too deeply about it, but do we
gain anything by loading $commondir/config here -- which is shared by
the main worktree and all secondary worktrees -- considering that it
was already loaded and consulted by the earlier is-bare check before
this function was even called?

> +       git_configset_get_bool(&cs, "core.bare", &bare);
> +
> +       git_configset_clear(&cs);
> +       free(config_file);
> +       free(worktree_config_file);
> +       return bare;

Everything gets cleaned up correctly. Good.

> @@ -77,18 +99,16 @@ static struct worktree *get_main_worktree(int skip_reading_head)
> +       /*
> +        * NEEDSWORK: the_repository is not always main worktree's repository
> +       */
>         worktree->repo = the_repository;
>         worktree->path = strbuf_detach(&worktree_path, NULL);

I found this new NEEDSWORK comment rather confusing the first several
times I read the patch. It wasn't until I finally realized that the
reference to `the_repository` here is the same reference to
`the_repository` in the commit message -- which confused me, as well
-- that I understood what this was trying to say. The actual problem,
of course, is that the _configuration_ stored in `the_repository` is
the secondary worktree's configuration, not the main worktree's
configuration. Considering that this patch addresses that problem, I'd
probably just drop this new comment altogether (unless, perhaps, you
rewrite it to talk about the _configuration_ stored in
`the_repository`).

> -       /*
> -        * NEEDSWORK: If this function is called from a secondary worktree and
> -        * config.worktree is present, is_bare_repository_cfg will reflect the
> -        * contents of config.worktree, not the contents of the main worktree.
> -        * This means that worktree->is_bare may be set to 0 even if the main
> -        * worktree is configured to be bare.
> -        */
> -       worktree->is_bare = (is_bare_repository_cfg == 1) ||
> -               is_bare_repository();
>         worktree->is_current = is_current_worktree(worktree);
> +       worktree->is_bare = (is_bare_repository_cfg == 1) ||
> +               is_bare_repository() ||
> +               (!worktree->is_current && is_bare_git_dir(repo_get_common_dir(the_repository)));

This is performing the expensive check only if the earlier checks left
the question unanswered. Good.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants