-
Notifications
You must be signed in to change notification settings - Fork 482
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: Ensure SSH connections are closed after each command execution #859
base: main
Are you sure you want to change the base?
Conversation
This commit addresses an issue where SSH connections were not being properly closed after each command execution, leading to timeout errors during the `kamal deploy` process.
For some strange reason the test on Ruby 3.1 failed, it looks like it might have been just a glitch in the test procedures as the code should not affect only one particular version of ruby but all versions. ensure
SSHKit::Backend::Netssh.pool.close_connections Would it be possible to run the tests again? |
I've kicked that off. I'm not sure though about this change. Creating the SSH connections can be expensive especially with large numbers of servers to deploy to. We configure keepalives with an interval of 30 seconds on the connections, so that generally should stop them from timing out. Maybe you could try reducing the keepalive interval and see if that makes any difference? |
Thanks for the suggestion, I did try to decrease the |
👍 on this one, as it's pretty much similar to delano/rye#63 I faced a while ago, and I think delano/rye#38 has more context on this |
@jfanals - we overwrite the keepalive_interval in the Kamal config, so I'm not sure it will pick up that from Could you try:
Setting the log_level to debug might also give us some useful feedback. |
I had a similar problem, which brought me to this issue. Unfortunately setting the ❯ bin/kamal config
ERROR (Kamal::ConfigurationError): ssh: unknown key: keepalive_interval The fix for me was to increase the |
@plattenschieber how did you apply your fix? Is it through the server? |
Yes, just |
Mine does not have |
I tried setting the log_level to debug and keepalive_interval to 10 by manually updating the ssh.rb at /opt/homebrew/lib/ruby/gems/3.3.0/gems/kamal-2.2.2/lib/kamal/configuration, but it's still not working. It looks like when my NextJS app is building for a long time while it streams the logs of the build, the keepalive ping is not running.
|
This PR actually fixed my issue. It might be worth considering merging this. |
This commit addresses an issue where SSH connections were not being properly closed after each command execution, leading to timeout errors during the
kamal deploy
process.Problem
When executing commands using the
execute
method inSSHKit::Runner::Parallel
, SSH connections were left open, causing subsequent commands to fail with timeout errors. This issue was particularly evident when runningkamal build deliver
afterkamal registry login
during the normalkamal deploy process
Solution
The
execute
method in theSSHKit::Runner::Parallel::CompleteAll
module has been modified to ensure that all SSH connections are closed after each command execution. This is achieved by adding anensure
block to the thread creation logic, which callsSSHKit::Backend::Netssh.pool.close_connections
after each command, regardless of whether an exception occurs.Changes
ensure
block to theexecute
method inSSHKit::Runner::Parallel::CompleteAll
to close SSH connections after each command execution.Impact
This change ensures that SSH connections are properly closed, preventing timeout errors and improving the reliability of the
kamal deploy
process.Testing
Tested the changes by running
kamal deploy
commands. Verified that the timeout errors no longer occur and the deployment process completes successfully.Closes #857