Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RDMA builtin support #1209

Open
wants to merge 12 commits into
base: unstable
Choose a base branch
from
Open

Conversation

pizhenwei
Copy link
Contributor

@pizhenwei pizhenwei commented Oct 22, 2024

Hi,

There are several patches in this PR:

  • Abstract set/rewrite config bind option: bind option is a special config, socket and tls are using the same one. However RDMA uses the similar style but different one. Use a bit abstract work to make it flexible for both socket and RDMA. Even for QUIC in the future.
  • Introduce closeListener for connection type: closing socket by a simple syscall would be fine, RDMA has complex logic. Introduce connection type specific close listener method.
  • RDMA: Use valkey.conf style instead of module parameters: use --rdma-bind and --rdma-port style instead of module parameters. The module style config rdma.bind and rdma.port are removed.
  • RDMA: Support builtin: support make BUILD_RDMA=yes. module style is still kept for now. Once module style is decided to drop, I volunteer to do it for TLS and RDMA together.

Each patch has independent functionality, also been tested by CI and local commands, so request no-squash merge for this PR.

Copy link

codecov bot commented Oct 22, 2024

Codecov Report

Attention: Patch coverage is 54.54545% with 40 lines in your changes missing coverage. Please review.

Project coverage is 70.71%. Comparing base (43b5026) to head (4fec63d).
Report is 3 commits behind head on unstable.

Files with missing lines Patch % Lines
src/config.c 50.84% 29 Missing ⚠️
src/server.c 27.27% 8 Missing ⚠️
src/unix.c 0.00% 3 Missing ⚠️
Additional details and impacted files
@@             Coverage Diff              @@
##           unstable    #1209      +/-   ##
============================================
- Coverage     70.73%   70.71%   -0.03%     
============================================
  Files           116      117       +1     
  Lines         63271    63341      +70     
============================================
+ Hits          44757    44792      +35     
- Misses        18514    18549      +35     
Files with missing lines Coverage Δ
src/connection.c 78.82% <100.00%> (+0.25%) ⬆️
src/connection.h 87.64% <100.00%> (+0.43%) ⬆️
src/rdma.c 100.00% <100.00%> (ø)
src/server.h 100.00% <ø> (ø)
src/socket.c 91.52% <100.00%> (+0.40%) ⬆️
src/tls.c 100.00% <ø> (ø)
src/unix.c 73.49% <0.00%> (-2.76%) ⬇️
src/server.c 87.43% <27.27%> (-0.21%) ⬇️
src/config.c 77.63% <50.84%> (-1.21%) ⬇️

... and 12 files with indirect coverage changes

---- 🚨 Try these New Features:

@zuiderkwast zuiderkwast added the major-decision-pending Major decision pending by TSC team label Nov 1, 2024
@zuiderkwast
Copy link
Contributor

@valkey-io/core-team Please approve.

This adds make BUILD_RDMA=yes, so just like TLS it can be built as a module or as builtin. Just like for TLS, the default is no.

It is a step towards official (non-experimantal) RDMA support.

I already merged the RDMA fork-safety here: #1244. It makes sure the server doesn't crash if we try to talk to an RDMA client from a forked child process.

@pizhenwei
Copy link
Contributor Author

@valkey-io/core-team Please approve.

This adds make BUILD_RDMA=yes, so just like TLS it can be built as a module or as builtin. Just like for TLS, the default is no.

It is a step towards official (non-experimantal) RDMA support.

I already merged the RDMA fork-safety here: #1244. It makes sure the server doesn't crash if we try to talk to an RDMA client from a forked child process.

Hi @valkey-io/core-team
The CMake building support has been merged, then we can drop module build option for TLS and RDMA together.

@PingXie
Copy link
Member

PingXie commented Nov 9, 2024

The CMake building support has been merged, then we can drop module build option for TLS and RDMA together.

I vote for dropping the module packaging. Modularity is what we need and we have it here for RDMA.

@zuiderkwast
Copy link
Contributor

I believe it's safe to drop the RDMA module because it's experimental, but I'm not sure about the TLS module. Maybe someone is using it? Maybe we should drop it in a major version (9.0?).

The potential benefit of module is that you don't need OpenSSL installed to start Valkey, as long as you don't load the module. If we ship binary releases or containers with bundled modules, they can be used also by those who don't have or want TLS. Examples are embedded system, IoT devices, etc. We've already talked about packaging binaries with buldled modules like (bloom, json).

Copy link
Contributor

@zuiderkwast zuiderkwast left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I didn't review fully. Just the interface changes.

valkey.conf Outdated Show resolved Hide resolved
valkey.conf Outdated Show resolved Hide resolved
valkey.conf Outdated Show resolved Hide resolved
Originally, special config 'bind' is used for socket and TLS, however
multiple addresses handling is also workable for RDMA(QUIC in the
future). Abstract bind option as helper functions to apply more
connection types.

rewriteConfigBindOption is a local function, declare it as 'static'
function.

Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
Socket family use close() syscall to close listener, however RDMA has
another style. Use an abstract function handler instead of hard code
syscall style.

Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
Move 4 parameters from valkey-rdma.so to valkey-server, keep RDMA
listener similar to TCP/TLS. Also prepare to build Valkey Over RDMA
into builtin.

Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
Support RDMA builtin and module together. To build a builtin version:
 $ make BUILD_RDMA=yes

Or build it by cmake:
 $ cmake .. -DBUILD_RDMA=yes

Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
@pizhenwei
Copy link
Contributor Author

Hi @zuiderkwast
Please take a loot at the latest version, modify as your suggestion, and support builtin for cmake.

Copy link
Contributor

@zuiderkwast zuiderkwast left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

Let's wait for a core team majority decision before merge since it's an interface change.

Next time, can you avoid force-push and instead just push more commits? It's easier to see what's changed since last review. We squash-merge in the end anyway, normally.

valkey.conf Outdated Show resolved Hide resolved
@pizhenwei
Copy link
Contributor Author

Next time, can you avoid force-push and instead just push more commits? It's easier to see what's changed since last review. We squash-merge in the end anyway, normally.

OK.

@zuiderkwast
Copy link
Contributor

zuiderkwast commented Nov 21, 2024

@valkey-io/core-team This adds make BUILD_RDMA=yes and some configs. Please approve or vote 👍 or 👎.

The module style configs rdma.port and rdma.bind are removed. Even if BUILD_RDMA=module, the module is using the new, normal configs. That's why it is a breaking change, but we allow it because RDMA is experimental.

@zuiderkwast zuiderkwast added the needs-doc-pr This change needs to update a documentation page. Remove label once doc PR is open. label Nov 21, 2024
Copy link
Member

@enjoy-binbin enjoy-binbin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not a full review, but LGTM.

README.md Show resolved Hide resolved
src/config.c Outdated Show resolved Hide resolved
src/config.c Outdated Show resolved Hide resolved
src/config.c Outdated Show resolved Hide resolved
pizhenwei and others added 3 commits November 21, 2024 17:01
Co-authored-by: Binbin <binloveplay1314@qq.com>
Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
Apply suggestion from Binbin.

Co-authored-by: Binbin <binloveplay1314@qq.com>
Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
Apply suggestion from Binbin.

Co-authored-by: Binbin <binloveplay1314@qq.com>
Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
@zuiderkwast zuiderkwast added release-notes This issue should get a line item in the release notes breaking-change Indicates a possible backwards incompatible change labels Nov 21, 2024
Copy link
Member

@madolson madolson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Directionally I'm inclined, I mostly just had some questions about the configs and documentation.

valkey.conf Show resolved Hide resolved
valkey.conf Outdated Show resolved Hide resolved
Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
@zuiderkwast zuiderkwast removed the breaking-change Indicates a possible backwards incompatible change label Nov 22, 2024
Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
@pizhenwei
Copy link
Contributor Author

pizhenwei commented Nov 25, 2024

Hi, @madolson @zuiderkwast
Append a new commit to fix conflict against 33f42d7. Please take a look at the latest version. Thanks!

zuiderkwast pushed a commit to valkey-io/valkey-doc that referenced this pull request Nov 27, 2024
valkey-server is performance sensitive on RDMA, add man page section for deeply
performance tuning.

[Link](valkey-io/valkey#1209) for valkey
changes.

---------

Signed-off-by: zhenwei pi <pizhenwei@bytedance.com>
@zuiderkwast
Copy link
Contributor

@pizhenwei The DCO is failing because some commits are written by some email and signed off with another email. Please fix it. Feel free to squash all commits to one if you want. See DCO job Details: https://github.com/valkey-io/valkey/pull/1209/checks?check_run_id=33460190673

@PingXie @soloestoy @hwware Can we have one more ACK to merge this one? (RDMA is still experimental so it's unclear if it even needs to be a major decision, but it's marked as one now.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
major-decision-pending Major decision pending by TSC team needs-doc-pr This change needs to update a documentation page. Remove label once doc PR is open. release-notes This issue should get a line item in the release notes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants