Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[uma] Replace list of providers with explicit data and metadata providers. #641

Closed
wants to merge 3 commits into from

Conversation

igchor
Copy link
Member

@igchor igchor commented Jun 20, 2023

Existing API exposes list of memory provider but does not specify type/purpose of those providers. This makes it challenging to connect a memory allocation from memory pool with a memory provider (in case, someone wants to query properties of that memory provider or needs to get a provider-specific error message, as here: #640).

There was a discussion on #315 about passing some extra information along providers to initialize() (e.g. type) but I don't think we have a use case for that right now - passing data and metadata providers seems simpler. In future, if we'll need to pass more then 1 metadata and 1 data provider (e.g. to let pool choose) we might revert the change for initialize(). However, for querying - I think we should always return only 1 data and 1 metadata provider (those that are actually used).

by returning the name directly instead of through out param. This
function is not allowed to fail anyway.
data and metadata providers in initialize. Also, replace
get_memory_providers with get_data_memory_provider and
get_metadata_memory_provider.

Implementation of get_*_provider is supposed to be supplied
by each memory pool: this is to allow ignoring passed memory
provider (especially the metadata provider).
Copy link
Contributor

@pbalcer pbalcer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In the future, we should avoid the temptation of architecting things we don't actually need yet... ;)

size_t numProviders, void *params,
uma_memory_pool_handle_t *hPool);
uma_memory_provider_handle_t data_provider,
uma_memory_provider_handle_t metadata_provider,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it is a non-flexible API. As I remember in the past you had an idea of passing a list of memory providers with the lists of types. Have you considered it as an alternative.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, as I mentioned in the PR description we might want to accept a list of provides with types. However, for that we would need to define some properties for memory providers (so that the pool can choose the appropriate one) and we don't have those right now. I feel like it's better to start with a simpler API and evolve it over time if needed.

Copy link
Contributor

@vinser52 vinser52 Jun 27, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My current feeling (and your comment confirms that) is that it is not the final API and furthermore it is specific to disjoint_pool implementation.
So why we cannot keep with the current approach when we pass just the list of memory providers and rely on the order of providers in that list?

I am worried because our goal was to upstream UMF as a part of UR but make API generic enough so that we can refer to it during discussions with other potential customers of the future standalone solution. Now it looks like we are making it USM (or disjoint_pool) specific.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We don't strictly need the interface to pass multiple memory providers to the memory pool right now. Even for metadata provider, it was mostly a 'future-looking' feature - the idea was that we could use it in DisjointPool (or any other) pool implementation, but we don't really use it right now (we just use regular new/delete).

As Piotr said, we should probably avoid implementing things we don't need to, perhaps for now, just having a single memory provider is enough - even without the metadata one (since we don;t have any implementation that would use it.)

I can revert the whole change that introduced the list of memory providers and just keep a single provider as it was originally. I would actually prefer that over defining which provider should be on position 0 or 1 in the list, etc. as that can change over time and we might silently break some (probably PoC) implementation silently - with function signature change, there would at least be a compilation error.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So, I think it's just a question: do we think that having separate data and metadata memory provider is useful (and generic enough) or we'd rather wait for an actual implementation that will want to leverage that distinction.

Copy link
Contributor

@vinser52 vinser52 Jun 28, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We don't strictly need the interface to pass multiple memory providers to the memory pool right now. Even for metadata provider, it was mostly a 'future-looking' feature - the idea was that we could use it in DisjointPool (or any other) pool implementation, but we don't really use it right now (we just use regular new/delete).

I agree with that.

I can revert the whole change that introduced the list of memory providers and just keep a single provider as it was originally.

Sounds like we came up to the conclusion that the current pool creation API and the proposed one in this PR are not required by the current UR implementation. Because disjoint_pool does not use the second memory provider today. Do we want to extend disjoint_pool to use "metadata" memory provider? If so, then the pool creation API should be able to accept more than one memory provider. But I think that this PR should be independant from #640.

My current feeling is that #640 is more important and first we should address the native error handling use case. And I think error handling should not depend on how the pool is created. Otherwise, native error handling will depend on the Pool creation API and whenever we change the pool creation API the client code for native error handling is also impacted.

We can discuss it by call. In general, we agreed in the past that API and/or architectural changes should be discussed prior to actual code changes.

@@ -49,7 +49,7 @@ struct uma_memory_provider_ops_t {
size_t *pageSize);
enum uma_result_t (*purge_lazy)(void *provider, void *ptr, size_t size);
enum uma_result_t (*purge_force)(void *provider, void *ptr, size_t size);
void (*get_name)(void *provider, const char **ppName);
const char *(*get_name)(void *provider);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: looks like a change unrelated to this PR.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes, that's why I've put it into a separate commit, it's just a small improvement

with METADATA and DATA provider specific errors.
@@ -77,7 +76,8 @@ INSTANTIATE_TEST_SUITE_P(
providerInitializeTest, providerInitializeTest,
::testing::Values(UMA_RESULT_ERROR_OUT_OF_HOST_MEMORY,
UMA_RESULT_ERROR_POOL_SPECIFIC,
UMA_RESULT_ERROR_MEMORY_PROVIDER_SPECIFIC,
UMA_RESULT_ERROR_METADATA_MEMORY_PROVIDER_SPECIFIC,
UMA_RESULT_ERROR_DATA_MEMORY_PROVIDER_SPECIFIC,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we need a test with memory provider which always returns the error. Should it be part of this PR or we are adding tests in separate PRs?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You mean memory pool that returns this error? Those two errors are supposed to be returned by umaPoolGetLastAllocationError (from my second PR) to indicate that the error is provider-specific.

There is some overlap between those 2 PR so I guess once one of the is merged, I will rebase (and I can add those tests) to the second one.

@igchor
Copy link
Member Author

igchor commented Jun 27, 2023

Also, regarding a list of memory providers - I've been thinking and perhaps a better interface might be just to expose a separate function like provider get_preferred_provider(providers...) in a memory pool API instead of passing the list to the initialize method (in case we'll need to select a provider based on some properties in future). This would simplify the most common case (just passing a single provider) and allow a memory pool to choose the best one based on some properties. But this is orthogonal to the data vs metadata provider discussion, I belive.

@igchor igchor marked this pull request as draft June 28, 2023 16:33
@igchor igchor closed this Oct 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants