Re: [PATCH 1/2] wined3d: Implement wined3d_deferred_context_update_sub_resource().

31 May 2021

Replying to both mails at once, sort of...

On 5/31/21 4:53 AM, Henri Verbeet wrote:
...
  Yes, mostly what Matteo said.

 On Fri, 28 May 2021 at 15:39, Matteo Bruni &lt;matteo.mystral(a)gmail.com&gt; wrote:
  So this is, AFAIR, more or less what the staging
patchset did. I don't
 think that we want to put the data in the command stream though.
  It may not be too bad for deferred contexts, but for immediate
 contexts probably not, no.

> Always copying the data from the user-provided source to some other
> storage is definitely the right choice: we don't want to block the
> application (more or less directly) to make sure that the data stays
> around until we need it. That's bad for performance for immediate
> contexts and just impossible for deferred, so not much to argue there.
> At the same time we also want to minimize the amount of copies.
> Ideally, for a GPU resource, here we'd upload the data right to the
> GPU (e.g. inside a persistently mapped buffer), ready to be used
> whenever it's time.  
To clarify, when talking about uploading directly to the GPU, are you 
just referring to map() here, or update_sub_resource() as well?

Does it even make sense to use a persistently mapped GPU buffer for a 
resource mapped/updated by a deferred context? Obligatory disclaimer 
that I don't know anything about GPU-side performance, but obviously you 
can't reuse an earlier mapping for UpdateSubResource() or for DISCARD 
maps, and I'm failing to understand why an application would ever use 
NOOVERWRITE on a deferred context.

...
 > For a CPU resource instead we'd copy the data
into
> its final place in sysmem (what will eventually become its
> "heap_memory"). We don't quite have the infrastructure for it at the
> moment but that would be the goal IMO.
> What you do in patch 2/2 is more along the lines of this idea. The
> sysmem allocation mechanism works okay for the time being and could be
> extended to handle other types of memory, e.g. it could eventually
> take charge of the vulkan allocator, making it generic and usable from
> GL too. I don't think it has to be managed by each deferred context
> (i.e. it should probably be per-device), although the current scheme
> has the advantage of not having to care about concurrency. 
In a sense there is no "current scheme", i.e. deferred contexts are 
tracking which memory belongs to which resource because that *has* to be 
done per-context, but nobody's managing custom heaps. Unless I 
misunderstand what's meant here?

...
   So, at least
in my view, I'd bring the allocator bits from patch 2/2
 into patch 1/2 and use that for update_sub_resource, both the deferred
 and the immediate variants (the latter could be its own patch). You'd
 only need the DISCARD portion for update_sub_resource, so the rest can
 (and probably should) stay in the map() patch.
  My (very rough) proposal for this would be to replace the "data" field
 in the wined3d_cs_update_sub_resource structure with a
 wined3d_bo_address, and then introduce something like the following:

      static bool wined3d_device_context_get_upload_bo(struct
 wined3d_device_context *context,
              struct wined3d_resource *resource, unsigned int
 sub_resource_idx, const struct wined3d_box *box,
              uint32_t flags, struct wined3d_bo_address *address, void **map_ptr)
      {
          ...
      } 
This function strikes me as more than a little confusing, party because 
it seems to be returning two partially overlapping things, with mostly 
different usages...

i.e. I'd presume the idea is for 
wined3d_device_context_update_sub_resource() to retrieve this and then 
call something like wined3d_context_copy_bo_address() to copy from 
sysmem, which makes sense, but then what are we doing with map_ptr?

Or, if we use this in wined3d_device_context_map(), couldn't we just 
call wined3d_context_map_bo_address() on the returned wined3d_bo_address?

...

 For deferred contexts you should then be able to do something
 (conceptually) very similar to patch 2/2, and in the end we may not
 even need map/unmap/update_sub_resource in the
 wined3d_device_context_ops structure.
  We'd still presumably need an unmap, though, unless the idea above is to 
*always* allocate new (GPU or CPU) memory, which seems like a much 
bigger step.

Unless I'm misreading things, this also seems to imply that 
*_resource_sub_resource_map() shouldn't exist anymore, right?

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

Re: [PATCH 1/2] wined3d: Implement wined3d_deferred_context_update_sub_resource().