Buffer filled with different types of data, and strict aliasing

Asked 20/8, 2018 at 12:2 Answered 23/8, 2018 at 8:9

According to the standard, it is always undefined behavior in C++ to make, for example, a float* point to the same memory location as a int*, and then read/write from them.

In the application I have, there can be a buffer filled with 32-bit integer elements, that are overwritten by 32-bit floating point elements. (It actually contains a representation of an image, that gets transformed in multiple stages by GPU kernels, but there should also be a host implementation that does the same processing, for verification.)

The program basically does this (not actual source code):

void* buffer = allocate_buffer(); // properly aligned buffer

static_assert(sizeof(std::int32_t) == sizeof(float), "must have same size");
const std::int32_t* in = reinterpret_cast<const std::int32_t*>(buffer); 
float* out = reinterpret_cast<float*>(buffer); 
for(int i = 0; i < num_items; ++i)
   out[i] = transform(in[i]);

Is there a way to make the reinterpret_cast pointer cases well-defined, within the C++ standard, without doing additional memory copies of the whole buffer, or additional per-element copies (for example with std::bit_cast)?

Skier answered 20/8, 2018 at 12:2 Comment(10)

The standard knows nothing about GPU's. So you are already in implementation defined territory. Why not just rely on your implementation (possibly with compiler switches) making it well defined? – Artiodactyl 20/8, 2018 at 12:12

Use no-strict-aliasing flag. For std::bit_cast you will have to wait until at least C++20. There is no standard conform way without using memcpy. – Horseplay 20/8, 2018 at 12:15

why not work out what type you actually want to be working with; ie ints or floats; and then have in and out the same type. Your transform then deals with the conversion of float to int / visa versa. – Intangible 20/8, 2018 at 12:17

The first paragraph is wrong. It is OK to have pointers of different types pointing to the same location. What you aren't allowed to do is to read or write the memory as the 'wrong' type. – Jaquith 20/8, 2018 at 12:40

May find it useful to read What is strict aliasing? compilers will treat memcpy used for type punning as a noop or at least quality implementations will. As i note in my my answer that I link to we have a implementation of bit_cast you can use, although you obviously can't get constexpr w/o implementation magic. – Alice 20/8, 2018 at 13:7

Oppss just realized I forgot to include link to bit_cast proposal, fixed! – Alice 20/8, 2018 at 13:15

@Pi as I pointed out in my comment the impl of bit_cast is available but basically is just wraps memcpy the constexpr magic requires compiler support. – Alice 20/8, 2018 at 16:33

@Shafik Yaghmour Possible implementations using memcpy are also stated in the links. Thanks for the heads up. – Horseplay 21/8, 2018 at 8:53

@Shafik Yaghmour Your implementation of bit_cast is very nice! Thanks for mentioning. – Horseplay 21/8, 2018 at 9:22

@Skier See the answers to this follow-up question. – Anaphylaxis 24/8, 2018 at 17:15

Even though I wished all the time there would be a nice way, currently there is non. You will have to use no-strict-aliasing flag of the compiler of your choice.

For std::bit_cast you will have to wait until C++20. There is no standard conform way without using memcpy as far as I know.

Also have a look at this bit_cast proposal and this website.

Horseplay answered 20/8, 2018 at 12:17 Comment(7)

There is also a proposal p0593r2 which introduces std::bless that could be used in conjunction with std::launder to achieve similar functionality. – Scholarship 20/8, 2018 at 13:59

I wonder what practical (as opposed to political) difficulty there would be with saying that the "aliasing rules" only apply in cases which actually involve aliasing, recognizing that aliasing requires that a region of storage which is modified during some particular execution of a function or loop is--within said execution--accessed via two pointers or references, neither of which is visibly freshly derived [within that context] from the other. – Mucilage 20/8, 2018 at 19:21

I also wonder if there would be any difficulty with saying that the result of applying reinterpret_cast to a reference will yield a reference that may, throughout its lifetime, be used objects of either the old or new type, provided that within that lifetime either (1) the object isn't modified by any means (but may be read via any means), or (2) the object accessed exclusively via that reference and references/pointers that are derived from it. That should be easy to implement, and shouldn't interfere with any otherwise-useful optimizations. – Mucilage 20/8, 2018 at 21:26

@Mucilage Why not ask a new question? – Horseplay 21/8, 2018 at 8:53

@user2079303 Very interessting, but the link is dead. Can you provide an alternative? – Horseplay 21/8, 2018 at 9:58

@Pi: Effective discussion on the issue has been crushed by politics. For years, discussion has been dominated by arguments over what the Standard requires. Such arguments fail to recognize that the Standard allows implementations that are of such low quality as to be useless, and makes no effort to fully describe what a high-quality implementation must do to be suitable for any particular purpose. I doubt the authors of gcc would be willing to admit that they've been fighting for the right to label a deliberately-inferior compiler as "conforming", but they've become... – Mucilage 21/8, 2018 at 14:31

...heavily invested in an optimzer design that really isn't suitable for low-level programming, and insist that any code which isn't suitable for use with it is "broken". – Mucilage 21/8, 2018 at 14:32

How about using a union? For example:

union T {
    std::int32_t i;
    float f;
}

T* buffer = allocate_buffer();
for(int i = 0; i < num_items; ++i)
    buffer[i].f = transform(buffer[i].i);

Halsey answered 23/8, 2018 at 8:9 Comment(0)

Recommended topics

Hot tags