How to allocate memory for an array and a struct in one malloc call without breaking strict aliasing?

Asked 9/2, 2018 at 21:24 Answered 9/2, 2018 at 21:37

Solved c memory-management strict-aliasing

When allocating memory for a variable sized array, I often do something like this:

struct array {
    long length;
    int *mem;
};

struct array *alloc_array( long length)
{
    struct array *arr = malloc( sizeof(struct array) + sizeof(int)*length);
    arr->length = length;
    arr->mem = (int *)(arr + 1); /* dubious pointer manipulation */
    return arr;
}

I then use the arrray like this:

int main()
{
    struct array *arr = alloc_array( 10);
    for( int i = 0; i < 10; i++)
        arr->mem[i] = i;
    /* do something more meaningful */
    free( arr);
    return 0;
}

This works and compiles without warnings. Recently however, I read about strict aliasing. To my understanding, the code above is legal with regard to strict aliasing, because the memory being accessed through the int * is not the memory being accessed through the struct array *. Does the code in fact break strict aliasing rules? If so, how can it be modified not to break them?

I am aware that I could allocate the struct and array separately, but then I would need to free them separately too, presumably in some sort of free_array function. That would mean that I have to know the type of the memory I am freeing when I free it, which would complicate code. It would also likely be slower. That is not what I am looking for.

Candlestick answered 9/2, 2018 at 21:24 Comment(1)

Prefer sizeof expr over sizeof(TYPE). Repeating the type is error-prone. – Paternalism 9/2, 2018 at 23:42

The proper way to declare a flexible array member in a struct is as follows:

struct array {
    long length;
    int mem[];
};

Then you can allocate the space as before without having to assign anything to mem:

struct array *alloc_array( long length)
{
    struct array *arr = malloc( sizeof(struct array) + sizeof(int)*length);
    arr->length = length;
    return arr;
}

Hydrazine answered 9/2, 2018 at 21:27 Comment(7)

This works only if the struct contains one variable length element. If there are more, then as far as I know pointer manipulation like I outlined is necessary. Is it legal with regard to strict aliasing rules? – Candlestick 9/2, 2018 at 21:32

@Candlestick Your question didn't specify that. In that case, you won't have any strict aliasing violations because the memory in question was malloc'ed and does not yet have a type, however you may run into alignment issues. – Hydrazine 9/2, 2018 at 21:36

@Candlestick Your best bet in that case would probably be to simply do separate allocations for the struct and the arrays it contains. You need to know something about what you free when you free it anyway, and you won't see any noticeable change in speed. – Hydrazine 9/2, 2018 at 21:38

@Hydrazine are you saying that doubling the number of malloc and free calls in the program will not cause a noticeable change in speed? – Centenarian 9/2, 2018 at 21:39

flexible array member is the way to go as long as only one such member is needed at the end (and at least one other member exist). – Aikens 9/2, 2018 at 21:45

Now if someone could just convince the developers of glibc that that is now the proper way -- we could all get on the same sheet of paper and dispense with the struct-hack forever, e.g. glibc - struct dirent – Rustie 9/2, 2018 at 23:53

Even if that works, why would you do something complicated and non-obvious, which as far as I can see offers no advantage over the normal, everyday method of simply assigning allocated memory to a pointer? – Quartziferous 10/2, 2018 at 3:50

Modern C officially supports flexible array members. So you can define your structure as follows:

struct array {
    long length;
    int mem[];
};

And allocate it as you do now, without the added hassle of dubious pointer manipulation. It will work out of the box, all the access will be properly aligned and you won't have to worry about dark corners of the language. Though, naturally, it's only viable if you have a single such member you need to allocate.

As for what you have now, since allocated storage doesn't have a declared type (it's a blank slate), you aren't breaking strict aliasing, since you haven't given that memory an effective type. The only issue is with possible mess-up of alignment. Though that's unlikely with the types in your structure.

Dunt answered 9/2, 2018 at 21:28 Comment(1)

Instead of saying “Modern”, you should specify that it’s been available since C99. – Ema 10/2, 2018 at 2:55

I believe the code as written does violate strict aliasing rules, when standard read in the strictest sense.

You are accessing an object of type int through a pointer to unrelated type array. I believe, that an easy way out would be to use starting address of the struct, and than convert it char*, and perform a pointer arithmetic on it. Example:

void* alloc = malloc(...);
array = alloc;
int* p_int = (char*)alloc + sizeof(array);

Centenarian answered 9/2, 2018 at 21:37 Comment(9)

Is (char*)alloc + sizeof(array); certainly aligned for int? alloc is OK, yet sizeof(array) is not specified to be a multiple of int. I'd expect that to fail though only an a hostile or unicorn platform. – Aikens 9/2, 2018 at 21:43

@chux I have all but forgotten about alignment. Haven't dealt with platforms which care about it for a while. I was focusing on aliasing question, but of course, alignment could be very important. – Centenarian 9/2, 2018 at 21:48

Same for me about alignment until a picky platform and a dozen bus-faults later. – Aikens 9/2, 2018 at 21:50

@chux which one it was if you do not mind me asking? Last align-conscious platform I dealt with was Sparc around 8 years ago or so. – Centenarian 9/2, 2018 at 21:53

Various PICs these days that do not like 2-byte int on odd boundary. Since then I have become more alignment-aware. – Aikens 9/2, 2018 at 21:54

IAC, even on platforms that tolerate unusual alignments (do not bus-fault), native aliment for the type can result in better performance. – Aikens 9/2, 2018 at 21:59

@chux can? Sure. However, the platform I am using now exclusively, x86_64, does not care about alignment, and I have grown lax. – Centenarian 9/2, 2018 at 22:4

@Centenarian Yes it does, unaligned accesses are slower. However on ARM (the now-most-popular general purpose computing platform?) unaligned accesses do raise an exception. – Bedew 10/2, 2018 at 2:54

Sergey and @immibis: breaking C alignment rules on x86-64 can result in correctness problems, not just performance, when gcc auto-vectorizes: #47511283. And BTW, unaligned access is only slower on modern x86-64 if it crosses a cache-line boundary (or maybe a 32-byte boundary on AMD CPUs). Also, if you misalign an atomic_int across a cache-line boundary, it won't be atomic anymore (for load or store, and for atomic RMW it will be very slow.) – Jaf 10/2, 2018 at 3:5

Recommended topics

Hot tags