memory alignment within gcc structs

Asked 31/3, 2010 at 15:27 Answered 30/8, 2018 at 20:51

I am porting an application to an ARM platform in C, the application also runs on an x86 processor, and must be backward compatible.

I am now having some issues with variable alignment. I have read the gcc manual for __attribute__((aligned(4),packed)) I interpret what is being said as the start of the struct is aligned to the 4 byte boundry and the inside remains untouched because of the packed statement.

originally I had this but occasionally it gets placed unaligned with the 4 byte boundary.

typedef struct  
{  
 unsigned int code;  
 unsigned int length;  
 unsigned int seq;  
 unsigned int request;  
 unsigned char nonce[16];  
 unsigned short  crc;  
} __attribute__((packed)) CHALLENGE;

so I change it to this.

typedef struct  
{  
 unsigned int code;  
 unsigned int length;  
 unsigned int seq;  
 unsigned int request;  
 unsigned char nonce[16];  
 unsigned short  crc;  
} __attribute__((aligned(4),packed)) CHALLENGE;

The understand I stated earlier seems to be incorrect as both the struct is now aligned to a 4 byte boundary, and and the inside data is now aligned to a four byte boundary, but because of the endianess, the size of the struct has increased in size from 42 to 44 bytes. This size is critical as we have other applications that depend on the struct being 42 bytes.

Could some describe to me how to perform the operation that I require. Any help is much appreciated.

Kissee answered 31/3, 2010 at 15:27 Comment(0)

If you're depending on sizeof(yourstruct) being 42 bytes, you're about to be bitten by a world of non-portable assumptions. You haven't said what this is for, but it seems likely that the endianness of the struct contents matters as well, so you may also have a mismatch with the x86 there too.

In this situation I think the only sure-fire way to cope is to use unsigned char[42] in the parts where it matters. Start by writing a precise specification of exactly what fields are where in this 42-byte block, and what endian, then use that definition to write some code to translate between that and a struct you can interact with. The code will likely be either all-at-once serialisation code (aka marshalling), or a bunch of getters and setters.

Doggett answered 31/3, 2010 at 16:2 Comment(7)

While I agree with everything else, I'm not sure why you recommend using a char array. – Elam 31/3, 2010 at 16:52

@Roger: I'm presuming that the OP needs to hold the struct in-memory in the mandated form as well as in a form they can more easily manipulate - unless you're making some other point which I've missed? – Doggett 31/3, 2010 at 17:12

@crazy: The OP is apparently fine with using a packed struct for the in-memory representation of the data file, which makes using a char array equivalent to using &struct_obj as a char array (by casting it to a char pointer) and only using the first 42 bytes. If he wants to abandon the packing, then there might be a need---temporarily. But even in that case, I'd just use buffered operations (e.g. FILE) and read each member. – Elam 31/3, 2010 at 18:3

The data struct is essentially a data packet, just before sending I ensure htonl/htons are used on the relevent members, I think that marshalling will ne the right option. I will look at how easy it is to implement as there are about 100 structs that are similar. Thank you very much for you reply – Kissee 31/3, 2010 at 21:3

@Mumbles: If you can use C++ instead of C, you can get it done by writing just a tiny bit of code for each struct (similar to how boost::serialize works). Otherwise (or even in C++, depending), I'd generate the code for your structs so you can use the same input file to generate the serialization functions and always know they're in sync. – Elam 31/3, 2010 at 21:28

@Roger: Unfortunately this part of the code has to stay in C, I think I am going to have to create serialization functions for my structs, Thank you very much. – Kissee 1/4, 2010 at 7:37

@Mumbles: With 100 structs you'd be well advised to automate that process. At a previous workplace we had a very powerful perl-based code generator which did this for us. It was fiendishly complex, but output for multiple languages and allowed structs to contain other structs... – Doggett 1/4, 2010 at 8:9

This is one reason why reading whole structs instead of memberwise fails, and should be avoided.

In this case, packing plus aligning at 4 means there will be two bytes of padding. This happens because the size must be compatible for storing the type in an array with all items still aligned at 4.

I imagine you have something like:

read(fd, &obj, sizeof obj)

Because you don't want to read those 2 padding bytes which belong to different data, you have to specify the size explicitly:

read(fd, &obj, 42)

Which you can keep maintainable:

typedef struct {
  //...
  enum { read_size = 42 };
} __attribute__((aligned(4),packed)) CHALLENGE;

// ...

read(fd, &obj, obj.read_size)

Or, if you can't use some features of C++ in your C:

typedef struct {
  //...
} __attribute__((aligned(4),packed)) CHALLENGE;
enum { CHALLENGE_read_size = 42 };

// ...

read(fd, &obj, CHALLENGE_read_size)

At the next refactoring opportunity, I would strongly suggest you start reading each member individually, which can easily be encapsulated within a function.

Elam answered 31/3, 2010 at 16:4 Comment(0)

I've been moving structures back and forth from Linux, Windows, Mac, C, Swift, Assembly, etc.

The problem is NOT that it can't be done, the problem is that you can't be lazy and must understand your tools.

I don't see why you can't use:

typedef struct  
{  
 unsigned int code;  
 unsigned int length;  
 unsigned int seq;  
 unsigned int request;  
 unsigned char nonce[16];  
 unsigned short  crc;  
} __attribute__((packed)) CHALLENGE;

You can use it and it doesn't require any special or clever code. I write a LOT of code that communicates to ARM. Structures are what make things work. __attribute__ ((packed)) is my friend.

The odds of being in a "world of hurt" are nil if you understand what is going on with both.

Finally, I can't for the life make out how you get 42 or 44. Int is either 4 or 8 bytes (depending on the compiler). That puts the number at either 16+16+2=34 or 32+16+2=50 -- assuming it is truly packed.

As I say, knowing your tools is part of your problem.

Capsaicin answered 30/8, 2018 at 20:51 Comment(4)

its safer to use uint32_t for unsigned ints and uint32_t for unsigned shorts. – Papism 30/10, 2018 at 6:36

No doubt you meant uint16_t for unsigned shorts. – Capsaicin 1/4, 2019 at 19:58

Puzzled by "safer". What you mean is that it is not confusing as to the number of bytes. The bottom line is if you do not know your tools, do not know the number of bytes, etc. You will crash and burn. As for int32_t, yes, that is better than int. As int16_t is better than short. (or uintxx_t depending if sign is an issue) – Capsaicin 10/12, 2019 at 21:52

Yeah, I meant uint16_t for unsigned shorts. What I meant by safer, is that if you are passing this struct between several different computers ( say a 16 bit machine, a 32 bit machine and a 64 bit machine), Each of them could have a different length for an unsigned int/ unsigned short. Because c++ doesn't make any guarantees to the size. Which make the struct useless between multiple machines. – Papism 21/1, 2020 at 5:3

What is your true goal?

If it's to deal with data that's in a file or on the wire in a particular format what you should do is write up some marshaling/serialization routines that move the data between the compiler struct that represents how you want to deal with the data inside the program and a char array that deals with how the data looks on the wire/file.

Then all that needs to be dealt with carefully and possibly have platform specific code is the marshaling routines. And you can write some nice-n-nasty unit tests to ensure that the marshaled data gets to and from the struct properly no matter what platform you might have to port to today and in the future.

Maloy answered 31/3, 2010 at 19:45 Comment(1)

The goal of this struct is to be network packet. I very much like the idea of having an internal structure that is aligned by the compiler so that it fits correctly, and then only construct this packet as and when needed. – Kissee 31/3, 2010 at 21:14

I would guess that the problem is that 42 isn't divisible by 4, and so they get out of alignment if you put several of these structs back to back (e.g. allocate memory for several of them, determining the size with sizeof). Having the size as 44 forces the alignment in these cases as you requested. However, if the internal offset of each struct member remains the same, you can treat the 44 byte struct as though it was 42 bytes (as long as you take care to align any following data at the correct boundary).

One trick to try might be putting both of these structs inside a single union type and only use 42-byte version from within each such union.

Externality answered 31/3, 2010 at 15:55 Comment(1)

Note that this "back to back" allocation happens automatically in arrays, which is why the size of the type must include those padding bytes to maintain alignment. You can't change array layout with any tricks, and I would not suggest using them anyway. – Elam 31/3, 2010 at 16:18

-4

As I am using linux, I have found that by echo 3 > /proc/cpu/alignment it will issue me with a warning, and fix the alignment issue. This is a work around but it is very helpful with locating where the structures are failing to be misaligned.

Kissee answered 1/4, 2010 at 14:47 Comment(0)

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags