Basically I have trouble understanding this: (from Bjarne FAQ)
However, most modern processors cannot read or write a single character, it must read or write a whole word, so the assignment to c really is ``read the word containing c, replace the c part, and write the word back again.'' Since the assignment to b is similar, there are plenty of opportunities for the two threads to clobber each other even though the threads do not (according to their source text) share data!
So how can char arrays exist without 3(7?) byte padding between elements?