Assignment operator sequencing in C11 expressions
Asked Answered
C

3

12

Introduction

The C11 standard (ISO/IEC 9899:2011) has introduced a new definition of side effect sequencing within an expression (see related question). The sequence point concept has been complemented with sequenced before and sequenced after relations which are now the basis for all definitions.

Section 6.5 "Expressions", point 2 says:

If a side effect on a scalar object is unsequenced relative to either a different side effect on the same scalar object or a value computation using the value of the same scalar object, the behavior is undefined. If there are multiple allowable orderings of the subexpressions of an expression, the behavior is undefined if such an unsequenced side effect occurs in any of the orderings.

Later on, section 6.5.16 "Assignment operators", point 3 states:

The side effect of updating the stored value of the left operand is sequenced after the value computations of the left and right operands. The evaluations of the operands are unsequenced.

Problem

The first quoted paragraph (6.5/2) is supported by two examples (same as in the C99 standard):

First example

a[i++] = i;  //! undefined
a[i] = i;    //  allowed

This can be easily explained with the definitions:

  1. If a side effect on a scalar object is unsequenced relative to (...) a value computation using the value of the same scalar object, the behavior is undefined. (6.5/2),
  2. The evaluations of the operands are unsequenced. [within an assignment] (6.5.16/3).

So, the side effect of i++ (LHS) is unsequenced with i (RHS), which gives undefined behaviour.

Second example

i = ++i + 1; //! undefined
i = i + 1;   //  allowed

This code, however, seems to result in a defined behaviour in both given cases as:

  1. the side effect of updating the stored value of the left operand is sequenced after the value computations of the left and right operands.

So, the execution of ++i + 1 shall precede the side effect of updating i, which means that there is not a side effect on a scalar object unsequenced relative to either a different side effect on the same scalar object or a value computation using the value of the same scalar object.

Question

It is easy to explain these examples with the terms and definitions presented by the C99 standard (see related question). But why is i = ++i + 1 undefined according to C11's terminology?

Cantus answered 24/3, 2014 at 17:9 Comment(0)
R
7

Update

I am changing my answer here, this is not well defined in C11 although it is in C++11. The key here is that the result of ++i is not an lvalue and therefore does not require an lvalue-to-rvalue conversion after ++i is evaluated and so we can not be assured that the result of ++i will be read afterwards. Which is different than C++ and so the defect report I originally linked to hinges on this critical fact:

[...] the lvalue expression ++i and then do an lvalue-to-rvalue conversion on the result. guarantees that the incrementation side-effect is sequenced before the computation of the addition operation[...]

we can see this by going to the C11 draft standard section 6.5.3.1 Prefix increment and decrement operators which says:

[...]The expression ++E is equivalent to (E+=1).[...]

and then section 6.5.16 Assignment operators which says (emphasis mine going forward):

An assignment operator stores a value in the object designated by the left operand. An assignment expression has the value of the left operand after the assignment,111 but is not an lvalue.[...]

and footnote 111 says:

The implementation is permitted to read the object to determine the value but is not required to, even when the object has volatile-qualified type.

There is no requirement to read the object to determine it's value even if it is volatile.

Original Answer

As far as I can tell this is actually well defined and this example was removed from the C++ draft standard which uses similar language. We can see this in 637. Sequencing rules and example disagree which says:

the following expression is still listed as an example of undefined behavior:

i = ++i + 1;

However, it appears that the new sequencing rules make this expression well-defined:

and the resolution was to strike the prefix example and use the postfix example instead which is clearly undefined:

Change the example in 1.9 [intro.execution] paragraph 16 as follows:

i = ++i i++ + 1; // the behavior is undefined

Rim answered 24/3, 2014 at 17:24 Comment(8)
++i has different meaning in C and C++.Roborant
@Roborant it looks like I was not correct, since the result of ++i is not lvalue that changes the semantics.Rim
Yes. That's what I wanted to say :)Roborant
Does it mean that i = ++i + 1 is well-defined in C++, while still having undefined behaviour in C11? After rewriting it to i = (i+=1) + 1 we end up with two sequenced assignments, or a sub-expression sequenced before addition, right? So it should be well-defined also in C11.Cantus
@KrzysztofAbramowicz the major difference is that in C++ assignment returns an lvalue while in C11 it specifically says but is not an lvalue.. So in C++ the fact that it returns an lvalue requires an lvalue-to-rvalue conversion of the result which requires a read of the the result which creates the sequenced before relationship that does not exist in C11.Rim
@KrzysztofAbramowicz defect report 222 goes into these differences in more detail. I would not call this intuitive reading.Rim
In the C case I don't see how the lvalue conversion is relevant - Jens' answer seems to me to hit the key pointConfute
"not well defined in C11 although it is in C++11" In retrospect this wasn't the case - this mess wasn't clarified until way later in C++17.Businesslike
V
4

The standard stipulates for assignment (6.5.16) as you are citing correctly

The side effect of updating the stored value of the left operand is sequenced after the value computations of the left and right operands.

(The increment operator is not different, it is just an assignment in disguise)

This means that there are two value computations (left and right) and the side effect of the assignment is then sequenced after these. But it is only sequenced against the value computations, not against the side effects that these may have. So at the end we are faced with two side effects (of the = operator and the ++ operator) that are not sequence with respect to each other.

Vannesavanness answered 27/3, 2014 at 19:32 Comment(2)
My last thoughts on that topic went exactly in that direction... but I wasn't sure if the two-assignments interpretation is actually an argument for or against undefined behaviour :) Are you sure that value computation does not include any side effects implied by used operators? Are operand evaluation, value computation and side effect execution three different aspects of expression evaluation, which are subject to independent sequencing rules?Cantus
@KrzysztofAbramowicz if it is not explicitly specified, then it is undefined.Rickirickie
R
3

But why is i = ++i + 1 undefined according to C11's terminology?

C11 says that the side effect on left i is sequenced but not the value computations (evaluations) of left and right i.
It is obvious that the side effect on LHS will take place after the evaluation of expressions on LHS and RHS.
To explain this a better example could be

int i = 1;
i = i++ + 3;

(First let's assume that this example will not invoke UB). Now the final value of i could be 4 or 2.
Case 1.
Left i is fetched and then it is incremented and 3 is added to it and finally 4 is assigned to i.
Case 2.
Left i is fetched and then 3 is added to it and then 4 is assigned to i and finally i is incremented. In this case the final value of i is 2.
Although the side effect on left i is sequenced the final value stored to i is not defined, i.e it is not necessarily by the assignment and hence the side effect on i is unsequenced.

Roborant answered 24/3, 2014 at 17:16 Comment(10)
But why should I count LHS's i as a value computation? The current value doesn't matter and is not used anywhere.Cantus
The LHS i will be evaluated before assigning the final value to it.Roborant
What does evaluation actually mean in this context? Within the execution environment, i is just an arithmetic object in memory, or even a register -- what else can be done here beyond just assigning the new value (which is the side effect)?Cantus
You need to fetch its address before assigning a value to it.Roborant
I updated my answer to clear this concept a bit more.Roborant
That's right, but fetching its address does not touch its value, so the fetch may interfere with the incrementation. Is it a standard requirement to count any identifier appearance as evaluation?Cantus
@KrzysztofAbramowicz The quote you gave says that the assignment is sequenced after the value computations, not side effects of the expressions. This allows calculating the result of ++i without performing the side effect of storing the calculated value until later.Electroplate
Sorry, I don't know about that. AFAIK, evaluation of left operand of = means fetching its address.Roborant
++i is defined as i += 1, whereas i++ has no such equivalence, so I don't think it is useful to explain ++i by analogy to a case involving i++Confute
@M.M; It will have the same effect in case of i = ++i + 3 too. Side effect on i is unsequenced.Roborant

© 2022 - 2024 — McMap. All rights reserved.