Code with undefined behavior in C#
Asked Answered
K

8

54

In C++ there are a lot of ways that you can write code that compiles, but yields undefined behavior (Wikipedia). Is there something similar in C#? Can we write code in C# that compiles, but has undefined behavior?

Keavy answered 7/12, 2009 at 15:15 Comment(0)
J
43

As others have mentioned, pretty much anything in the "unsafe" block can yield implementation-defined behaviour; abuse of unsafe blocks allows you to change the bytes of code that make up the runtime itself, and therefore all bets are off.

The division int.MinValue/-1 has an implementation-defined behaviour.

Throwing an exception and never catching it causes implementation-defined behaviour -- terminate the process, start a debugger, and so on.

There are a number of other situations in C# where we are forced to emit code which has implementation-determined behaviour. For example, this situation:

https://learn.microsoft.com/en-us/archive/blogs/ericlippert/odious-ambiguous-overloads-part-two

However, the situations in which a safe, well-behaved C# program has implementation-defined behaviour should be quite rare.

Joni answered 7/12, 2009 at 16:7 Comment(11)
Ouch, that overload issue is really weird. And I thought DivisionByZeroException was the expected behavior on 1/0. Anyway, I'll +1 this tomorrow...Aconcagua
Who said anything about division by zero?Joni
@Eric: I thought the corner case you mentioned was the usual 1/0. I searched around and I found it's actually int.MinValue/-1.Aconcagua
Does anyone know or can anyone speculate why int.MinValue/-1 isn't or couldn't be defined in the language specification?Morelli
@NickStrupat: This is a question and answer site; if you have a question, post a question!Joni
@NickStrupat: See explanation by Hans PassantEgor
Are you aware, that implementation defined and undefined behavior are two totally different things in c++?Houck
@MikeMB: I am. It's a subtle distinction that I usually don't bother making in the context of C# unless I need to be extraordinarily precise. The fact that the C++ specification authors felt the need to make a taxonomy of ways in which the language can trip you up without telling you is not something I consider to be a strength of the language.Joni
The reason why I'm asking is, because - while answering another questions - I was wondering whether it is possible to cause UB by non-unsafe C#-code-blocks. In c++ it makes actually an enormous difference whether something is implementation defined or undefined behavior. The former just results in non-portable code, the latter results in - by definition - completely unpredictable behavior of your program. But from your answer, it isn't obvious, whether the other examples are IB or genuinely (c++ like) UB. And yes, all the UB clauses in the standard are definitively not my favorite part of c++.Houck
@MikeMB: Ah, that is a different kettle of fish altogether. Yes, there are ways you can mess stuff up without having any marked-as-unsafe code in your program, but this comment is maybe not a great place to enumerate them. \Joni
NP. I think the other answers (especially by Henk and Brett) cover my particular question.Houck
S
35

Yes! There is, even in a safe context! (Well, it's implementation defined to be undefined, at least)

Here's one from Marek Safar and VSadov in the Roslyn issues.There is a mismatch between C# and the CLI in regards to bool.

C# believes that there is only one kind of true, and one kind of false.

CLI believes that false is a byte containing 0, and all other values are true.

This discrepancy means we can coerce C# to do some a (marginally) interesting things thing:

//non-standard bool
//We're setting a bool's value to a byte value of 5.
var a = new bool[1];
Buffer.SetByte(a, 0, 5);

//non-standard bool
//We're setting a bool's value to a byte value of 10.
var b = new bool[1];
Buffer.SetByte(b, 0, 10);

//Both are true.
Console.WriteLine(a[0]);
Console.WriteLine(b[0]);

//But they are not the same true.
Console.WriteLine(a[0] == b[0]);

The above outputs:

true

true

false

Interestingly, the debugger disagrees (must evaluate truth differently?)

enter image description here

Anyways, the conclusion the C# team appears to have come to is (emphasis added):

I.E. the language will stay entirely unconcerned about nonstandard bools. The particular implementation (as in MS C# on CIL) will acknowledge the existence of nonstandard bools and specify their behavior as undefined

Saprolite answered 3/10, 2014 at 5:28 Comment(1)
This actually made me laugh out loud.Thrombophlebitis
H
12

Looking at the Wikipedia article on undefined behaviour, the situations in which undefined behavior happens are either not allowed or throw an exception in C#.

However in Unsafe code, undefined behavior I believe is possible, as that allows you to use pointers etc.

Edit: It looks like I'm right: http://msdn.microsoft.com/en-us/library/aa664771%28VS.71%29.aspx

Has an example of undefined behavior in c#

Heather answered 7/12, 2009 at 15:17 Comment(4)
When you state "the Wiki", to what Wiki are you referring?Sombrous
@Jeff Yates: The "undefined behavior" link up in the question goes to Wikipedia.Caldera
@Jeff Yates: Yeah, I was looking to his wikipedia link to determine what situations cause undefined behaviors.Heather
Link is dead, this answer does not say anything.Rescript
H
11

According to the ECMA-334 document (p. 473):

A program that does not contain any occurrences of the unsafe modifier cannot exhibit any undefined behavior.

That promotes 'implementation-defined' to the worst case, see Eric Lippert's answer.

Holusbolus answered 7/12, 2009 at 22:11 Comment(1)
Unspecified, rather than Implementation-Defined, at least as the term is used in the C spec. If the spec regards an implementation as Implementation-Defined, then conforming implementations are required to provide sufficient documentation to let a programmer know exactly how it will behave. An implementation could legitimately specify that (signed char)128 would yield -126 during the first half of each phase of the moon, and -127 during the second half, but only if the implementation actually ascertains the phase of the Moon. Implementations cannot simply say "yields arbitrary value".Hoffert
H
1

Many and subprograms have requirements that can be summarized as:

  1. When given valid data, produce valid output.

  2. Refrain from launching nuclear missiles or negating the laws of time and causality, even when given invalid input.

One of the major design goals of Java and .NET languages is that unless code makes use of certain which are marked as "unsafe", no particular effort is generally required to meet the second constraint above [though some behaviors related to garbage collection and Finalize can be a little weird from the time/causality standpoint, those can be described as exceptions to normal rules of causality, rather than a total revocation of them]. That situation is very different from the situation in C, where many kinds of data-dependent errors (e.g. integer overflow) may result in compilers behaving in arbitrary fashion including making whatever assumptions would be necessary to avoid overflow. The truly horrible kinds of Undefined Behavior which are encouraged in hypermodern C philosophy do not exist in C# or other .NET languages outside of "unsafe" blocks.

Hoffert answered 22/6, 2015 at 20:20 Comment(0)
M
0

Not really in the exactly Wiki sense but I suppose the most obvious example that comes to my mind is simply writing some threaded code, but then it's like that in any language.

Metralgia answered 7/12, 2009 at 15:19 Comment(2)
That's just nondeterministic, not undefined. You're still given a guarantee of exactly what each thread will be doing. You just don't know the exact order in which things happen, and so you might get some surprising results.Dalliance
@mfloryan: unfortunately, whenever you spawn a new thread in C# you can't expect a game of NetHack to be started. Ah, good days when you could play it by running gcc...Aconcagua
D
0

The C# spec lists a few more undefined, implementation-defined, and unspecified behaviors: https://learn.microsoft.com/en-us/dotnet/csharp/language-reference/language-specification/portability-issues ... these are defined in the spec as follows:

behavior, implementation-defined – unspecified behavior where each implementation documents how the choice is made

behavior, undefined – behavior, upon use of a non-portable or erroneous construct or of erroneous data, for which this specification imposes no requirements

behavior, unspecified – behavior where this specification provides two or more possibilities and imposes no further requirements on which is chosen in any instance

https://learn.microsoft.com/en-us/dotnet/csharp/language-reference/language-specification/terms-and-definitions

The spec evolves and the list is too long for a StackOverflow answer, plus it's not all-encompassing. As the spec only has one term for "undefined behavior" ("Undefined behavior is indicated in this specification only by the words ‘undefined behavior.’") googling "undefined" site:https://learn.microsoft.com/en-us/dotnet/csharp/language-reference/" reveals some interesting tidbits:

Field Initialization across Partial Classes is Undefined

Field initialization order can be significant within C# code, and some guarantees are provided, as defined in §15.5.6.1. Otherwise, the ordering of members within a type is rarely significant, but may be significant when interfacing with other languages and environments. In these cases, the ordering of members within a type declared in multiple parts is undefined.

Pattern Evaluation Order is Undefined

The order in which patterns are checked is undefined. At run time, the right-hand nested patterns of or and and patterns can be checked first.

Furthermore, regarding list patterns:

The following assumptions are made on the members being used:

The property that makes the type countable is assumed to always return a non-negative value, if and only if the type is indexable. For instance, the pattern { Length: -1 } can never match an array.

The member that makes the type sliceable is assumed to be well-behaved, that is, the return value is never null and that it is a proper subslice of the containing list.

The behavior of a pattern-matching operation is undefined if any of the above assumptions doesn't hold.

https://learn.microsoft.com/en-us/dotnet/csharp/language-reference/proposals/csharp-11.0/list-patterns

Likewise, Collection Literal behavior is undefined if collections are not well-behaved:

Collections are assumed to be well-behaved. For example:

It is assumed that the value of Count on a collection will produce that same value as the number of elements when enumerated.

The types used in this spec defined in the System.Collections.Generic namespace are presumed to be side-effect free. As such, the compiler can optimize scenarios where such types might be used as intermediary values, but otherwise not be exposed.

It is assumed that a call to some applicable .AddRange(x) member on a collection will result in the same final value as iterating over x and adding all of its enumerated values individually to the collection with .Add.

The behavior of collection literals with collections that are not well-behaved is undefined.

https://learn.microsoft.com/en-us/dotnet/csharp/language-reference/proposals/csharp-12.0/collection-expressions

default(Nullable).Value is implementation-defined (not undefined)

The elements of an array can be initialized to known values when the array is created. Beginning with C# 12, all of the collection types can be initialized using a Collection expression. Elements that aren't initialized are set to the default value. The default value is the 0-bit pattern. All reference types (including non-nullable types), have the values null. All value types have the 0-bit patterns. That means the Nullable.HasValue property is false and the Nullable.Value property is undefined. In the .NET implementation, the Value property throws an exception.

Deliquescence answered 7/1, 2024 at 12:3 Comment(0)
G
-2

In general I would say no.

Use Automatic variable before it’s initialized.

All variables must be initialized. If not an exception occurs.

Division by zero

Exception is thrown.

Indexing an array out of bounds

Exception is thrown

As Aequitarum Custos pointed out you can use unsafe code. Then again this isn't really C#, you are explicitly opting out of the C# environment.

Gory answered 7/12, 2009 at 15:26 Comment(3)
Unsafe code is C#. What you're opting out of is the managed code guarantees.Aconcagua
This is not C# unsafe { int* px1; int* px2 = &i; F(out px1, ref px2); Console.WriteLine("*px1 = {0}, *px2 = {1}", *px1, *px2); // undefined behavior } Yes it's written with in C#.Gory
Sure, that's perfectly legal C#. An implementation of C# that implements the optional "unsafe" subset is still an implementation of C#.Joni

© 2022 - 2025 — McMap. All rights reserved.