Why do structs need to be boxed?

Asked 30/12, 2009 at 5:43 Answered 23/2, 2013 at 3:33

In C#, any user-defined struct is automatically a subclass of ~~System.Struct~~ System.ValueType and ~~System.Struct~~ System.ValueType is a subclass of System.Object.

But when we assign some struct to object-type reference it gets boxed. For example:

struct A
{
    public int i;
}

A a;
object obj = a;  // boxing takes place here

So my question is: if A is an descendant of System.Object, can't the compiler up-cast it to object type instead of boxing?

Bradybradycardia answered 30/12, 2009 at 5:43 Comment(0)

A struct is a value type. System.Object is a reference type. Value types and reference types are stored and treated differently by the runtime. For a value type to be treated as a reference type, it's necessary for it to be boxed. From a low level perspective, this includes copying the value from the stack where it originally lives to the newly allocated memory on the heap, which also contains an object header. Additional headers are necessary for reference types to resolve their vtables to enable virtual method dispatches and other reference type related features (remember that a struct on stack is just a value and it has zero type information; it doesn't contain anything like vtables and can't be directly used to resolve dynamically dispatched methods). Besides, to treat something as a reference type, you have to have a reference (pointer) to it, not the raw value of it.

So my question is - if A is an descendant of System.Object, can't compiler upcast it to object type instead of boxing?

At a lower level, a value does not inherit anything. Actually, as I said before, it's not really an object. The fact that A derives from System.ValueType which in turn derives from System.Object is something defined at the abstraction level of your programming language (C#) and C# is indeed hiding the boxing operation from you pretty well. You don't mention anything explicitly to box the value so you can simply think the compiler has "upcasted" the structure for you. It's making the illusion of inheritance and polymorphism for values while none of the tools required for polymorphic behavior is directly provided by them.

Fertilization answered 30/12, 2009 at 5:47 Comment(6)

Good answer. Couple minor problems with it though. First, the stack vs heap is irrelevant to boxing; value types need not be on the stack, and they are boxed even if they are on the heap. Second, virtual methods are irrelevant; boxing is never needed to dispatch a virtual method on a struct! Since all structs are sealed, the jitter has sufficient information to exactly determine which method is called at jit time. – Fanciful 30/12, 2009 at 7:20

Eric: I knew you're going to comment on that. I mentioned the stack and heap metaphor mainly to point out you'll need to have some kind of pointer to it. Regarding your second point, I think you're referring to the constrained IL instruction. What I meant though is calling something like ToString on a struct casted to System.Object or say, IComparable.CompareTo on a boxed integer statically typed as IComparable. I think vtable lookup is required here, isn't it? – Fertilization 30/12, 2009 at 7:30

An invocation of a method on a boxed value type is treated as a "vtable" call, yes; the jitter has no reason to believe that it's anything special. (The question of whether in practice calls to interface methods are what a C++ compiler writer would strictly speaking think of as a "vtable" call is an interesting one but not that germane to this question.) But many people incorrectly believe that calling interface methods on an unboxed struct actually boxes the struct and then does the virtual call; why would the jitter go to all that trouble when the method is already named in the metadata? – Fanciful 30/12, 2009 at 7:37

True. I've had this discussion with people who think 2.ToString() will box 2. By the way, is it possible to demonstrate this fact with C# code only? I mean, short of disassembly or digging through WinDbg... System.Object does not provide a method that mutates a boxed value and I don't know a way to prove this. – Fertilization 30/12, 2009 at 7:43

(And yes, the .constrained prefix instruction helps hint to the jitter that a particular invocation can skip the boxing. If you're interested in how interface dispatch does not actually work the same as virtual method calls, here's an old article that explains it: msdn.microsoft.com/en-us/magazine/cc163791.aspx#S12) – Fanciful 30/12, 2009 at 7:43

Hmm, interesting question. Nothing immediately comes to mind. – Fanciful 30/12, 2009 at 7:46

Here's how I prefer to think about it. Consider the implementation of a variable containing a 32 bit integer. When treated as a value type, the entire value fits into 32 bits of storage. That's what a value type is: the storage contains just the bits that make up the value, nothing more, nothing less.

Now consider the implementation of a variable containing an object reference. The variable contains a "reference", which could be implemented in any number of ways. It could be a handle into a garbage collector structure, or it could be an address on the managed heap, or whatever. But it's something which allows you to find an object. That's what a reference type is: the storage associated with a variable of reference type contains some bits that allow you to reference an object.

Clearly those two things are completely different.

Now suppose you have a variable of type object, and you wish to copy the contents of a variable of type int into it. How do you do it? The 32 bits that make up an integer aren't one of these "reference" things, it's just a bucket that contains 32 bits. References could be 64 bit pointers into the managed heap, or 32 bit handles into a garbage collector data structure, or any other implementation you can think of, but a 32 bit integer can only be a 32 bit integer.

So what you do in that scenario is you box the integer: you make a new object that contains storage for an integer, and then you store a reference to the new object.

Boxing is only necessary if you want to (1) have a unified type system, and (2) ensure that a 32 bit integer consumes 32 bits of memory. If you're willing to reject either of those then you don't need boxing; we are not willing to reject those, and so boxing is what we're forced to live with.

Fanciful answered 30/12, 2009 at 7:32 Comment(6)

Eric, as always, very nice explanation! Though could you please expand a little bit on what you mean when you say that 'boxing is necessary if you want to have a unified type system.' I am failing to grasp how 'boxing' unifies the type system. Thanks! – Aile 30/12, 2009 at 18:2

Thinking about it a bit more, are you suggesting that C# prefers a system where the developer can 'treat' value types similar to a reference type without needing to understand how they are actually implemented by .NET CLR? And to achieve this, 'boxing' becomes necessary evil? What would it look like if you chose to avoid 'boxing', how would a developer interact with value types/reference types? – Aile 30/12, 2009 at 18:6

Let me rephrase. Three desirable things: (1) value types only contain their data and therefore have a different representation than reference types, (2) all values can be converted to a common unified type, object, and (3) value types never need to be "boxed". Those three desirable things are mutually impossible; you can have at most two of them. We've chosen to have (1) and (2); not having (3) is the price you pay. – Fanciful 30/12, 2009 at 21:58

Similarly, you probably want your camera to be (1) cheap, (2) lightweight, and (3) take good pictures. You only get two out of the three; which two you choose is up to you, but you don't get all three. – Fanciful 30/12, 2009 at 22:0

A couple of nits: (1) The necessity of boxing depends upon whether one wants value types with a number of significant bits close to the number of bits in an object reference, to be passed as an object. If an object reference is 64 bits but will never need more than 2^48 distinct object instances, one could if desired avoid boxing for all pre-defined value types 32 bits and smaller, as well as all doubles within the range +/- 2^512, all Int64's in the range +/- 2^55, and all UInt64's in the range 0..2^56-1, and perhaps some other types as well. Bigger numbers would still need to be boxed. – Tungus 18/11, 2011 at 22:35

Also, (2) While it would be possible to define a framework in which standalone heap objects could provide either value-type or reference-type semantics, the .net framework uses a somewhat simplified model in which all heap objects have reference-type semantics. This greatly simplifies some operations (e.g. it means a structure assignment or MemberwiseClone can perform a simple bit-for-bit copy) but compels many classes to use reference-type semantics even when value semantics would be a more natural fit for what the programmer is doing. – Tungus 18/11, 2011 at 22:45

While the designers of .NET certainly didn't need to include boxing section 4.3 of the C# Language Specification explains the intent behind it quite well, IMO:

Boxing and unboxing enables a unified view of the type system wherein a value of any type can ultimately be treated as an object.

Because value types are not reference types (which System.Object ultimately is), the act of boxing exists in order to have a unified type system where the value of anything can be represented as an object.

This is different from say, C++ where the type system isn't unified, there isn't a common base type for all types.

Untold answered 30/12, 2009 at 5:53 Comment(5)

Strictly speaking, not everything derives from object. On the type side, pointer types are neither convertible to nor derived from object. Type parameter types and interface types do not derive from object but are always convertible to object. It is values of non-pointer types which always derive from object. Except for the null value of reference types, which derives from nothing, not being an object. A reference that refers to nothing does not refer to an object; such a reference is convertible to object but does not derive from object. – Fanciful 30/12, 2009 at 15:46

@Eric Lippert: Changed the answer to reflect your concerns. – Untold 30/12, 2009 at 18:49

According to the C# specification, all value types are derived from reference types (System.ValueType and System.Enum), but are not reference types. Which is nonsense & makes me suspect that the C# specification is inaccurate. @EricLippert: I'd be interested in your opinion of my answer. – Seaplane 22/2, 2013 at 19:54

@stakx: I think your answer confuses more than it explains. Of course all value types are derived from a reference type; there is no contradiction there. The derivation relationship does not require that there be any commonality in the representation. Yes, the CLI specification is concerned with representation, but that's an irrelevant implementation detail from the C# programmer's perspective. – Fanciful 22/2, 2013 at 20:13

@EricLippert: The fact that if T genuinely derives from U, a conversion from T to U is identity-preserving is not an "irrelevant implementation detail". The C# spec may use a definition of "derives" which does includes types whose conversions are not identity preserving, but that doesn't mean such a definition is helpful. – Tungus 22/2, 2013 at 21:31

struct is a value-type by design, hence it needs to be boxed when turned into a reference type. struct derives from System.ValueType, which in term derives from System.Object.

The mere fact that struct is a descendant of object, does not mean much..since the CLR deals with structs differently at runtime than a reference type.

Adeline answered 30/12, 2009 at 5:50 Comment(0)

After the question has been answered I'll present a little "trick" related to that topic:

structs can implement interfaces. If you pass a value type to a function that expects an interface that this value type implements the value normally gets boxed. Using generics you can avoid the boxing:

interface IFoo {...}
struct Bar : IFoo {...}

void boxing(IFoo x) { ... }
void byValue<T>(T x) : where T : IFoo { ... }

var bar = new Bar();
boxing(bar);
byValue(bar);

Gio answered 30/12, 2009 at 9:47 Comment(0)

"If struct A is an descendant of System.Object, can't the compiler up-cast it instead of boxing?"

No, simply because according to the definition of the C# language, "up-casting" in this case is boxing.

The language specification for C# contains (in chapter 13) a catalogue of all possible type conversions. All these conversions are categorized in a specific fashion (e.g. numeric conversions, reference conversions, etc.).

There are implicit type conversions from a type S to its super-type T, but these are only defined for the pattern "from a class type S to a reference type T". Because your struct A is not a class type, these conversions cannot be applied in your example.

That is, the fact that A is (indirectly) derived from object (while correct) is simply irrelevant here. What is relevant is that A is a struct value type.
The only existing conversion that matches the pattern "from a value type A to its reference super-type object" is categorized as a boxing conversion. Thus every conversion from a struct to object is by definition considered boxing.

Seaplane answered 23/2, 2013 at 3:33 Comment(0)

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags