Slicing a Span<T> row from a 2D matrix - not sure why this works

Asked 3/1, 2018 at 0:24 Answered 3/1, 2018 at 1:43

I've been looking for a way to extract slices from a 2D matrix without having to actually reallocate-copy the contents, and

public static Span<float> Slice([NotNull] this float[,] m, int row)
{
    if (row < 0 || row > m.GetLength(0) - 1) throw new ArgumentOutOfRangeException(nameof(row), "The row index isn't valid");
    return Span<float>.DangerousCreate(m, ref m[row, 0], m.GetLength(1));
}

I've checked this method with this simple Unit tests and apparently it works:

[TestMethod]
public void Foo()
{
    float[,] m =
    {
        { 1, 2, 3, 4 },
        { 5, 6, 7, 8 },
        { 9, 9.5f, 10, 11 },
        { 12, 13, 14.3f, 15 }
    };
    Span<float> s = m.Slice(2);
    var copy = s.ToArray();
    var check = new[] { 9, 9.5f, 10, 11 };
    Assert.IsTrue(copy.Select((n, i) => Math.Abs(n - check[i]) < 1e-6f).All(b => b));
}

This doesn't seem right to me though. I mean, I'd like to understand what's exactly happening behind the scenes here, as that ref m[x, y] part doesn't convince me.

How is the runtime getting the actual reference to the value at that location inside the matrix, since the this[int x, int y] method in the 2D array is just returning a value and not a reference?

Shouldn't the ref modifier only get a reference to the local copy of that float value returned to the method, and not a reference to the actual value stored within the matrix? I mean, otherwise having methods/parameters with ref returns would be pointless, and that's not the case.

I took a peek into the IL for the test method and noticed this:

Now, I'm not 100% sure since I'm not so great at reading IL, but isn't the ref m[x, y] call being translated to a call to that other Address method, which I suppose just returns a ref value on its own?

If that's the case, is there a way to directly use that method from C# code?

And is there a way to discover methods like this one, when available?

I mean, I just noticed that by looking at the IL and I had no idea it existed or why was the code working before, at this point I wonder how much great stuff is there in the default libs without a hint it's there for the average dev.

Thanks!

Province answered 3/1, 2018 at 0:24 Comment(7)

In your unit test I do not see you calling Slice so I do not understand how you are testing it. – Sarmentum 3/1, 2018 at 0:34

Why do you assume this[int x, int y] returns a value? You can do m[2, 1] = 7;, correct? – Crosslet 3/1, 2018 at 0:39

@Crosslet True, but since I'm using the getter part of the this[int, int] method on a 2D Array, that returns an int value, I'm not sure I understand your question. My point here is that the getter in question is only supposed to return a value, and I wasn't excepting the compiler to replace that call with an entirely different method (apparently), and I'd like to know the implementation details or what's exactly happening. – Province 3/1, 2018 at 0:46

@Sarmentum It's the exact same code of the extension method, I've used the code in the Unit test to inspect the IL because it was just easier to read. But sure, I'll go ahead and refactor the question using the first method, you're right. – Province 3/1, 2018 at 0:48

How are you using the "getter part" of the indexer? – Crosslet 3/1, 2018 at 0:52

Wondering where you found "Span<float>.DangerousCreate" because I am on 7.3 and downloaded the System.Memory and still don't have it. – Masonry 19/6, 2018 at 21:58

@Masonry IIRC they've since moved that method in another class, can't recall which one right now. if you go to the official repo you should be able to just look for that name and check where it's currently located. – Province 19/6, 2018 at 22:2

It seems to me that the crux of your confusion is here:

Shouldn't the ref modifier only get a reference to the local copy of that float value returned to the method, and not a reference to the actual value stored within the matrix?

You seem to be under the mistaken impression that the indexer syntax for an array works exactly the same as for other types. But it doesn't. An indexer for an array is a special case in .NET, and treated as a variable, not a property or pair of methods.

For example:

void M1()
{
    int[] a = { 1, 2, 3 };

    M2(ref a[1]);
    Console.WriteLine(string.Join(", ", a);
}

void M2(ref int i)
{
    i = 17;
}

yields:

1, 17, 3

This works because the expression a[1] is not a call to some indexer getter, but rather describes a variable that is physically located in the second element of the given array.

Likewise, when you call DangerousCreate() and pass ref m[row, 0], you are passing the reference to the variable that is exactly the element of the m array at [row, 0].

Since a reference to the actual memory location is what's being passed, the rest should be no surprise. That is, that the Span<T> class is able to then use that address to wrap a specific subset of the original array, without allocating any extra memory.

Lhary answered 3/1, 2018 at 1:2 Comment(0)

Standard 1D (SZ) arrays have three opcodes to work with them - ldelem, stelem, and ldelema. They represent the actions that can be performed on a variable - getting its value, setting its value, and obtaining a reference to it. a[i] syntax is just translated to whatever represents what you do with the element. Other variables have similar opcodes (ldloc, stloc, ldloca; ldfld, stfld, ldflda etc.)

However, these opcodes cannot be used with multidimensional arrays. Quoting ECMA-335:

For one-dimensional arrays that aren’t zero-based and for multidimensional arrays, the array class provides a Get method.

For one-dimensional arrays that aren’t zero-based and for multidimensional arrays, the array class provides a StoreElement [sic] method

For one-dimensional arrays that aren’t zero-based and for multidimensional arrays, the array class provides an Address method.

The StoreElement method has been since renamed to Set, but this still holds. Accesing elements of a multidimensional array is translated to whatever action you perform on them.

This triplet of methods have these signatures:

instance int32 int32[0...,0...]::Get(int32, int32)
instance void int32[0...,0...]::Set(int32, int32, int32)
instance int32& int32[0...,0...]::Address(int32, int32)

These intrinsic methods are implemented by the CLR. Notice the reference returned by the last method. While the ability to return a reference has been added to C# quite recently, CLI supported it from the beginning.

Also notice that at no point an indexer is involved. In fact, arrays don't even have an indexer, because that is a C# thing and it is not sufficient to implement all actions for a variable, because the get reference accessor is missing.

To sum things up, a[x] on an array and a[x] on a non-array (any object with an indexer) are massively different things.

By the way, DangerousCreate also works thanks to this statement (ECMA-335 again):

Array elements shall be laid out within the array object in row-major order (i.e., the elements associated with the rightmost array dimension shall be laid out contiguously from lowest to highest index). The actual storage allocated for each array element can include platform-specific padding.

Internship answered 3/1, 2018 at 1:43 Comment(0)