most efficient way of swapping values c++
Asked Answered
H

5

7

I was wondering what the most efficient, in terms of operations, way of swapping integers is in c++, and why? Is something like:

int a =..., b = ...;
a = a + b;
b = a - b;
a = a - b;

more efficient than using a temporary? Are there any other more efficient ways? (not asking for just other ways to swap the ints) and why would they be more efficient?

Helbona answered 2/2, 2016 at 13:12 Comment(2)
I would suggest std::swapDailey
On a modern machine, that's possibly the slowest way to swap integers. If you had a machine with two registers it could be a good idea, particulary if it had a drum memory.Gaona
B
11

Assigning values is always faster than doing arithmetic operations.

C++ implementation for std::swap is

template<typename T> void swap(T& t1, T& t2) {
    T temp = std::move(t1); // or T temp(std::move(t1));
    t1 = std::move(t2);
    t2 = std::move(temp);
}

So to use a temporary variable is better than doing arithmetic trick.
And to use std::swap is even better because To reinvent the wheel in programming is never a good idea

Bolinger answered 2/2, 2016 at 13:21 Comment(2)
It's a possible implementation, yes. But not necessarily what will be called for integers. It's just a reasonable default.Cavitation
It may also be done as t1 = std::exchange(t2, t1);Cavitation
I
7

The best way is to trust your compiler and use the C++ standard library functions. They are designed for each other.

std::swap will win.

You could use an XOR swap for an int (which doesn't require a temporary), but these days it would still perform less well than std::swap.

Is answered 2/2, 2016 at 13:14 Comment(5)
Ok thanks, didn't realize standard functions would be faster than a few lines of code.Helbona
I would add that it will perform less well than std::swap, because std::swap may do the swap with a single machine instruction on certain architectures.Cavitation
@MaraJade My rule of thumb is try it with the standard provided functions/constructs. If you profile and find that they are not performant enough then look for a replacement.Dailey
Also note that in the rare case where handwritten code performs better than a standard library function that does the same thing, it is likely that you have found a performance bug. So don't be afraid to contact your compiler writer/standard library maintainer in such cases.Mispleading
And XOR swap fails if you accidentally try to swap a value with itself.Runlet
C
5

In my case, std::swap is 5% slower than the following (both with O3 optimization). In general, std::swap() function calls copy constructor that will be probably always slower than just copy part of memory.

#include <cstring>

size_t objectSize = sizeof(Object);
char temp[objectSize];

loop {
    loop {
        memcpy(temp, a, objectSize);
        memcpy(a, b, objectSize);
        memcpy(b, temp, objectSize);
    }
}

Edit: Using stack instead of heap memory allocation.

Chan answered 7/9, 2018 at 19:6 Comment(4)
Can I also use this to swap uint64_t several million times, or is it only beneficial for large object elements?Demmer
I think, standard swap of values will be faster in this case. But you have to try it.Chan
But memcpy can break object consistency in c++.Ensiform
@Ensiform Could you please explain how object consistency will be broken?Chan
M
1

Most efficient way is to NOT try to do it yourself. It really depends on why/were you want to do this. Trying to be clever and writing obscure code in C++ only reduces the chance of the compiler to optimize it correctly.

Lets say we use the ±-way you wrote: First the values a and b have to be loaded from memory. Then you are doing 3 arithmetic-operations to "swap" their content. And lastly the 2 values have to be stored in memory again. (Not gonna use actual assembly-code as i am not well versed with it and this pseudo-assembly is easier to get the concept accross )

load a into register rA
load b into register rB
add rB to rA and store in rA
subtract rB from rA and stor in rB
subtract rB from rA and store in rA
store register rA to memory b
store register rB to memory a

If the compiler would do exactly what you wanted (likely he will ignore it and make it better) that would be: 2 loads, 3 simple math-funtions, 2 stores - 7 operations.

It could also do slightly better as addition/subtraction can be done with 1 value from memory.

load 'a' into register rA
add b to rA and store in rA
subtract b from rA and store in rB
subtract rB from rA and store in rA
store rA to a
store rB to b

If we use an extra tmp-variable:

int a =..., b = ...;
int tmp = a;
a = b;
b = tmp;

The compiler will likely recognize that "tmp" is only a temporary variable only used for swapping the 2 values so it would not assign it a memory location btu only use registers. In that case what it would do is something along the lines of:

load a into register rA
load b into register rB
store register rA to memory b
store register rB to memory a

Only 4 operations - Basically the fastest it can do it as you need to load 2 values and you need to store 2 values and nothing else. (for moder nx86_64 processors there is no command that would just swap 2 values in memory - other architectures might have it and be even faster in that case).

Doing those arithmetic operations (or the xor-trick) is a nice excercise but on modern x86 CPUs with all but the most basic compilers it will not be "more efficient" in any form. It will user just as many registers, the same amount of memory for the variables, but require more instructions to do the same job. In general you should not attempt to outsmart the compiler unless you have checked your code, tested and benchmarked it and found that the generated assembly is not as good as it can be.

But it is nearly never needed to go to that level for optimisation and your time is better spent looking at the larger picture.

Mckenzie answered 15/12, 2020 at 13:37 Comment(0)
L
0
#include <iostream>
using namespace std;

void swap(int &a, int &b){
    b = (a+b) - (a=b);
}

int main() {
    int a=1,b=6;
    swap(a,b);
    cout<<a<<b;
    return 0;
}
Laburnum answered 25/8, 2018 at 5:17 Comment(1)
That's undefined behavior.Ensiform

© 2022 - 2025 — McMap. All rights reserved.