How do you reverse a string in place in C or C++?

M

21

197

How do you reverse a string in C or C++ without requiring a separate buffer to hold the reversed string?

Microcopy answered 13/10, 2008 at 16:36 Comment(0)

P

134

The standard algorithm is to use pointers to the start / end, and walk them inward until they meet or cross in the middle. Swap as you go.

Reverse ASCII string, i.e. a 0-terminated array where every character fits in 1 char. (Or other non-multibyte character sets).

void strrev(char *head)
{
  if (!head) return;
  char *tail = head;
  while(*tail) ++tail;    // find the 0 terminator, like head+strlen
  --tail;               // tail points to the last real char
                        // head still points to the first
  for( ; head < tail; ++head, --tail) {
      // walk pointers inwards until they meet or cross in the middle
      char h = *head, t = *tail;
      *head = t;           // swapping as we go
      *tail = h;
  }
}

// test program that reverses its args
#include <stdio.h>

int main(int argc, char **argv)
{
  do {
    printf("%s ",  argv[argc-1]);
    strrev(argv[argc-1]);
    printf("%s\n", argv[argc-1]);
  } while(--argc);

  return 0;
}

The same algorithm works for integer arrays with known length, just use tail = start + length - 1 instead of the end-finding loop.

(Editor's note: this answer originally used XOR-swap for this simple version, too. Fixed for the benefit of future readers of this popular question. XOR-swap is highly not recommended; hard to read and making your code compile less efficiently. You can see on the Godbolt compiler explorer how much more complicated the asm loop body is when xor-swap is compiled for x86-64 with gcc -O3.)

Ok, fine, let's fix the UTF-8 chars...

(This is XOR-swap thing. Take care to note that you must avoid swapping with self, because if *p and *q are the same location you'll zero it with a^a==0. XOR-swap depends on having two distinct locations, using them each as temporary storage.)

Editor's note: you can replace SWP with a safe inline function using a tmp variable.

#include <bits/types.h>
#include <stdio.h>

#define SWP(x,y) (x^=y, y^=x, x^=y)

void strrev(char *p)
{
  char *q = p;
  while(q && *q) ++q; /* find eos */
  for(--q; p < q; ++p, --q) SWP(*p, *q);
}

void strrev_utf8(char *p)
{
  char *q = p;
  strrev(p); /* call base case */

  /* Ok, now fix bass-ackwards UTF chars. */
  while(q && *q) ++q; /* find eos */
  while(p < --q)
    switch( (*q & 0xF0) >> 4 ) {
    case 0xF: /* U+010000-U+10FFFF: four bytes. */
      SWP(*(q-0), *(q-3));
      SWP(*(q-1), *(q-2));
      q -= 3;
      break;
    case 0xE: /* U+000800-U+00FFFF: three bytes. */
      SWP(*(q-0), *(q-2));
      q -= 2;
      break;
    case 0xC: /* fall-through */
    case 0xD: /* U+000080-U+0007FF: two bytes. */
      SWP(*(q-0), *(q-1));
      q--;
      break;
    }
}

int main(int argc, char **argv)
{
  do {
    printf("%s ",  argv[argc-1]);
    strrev_utf8(argv[argc-1]);
    printf("%s\n", argv[argc-1]);
  } while(--argc);

  return 0;
}

Why, yes, if the input is borked, this will cheerfully swap outside the place.
Useful link when vandalising in the UNICODE: http://www.macchiato.com/unicode/chart/
Also, UTF-8 over 0x10000 is untested (as I don't seem to have any font for it, nor the patience to use a hexeditor)

Examples:

$ ./strrev Räksmörgås ░▒▓○◔◑◕●

░▒▓○◔◑◕● ●◕◑◔○▓▒░

Räksmörgås sågrömskäR

./strrev verrts/.

Pairs answered 13/10, 2008 at 16:58 Comment(19)

Why not "*p ^= *q, *q ^= *p, *p ^= *q"? – Flophouse 13/10, 2008 at 17:19

I'd say that if you are going to ask for "In place" without being more specific, it HAS to be the xor thing. Anything else isn't in-place. That said, this has no business being in production code anywhere ever. if you're ever even tempted to use it, quit engineering now. – Meadowsweet 13/10, 2008 at 17:24

You think "in-place" means "no extra memory", not even O(1) memory for temporaries? What about the space on the stack for str and the return address? – Flophouse 13/10, 2008 at 17:30

@Bill, that's not what the common definition of “in-place” means. In-place algorithms may use additional memory. However, the amount of this additional memory must not depend on the input – i.e. it must be constant. Therefore, swapping of values using additional storage is completely in-place. – Berlinda 13/10, 2008 at 17:31

Oh no, Räksmörgås! Now I have to go to the fridge and make me one. :P – Nissy 13/10, 2008 at 22:4

Perhaps you'd like to check this question I've asked specifically about how to handle this task with UTF8 strings (not an universal solution, though). #199760 – Sorites 13/10, 2008 at 22:41

XOR swapping is slower than swapping via register on modern out-of-order processors. – Augmented 15/8, 2011 at 17:42

I googled "in place string reversal" specifically to find an implementation of the XOR trick. Haters gonna hate, but I found what I was looking for so +1 from me. – Haft 2/10, 2012 at 22:6

I ran some tests, and it seems that the "while (q && *q)" test should be changed. It should be "while (*q)" or changed to a separate "if (!q) { return; }" (if the intention was to catch NULL pointers). The problem is in the NULL case, the next statement, for(--q; p < q; ++p, --q), causes q to roll over to 0xFFFFFFFFFFFFFFFF (at least on the 64-bit CentOS machine I was testing on), which will cause the for loop to execute (since 0 is less than 0xFFFFFFFFFFFFFFFF, and the first "*p = *p ^ *q;" will then cause a de-reference of a null pointer, and a de-reference of 0xFFFFFFFFFFFFFFFF... – Rico 13/7, 2013 at 20:6

Guys, most of you are wrong about definition of in-place. It has a strong definition: in-place algorithm uses O(logn) memory. Fox example do you consider quicksort in-place? It is in place and it uses logarithmic memory for storing data in recursion stack. – Tara 18/7, 2013 at 20:40

@Tara You are wrong. In-place means one thing, and one thing only: constant memory, not O(logn). And you got it: quicksort isn’t in-place. Whoever claims that is wrong. In fact, many (ostensibly “in-place”) implementations of quicksort even use O(n) additional memory in the worst case (since they don’t guarantee logarithmic recursion depth. But like I said, quicksort isn’t really in-place to begin with. – Berlinda 16/10, 2013 at 17:25

@KonradRudolph, yeah, I got it. Perhaps some time ago I read this article en.wikipedia.org/wiki/In-place_algorithm (section "In computational complexity") and forgot that this log n was applied there in a little bit different context. – Tara 16/10, 2013 at 19:26

@AndersEurenius can you clarify on why do use use while(q && *q) ++q; to find the eos? Isn't it the same as while(*q) ++q;, since q is never going to be 0? How can you be sure that q or *q are going to be 0? – Knitted 17/1, 2014 at 11:9

"you must avoid swapping with self, because a^a==0" - wrong. a ^ a == 0, but that's not a problem, because then you will do a ^ (a ^ a) which is a ^ 0 which is a. So the XOR swap works even if the two swappees (is there such a word) are equal. – Felicafelicdad 21/1, 2014 at 15:52

char *q = p; while(q && *q) ++q; for(--q; p < q; ++p, --q) to do NULL detection is problematic. For the code progress to --q when q is NULL and at best that is the largest legal pointer, at worst it may be UB. Then the for() loop would iterate for a long time. Suggest a simple if (q != NULL) { do the rest of code }. – Dorweiler 13/6, 2014 at 22:8

As an interviewer, I dock points for use of the old xor-swap trick. It's less efficient on modern processors and anybody who uses it because it's "neat" is not a great programmer -- "good", possibly, but not "great". – Uncaused 11/9, 2015 at 1:39

Wow, every time I think I can't find an uglier code than last time, I find an uglier one. – Caundra 12/9, 2018 at 22:3

@ChrisConway Yes and as a participant and winner also I can say that it's not going to confuse many people. It might have its uses there but it's not confusing either. I can think of numerous ways it might be useful in an entry but overall it's so unoriginal that it would be not relevant. They want much more balanced entries. Still there are times outside the contest where I might consider it though that's extremely limited too - and not in regular code. – Dilley 21/2, 2020 at 16:50

As for while(q && *q) ++q; why not just use strchr(q, '\0'); ? Probably doesn't matter but it seems more natural. Of course there are other ways too. Maybe you just didn't want to include string.h even. – Dilley 21/2, 2020 at 17:0

G

494

#include <algorithm>
std::reverse(str.begin(), str.end());

This is the simplest way in C++.

Gaut answered 13/10, 2008 at 16:39 Comment(4)

In C++ a string is represented by the string class. He didn't asked for "char star" or a "char brackets". Stay classy, C. – Odlo 30/8, 2011 at 21:53

@fredsbend, the "ridiculously long" version of the selected answer handles a case which this simple answer doesn't - UTF-8 input. It shows the importance of fully specifying the problem. Besides the question was about code that would work in C as well. – Electrical 16/8, 2013 at 14:45

This answer does handle the case if you use a UTF-8 aware string class (or possibly a utf-8 character class with std::basic_string). Besides, the question said "C or C++", not "C and C++". C++ only is "C or C++". – Osmometer 5/7, 2016 at 18:54

This will completely break UTF-8 multibyte strings and in this day and age a total non-starter. – Chiarra 27/1, 2021 at 19:18

A

169

Read Kernighan and Ritchie

#include <string.h>

void reverse(char s[])
{
    int length = strlen(s) ;
    int c, i, j;

    for (i = 0, j = length - 1; i < j; i++, j--)
    {
        c = s[i];
        s[i] = s[j];
        s[j] = c;
    }
}

Anstice answered 14/10, 2008 at 3:9 Comment(13)

Tested on my iphone this is slower than using raw pointer addresses by about 15% – Tko 29/4, 2013 at 20:16

Shouldn't variable "c" be a char instead of an int? – Blooded 17/11, 2013 at 17:25

@Blooded - In C, whenever a character constant or variable is used in an expression in C, it is automatically converted & treated as an integer. If you have a linux termminal, you can see the ascii codes by typing man ascii – Soso 9/4, 2014 at 1:48

Its important to note in this example that the string s must be declared in an array form. In other words, char s[] = "this is ok" rather than char *s="cannot do this" because the latter results in a string constant which cannot be modified – Soso 9/4, 2014 at 1:50

length, i, j should be size_t, but then would have trouble when length == 0. – Dorweiler 13/6, 2014 at 22:16

@PsychoDad Did you enable optimizations when you did the iPhone benchmark of array access vs pointer?? – Foah 1/9, 2014 at 6:27

@brandin Hopefully I did it in release mode, but I don't remember. – Tko 1/9, 2014 at 14:40

With apologizes to "The Godfather" .... "Leave the guns, bring the K&R". As a C bigot, I'd use pointers, as they're simpler and more straight-forward for this problem, but the code would be less portable to C#, Java, etc. – Cholecystectomy 27/11, 2014 at 2:14

I don't understand why this isn't the accepted answer. It's the simplest, most robust solution that performs in O(1) space. It also performs in O(log(n)) time, which is awesome. – Oscillation 7/12, 2016 at 19:5

@Eric This does not run in O(log(n)) time. It runs in O(n) if you're referring to the number of character swaps the code performs, for n of string length, then n swaps are performed. If you we're talking about the amount of loops performed then its still O(n) - albeit O(n/2) but you drop the constants in Big O notation. – Nmr 18/12, 2016 at 18:43

@Eric In addition to what Stephen says it - as user1527227 also says - requires it to be in array form. That would be another good reason it's not accepted. – Dilley 21/2, 2020 at 17:4

@Tko Then you have a bad compiler – Evars 25/1, 2021 at 13:41

Sorry, but this solution is not "in place". When I used to teach C programming, one of the problems I always gave my students was to reverse the string in place using only two registers and no stack variables to hold any of the string characters. The trick was using exclusive or. It was easier back then though because we could easily write past the \0 and do a buffer overrun. This was allowed as long as they only overwrote the \0, giving them a byte to play with. – Hesperus 27/7, 2023 at 4:0

P

134