Converting byte array to string in javascript
Asked Answered
P

16

102

How do I convert a byte array into a string?

I have found these functions that do the reverse:

function string2Bin(s) {
    var b = new Array();
    var last = s.length;
    
    for (var i = 0; i < last; i++) {
        var d = s.charCodeAt(i);
        if (d < 128)
            b[i] = dec2Bin(d);
        else {
            var c = s.charAt(i);
            alert(c + ' is NOT an ASCII character');
            b[i] = -1;
        }
    }
    return b;
}

function dec2Bin(d) {
    var b = '';
    
    for (var i = 0; i < 8; i++) {
        b = (d%2) + b;
        d = Math.floor(d/2);
    }
    
    return b;
}

But how do I get the functions working the other way?

Polluted answered 7/7, 2010 at 14:46 Comment(2)
Do you want to convert a byte array to a string, or an array of bits to a string?Grieco
See also proper solution for utf8 array: Uint8Array to string in JavascriptMcglothlin
R
98

You need to parse each octet back to number, and use that value to get a character, something like this:

function bin2String(array) {
  var result = "";
  for (var i = 0; i < array.length; i++) {
    result += String.fromCharCode(parseInt(array[i], 2));
  }
  return result;
}

bin2String(["01100110", "01101111", "01101111"]); // "foo"

// Using your string2Bin function to test:
bin2String(string2Bin("hello world")) === "hello world";

Edit: Yes, your current string2Bin can be written more shortly:

function string2Bin(str) {
  var result = [];
  for (var i = 0; i < str.length; i++) {
    result.push(str.charCodeAt(i).toString(2));
  }
  return result;
}

But by looking at the documentation you linked, I think that the setBytesParameter method expects that the blob array contains the decimal numbers, not a bit string, so you could write something like this:

function string2Bin(str) {
  var result = [];
  for (var i = 0; i < str.length; i++) {
    result.push(str.charCodeAt(i));
  }
  return result;
}

function bin2String(array) {
  return String.fromCharCode.apply(String, array);
}

string2Bin('foo'); // [102, 111, 111]
bin2String(string2Bin('foo')) === 'foo'; // true
Roberts answered 7/7, 2010 at 14:55 Comment(8)
Thanks for the super speedy response. Couple of questions... 1) Your bin2String function is impressive - only 5 lines of code. Can the string2bin function be changed to make use of more Javascript functions to shorten the function and sub-function? .....Polluted
2) The reason I need these conversions is because I am capturing a signature and I have to convert it to populate a BLOB field in the database. Problem is, whilst these 2 functions work, something else is going wrong. The main thing is that when I retrieve a BLOB from the database it goes into a bytes array object. However, when I am writing the BLOB to the database after running it through the original function it is not a bytes array object. This might be what's causing the problem. Any ideas?Polluted
dcx.sybase.com/index.html#1101en/ulmbus_en11/… This is the syntax I use to set the data.Polluted
Hi Christian. You're an absolute genius! :) Thanks for that - it works perfectly! Many thanks!Polluted
String.fromCharCode.apply(String, array) is unsafe for very long strings in Safari. There's an issue in JavaScriptCore which means that functions can't take more than 65536 arguments, or a RangeError will be thrown. It also locks up the browser on arrays slightly smaller than that. See bugs.webkit.org/show_bug.cgi?id=80797Carleton
Fails for multi-byte utf-8 characters ie: bin2String([0xE2, 0x98, 0xB9])Alkene
Isn't that result += by one char in a loop killing the performance?Mcglothlin
@BradKent string2Bin(bin2String([0xE2, 0x98, 0xB9])) == [0xE2, 0x98, 0xB9], so it doesnt fail.Paletot
A
75

ES6 update

Now, string 'foo' also equals String.fromCharCode(...[102, 111, 111])

Original answer

Simply apply your byte array to String.fromCharCode. For example

String.fromCharCode.apply(null, [102, 111, 111]) equals 'foo'.

MDN docs here.

Caveat: works for arrays shorter than 65535 - MDN docs here.

Alexander answered 31/5, 2016 at 10:11 Comment(3)
This has already been demonstrated by the accepted answer 6 years ago.Ison
aah, indeed, I missed that line. Basically I was looking for a short one-liner and I dismissed that long and edited answer (maybe too hasty).Alexander
Even though a re-iteration, its brevity makes it better than the accepted answer.Constant
U
43

Try the new Text Encoding API:

// create an array view of some valid bytes
let bytesView = new Uint8Array([104, 101, 108, 108, 111]);

console.log(bytesView);

// convert bytes to string
// encoding can be specfied, defaults to utf-8
let str = new TextDecoder().decode(bytesView); 

console.log(str);

// convert string to bytes
// encoding can be specfied, defaults to utf-8
let bytes2 = new TextEncoder().encode(str);

// look, they're the same!
console.log(bytes2);
console.log(bytesView);
Ulberto answered 9/8, 2018 at 17:1 Comment(4)
Unfortunately IE doesn't support it.Urethra
If you need UTF-8 and IE support, you can you use the FastestSmallestTextEncoderDecoder polyfill, recommended by the MDN website.Zachary
TextDecoder fails for non printable ASCII chars. Test new TextEncoder().encode(new TextDecoder().decode(new Uint8Array([255, 255, 255, 255]))) is not [255, 255, 255, 255]. To convert byte array to string you use String.fromCharCode(), then the inverse operation is possible.Paletot
Use TextDecoder("UTF-16") to half the string. In 16 bit javascript memory the upper byte is now used, so it halves the memory usage. But files is best with the default "UTF-8" so it got bigger when I tried to download.Dorothi
S
18

This should work:

String.fromCharCode(...array);

Or

String.fromCodePoint(...array)
Seraphic answered 21/2, 2019 at 20:41 Comment(1)
short and sweet ;)Deconsecrate
G
7

That string2Bin can be written even more succinctly, and without any loops, to boot!

function string2Bin ( str ) {
    return str.split("").map( function( val ) { 
        return val.charCodeAt( 0 ); 
    } );
}
Gleiwitz answered 10/8, 2012 at 2:12 Comment(2)
Would be curious to see if added function calls slow this down.Anticlastic
It still has a loop, it's just hidden within map().Obie
C
5

String to byte array: "FooBar".split('').map(c => c.charCodeAt(0));

Byte array to string: [102, 111, 111, 98, 97, 114].map(c => String.fromCharCode(c)).join('');

Cholinesterase answered 17/10, 2018 at 15:11 Comment(1)
be careful, this is not supported by IE!Ebb
P
4

I think this would be more efficient:

function toBinString (arr) {
    var uarr = new Uint8Array(arr.map(function(x){return parseInt(x,2)}));
    var strings = [], chunksize = 0xffff;
    // There is a maximum stack size. We cannot call String.fromCharCode with as many arguments as we want
    for (var i=0; i*chunksize < uarr.length; i++){
        strings.push(String.fromCharCode.apply(null, uarr.subarray(i*chunksize, (i+1)*chunksize)));
    }
    return strings.join('');
}
Paisley answered 13/6, 2014 at 10:34 Comment(0)
I
4

Even if I'm a bit late, I thought it would be interesting for future users to share some one-liners implementations I did using ES6.

One thing that I consider important depending on your environment or/and what you will do with with the data is to preserve the full byte value. For example, (5).toString(2) will give you 101, but the complete binary conversion is in reality 00000101, and that's why you might need to create a leftPad implementation to fill the string byte with leading zeros. But you may not need it at all, like other answers demonstrated.

If you run the below code snippet, you'll see the first output being the conversion of the abc string to a byte array and right after that the re-transformation of said array to it's corresponding string.

// For each byte in our array, retrieve the char code value of the binary value
const binArrayToString = array => array.map(byte => String.fromCharCode(parseInt(byte, 2))).join('')

// Basic left pad implementation to ensure string is on 8 bits
const leftPad = str => str.length < 8 ? (Array(8).join('0') + str).slice(-8) : str

// For each char of the string, get the int code and convert it to binary. Ensure 8 bits.
const stringToBinArray = str => str.split('').map(c => leftPad(c.charCodeAt().toString(2)))

const array = stringToBinArray('abc')

console.log(array)
console.log(binArrayToString(array))
Ison answered 8/7, 2016 at 18:59 Comment(0)
Z
2

If your array is encoded in UTF-8 and you can't use the TextDecoder API because it is not supported on IE:

  1. You can use the FastestSmallestTextEncoderDecoder polyfill recommended by the Mozilla Developer Network website;
  2. You can use this function also provided at the MDN website:

function utf8ArrayToString(aBytes) {
    var sView = "";
    
    for (var nPart, nLen = aBytes.length, nIdx = 0; nIdx < nLen; nIdx++) {
        nPart = aBytes[nIdx];
        
        sView += String.fromCharCode(
            nPart > 251 && nPart < 254 && nIdx + 5 < nLen ? /* six bytes */
                /* (nPart - 252 << 30) may be not so safe in ECMAScript! So...: */
                (nPart - 252) * 1073741824 + (aBytes[++nIdx] - 128 << 24) + (aBytes[++nIdx] - 128 << 18) + (aBytes[++nIdx] - 128 << 12) + (aBytes[++nIdx] - 128 << 6) + aBytes[++nIdx] - 128
            : nPart > 247 && nPart < 252 && nIdx + 4 < nLen ? /* five bytes */
                (nPart - 248 << 24) + (aBytes[++nIdx] - 128 << 18) + (aBytes[++nIdx] - 128 << 12) + (aBytes[++nIdx] - 128 << 6) + aBytes[++nIdx] - 128
            : nPart > 239 && nPart < 248 && nIdx + 3 < nLen ? /* four bytes */
                (nPart - 240 << 18) + (aBytes[++nIdx] - 128 << 12) + (aBytes[++nIdx] - 128 << 6) + aBytes[++nIdx] - 128
            : nPart > 223 && nPart < 240 && nIdx + 2 < nLen ? /* three bytes */
                (nPart - 224 << 12) + (aBytes[++nIdx] - 128 << 6) + aBytes[++nIdx] - 128
            : nPart > 191 && nPart < 224 && nIdx + 1 < nLen ? /* two bytes */
                (nPart - 192 << 6) + aBytes[++nIdx] - 128
            : /* nPart < 127 ? */ /* one byte */
                nPart
        );
    }
    
    return sView;
}

let str = utf8ArrayToString([50,72,226,130,130,32,43,32,79,226,130,130,32,226,135,140,32,50,72,226,130,130,79]);

// Must show 2H₂ + O₂ ⇌ 2H₂O
console.log(str);
Zachary answered 5/12, 2019 at 1:29 Comment(0)
R
2

If you are using node.js you can do this:

yourByteArray.toString('base64');
Rounds answered 22/2, 2020 at 3:10 Comment(0)
A
1

Too late to answer but if your input is in form of ASCII bytes, then you could try this solution:

function convertArrToString(rArr){
 //Step 1: Convert each element to character
 let tmpArr = new Array();
 rArr.forEach(function(element,index){
    tmpArr.push(String.fromCharCode(element));
});
//Step 2: Return the string by joining the elements
return(tmpArr.join(""));
}

function convertArrToHexNumber(rArr){
  return(parseInt(convertArrToString(rArr),16));
}
Asymmetric answered 22/1, 2019 at 15:19 Comment(0)
U
0

UPDATE

@rosberg-linhares posted best solution so far to handle UTF8.


Didn't find any solution that would work with UTF-8 characters. String.fromCharCode is good until you meet 2 byte character.

For example word Hüser can come over the wire in form of arraybuffer as [0x48,0xc3,0xbc,0x73,0x65,0x72] (e.g. through websocket connection)

But if you go through it with String.fromCharCode you will have Hüser as each byte will be converted to a char separately, and letter ü is encoded in two bytes.

Solution

Currently I'm using following solution:

function pad(n) { return (n.length < 2 ? '0' + n : n); }
function decodeUtf8(data) {
  return decodeURIComponent(
    data.map(byte => ('%' + pad(byte.toString(16)))).join('')
  );
}
Urethra answered 11/10, 2018 at 14:1 Comment(2)
Probably you confused plain 8-bit byte array with "UTF-8" byte array. If you want to deal with UTF-8 byte arrays just use the TextEncoder and TextDecoder. If you deal with plain byte arrays where values are from 0 to 255 you can use String.fromCharCode() and String.charCodeAt() fine.Paletot
@Paletot probably I did not express myself clear enough. In the world where you still need to support browsers that do not have support of TextEncoder (e.g. Edge 12-18) so you have to think of some solutions that would work for the client over the wire, where you receive surprisingly 8-bit byte arrays and you just know that they are UTF-8 encoded. Solution I provided is about using different decoding method in the browser. Thank you for pointing out on some flaws in my answer :)Urethra
S
0

I had some decrypted byte arrays with padding characters and other stuff I didn't need, so I did this (probably not perfect, but it works for my limited use)

var junk = String.fromCharCode.apply(null, res).split('').map(char => char.charCodeAt(0) <= 127 && char.charCodeAt(0) >= 32 ? char : '').join('');
Shingle answered 15/1, 2019 at 20:34 Comment(0)
I
0
> const stringToBin = (str) => [...str].map(item=>item.charCodeAt())
> undefined
> stringToBin('hello')
> (5) [104, 101, 108, 108, 111]
> const binToString = (array) => String.fromCharCode(...array) 
> undefined
> binToString(stringToBin('hello'))
> 'hello'
Incarnate answered 22/12, 2021 at 4:55 Comment(0)
D
0

What you are looking for is String.fromCharCode

What you want to do is loop through the array of bytes (represented as integers), create the string equivalent and add it to the result:

function bin2String(array) {
  var result = "";
  for (const char of array) {
    result += String.fromCharCode(char);
  }
  return result;
}

console.log(bin2String([116, 104, 101, 32, 114, 101, 115, 117, 108, 116]));

You can also use the Array.Map function to convert the array of bytes into an array of strings, then join them all.

function string2Bin(array) {
  return array.map(byte => String.fromCharCode(byte)).join("");
}

console.log(string2Bin([116, 104, 101, 32, 114, 101, 115, 117, 108, 116]));
Darlinedarling answered 4/6, 2022 at 18:38 Comment(0)
V
-3

The simplest solution I've found is:

var text = atob(byteArray);
Viccora answered 30/9, 2020 at 15:38 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.