utf-32 Questions

14

Solved

What are the differences between UTF-8, UTF-16, and UTF-32? I understand that they will all store Unicode, and that each uses a different number of bytes to represent a character. Is there an adva...
Malvinamalvino asked 30/1, 2009 at 17:5

2

Solved

utf16 or utf32? I'm trying to store content in a lot of languages. Some of the languages use double-wide fonts (for example, Japanese fonts are frequently twice as wide as English fonts). I'm not s...
Swarthy asked 18/7, 2012 at 2:19

1

Solved

My main platform is Windows which is the reason why I use internally UTF-16 (mostly BMP strings). I would like to use console output for these strings. Unfortunately there is no std::u16cout or st...
Minnich asked 20/4, 2020 at 13:19

2

Solved

How can I print (cout / wcout / ...) char32_t to console in C++11? The following code prints hex values: u32string s2 = U"Добрый день"; for(auto x:s2){ wcout<<(char32_t)x<<endl; }
Schafer asked 7/4, 2013 at 0:57

2

Solved

In C#, Encoding.UTF32 is UTF-32 little-endian, Encoding.BigEndianUnicode is UTF-16 big-endian, Encoding.Unicode is UTF-16 little-endian. But I can't find any for UTF-32 big-endian. I'm developing ...
Gotthelf asked 6/10, 2015 at 15:23

3

Solved

I have read many articles in order to know what is the maximum number of the Unicode code points, but I did not find a final answer. I understood that the Unicode code points were minimized to mak...
Colorist asked 11/12, 2014 at 5:26

2

Solved

I'm working on a terminal based program that has unicode support. There are certain cases where I need to determine how many terminal columns a string will consume before I print it. Unfortunately ...
Addendum asked 23/5, 2016 at 17:30

2

Solved

I'm learning about Unicode and have a few questions that I'm hoping to get answered. 1) I've read that on Linux, a std::wstring is 4-bytes, while on Windows, it's 2-bytes. Does this mean that Linu...
Nmr asked 19/9, 2014 at 16:23

1

Solved

I'd like to write unicode literal U+10428 in Java. http://www.marathon-studios.com/unicode/U10428/Deseret_Small_Letter_Long_I I tried with '\u10428' and it doesn't compile.
Karelia asked 8/7, 2014 at 13:35

1

Solved

I need to escape unicode characters within a input string to either UTF-16 or UTF-32 escape sequences. For example, the input string literal "Eat, drink, 愛" should be escaped as "Eat, drink, \u611b...
Stockbreeder asked 24/5, 2014 at 10:10

5

Solved

I've never understood the point of UTF-16 encoding. If you need to be able to treat strings as random access (i.e. a code point is the same as a code unit) then you need UTF-32, since UTF-16 is sti...
Rasmussen asked 13/3, 2011 at 20:28

2

Solved

I have been exploring C++11's new Unicode functionality, and while other C++11 encoding questions have been very helpful, I have a question about the following code snippet from cppreference. The c...
Undulate asked 18/3, 2013 at 9:10

2

Solved

What open source C or C++ libraries can convert arbitrary UTF-32 to NFC? Libraries that I think can do this so far: ICU, Qt, GLib (not sure?). I don't need any other complex Unicode support...
Dubuffet asked 24/11, 2011 at 6:35

2

Solved

I saw this post on Jon Skeet's blog where he talks about string reversing. I wanted to try the example he showed myself, but it seems to work... which leads me to believe that I have no idea how to...
Brosy asked 15/1, 2013 at 22:6

1

Solved

Possible Duplicate: Why UTF-32 exists whereas only 21 bits are necessary to encode every character? The maximum Unicode code point is 0x10FFFF in UTF-32. UTF-32 has 21 information bit...
Agha asked 13/4, 2012 at 15:32

5

Solved

So I've got some troubles with character encoding. When I put the following two characters into a UTF32 encoded text file: 𩸕 鸕 and then run this code on them: System.IO.StreamReader streamReader ...
Newel asked 3/4, 2012 at 5:44

2

Solved

Google App Engine uses Python 2.5.2, apparently with UCS4 enabled. But the GAE datastore uses UTF-8 internally. So if you store u'\ud834\udd0c' (length 2) to the datastore, when you retrieve it, yo...
Pyosis asked 3/8, 2011 at 6:26

1

Solved

i'm trying to print out a string of UTF-16 characters. i posted this question a while back and the advice given was to convert to UTF-32 using iconv and print it as a string of wchar_t. i've done ...
Amann asked 11/12, 2011 at 17:24

3

Solved

If you had a website that was to be translated into every language in the world and therefore had a database with all these translations what character encoding would be best? UTF-128? If so do al...
Lavish asked 20/4, 2011 at 15:43
1

© 2022 - 2024 — McMap. All rights reserved.