codepoint - McMap

3

Solved

Difference between codePointAt and charCodeAt

What is the difference between String.prototype.codePointAt() and String.prototype.charCodeAt() in JavaScript? 'A'.codePointAt(); // 65 'A'.charCodeAt(); // 65

javascript string codepoint

Interlanguage asked 10/4, 2016 at 8:40

3

Isn't a 2-byte char datatype insufficient to deal with the concept of "characters" in a Unicode string?

Various programming languages use a 2-byte char datatype (not to be confused with C/C++'s char, which is just one byte) out of which strings are constructed. Various utility functions will try to f...

string unicode char unicode-string codepoint

Brandi asked 15/10, 2020 at 18:18

3

Solved

Convert from hex character to Unicode character in python

The hex string '\xd3' can also be represented as: Ó. The easiest way I've found to get the character representation of the hex string to the console is: print unichr(ord('\xd3')) Or in English...

python unicode hex codepoint

Chavira asked 9/8, 2011 at 16:44

4

Solved

What exactly does String.codePointAt do?

Recently I ran into codePointAt method of String in Java. I found also a few other codePoint methods: codePointBefore, codePointCount etc. They definitely have something to do with Unicode but I do...

java string unicode codepoint

Liegnitz asked 5/9, 2012 at 11:51

4

Solved

Java unicode where to find example N-byte unicode characters

I'm looking for sample 1-byte, 2-byte, 3-byte, 4-byte, 5-byte, and 6-byte unicode characters. Any links to some sort of reference of all the different unicode characters out there and how big they ...

java unicode sample-data codepoint

Polemoniaceous asked 19/5, 2011 at 18:23

1

Solved

Identify if a Unicode code point represents a character from a certain script such as the Latin script?

Unicode categorizes characters as belonging to a script, such as the Latin script. How do I test whether a particular character (code point) is in a particular script?

java unicode character codepoint

Distiller asked 30/5, 2020 at 23:18

2

Solved

Are all Unicode Emoji ZWJ Sequences valid?

When creating an emoji font, is any sequence of ZERO WIDTH JOINER valid? For instance: can I use 🏳‍★‍🟩 (Waving White Flag + zwj + Black Star + zwj + Green Square) to represent a white flag with...

unicode fonts emoji codepoint

Gold asked 1/5, 2020 at 8:0

4

Solved

Split JavaScript string into array of codepoints? (taking into account "surrogate pairs" but not "grapheme clusters")

Splitting a JavaScript string into "characters" can be done trivially but there are problems if you care about Unicode (and you should care about Unicode). JavaScript natively treats characters as...

javascript string unicode codepoint surrogate-pairs

Alleged asked 28/1, 2014 at 5:9

5

Solved

Get unicode code point of a character using Python

In Python API, is there a way to extract the unicode code point of a single character? Edit: In case it matters, I'm using Python 2.7.

python python-2.7 unicode codepoint

Soluk asked 3/9, 2011 at 4:12

1

Solved

kotlin split utf string into single length sub strings using codepoint

I'm just starting kotlin so I'm sure there is an easy way to do this but I don't see it. I want to split a into single-length sub strings using codepoints. In Java 8, this works: public class UtfS...

unicode kotlin codepoint

Slap asked 16/12, 2018 at 3:12

1

Solved

Why is Unicode restricted to 0x10FFFF?

Why is the maximum Unicode code point restricted to 0x10FFFF? Is it possible to represent Unicode above this code point - for e.g. 0x10FFFF + 0x000001 = 0x110000 - through any encoding schemes like...

unicode character-encoding range limit codepoint

Filip asked 6/9, 2018 at 11:43

3

Solved

Retrieve Unicode code points > U+FFFF from QChar

I have an application that is supposed to deal with all kinds of characters and at some point display information about them. I use Qt and its inherent Unicode support in QChar, QString etc. Now I...

qt unicode astral-plane codepoint qchar

Element asked 7/8, 2011 at 12:41

2

Solved

Why does the red heart emoji require two code points, but the other colored hearts require one?

It appears that the red heart emoji (❤️) "\u2764\uFE0F" requires two Unicode codepoints, specifically Heavy Black Heart followed by a Variation Selector. However, blue 💙, green 💚, yellow 💛, and ...

unicode emoji codepoint

Retrenchment asked 8/3, 2017 at 19:0

3

Solved

Does Unicode have a defined maximum number of code points?

I have read many articles in order to know what is the maximum number of the Unicode code points, but I did not find a final answer. I understood that the Unicode code points were minimized to mak...

unicode utf-8 utf-16 codepoint utf-32

Colorist asked 11/12, 2014 at 5:26

4

Solved

How to output unicode string to RTF (using C#)

I'm trying to output unicode string into RTF format. (using c# and winforms) From wikipedia: If a Unicode escape is required, the control word \u is used, followed by a 16-bit signed decimal in...

c#unicode rtf codepoint

Garfield asked 2/9, 2009 at 14:23

2

Solved

Creating a UTF-8 string from hexadecimal code

In C++, it's possible create a UTF-8 string using this kind of notation: "\uD840\uDC50". However this doesn't work in PHP. Is there a similar notation? If not, is there any built-in way to create...

php unicode utf-8 hex codepoint

Pluton asked 19/4, 2013 at 6:44

2

Solved

Comparing characters in Rebol 3

I am trying to compare characters to see if they match. I can't figure out why it doesn't work. I'm expecting true on the output, but I'm getting false. character: "a" word: "aardvark" (first wor...

unicode character rebol rebol3 codepoint

Pottle asked 31/1, 2014 at 23:15

4

Solved

Convert UTF8 string into numeric values in Perl

For example, my $str = '中國c'; # Chinese language of china I want to print out the numeric values 20013,22283,99

perl unicode utf-8 codepoint

Fourinhand asked 22/8, 2010 at 17:19

1

Solved

How to cast a QChar to int

In C++ there is a way to cast a char to int and get the ascii value in return. Is there such a way to do the same with a qchar? Since unicode supports so many characters and some of them are actual...

qt casting int codepoint qchar

Albric asked 21/8, 2013 at 17:56

3

Solved

What are the most common non-BMP Unicode characters in actual use? [closed]

In your experience which Unicode characters, codepoints, ranges outside the BMP (Basic Multilingual Plane) are the most common so far? These are the ones which require 4 bytes in UTF-8 or sur...

unicode cjk codepoint surrogate-pairs astral-plane

Bowery asked 6/4, 2011 at 13:36

2

Solved

How to read non-BMP (astral) Unicode supplementary characters (code points)

The G-Clef (U+1D11E) is not part of the Basic Multilingual Plane (BMP), which means that it requires more than 16 bit. Almost all of Java's read functions return only a char or a int containing als...

java unicode codepoint surrogate-pairs supplementary

Ostracod asked 28/6, 2013 at 9:14

2

Solved

What is exactly an overlong form/encoding?

Reading the Wikipedia article on UTF-8, I've been wondering about the term overlong. This term is used various times but the article doesn't provide a definition or reference for its meaning. I wo...

unicode utf-8 character-encoding sequences codepoint

Tabathatabb asked 18/8, 2011 at 19:37

2

Solved

What are the consequences of storing a C# string (UTF-16) in a SQL Server nvarchar (UCS-2) column?

It seems that SQL Server uses Unicode UCS-2, a 2-byte fixed-length character encoding, for nchar/nvarchar fields. Meanwhile, C# uses Unicode UTF-16 encoding for its strings (note: Some people don't...

sql-server character-encoding utf-16 ucs2 codepoint

Satirical asked 13/4, 2011 at 20:36

2

Finding Unicode character name with Javascript

I need to find out the names for Unicode characters when the user enters the number for it. An example would be to enter 0041 and get given "Latin Capital Letter A" as the result.

javascript unicode character codepoint

Verleneverlie asked 26/9, 2010 at 16:53

2

Solved

Writing a better natural sort (than mine)

I added an answer to this question here: Sorting List<String> in C# which calls for a natural sort order, one that handles embedded numbers. My implementation, however, is naive, and in lieu...

.net unicode natural-sort codepoint

Affront asked 15/9, 2010 at 11:26

codepoint Questions

Recommended topics

Hot tags