character-properties

6

Solved

How to know the preferred display width (in columns) of Unicode characters?

In different encodings of Unicode, for example UTF-16le or UTF-8, a character may occupy 2 or 3 bytes. Many Unicode applications doesn't take care of display width of Unicode chars just like they a...

unicode text-formatting character-properties mbcs

Bayern asked 3/9, 2010 at 9:54

5

Solved

Unicode block of a character in python

Is there a way to get the Unicode Block of a character in python? The unicodedata module doesn't seem to have what I need, and I couldn't find an external library for it. Basically, I need the sam...

python unicode character-properties

Arber asked 28/10, 2008 at 15:56

11

Solved

How to match Cyrillic characters with a regular expression

How do I match French and Russian Cyrillic alphabet characters with a regular expression? I only want to do the alpha characters, no numbers or special characters. Right now I have [A-Za-z]

regex unicode character-properties

Darvon asked 11/11, 2009 at 17:1

9

Solved

Python: Split unicode string on word boundaries

I need to take a string, and shorten it to 140 characters. Currently I am doing: if len(tweet) > 140: tweet = re.sub(r"\s+", " ", tweet) #normalize space footer = "… " + utils.shorten_urls(p...

python unicode internationalization character-properties

Stagnant asked 15/11, 2009 at 20:53

5

Solved

Matching Unicode letter characters in PCRE/PHP

I'm trying to write a reasonably permissive validator for names in PHP, and my first attempt consists of the following pattern: // unicode letters, apostrophe, hyphen, space $namePattern = "/^([\\...

php regex unicode pcre character-properties

Dedrick asked 13/2, 2011 at 9:17

2

Solved

Match any unicode letter?

In .net you can use \p{L} to match any letter, how can I do the same in Python? Namely, I want to match any uppercase, lowercase, and accented letters.

python regex character-properties

Everara asked 11/6, 2011 at 7:5

11

How can I use Unicode-aware regular expressions in JavaScript?

There should be something akin to \w that can match any code-point in Letters or Marks category (not just the ASCII ones), and hopefully have filters like [[P*]] for punctuation, etc.

javascript regex unicode character-properties

Slattern asked 11/11, 2008 at 12:0

11

How can I use Unicode-aware regular expressions in JavaScript?

There should be something akin to \w that can match any code-point in Letters or Marks category (not just the ASCII ones), and hopefully have filters like [[P*]] for punctuation, etc.

javascript regex unicode character-properties

Survive asked 11/11, 2008 at 12:0

5

Solved

How to validate both Chinese (unicode) and English name?

I have a multilingual website (Chinese and English). I like to validate a text field (name field) in javascript. I have the following code so far. var chkName = /^[characters]{1,20}$/; if( chkN...

javascript regex unicode character-properties

Breechloader asked 16/6, 2011 at 19:25

7

Solved

Regex for names with special characters (Unicode)

Okay, I have read about regex all day now, and still don't understand it properly. What i'm trying to do is validate a name, but the functions i can find for this on the internet only use [a-zA-Z],...

php javascript regex character-properties

Detour asked 11/5, 2011 at 11:8

3

Solved

Scanning for Unicode Numbers in a string with \d

According to the Oniguruma documentation, the \d character type matches: decimal digit char Unicode: General_Category -- Decimal_Number However, scanning for \d in a string with all the Decim...

ruby regex unicode character-properties

Whichever asked 9/8, 2011 at 15:28

4

regular expression containing unicode words

I'd like to match all strings containing a certain word. like: String regex = (?:\P{L}|\W|^)(ベスパ)(?:\b|$) however, the Pattern class doesn't compile it: java.util.regex.PatternSyntaxException:...

java regex unicode character-properties

Dichromic asked 12/4, 2011 at 21:14

3

Solved

Latin Characters check

there are some similar questions out there, but none that are quite the same or that have an answer that works for me. I need a javascript function which validates whether a text field contains al...

javascript regex unicode character-properties

Bikini asked 3/4, 2013 at 10:59

4

Spilt String using Unicode delimiter

I need to split a string with "-" as delimiter in java. Ex: "Single Room - Enjoy your stay" I have the same data coming in english and german depending on locale . Hence I cannot use the usual st...

java string unicode character-properties

German asked 8/3, 2012 at 4:25

1

Solved

Regular expression to match boundary between different Unicode scripts

Regular expression engines have a concept of "zero width" matches, some of which are useful for finding edges of words: \b - present in most engines to match any boundary between word and non-wor...

regex unicode character-properties word-boundary word-boundaries

Grose asked 11/5, 2013 at 1:39

2

Solved

Perl: How to match FULLWIDTH LATIN SMALL

I am using listadmin to manage many mailman-based mailing lists. I have a long list of subjects and from addresses set up to block spam. Recently, I received smarter spam in the sense that it uses ...

regex perl unicode character-properties

Kelso asked 9/5, 2013 at 20:17

3

Solved

Does \w match all alphanumeric characters defined in the Unicode standard?

Does Perl's \w match all alphanumeric characters defined in the Unicode standard? For example, will \w match all (say) Chinese and Russian alphanumeric characters? I wrote a simple test script (s...

regex perl unicode internationalization character-properties

Volturno asked 5/4, 2011 at 17:4

1

Efficiently list all characters in a given Unicode category

Often one wants to list all characters in a given Unicode category. For example: List all Unicode whitespace, How can I get all whitespaces in UTF-8 in Python? Characters with the property Alphab...

python unicode character-properties

Northeaster asked 9/1, 2013 at 20:30

3

Solved

matching unicode characters in python regular expressions

I have read thru the other questions at Stackoverflow, but still no closer. Sorry, if this is allready answered, but I didn`t get anything proposed there to work. >>> import re >>&g...

python regex unicode non-ascii-characters character-properties

Bodine asked 17/2, 2011 at 12:8

3

Solved

Match C# Unicode Identifier using Regex

What is the right way to match a C# identifier, specifically a property or field name, using .Net Regex patterns? Background. I used to use the ASCII centric @"[_a-zA-Z][_a-zA-Z0-9]*" But now unic...

c#regex unicode character-properties

Beekeeping asked 9/12, 2010 at 16:8

3

How to mark all CJK text in a document?

I have a file, file1.txt, containing text in English, Chinese, Japanese, and Korean. For use in ConTeXt, I need to mark each region of text within the file according to language, except for English...

unicode multilingual cjk character-properties

Townswoman asked 7/5, 2012 at 13:23

2

Solved

Regex - Unicode Properties Reference and Examples

I feel lost with the Regex Unicode Properties presented by RegexBuddy, I cannot distinguish between any of the Number properties and the Math symbol property only seems to match + but not -, *, /, ...

php regex unicode pcre character-properties

Middleclass asked 14/1, 2010 at 6:17

6

Solved

Python regex matching Unicode properties

Perl and some other current regex engines support Unicode properties, such as the category, in a regex. E.g. in Perl you can use \p{Ll} to match an arbitrary lower-case letter, or p{Zs} for any spa...

python regex unicode ucd character-properties

Sinclair asked 2/12, 2009 at 13:25

1

Solved

Matching only a unicode letter in Python re

I have a string from which i want to extract 3 groups: '19 janvier 2012' -> '19', 'janvier', '2012' Month name could contain non ASCII characters, so [A-Za-z] does not work for me: >>&...

python regex unicode character-properties

Stilbestrol asked 19/1, 2012 at 9:49

2

Solved

Iterating through Unicode codepoints character by character

I've got a series of Unicode codepoints. What I really need to do is iterate through these codepoints as a series of characters, not a series of codepoints, and determine properties of each individ...

c++unicode character-properties

Stile asked 26/11, 2011 at 22:5

character-properties Questions

Recommended topics

Hot tags