Unicode characters

A swapping-ground for Regular Expression syntax

Unicode characters

Postby ap86 » Mon Sep 24, 2012 2:16 am

I used the search bar first, and some threads suggested that regex couldn't handle unicode characters (I'm trying to use it with Korean characters, in particular), but those threads were all at least a few years old and since it is the Unicode version of BRU, I'm asking again if it's possible to use regex with unicode characters. If not, I suppose I'll just use the crop first X chars function.

Here's my example.
Code: Select all
Regex (1)
Match:     (\d{4}\.\d{2}\.\d{2}\] )(.*)
Replace:   \2


[2012.09.20] ABC (???) - XYZ
is renamed to
ABC (
Edit: ??? was in Korean when I submitted it

but
[2012.09.20] ABC (DEF) - XYZ
is renamed to
ABC (DEF) - XYZ

I mean, clearly the problem is the Korean chars here, so maybe I've answered my own question, but perhaps there's another way?
ap86
 
Posts: 4
Joined: Tue Jul 03, 2012 2:16 am

Return to Regular Expressions