Unicode Support Required!!

Would you like Bulk Rename Utility to offer new functionality? Post your comments here!

Postby lorenzr » Fri Feb 10, 2006 12:23 am

Why I suggested another option in removeing those characters. There are options to remove every other type including a-z. Suggestions:
Non-ASCII
Non-codepage (current).
or just a NOT option for the whole box.
Ray
lorenzr
 
Posts: 3
Joined: Tue Feb 07, 2006 10:50 pm

Unicode support in regex?

Postby osterlitz » Tue Feb 21, 2006 4:06 pm

Hi,

I just tried removing unicode characters (polish letters) from some filenames. This did not work, though I tried several ways to match them.

I could not match using the \p{M} or the \p{Cyrillic} syntax.
\p{Lu}, used for testing, did not work either, though it should have matched all uppercase Characters.

Are you sure that your PCRE lib is unicode compliant?

One of the filenames which failed was "9. Różnymi drogami.tif" - I hope I gets through here - there should be an accent on the o and a dot on the z letter.

Regards,
Stefan
osterlitz
 
Posts: 1
Joined: Tue Feb 21, 2006 4:00 pm

Postby Admin » Tue Feb 21, 2006 4:14 pm

Hi Stefan,

No, my PCRE file is definitely NOT Unicode compliant, and this was stated in the notes when I released the Unicode version. The DLL I use is pre-compiled, but is not compiled in Unicode format. There might be a precompiled DLL out there somewhere but I haven't found it.


Jim
Admin
Site Admin
 
Posts: 2343
Joined: Tue Mar 08, 2005 8:39 pm

Re: Unicode Support Required!!

Postby Black_Mark » Sat Jan 12, 2008 3:04 am

Any progress on making your program fully Unicode-compliant?
Black_Mark
 
Posts: 20
Joined: Fri Jun 24, 2005 3:43 pm

Re: Unicode Support Required!!

Postby Admin » Sat Jan 12, 2008 8:47 am

Afraid not. PCRE isn't unicode compliant, and probably never will be, so I suspect BRU will suffer the same limitation.


Jim
Admin
Site Admin
 
Posts: 2343
Joined: Tue Mar 08, 2005 8:39 pm

Re: Unicode Support Required!!

Postby alex.forencich » Tue Jan 15, 2008 10:48 am

PCRE does support Unicode. You just have to compile it with Unicode support. See this site: http://www.regular-expressions.info/pcre.html .
alex.forencich
 
Posts: 1
Joined: Tue Jan 15, 2008 1:03 am

Re: Unicode Support Required!!

Postby Admin » Tue Jan 15, 2008 1:19 pm

Unfortunately the version I am using doesn't support these flags, and upgrading will nto be straightforward.


Jim
Admin
Site Admin
 
Posts: 2343
Joined: Tue Mar 08, 2005 8:39 pm

Re: Unicode Support Required!!

Postby Admin » Mon Jan 28, 2008 9:00 am

I now have a version of PCRE which I can compile with unicode support. However I do not have a clue how to test it!

Can someone give me some examples please? Filenames, match-strings and replace-strings for cases which should match and also for cases which should not match.

Many thanks.



Jim
Admin
Site Admin
 
Posts: 2343
Joined: Tue Mar 08, 2005 8:39 pm

Previous

Return to Suggestions