Unicode characters and the "|" separator

Would you like Bulk Rename Utility to offer new functionality? Post your comments here!

Unicode characters and the "|" separator

Postby MKCA » Thu Nov 25, 2021 4:04 pm

There seems to be a bug affecting the "|" separator and Unicode characters.

When I try to do multiple replacements with alphanumeric characters everything works properly:
Code: Select all
Replace:     po|re
With:        ka|su

port.txt     >      kart.txt
retf.txt     >      sutf.txt
??.txt      >      ??.txt
??.txt      >      ??.txt


When I try to do a single replacement with Japanese characters, it also works properly:
Code: Select all
Replace:     ?
With:        ka

port.txt     >      port.txt
retf.txt     >      retf.txt
??.txt      >      ka?.txt
??.txt      >      ??.txt


However, when I try to use the "|" separator with Japanese characters, it bugs out:
Code: Select all
Replace:     ?|??
With:        ka|sute

port.txt     >      kapkaokarkatka.txt
retf.txt     >      karkaekatkafka.txt
??.txt      >      ka.txt
??.txt      >      ka.txt


It actually happens as soon as the "I" separator is inputted:
Code: Select all
Replace:     ?|
With:        ka|

port.txt     >      kapkaokarkatka.txt
retf.txt     >      karkaekatkafka.txt
??.txt      >      ka.txt
??.txt      >      ka.txt



But will work correctly if additional alphanumerical characters are introduced:
Code: Select all
Replace:     ?a|??b
With:        ka|sute

port.txt     >      port.txt
retf.txt     >      retf.txt
?a?.txt     >      ka?.txt
??b.txt     >      sute.txt


The issue occurs with the "With" field too, except here it only causes the characters to be read as empty:
Code: Select all
Replace:     po|re
With:        ?|?

port.txt     >      rt.txt
retf.txt     >      tf.txt


Replace:     po|re
With:        ?a|b?

port.txt     >      ?art.txt
retf.txt     >      b?tf.txt


It affects other types types of Unicode characters too:
Code: Select all
Replace:     po|
With:        ????|

port.txt     >      rt.txt


Replace:     po|
With:        ??|

port.txt     >      rt.txt


Replace:     po|
With:        ?|

port.txt     >      rt.txt


But not all, for example Cyrillic or Greek works fine:
Code: Select all
Replace:     po|
With:        ??????|

port.txt     >      ??????rt.txt

Replace:     po|
With:        ??????|

port.txt     >      ??????rt.txt


I tested this on multiple computers, with different System Locale, both the 32 and 64 bit versions, both portable and installed.
MKCA
 
Posts: 4
Joined: Thu Nov 25, 2021 1:36 am

Re: Unicode characters and the "|" separator

Postby therube » Fri Nov 26, 2021 5:41 pm

Your wanted characters aren't displaying (correctly) at least to me (simply displaying as ? [question marks]) & I'm thinking they could (as in the board doesn't look like it shouldn't be affecting what is seen).
Code: Select all
?    ?    ?    ?    ?    ?    ?

Ah, they were fine in the Preview, but once the post Posted, they were replaced by ?, so it is the board.
So maybe you could host the sample file names elsewhere?

Looks like a site like snippet.host, should work.
(Above link valid for 1 month only.)
therube
 
Posts: 1050
Joined: Mon Jan 18, 2016 6:23 pm

Re: Unicode characters and the "|" separator

Postby MKCA » Sat Nov 27, 2021 1:43 am

? ?? ?? ????
MKCA
 
Posts: 4
Joined: Thu Nov 25, 2021 1:36 am

Re: Unicode characters and the "|" separator

Postby MKCA » Sat Nov 27, 2021 1:49 am

LMAO
The board can't even handle the characters that actually worked in the application in the first place! It even gave me an error when submitting! :lol:
I will try posting links then.
MKCA
 
Posts: 4
Joined: Thu Nov 25, 2021 1:36 am

Re: Unicode characters and the "|" separator

Postby MKCA » Sat Nov 27, 2021 2:24 am

Attempt 2: Unicode boogaloo
(Thank you therube, for suggesting snippet.host)
/////////////////////////////////////////////////////////////////////////////////////
There seems to be a bug affecting the "|" separator and Unicode characters.

When I try to do multiple replacements with alphanumeric characters everything works properly:
https://snippet.host/voap

When I try to do a single replacement with Japanese characters, it also works properly:
https://snippet.host/awgy

However, when I try to use the "|" separator with Japanese characters, it bugs out:
https://snippet.host/knvw

It actually happens as soon as the "|" separator is inputted:
https://snippet.host/omea

But will work correctly if additional alphanumerical characters are introduced:
https://snippet.host/zabw

The issue occurs with the "With" field too, except here it only causes the characters to be read as empty:
https://snippet.host/bzxr

It affects other types types of Unicode characters too:
https://snippet.host/buur

But not all, for example Cyrillic or Greek works fine:
https://snippet.host/zgqy

I tested this on multiple computers, with different System Locale, both the 32 and 64 bit versions, both portable and installed.
MKCA
 
Posts: 4
Joined: Thu Nov 25, 2021 1:36 am

Re: Unicode characters and the "|" separator

Postby Admin » Sat Nov 27, 2021 9:50 am

We need to check this issue for the next update!
Admin
Site Admin
 
Posts: 2085
Joined: Tue Mar 08, 2005 8:39 pm


Return to Suggestions