by Luuk » Thu Mar 04, 2021 5:36 am
This very good to know about "Simple" also granting case conversions, because its much easier for the eyesight!
During the experiments, it seems that RegEx(1) fails whenever trying to lowercase the "«" or "»" characters.
It must consider them both uppercases, because the \U conversion does always succeed with both of them.
But its strange, because why two characters getting assigned to the same lowercase conversion?
So then testing with more unicodes, it seem that "rhombus/question" is some "default character" for "unknown".
This another way:.. Convert "«" into "<" and "«" into "<"... Then lowercase... Then convert them back...
So this is needing five different Match/Replace, each being separated by (?X) like ...
«/g(?X)»/g(?X)(\(.*?\))/g(?X)</g(?X)>/g
<(?X)>(?X)\L$1(?X)«(?X)»
But if lowercasing only at the very (End Of Name), it would instead be like ...
«/g(?X)»/g(?X)(\([^)]*\))$(?X)</g(?X)>/g
<(?X)>(?X)\L$1(?X)«(?X)»