From: adacrypt on
On Jan 1, 9:27 pm, Globemaker <alanfolms...(a)cabanova.com> wrote:
> On Jan 1, 12:44 am, adacrypt
>
> > Just remembered - all the most recently reviewed programming language
> > compilers come with a full library of Unicode languages pre-installed
>
> Hello adacrypt,
> I have already studied Unicode, so here is how I take English Unicode
> in a text file and make a Greek file in .rtf rich text format for
> Microsoft Wordpad. The program is in the Perl language
>
> # baffle_63.pl August 7, 2009 Alan Folmsbee, Kona, Hawaii
> # .RTF Spec requires decimal Unicode in Wordpad Rich
> # Text Format, after backslash u
> # Unicode spec has Greek tables in hex so convert to decimal for .rtf
> use strict;
> use warnings;
>
> my $paragraph = "C:/import/writings/greek3/english_50.txt";
> my  $code_out = "C:/import/writings/greek3/greek_63.rtf";
> open(ENGLISH, "$paragraph") or die "Can't open $paragraph: $!";
> open(GREEK, ">$code_out") or die "Can't open $code_out: $!";
>
> # 26 small letters assigned for Unicode Greek version of English
> sentence.
> # The letters c and j are left as in English so the 24 Greek letters
> have 2 added.
> my @greek_small_letters = (945, 946, 99, 948, 949, 966, 947, 951, 953,
> 106,
> 954, 955, 956, 957, 959, 960, 952, 961, 963, 964, 965, 968, 969, 967,
> 958, 950);
> # The RTF header is sent to the file.
> print GREEK "\{\\rtf1\\ansi\\ansicpg1252\\deff0\\deflang1033\{\\fonttbl
> \{\\f0\\froman\\fcharset0 Times New Roman\;\}\{\\f1\\fswiss\\fcharset0
> Arial\;\}\}
> \{\\\*\\generator alan Msftedit 5\.41\.15\.1515\;\}\\viewkind4\\uc1\
> \pard\\sb100\\sa100\\f0\\fs24";
> my $k;
> my $offset;
> for ($k=0; $k<26; $k++)
>     {
>      print GREEK "\\"."u".$greek_small_letters[$k].",";
>     }
> # Next, show capital letters numbered 32 less than small letters
> for ($k=0; $k<26; $k++)
>     {
>      $offset = $greek_small_letters[$k] - 32;
>      print GREEK "\\"."u".$offset.",";
>     }
>
> print GREEK "\\"."line";
>
> my $line;
> my $size;
> my @english_sentence;
> my $letter;
> my $greek_letter;
> my $flag_match = 0;
>
> while ($line = <ENGLISH>)
>    {
>    print "\n$line";
>    chomp $line;
>    @english_sentence = split (//,$line);
>    $size = @english_sentence;
>    for ($letter=0; $letter<$size; $letter++)
>        {
>        $flag_match = 0;
>        for ($k=0; $k<26; $k++)
>           {
>           # small letters
>           if ($flag_match !=1)
>              {
>              if (ord($english_sentence[$letter]) == ($k+97))
>                 {
>                 $greek_letter = $greek_small_letters[$k];
>                 $flag_match = 1;
>                 }
>              else
>                 {
>                 $greek_letter = ord($english_sentence[$letter]);
>                 }
>              }
>           # capital letters
>           if ($flag_match !=1)
>              {
>              if (ord($english_sentence[$letter]) == ($k+65))
>                 {
>                 $greek_letter = $greek_small_letters[$k] - 32;
>                 $flag_match = 1;
>                 }
>              else
>                 {
>                 $greek_letter = ord($english_sentence[$letter]);
>                 }
>              }
>
>           }
> # handle newline code, the invisible code used as /line or /par
> in .rtf
>        if ($letter == ($size-1))
>           {
>           print GREEK "\\"."u".$greek_letter."\\"."line";
>           }
>        else
>           {
>           print GREEK "\\"."u".$greek_letter.",";;
>           }
>        }
>    }
> print "\nThe size of the last sentence was $size characters";
> print GREEK "\}";
> close (ENGLISH);
> close (GREEK);
> exit();

Hi again,
I'm a bit slow latching on to your drift - are you simply doing a
straight conversion of languages say, as you describe - converting
English to Greek for reproducing in Wordpad (the inference of Word pad
being that there is no help from Microsoft Word ?) but you are using
Perl as the converting medium to do that. Does this mean no
cryptography is involved ? - If this is a piece of research fine but
it seems to me that there are easier ways of doing that - also it
seems to be counter the object of Unicode ?.

I slap my soul on the table here and tell you I am not the best person
to debate this project - my efforts have been all to do with the
cryptography of integers in Unicode as the solution to not knowing any
Cantonese, Mongolian, Arabic or other - even then I zoom in to the
heart of the matter and leave a huge amount of periphereal management
stuff to better informed people - computer scientists, hands-on
cryptographers, programming language specialists in sci crypt and so
on. I admire research tremenduously but if it means swimming aginst
the tide of established methods in the Unicode Consortium then its no
- there can only be curio value to anything I find - I suspect I may
be reading you all wrong.

The point that I am emphasizing in my notes is that a western operator
does not need to know anything more than how to encrypt integers so as
to have secure communications with somebody in any other part of the
world - this will always require an intermediary human being to
provide the interface between the two cultures involved because Alice
brought up in the Bronx knows no CJK (Chinese, Japanese, Korean to
other general readership readers) nor should she either - this is the
essence of Unicode's usefulness - cross-referencing is provided within
Unicode but trying to do this from a private bare board platform is
counter productive ? What say you O' scribe - Cheers - adacrypt