From: RF on
Hi Experts :-)

I have a PDF version of a language dictionary that has two different fonts.

It would be much faster to access the information if I had it in the
electronic form - just type in a word and get the info related to that
word.

Apart from a smart 2-font OCR, what else would I need?

Suggestions greatly appreciated.
From: Craig on
On 09/16/2009 10:22 AM, RF wrote:
> Hi Experts :-)
>
> I have a PDF version of a language dictionary that has two different fonts.
>
> It would be much faster to access the information if I had it in the
> electronic form - just type in a word and get the info related to that
> word.
>
> Apart from a smart 2-font OCR, what else would I need?
>
> Suggestions greatly appreciated.

Gosh. That could be a rat's-nest of a project... Listen, before
attempting OCR, how about cut 'n paste? I mean, unless the doc itself
is an image, the text is still accessible.

Interesting... Please post /the/ solution when you find it.

thx,
--
-Craig
From: RF on
Craig wrote:
> On 09/16/2009 10:22 AM, RF wrote:
>> Hi Experts :-)
>>
>> I have a PDF version of a language dictionary that has two different
>> fonts.
>>
>> It would be much faster to access the information if I had it in the
>> electronic form - just type in a word and get the info related to that
>> word.
>>
>> Apart from a smart 2-font OCR, what else would I need?
>>
>> Suggestions greatly appreciated.
>
> Gosh. That could be a rat's-nest of a project... Listen, before
> attempting OCR, how about cut 'n paste? I mean, unless the doc itself
> is an image, the text is still accessible.
>
> Interesting... Please post /the/ solution when you find it.
>
> thx,

No progress yet. The book has a few hundred pages so cutting and pasting
is likely to be very tedious. The doc is a pdf, so it is an image but I
have heard of PDF readers that had an OCR facility.
From: Johnw on
RF formulated on Friday :

> No progress yet. The book has a few hundred pages so cutting and pasting is
> likely to be very tedious. The doc is a pdf, so it is an image but I have
> heard of PDF readers that had an OCR facility.

Putting the keywords > pdf ocr into Search, gets this.

http://primewares.com/links.php?search=pdf+ocr&la=en


From: RF on
Johnw wrote:
> RF formulated on Friday :
>
>> No progress yet. The book has a few hundred pages so cutting and
>> pasting is likely to be very tedious. The doc is a pdf, so it is an
>> image but I have heard of PDF readers that had an OCR facility.
>
> Putting the keywords > pdf ocr into Search, gets this.
>
> http://primewares.com/links.php?search=pdf+ocr&la=en

Thanks Johnw.

That is an interesting find. I guess the question now is, can the prog
deal with two languages mixed in the same document.

The following caught my eye in the "Scan and Sort it Freeware" :
"Archives in Scan and Sort it are easy to backup. Just save a copy in a
safe place. Nothing will get lost in a digital archive."

Would you believe?????