From: zigzagdna on

I am on Oracle 11 g using hp unix 11i.
Can one set Oracle text Indexes on Microsoft WORD and pdf files. I
can understand text indexes on text files, but how do test indexes
work on “binary” files such as WORD and pdf files. I have been told
they work on these files as well, just curious how oracle manages to
parse such files.


From: Sybrand Bakker on
On Sat, 8 May 2010 20:14:15 -0700 (PDT), zigzagdna
<zigzagdna(a)yahoo.com> wrote:

>
>I am on Oracle 11 g using hp unix 11i.
>Can one set Oracle text Indexes on Microsoft WORD and pdf files. I
>can understand text indexes on text files, but how do test indexes
>work on �binary� files such as WORD and pdf files. I have been told
>they work on these files as well, just curious how oracle manages to
>parse such files.
>

Ahh, here is the master of asking doc questions on *BOTH*
comp.oracle.databases.oracle.server and http://forums.oracle.com
again!

Question: does http://tahiti.oracle.com work for you?
So why are you so fundamentally lazy, *never* being prepared to do
your own research, and you require to be *spoon-fed* like a baby?
Did your mother forget to beat some industry into you?


Aren't you ashamed of yourself?

-----
Sybrand Bakker
Senior Oracle DBA
From: zigzagdna on
On May 8, 11:14 pm, zigzagdna <zigzag...(a)yahoo.com> wrote:
> I am on Oracle 11 g using hp unix 11i.
> Can one set Oracle text Indexes on Microsoft WORD and  pdf files. I
> can understand text indexes on text files, but how do test indexes
> work on “binary” files such as WORD and pdf files. I have been told
> they work on these files as well, just curious how oracle manages to
> parse such files.

I am still using Oracle 9 text because that is what what my vendor
supports eventhough I am on Oracle 11i.
From: Vladimir M. Zakharychev on
On May 9, 7:14 am, zigzagdna <zigzag...(a)yahoo.com> wrote:
> I am on Oracle 11 g using hp unix 11i.
> Can one set Oracle text Indexes on Microsoft WORD and  pdf files. I
> can understand text indexes on text files, but how do test indexes
> work on “binary” files such as WORD and pdf files. I have been told
> they work on these files as well, just curious how oracle manages to
> parse such files.

They are using external converters to parse them and convert them to
HTML or XML. Extproc functionality is heavily used for that. Since
file formats evolve over time so do these external filters, so if
you're on 11 you better use filters native to this version (actually I
can't see how you could use 9i filters with 11 anyway.)

Regards,
Vladimir M. Zakharychev
N-Networks, makers of Dynamic PSP(tm)
http://www.dynamicpsp.com
From: zigzagdna on
On May 9, 1:42 pm, "Vladimir M. Zakharychev"
<vladimir.zakharyc...(a)gmail.com> wrote:
> On May 9, 7:14 am, zigzagdna <zigzag...(a)yahoo.com> wrote:
>
> > I am on Oracle 11 g using hp unix 11i.
> > Can one set Oracle text Indexes on Microsoft WORD and  pdf files. I
> > can understand text indexes on text files, but how do test indexes
> > work on “binary” files such as WORD and pdf files. I have been told
> > they work on these files as well, just curious how oracle manages to
> > parse such files.
>
> They are using external converters to parse them and convert them to
> HTML or XML. Extproc functionality is heavily used for that. Since
> file formats evolve over time so do these external filters, so if
> you're on 11 you better use filters native to this version (actually I
> can't see how you could use 9i filters with 11 anyway.)
>
> Regards,
>    Vladimir M. Zakharychev
>    N-Networks, makers of Dynamic PSP(tm)
>    http://www.dynamicpsp.com

Vladamir:
Thanks a lot. Oracle 11g still provides older Oracle Text, that's
what I installed inside Oracle 11g database instance,