From: Craig on
Soz for the long quote but;

Thought the write-up was well-done, the feature-list compelling &,
finally, I don't have any experience with this project. For the
anti-Java warriors, yup, it's a Java app.

> For speedy, cross-platform desktop search, try DocFetcher
> Posted on Wednesday, January 13th, 2010 by leeschlesinger
>
> �It�s basically for the desktop what Google is for the Web.� That�s
> how Tran Nam Quang describes DocFetcher, an application that allows
> you to search the contents of documents on your computer.
>
> DocFetcher has a few characteristics that make it unique among
> comparable alternatives, Tran says. Most desktop search apps are
> either Windows-only (Google Desktop Search, Copernic) or Linux-only
> (Beagle, Tracker). DocFetcher is cross-platform and portable. You can
> use the same DocFetcher instance on your Windows and Linux machine,
> and you can put it on a USB stick, hide it in an encrypted volume, or
> burn it onto a CD-ROM � along with your documents.
>
> In addition, it�s fast at indexing. �People often think that Java, in
> which DocFetcher was written, automatically means slow, but in the
> case of indexing, it�s really not,� Tran says. �In my experience, the
> way to get the biggest performance boost is to give users full
> control over what is indexed. I believe we did a better job with that
> than, say, Google Desktop, where it is a lot more difficult to narrow
> down the search scope.�

<http://sourceforge.net/blog/for-speedy-cross-platform-desktop-search-try-docfetcher/>
<http://docfetcher.sourceforge.net/en/index.html>

--
-Craig
From: Why Tea on
On Jan 14, 5:50 am, Craig <netburg...(a)REMOVEgmail.com> wrote:
> Soz for the long quote but;
>
> Thought the write-up was well-done, the feature-list compelling &,
> finally, I don't have any experience with this project.  For the
> anti-Java warriors, yup, it's a Java app.
>
>
>
> > For speedy, cross-platform desktop search, try DocFetcher
> > Posted on Wednesday, January 13th, 2010 by leeschlesinger
>
> > It s basically for the desktop what Google is for the Web. That s
> > how Tran Nam Quang describes DocFetcher, an application that allows
> > you to search the contents of documents on your computer.
>
> > DocFetcher has a few characteristics that make it unique among
> > comparable alternatives, Tran says. Most desktop search apps are
> > either Windows-only (Google Desktop Search, Copernic) or Linux-only
> > (Beagle, Tracker). DocFetcher is cross-platform and portable. You can
> > use the same DocFetcher instance on your Windows and Linux machine,
> > and you can put it on a USB stick, hide it in an encrypted volume, or
> > burn it onto a CD-ROM along with your documents.
>
> > In addition, it s fast at indexing. People often think that Java, in
> > which DocFetcher was written, automatically means slow, but in the
> > case of indexing, it s really not, Tran says. In my experience, the
> > way to get the biggest performance boost is to give users full
> > control over what is indexed. I believe we did a better job with that
> > than, say, Google Desktop, where it is a lot more difficult to narrow
> > down the search scope.
>
> <http://sourceforge.net/blog/for-speedy-cross-platform-desktop-search-...>
> <http://docfetcher.sourceforge.net/en/index.html>
>
> --
> -Craig

Yup, I'm a bit anti-Java. But I have just downloaded it and it
seems to run well. Will try to index a mixture of documents
to give it a real test later.
From: Craig on
On 01/13/2010 11:24 PM, Why Tea wrote:
> On Jan 14, 5:50 am, Craig<netburg...(a)REMOVEgmail.com> wrote:

>> <http://sourceforge.net/blog/for-speedy-cross-platform-desktop-search-...>
>> <http://docfetcher.sourceforge.net/en/index.html>
>>
>
> Yup, I'm a bit anti-Java. But I have just downloaded it and it
> seems to run well. Will try to index a mixture of documents
> to give it a real test later.

Thanks!

--
-Craig
From: Why Tea on
On Jan 14, 6:24 pm, Why Tea <ytl...(a)gmail.com> wrote:
> On Jan 14, 5:50 am, Craig <netburg...(a)REMOVEgmail.com> wrote:
>
>
>
> > Soz for the long quote but;
>
> > Thought the write-up was well-done, the feature-list compelling &,
> > finally, I don't have any experience with this project.  For the
> > anti-Java warriors, yup, it's a Java app.
>
> > > For speedy, cross-platform desktop search, try DocFetcher
> > > Posted on Wednesday, January 13th, 2010 by leeschlesinger
>
> > > It s basically for the desktop what Google is for the Web. That s
> > > how Tran Nam Quang describes DocFetcher, an application that allows
> > > you to search the contents of documents on your computer.
>
> > > DocFetcher has a few characteristics that make it unique among
> > > comparable alternatives, Tran says. Most desktop search apps are
> > > either Windows-only (Google Desktop Search, Copernic) or Linux-only
> > > (Beagle, Tracker). DocFetcher is cross-platform and portable. You can
> > > use the same DocFetcher instance on your Windows and Linux machine,
> > > and you can put it on a USB stick, hide it in an encrypted volume, or
> > > burn it onto a CD-ROM along with your documents.
>
> > > In addition, it s fast at indexing. People often think that Java, in
> > > which DocFetcher was written, automatically means slow, but in the
> > > case of indexing, it s really not, Tran says. In my experience, the
> > > way to get the biggest performance boost is to give users full
> > > control over what is indexed. I believe we did a better job with that
> > > than, say, Google Desktop, where it is a lot more difficult to narrow
> > > down the search scope.
>
> > <http://sourceforge.net/blog/for-speedy-cross-platform-desktop-search-....>
> > <http://docfetcher.sourceforge.net/en/index.html>
>
> > --
> > -Craig
>
> Yup, I'm a bit anti-Java. But I have just downloaded it and it
> seems to run well. Will try to index a mixture of documents
> to give it a real test later.

Have just tried it with ~40,000 files of html, pdf and .doc.
Indexing speed was reasonable, searching was fast. The
internal viewer works well for html, but displays garbled text
for pdf. The search results do not agree with swish-e on the
same set of files. I don't know which one is at fault yet, but
I believe it's DocFetcher.
From: Mike Mills on
Why Tea <ytlim1(a)gmail.com> wrote in news:61cb8789-7285-4c7a-826d-
e41801d6a99b(a)21g2000yqj.googlegroups.com:

> http://docfetcher.sourceforge.net/en/index.html

I tested DocFetcher briefly using a part of a bird project I am
working on.

<!--runs on java , starts slowly but works quickly.-->
This installation chosen is "portable"
The indexes can be created and then later burned to cd.

<!--free-->

<!--highlighted found terms-->

Only shows highlighted found terms when using text type html display
without pictures, this is the quick display, the other display type
shows pictures, but does not highlight found terms. [click on button
top of bottom window to toggle].

Indexes ~222 species [>1200 files] & creates ~1Mb index fairly
quickly, slows down a little while indexing .pdf files.

Searches in .doc .docx .htm .html .xls .txt .info etc. files as
selected.
directories to search and index are individually selected.
Indexing a *whole* drive is not encouraged. This is the opposite of
the Google Desktop Search approach.

I intend to experiment further with this application, it seems
worthwhile. I usually use indexyourfiles to create an index and to
later burn the index to a dvd where it will run. This is nearly
always adequate for my needs. This docFetcher will apparently do more
which I may find useful. It is slower than IndexYourFiles because it
indexes every word rather than just the titles, of course the indexes
are *much* larger than the filename indexes in IndexYourFiles.

http://www.indexyourfiles.com