Prev: Freeware needlepoint design software?
Next: protect..
From: MitchellWmA on 9 Apr 2007 20:35 On Sun, 8 Apr 2007 15:53:55 -0500, �Q� <boxcars(a)gmx.net> wrote: >In <news:klqf13tsc90dk1kd1pj712ude1aphd3grn(a)4ax.com>, >MitchellWmA <nospammail(a)nonsense.com> wrote: > >> But now that I know what to look for, I'll google through all the >> regex stuff again online to see if something, somehwere doesn't point >> what really shows the beginning of a word. Agent Ransack just doesn't >> seem to like "\b" which is only other "beginning of word" thing to use >> and I must be missing out on a lot of words embedded in the info files >> vs the word lists. > >AFAICT, there is no good way to make Agent Ransack's regex recognize >word boundaries. It seems to be very limited, with none of the special >expressions of most regex schemes. Phew, I'm glad it just wasn't me. I searched and searched and searched and all the tutorials and such didn't point to anything that worked with AR for finding words amongst text. I suspect that it's a built-in limitation in AR as it's a lite version of a shareware "pro" app. Betcha the $ware one does the job completely. ..Again, I needed the confirmation from someone else because I'd begun to suspect that as good as it was, AR is not the app for this job. Probably the only reason it's been so good is because I'm dealing mainly with word lists which have one word per line so it's just coincidence that I find as many words as I do. But since fully 1/3 of my information text files are multi-lined documents chock-full of data, I'm now completely sure that I'm missing out on fully mining them all properly. Ya know, I know that I don't have as much knowledge as other people, but I tell you one thing. If there is a weakness in an app, I'm the one most likely to find it. AR has proven that yet again ... *g* I guess I just trust when a developer says something is good for something and I use it to the hilt only to find out their claims are not as comprehensive as one would believe ... >Here's a rather ugly workaround, in case you want to stick with Agent >Ransack; for me it would be too annoying, and I'd look for some other >app with better regex support. (I like and use AR, but mostly just for >searching filenames without much need for regular expressions.) You >can define a character class consisting of all the characters that >might indicate word boundaries, or at least the more common ones, like >whitespaces and punctuation. I don't see any way at all to match tab >characters, though. > > (^|[ .;:,'"])foo([ .;:,'"]|$) > >will match > > bar foo bar > foo bar > bar foo. Bar > >but not > > bar foomatic bar Interesting. I'll give it a try. But I think I'll keep searching at the same time for a regex search app that follows more conventional search parameters. AR is good but it won't fit my needs long-term. Thanks for your help. I wasn't trusting my results due to my inexperience with regex and regexp. Appreciate it.
From: MitchellWmA on 10 Apr 2007 04:44 On Mon, 09 Apr 2007 20:35:02 -0400, MitchellWmA <nospammail(a)nonsense.com> wrote: [snip] >Interesting. I'll give it a try. But I think I'll keep searching at >the same time for a regex search app that follows more conventional >search parameters. AR is good but it won't fit my needs long-term. [snip] Funny how once you know what to search for you eventually you find something. After only a bit of searching I found Vgrep here http://www.english.aionel.net/. It looks promising. Not as nice an interface after AR but it did find my earlier text string search that AR didn't - "joy on the burn" - easily with both of these buried in and amongst a bunch of other text : \bjoy.on.the.burn\b \bjoy.*on.*the.*burn\b I think I'll probably always use the second way of searching, if I've understood regexp properly, in case there accidentally is more than one space between words. Anyway, V-Grep is a contender. Very importantly, it allows one to save the search parameters. Another bonus to offset the clunkiness of the interface for me is the fact that it is that its folder is 329kb vs AR's 1.16 megs. A consideration on a small thumb drive. Anyway, I'll keep looking but wanted to post findings to the thread. thanx
From: jmatt on 10 Apr 2007 10:42 On Apr 10, 4:44 pm, MitchellWmA <nospamm...(a)nonsense.com> wrote: > On Mon, 09 Apr 2007 20:35:02 -0400, MitchellWmA Here are some more. http://www.all4you.dk/FreewareWorld/links.php?search=grep&la=en http://win.softpedia.com/progSearch/grep,1,1,1,1,1,0,0,0,1,0 http://www.all4you.dk/FreewareWorld/links.php?search=Regex&la=en http://win.softpedia.com/progSearch/regex,1,1,1,1,1,0,0,0,1,0
From: MitchellWmA on 11 Apr 2007 06:55
On 10 Apr 2007 07:42:18 -0700, jmatt(a)webace.com.au wrote: >On Apr 10, 4:44 pm, MitchellWmA <nospamm...(a)nonsense.com> wrote: >> On Mon, 09 Apr 2007 20:35:02 -0400, MitchellWmA > >Here are some more. > >http://www.all4you.dk/FreewareWorld/links.php?search=grep&la=en > >http://win.softpedia.com/progSearch/grep,1,1,1,1,1,0,0,0,1,0 > >http://www.all4you.dk/FreewareWorld/links.php?search=Regex&la=en > >http://win.softpedia.com/progSearch/regex,1,1,1,1,1,0,0,0,1,0 Wow, excellent. Thanks. I'll most definitely take a look. *g* |