From: Killer on
Hi Team,

I have an excel sheet with a list of 50 page URLs for which I am
trying to extract the page titles from the HTML code of each page.
This is a secure site and I would have to pass it through
authentication. Can I get all the titles in one shot?

filename blah url 'http://www.xyz.com/' user="myuserid"
pass="mypassword" prompt;;

data blah;
infile blah;
input @"<title>";
rxid1 = prxparse('/<title>(.*)<\/title>/');
if prxmatch(rxid1, _infile_) then title = prxposn(rxid1, 1,
_infile_);
keep title;
run;
proc print;
run;

On running this code I am getting "Cannot load SSL support".

Is there a way to get around this? Or is there a different approach to
this, so that I could extrat the page titles.

Please help!

Regards,
KK



From: Arthur Tabachneck on
Take a look at:

http://www.listserv.uga.edu/cgi-bin/wa?A2=ind0506d&L=sas-l&P=48385

HTH,
Art
-----------
On Feb 6, 1:12 pm, Killer <kk.ma...(a)gmail.com> wrote:
> Hi Team,
>
> I have an excel sheet with a list of 50 page URLs for which I am
> trying to extract the page titles from the HTML code of each page.
> This is a secure site and I would have to pass it through
> authentication. Can I get all the titles in one shot?
>
> filename blah url 'http://www.xyz.com/'user="myuserid"
> pass="mypassword" prompt;;
>
> data blah;
> infile blah;
> input @"<title>";
> rxid1 = prxparse('/<title>(.*)<\/title>/');
> if prxmatch(rxid1, _infile_) then title = prxposn(rxid1, 1,
> _infile_);
> keep title;
> run;
> proc print;
> run;
>
> On running this code I am getting "Cannot load SSL support".
>
> Is there a way to get around this? Or is there a different approach to
> this, so that I could extrat the page titles.
>
> Please help!
>
> Regards,
> KK
From: Killer on
On Feb 7, 12:17 am, art...(a)NETSCAPE.NET (Arthur Tabachneck) wrote:
> Take a look at:
>
> http://www.listserv.uga.edu/cgi-bin/wa?A2=ind0506d&L=sas-l&P=48385
>
> HTH,
> Art
> -----------
> On Feb 6, 1:12 pm, Killer <kk.ma...(a)gmail.com> wrote:
>
>
>
> > Hi Team,
>
> > I have an excel sheet with a list of 50 page URLs for which I am
> > trying to extract the page titles from the HTML code of each page.
> > This is a secure site and I would have to pass it through
> > authentication. Can I get all the titles in one shot?
>
> >  filename blah url 'http://www.xyz.com/'user="myuserid"
> > pass="mypassword" prompt;;
>
> >   data blah;
> >     infile blah;
> >     input @"<title>";
> >     rxid1 = prxparse('/<title>(.*)<\/title>/');
> >     if prxmatch(rxid1, _infile_) then title = prxposn(rxid1, 1,
> > _infile_);
> >     keep title;
> >   run;
> >   proc print;
> >   run;
>
> > On running this code I am getting  "Cannot loadSSLsupport".
>
> > Is there a way to get around this? Or is there a different approach to
> > this, so that I could extrat the page titles.
>
> > Please help!
>
> > Regards,
> > KK- Hide quoted text -
>
> - Show quoted text -


Hi ,

I tried that, now I am getting a message "ERROR: Hostname kchttps not
found". any ideas?
From: Arthur Tabachneck on
KK,

I'd think that you will have to post your code, and the resulting log, for
anyone to see what might be happening.

Could be something as simple as having made a typo of some sort in
identifying the hostname.

Art
---------
On Tue, 9 Feb 2010 10:09:04 -0800, Killer <kk.majji(a)GMAIL.COM> wrote:

>On Feb 7, 12:17 am, art...(a)NETSCAPE.NET (Arthur Tabachneck) wrote:
>> Take a look at:
>>
>> http://www.listserv.uga.edu/cgi-bin/wa?A2=ind0506d&L=sas-l&P=48385
>>
>> HTH,
>> Art
>> -----------
>> On Feb 6, 1:12 pm, Killer <kk.ma...(a)gmail.com> wrote:
>>
>>
>>
>> > Hi Team,
>>
>> > I have an excel sheet with a list of 50 page URLs for which I am
>> > trying to extract the page titles from the HTML code of each page.
>> > This is a secure site and I would have to pass it through
>> > authentication. Can I get all the titles in one shot?
>>
>> > filename blah url 'http://www.xyz.com/'user="myuserid"
>> > pass="mypassword" prompt;;
>>
>> > data blah;
>> > infile blah;
>> > input @"<title>";
>> > rxid1 = prxparse('/<title>(.*)<\/title>/');
>> > if prxmatch(rxid1, _infile_) then title = prxposn(rxid1, 1,
>> > _infile_);
>> > keep title;
>> > run;
>> > proc print;
>> > run;
>>
>> > On running this code I am getting "Cannot loadSSLsupport".
>>
>> > Is there a way to get around this? Or is there a different approach to
>> > this, so that I could extrat the page titles.
>>
>> > Please help!
>>
>> > Regards,
>> > KK- Hide quoted text -
>>
>> - Show quoted text -
>
>
>Hi ,
>
>I tried that, now I am getting a message "ERROR: Hostname kchttps not
>found". any ideas?