From: Killer on 6 Feb 2010 13:12 Hi Team, I have an excel sheet with a list of 50 page URLs for which I am trying to extract the page titles from the HTML code of each page. This is a secure site and I would have to pass it through authentication. Can I get all the titles in one shot? filename blah url 'http://www.xyz.com/' user="myuserid" pass="mypassword" prompt;; data blah; infile blah; input @"<title>"; rxid1 = prxparse('/<title>(.*)<\/title>/'); if prxmatch(rxid1, _infile_) then title = prxposn(rxid1, 1, _infile_); keep title; run; proc print; run; On running this code I am getting "Cannot load SSL support". Is there a way to get around this? Or is there a different approach to this, so that I could extrat the page titles. Please help! Regards, KK
From: Arthur Tabachneck on 6 Feb 2010 14:17 Take a look at: http://www.listserv.uga.edu/cgi-bin/wa?A2=ind0506d&L=sas-l&P=48385 HTH, Art ----------- On Feb 6, 1:12 pm, Killer <kk.ma...(a)gmail.com> wrote: > Hi Team, > > I have an excel sheet with a list of 50 page URLs for which I am > trying to extract the page titles from the HTML code of each page. > This is a secure site and I would have to pass it through > authentication. Can I get all the titles in one shot? > > filename blah url 'http://www.xyz.com/'user="myuserid" > pass="mypassword" prompt;; > > data blah; > infile blah; > input @"<title>"; > rxid1 = prxparse('/<title>(.*)<\/title>/'); > if prxmatch(rxid1, _infile_) then title = prxposn(rxid1, 1, > _infile_); > keep title; > run; > proc print; > run; > > On running this code I am getting "Cannot load SSL support". > > Is there a way to get around this? Or is there a different approach to > this, so that I could extrat the page titles. > > Please help! > > Regards, > KK
From: Killer on 9 Feb 2010 13:09 On Feb 7, 12:17 am, art...(a)NETSCAPE.NET (Arthur Tabachneck) wrote: > Take a look at: > > http://www.listserv.uga.edu/cgi-bin/wa?A2=ind0506d&L=sas-l&P=48385 > > HTH, > Art > ----------- > On Feb 6, 1:12 pm, Killer <kk.ma...(a)gmail.com> wrote: > > > > > Hi Team, > > > I have an excel sheet with a list of 50 page URLs for which I am > > trying to extract the page titles from the HTML code of each page. > > This is a secure site and I would have to pass it through > > authentication. Can I get all the titles in one shot? > > > filename blah url 'http://www.xyz.com/'user="myuserid" > > pass="mypassword" prompt;; > > > data blah; > > infile blah; > > input @"<title>"; > > rxid1 = prxparse('/<title>(.*)<\/title>/'); > > if prxmatch(rxid1, _infile_) then title = prxposn(rxid1, 1, > > _infile_); > > keep title; > > run; > > proc print; > > run; > > > On running this code I am getting "Cannot loadSSLsupport". > > > Is there a way to get around this? Or is there a different approach to > > this, so that I could extrat the page titles. > > > Please help! > > > Regards, > > KK- Hide quoted text - > > - Show quoted text - Hi , I tried that, now I am getting a message "ERROR: Hostname kchttps not found". any ideas?
From: Arthur Tabachneck on 9 Feb 2010 17:49 KK, I'd think that you will have to post your code, and the resulting log, for anyone to see what might be happening. Could be something as simple as having made a typo of some sort in identifying the hostname. Art --------- On Tue, 9 Feb 2010 10:09:04 -0800, Killer <kk.majji(a)GMAIL.COM> wrote: >On Feb 7, 12:17 am, art...(a)NETSCAPE.NET (Arthur Tabachneck) wrote: >> Take a look at: >> >> http://www.listserv.uga.edu/cgi-bin/wa?A2=ind0506d&L=sas-l&P=48385 >> >> HTH, >> Art >> ----------- >> On Feb 6, 1:12 pm, Killer <kk.ma...(a)gmail.com> wrote: >> >> >> >> > Hi Team, >> >> > I have an excel sheet with a list of 50 page URLs for which I am >> > trying to extract the page titles from the HTML code of each page. >> > This is a secure site and I would have to pass it through >> > authentication. Can I get all the titles in one shot? >> >> > filename blah url 'http://www.xyz.com/'user="myuserid" >> > pass="mypassword" prompt;; >> >> > data blah; >> > infile blah; >> > input @"<title>"; >> > rxid1 = prxparse('/<title>(.*)<\/title>/'); >> > if prxmatch(rxid1, _infile_) then title = prxposn(rxid1, 1, >> > _infile_); >> > keep title; >> > run; >> > proc print; >> > run; >> >> > On running this code I am getting "Cannot loadSSLsupport". >> >> > Is there a way to get around this? Or is there a different approach to >> > this, so that I could extrat the page titles. >> >> > Please help! >> >> > Regards, >> > KK- Hide quoted text - >> >> - Show quoted text - > > >Hi , > >I tried that, now I am getting a message "ERROR: Hostname kchttps not >found". any ideas?
|
Pages: 1 Prev: Repeat Steps >1 Data Set Next: Gap statistic for clusters |