htdig: [PATCH] for <META HTTP-EQUIV=REFRESH ...


Aidas Kasparas (kaspar@dobilas.infosistema.lt)
Fri, 13 Nov 1998 20:24:02 +0100 (CET)


Hi,

Please find attached a patch which allows to index links where actual
link is pointed out via <META HTTP-EQUIV=REFRESH
CONTENT="0;URL=some_file.html">

        Sorry, it's relative to 3.1.0b1 and should be applied in htdig
directory.

Regards,

Aidas

*** HTML.cc.R Fri Nov 13 19:05:25 1998
--- HTML.cc Fri Nov 13 20:03:02 1998
***************
*** 609,614 ****
--- 609,633 ----
                      w = strtok(0, " \t\r\n");
                  }
              }
+
+ // <META HTTP-EQUIV=REFRESH case
+ if (conf["http-equiv"]){// && conf["content"]){
+ if (mystrcasecmp(conf["http-equiv"], "refresh") == 0){
+ char *content=conf["content"];
+ char *q = mystrcasestr(content, "url=");
+ if (*q){
+ q+=4; // skiping "URL="
+ char *qq = q;
+ while (*qq && (*qq!=';') && (*qq!='"') &&
+ !isspace(*qq))qq++;
+ *qq=0;
+ URL *href = new URL(q, *base);
+ retriever.got_href(*href, "");
+ delete href;
+
+ }
+ }
+ }
  
              //
              // Now check for <meta name=... content=...> tags that
Nry^bf)+-jgb-.yۜ{Zx^)(ۧrܨ~^,



This archive was generated by hypermail 2.0b3 on Sat Jan 02 1999 - 16:28:48 PST