
From nickt@powys.gov.uk Mon Aug  3 09:13:59 1998
Date: Mon, 3 Aug 1998 08:24:36 +0000
From: Nick Talbott <nickt@powys.gov.uk>
To: Brad Shelton <brad.shelton@centurytel.com>
Cc: HTDIG mailing list <htdig@sdsu.edu>
Subject: Re: htdig: htdig mishandles commas in META tag keyword list?

    [The following text is in the "ISO-8859-1" character set]
    [Your display is set for the "US-ASCII" character set]
    [Some characters may be displayed incorrectly]

Brad

> The problem is, htdig doesn't merely ignore or strip any commas that are
> there, but rather lumps them in as part of the keyword (according to the
> debug output we've seen). That is, the tag
> 
> <META NAME="keywords" CONTENT="guestbook, register, newsletter">   
> 
> produces the four words
> 
> guestbook,
> register,
> newsletter
> ...

> ...
> Is there a patch for this, or any other option to get this corrected?
> Any thoughts? Thanks in advance for any information!
> 

I noticed this behaviour and a quick scan through the source reveals a
solution.
Patch the source file htdig/HTML.cc as follows:.
Add a comma to the second parameter of the strtok call in each of lines 
547, 552, 567 and 572 of the file.

For example in line 547:
char    *w = strtok(keywords, " ,\t\r\n");
                                =
If you make this patch, run "make depend" to update dependencies before
re-compiling

Regards

Nick Talbott
Corporate IT Unit, Powys County Council, UK

----------------------------------------------------------------------
To unsubscribe from the htdig mailing list, send a message to
htdig-request@sdsu.edu containing the single word "unsubscribe" in
the body of the message.

