[LA.pm] HTML page word count/density module?

Robert Spier rspier at pobox.com
Thu Feb 19 19:42:24 CST 2004


> Here is an example output.
> perl wordcount.pl ls1.list 
> 
> Word    Word    
> Count   Percent Word
> -----   ------- -----------------
> 559     4.232   the
> 320     2.423   to
> 211     1.597   for
> 209     1.582   of
> 205     1.552   a
> 199     1.507   and

Taking HTML out of the picture, this is _trivial_.  (And could be done
in awk.)

Changing HTML to text in a way suitable for this program is trivial.

Ergo, 1+1=2.

-R



More information about the Losangeles-pm mailing list