[Chicago-talk] Please test my new speedy ack-1.75_01.tar.gz

Andy Lester andy at petdance.com
Thu Dec 6 18:46:46 PST 2007



I've released 1.75_01 which is a big speedup for what I suspect is the  
most common case of scanning large directories of many relatively  
(<100K) small files and not finding matches in all of them.

Here's what it does:  It checks the size of each file, and if it's  
<100K, slurps up the entire file.  Then, it scans the entire 100K file  
at once, and if there are no matches, then there's no need to do the  
line-by-line more expensive searching.

In my test scanning a couple of big source directories like ~/parrot,  
it saved about 40% run-time.  If I had -C or -B to store context, it  
saved about 70% run-time because the saving of context is more  
expensive.

So, please, check it out and make sure all is still working.  Also,  
any eyes on the algorithm I'm using would be appreciated, too.  I'm  
sure I can do things cleaner or a bit more speedily, but right now I'm  
mostly worried about accuracy.

xoxo,
Andy


Begin forwarded message:

> From: PAUSE <upload at pause.perl.org>
> Date: December 6, 2007 8:41:37 PM CST
> To: "Andy Lester" <andy at petdance.com>
> Subject: CPAN Upload: P/PE/PETDANCE/ack-1.75_01.tar.gz
> Reply-To: cpan-testers at perl.org
>
> The uploaded file
>
>   ack-1.75_01.tar.gz
>
> has entered CPAN as
>
> file: $CPAN/authors/id/P/PE/PETDANCE/ack-1.75_01.tar.gz
> size: 51701 bytes
>  md5: 43673ac643d9d58df24f4e349640791d
>
> No action is required on your part
> Request entered by: PETDANCE (Andy Lester)
> Request entered on: Fri, 07 Dec 2007 02:41:10 GMT
> Request completed:  Fri, 07 Dec 2007 02:41:37 GMT
>
> Thanks,
> -- 
> paused, v937
>

--
Andy Lester => andy at petdance.com => www.petdance.com => AIM:petdance






More information about the Chicago-talk mailing list