[DFW.pm] Deduplication Hackathon: Formal Output Specification

Tommy Butler dfwpm at internetalias.net
Tue Dec 31 12:37:12 PST 2013


You may use any hashing/mapping algorithm you like; xxhash was just a
personal choice given it's emphasis on speed.

The "robot" format of output is what must match up with your own
output.  The "human" format is strictly optional, and if you choose to
create a human-readable output format option for your own code, it can
look however you want it to.

Remember to email your ssh key to me if you want server access and the
ability to test your code against the reference data.

--Tommy Butler

On 12/31/2013 04:19 AM, Joakim Lagerqvist wrote:
> Hello Tommy,
>
> On Tue, Dec 31, 2013 at 1:40 PM, Tommy Butler <dfwpm at internetalias.net
> <mailto:dfwpm at internetalias.net>> wrote:
>
>     */Please take time to compare your code output to the output of
>     the "reference design" code on github. If your output is not
>     identical, then you will be disqualified for producing incorrect
>     results. /*
>
>
> In your reference design "human" output, you have included the xxHash
> value, is this needed for the contest? If another digest/method has
> been used to identify the duplicates, it will not match up.
>
> Cheers and happy new year,
> Joakim

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.pm.org/pipermail/dfw-pm/attachments/20131231/8b67d019/attachment.html>


More information about the Dfw-pm mailing list