<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<br>
Thanks so much for looking into that Robert.<br>
<br>
I was trying to download this information by doing a <br>
LWP::Simple <br>
my $retcode1 = getstore( $second, "$dir/$first" );<br>
on links like this:<br>
<a class="moz-txt-link-freetext" href="https://web.archive.org/web/20131005142948/http://freepages.genealogy.rootsweb.ancestry.com:80/~caulleyfamilyinfo/MissouriMarriages/Franklin18451864BookBConsolidatedIndex.txt">https://web.archive.org/web/20131005142948/http://freepages.genealogy.rootsweb.ancestry.com:80/~caulleyfamilyinfo/MissouriMarriages/Franklin18451864BookBConsolidatedIndex.txt</a><br>
<br>
Which gives me a text file similar to the attached HTM file.<br>
That file has a bunch of HTML in it that produces the data in a <br>
text scroll if you open it in a browser. I am embarrassed to <br>
say that all my efforts to obtain the data straight away were <br>
unsuccessful. I expected <br>
use LWP::UserAgent;<br>
my $ua = LWP::UserAgent->new;<br>
$ua->agent("Mozilla/8.0"); # pretend we are very capable browser<br>
<br>
or<br>
<br>
use LWP::UserAgent;<br>
my $ua = LWP::UserAgent->new;<br>
$ua->agent("$0/0.1 " . $ua->agent);<br>
<br>
to work, but they download the same HTML file <br>
as the attached one.<br>
<br>
I'll probably figure this out someday, but for the moment I am <br>
trying to limit the time I spend on this embarrassing situation :-)<br>
But I just can't help myself. I am still working on it a little.<br>
<br>
This is not a lot of data, so I can certainly get it, but I am <br>
more interested in fixing the problem I have obtaining the data <br>
than actually getting the data. I don't really care too much <br>
about the data, but others do.<br>
<br>
<br>
<br>
Mike<br>
<br>
<br>
<br>
<div class="moz-cite-prefix">On 1/17/2018 1:21 PM, Robert Stone
wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CAAvSPpTnE1+_creuLtvBb3yeWH=-U8qV1cJ1Lw2LvW0yyoxJAQ@mail.gmail.com">
<div dir="ltr">(resending this without images due to mailing list
size limit)
<div><br>
</div>
<div><span style="font-size:12.8px">Greetings,</span>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px">tl;dr - While the form likely
connected to a database/datastore and there is no way to
retrieve that, the wayback machine archived a lot (but not
all) of the data in another format.</div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px"><b>The Bad News</b></div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px">So for funsies I took a look at
this form and the HTML for it. Turns out<span
style="font-size:12.8px"> that the information entered is
POST'ed back to the server at yearlastwild.asp to handle
the request. Just to be absolutely certain, I went ahead
and submitted a request monitoring the network traffic and
confirmed the POST request. </span><span
style="font-size:12.8px">That ASP script was likely
connecting to some sort of database to retrieve and then
format the data for presentation.</span></div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px">Just to be SUPER certain there
wasn't a whole huge blob of javascript representing the
dataset (which would be incredibly unlikely, but you never
know...) and the <span style="font-size:12.8px">largest
request is 27.7 KB, and it's for a font.</span></div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px"><b>The Good News</b></div>
<div style="font-size:12.8px"><b><br>
</b></div>
<div style="font-size:12.8px">Well, then, let's see if the
marriage data is presented in any other format on the site,
like a big huge list. Crazier things have happened...</div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px"><a
href="https://web.archive.org/web/20030208012802/http://vienici.com:80/abmomarr.html"
target="_blank" moz-do-not-send="true">https://web.archive.org/web/<wbr>20030208012802/http://vienici.<wbr>com:80/abmomarr.html</a><br>
</div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px">If we scroll down we can see
Washington County a<span style="font-size:12.8px">nd if we
select the He- we can see the same entry for Henry S:</span></div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px"><a
href="https://web.archive.org/web/20030219131906/http://vienici.com:80/moabs/xmarrwash/xhe-j.html"
target="_blank" moz-do-not-send="true">https://web.archive.org/web/<wbr>20030219131906/http://vienici.<wbr>com:80/moabs/xmarrwash/xhe-j.<wbr>html</a></div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px">Which actually matches the data
from yearlastwild.asp (although, only the name and date are
contained here and not the description).</div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px">So it seems for washing county
there is some data and possibly more from the Washing County
GenWeb. I do see for other counties there is much more
data, such as Franklin County:</div>
<div style="font-size:12.8px"><a
href="https://web.archive.org/web/20030407195843/http://www.vienici.com:80/mofran/vB/p201225.html"
target="_blank" moz-do-not-send="true">https://web.archive.org/web/<wbr>20030407195843/http://www.<wbr>vienici.com:80/mofran/vB/<wbr>p201225.html</a></div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px">With some work and a whole bunch
of parsing you could recreate a good chunk! Of course, I'd
probably hunt high and low to see if someone else had this
dataset I could use (or buy) but nice to know at least parts
of it live on.</div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px">Hopefully you find the above
helpful.</div>
<div style="font-size:12.8px"><br>
</div>
<div style="font-size:12.8px">Best Regards,</div>
<div style="font-size:12.8px">Robert Stone</div>
</div>
</div>
<div class="gmail_extra"><br>
<div class="gmail_quote">On Tue, Jan 16, 2018 at 8:54 PM, Mike
Flannigan <span dir="ltr"><<a
href="mailto:mikeflan@att.net" target="_blank"
moz-do-not-send="true">mikeflan@att.net</a>></span>
wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0
.8ex;border-left:1px #ccc solid;padding-left:1ex"><br>
This is an archive of a website that went dead in 2011:<br>
<a
href="https://web.archive.org/web/20090609191130/http://www.vienici.com:80/moabs/lookups.html"
rel="noreferrer" target="_blank" moz-do-not-send="true">https://web.archive.org/web/20<wbr>090609191130/http://www.vienic<wbr>i.com:80/moabs/lookups.html</a><br>
<br>
The 3rd search box (link) takes you to:<br>
<a
href="https://web.archive.org/web/20090306211924/http://www.vienici.com:80/moabs/yearlastwild.asp"
rel="noreferrer" target="_blank" moz-do-not-send="true">https://web.archive.org/web/20<wbr>090306211924/http://www.vienic<wbr>i.com:80/moabs/yearlastwild.<wbr>asp</a><br>
<br>
The search does not work on that page, for obvious reasons.
I have looked at<br>
the page source and decided the search was run by
javascript, but I could be<br>
wrong about that. If you are snowed in and have some time
to devote to this,<br>
what I want to know is what format was the marriage license
data in on this<br>
guys server. I don't think that can be told from the page
source, but I thought<br>
I would ask you guys. Perhaps you would need the ASP file
to tell that??<br>
It was not a huge amount of data, so it could have been in
almost any format.<br>
<br>
The reason I am asking is because we are trying to find that
data 6 years<br>
after the guy died.<br>
<br>
I'm pretty sure he had an account at the Wayback Machine,
and he may have stored<br>
the data there, in addition to other places.<br>
<br>
<br>
Mike<br>
______________________________<wbr>_________________<br>
Houston mailing list<br>
<a href="mailto:Houston@pm.org" target="_blank"
moz-do-not-send="true">Houston@pm.org</a><br>
<a href="http://mail.pm.org/mailman/listinfo/houston"
rel="noreferrer" target="_blank" moz-do-not-send="true">http://mail.pm.org/mailman/lis<wbr>tinfo/houston</a><br>
Website: <a href="http://houston.pm.org/" rel="noreferrer"
target="_blank" moz-do-not-send="true">http://houston.pm.org/</a><br>
</blockquote>
</div>
<br>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
Houston mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Houston@pm.org">Houston@pm.org</a>
<a class="moz-txt-link-freetext" href="http://mail.pm.org/mailman/listinfo/houston">http://mail.pm.org/mailman/listinfo/houston</a>
Website: <a class="moz-txt-link-freetext" href="http://houston.pm.org/">http://houston.pm.org/</a>
</pre>
</blockquote>
<br>
</body>
</html>