[Purdue-pm] challenge problem: sentiment analysis
dsk
zeewfo at gmail.com
Sun Jan 27 06:31:47 PST 2019
Interesting challenge. A quick search on CPAN led me to the Text::Mining
package, has anyone used it for this type of project?
https://metacpan.org/pod/Text::Mining
With the government shutdown, regulations.gov may not be approving new API
keys. If anyone needs some example comments, I can put together a small
archive.
Thanks,
dsk
On Thu, Jan 24, 2019 at 1:27 PM Mark Senn <mark at purdue.edu> wrote:
> Purdue Perl Mongers,
>
> A person (I don't know if they want to be identified offhand)
> demonstrated how to get information from federalregistrar.gov and/or (I
> forget for sure) regulations.gov during our last meeting using an API.
>
> From
> https://www.regulations.gov/document?D=EPA-HQ-OAR-2017-0355-21117
> EPA received more than 270,000 comments on the ANPRM, which have
> informed this proposed rulemaking.
>
> From
> https://www.wolframalpha.com/input/?i=270000+seconds
> [270000 seconds is] 3.3 days
>
> Challenge problem: figure out how to use the API for regulations.gov and
> "sentiment analysis" (google it) to automatically classify comments. I
> understand regulations.gov limits the rate at which one can download
> information but if some "sentiment analysis" software can automatically
> classify comments faster/better/cheaper that humans or other existing
> software on a small trial, regulations.gov may be interested in that. I
> certainly wouldn't want to read 270K comments and summarize them.
>
> -mark
> _______________________________________________
> Purdue-pm mailing list
> Purdue-pm at pm.org
> https://mail.pm.org/mailman/listinfo/purdue-pm
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mail.pm.org/pipermail/purdue-pm/attachments/20190127/a5e0fcc2/attachment.html>
More information about the Purdue-pm
mailing list