[Purdue-pm] challenge problem: sentiment analysis

Mark Senn mark at purdue.edu
Thu Jan 24 10:27:36 PST 2019


Purdue Perl Mongers,

A person (I don't know if they want to be identified offhand)
demonstrated how to get information from federalregistrar.gov and/or (I
forget for sure) regulations.gov during our last meeting using an API.

From
https://www.regulations.gov/document?D=EPA-HQ-OAR-2017-0355-21117
    EPA received more than 270,000 comments on the ANPRM, which have
    informed this proposed rulemaking.

From
https://www.wolframalpha.com/input/?i=270000+seconds
    [270000 seconds is] 3.3 days

Challenge problem: figure out how to use the API for regulations.gov and
"sentiment analysis" (google it) to automatically classify comments.  I
understand regulations.gov limits the rate at which one can download
information but if some "sentiment analysis" software can automatically
classify comments faster/better/cheaper that humans or other existing
software on a small trial, regulations.gov may be interested in that.  I
certainly wouldn't want to read 270K comments and summarize them.

-mark


More information about the Purdue-pm mailing list