From xerofoify at gmail.com Fri Jun 2 15:09:55 2017 From: xerofoify at gmail.com (nick) Date: Fri, 2 Jun 2017 18:09:55 -0400 Subject: [tpm] Looking for Cpan Module for Parsing Metadata of Ebooks, Audiobooks and Comics Message-ID: Greetings all, I am looking for a CPAN module to help build a search tool for my collection of ebooks,comics and audiobooks. I was wondering what modules there are to read metadata as I would like this tool to parse based on opinions passed to it for metadata information like author,published date, file type i.e. It does not need to work for media codecs for TV shows as that is just going to be stored in another program to search like my music. The module must support metadata parsing for mp3s,aacs,cbr formats,mobi,epub, and pdfs. I also plan to use it on a rather large set of directories with thousands or maybe even tens of thousands of files and therefore it needs to parse rather fast with efficient use of my resources. Thanks for the help, Nick From talexb at gmail.com Sat Jun 3 03:17:06 2017 From: talexb at gmail.com (Alex Beamish) Date: Sat, 3 Jun 2017 06:17:06 -0400 Subject: [tpm] Looking for Cpan Module for Parsing Metadata of Ebooks, Audiobooks and Comics In-Reply-To: References: Message-ID: You could start with a module like MP3::Info for mp3 files and just store everything that the get_mp3info method returns. On Jun 2, 2017 6:11 PM, "nick" wrote: > Greetings all, > > I am looking for a CPAN module to help build a search tool for my > collection of ebooks,comics and > audiobooks. I was wondering what modules there are to read metadata as I > would like this tool to > parse based on opinions passed to it for metadata information like > author,published date, file > type i.e. It does not need to work for media codecs for TV shows as that > is just going to be > stored in another program to search like my music. The module must support > metadata parsing > for mp3s,aacs,cbr formats,mobi,epub, and pdfs. I also plan to use it on a > rather large set > of directories with thousands or maybe even tens of thousands of files and > therefore it needs > to parse rather fast with efficient use of my resources. > > Thanks for the help, > Nick > _______________________________________________ > toronto-pm mailing list > toronto-pm at pm.org > http://mail.pm.org/mailman/listinfo/toronto-pm > -------------- next part -------------- An HTML attachment was scrubbed... URL: From xerofoify at gmail.com Sat Jun 3 07:01:04 2017 From: xerofoify at gmail.com (Nicholas Krause) Date: Sat, 03 Jun 2017 10:01:04 -0400 Subject: [tpm] Looking for Cpan Module for Parsing Metadata of Ebooks, Audiobooks and Comics In-Reply-To: References: Message-ID: <6EC4C96B-6929-4E22-A3D7-222C9CE97EDA@gmail.com> On June 3, 2017 6:17:06 AM EDT, Alex Beamish wrote: >You could start with a module like MP3::Info for mp3 files and just >store >everything that the get_mp3info method returns. > Thanks is there anything for the other file formats. >On Jun 2, 2017 6:11 PM, "nick" wrote: > >> Greetings all, >> >> I am looking for a CPAN module to help build a search tool for my >> collection of ebooks,comics and >> audiobooks. I was wondering what modules there are to read metadata >as I >> would like this tool to >> parse based on opinions passed to it for metadata information like >> author,published date, file >> type i.e. It does not need to work for media codecs for TV shows as >that >> is just going to be >> stored in another program to search like my music. The module must >support >> metadata parsing >> for mp3s,aacs,cbr formats,mobi,epub, and pdfs. I also plan to use it >on a >> rather large set >> of directories with thousands or maybe even tens of thousands of >files and >> therefore it needs >> to parse rather fast with efficient use of my resources. >> >> Thanks for the help, >> Nick >> _______________________________________________ >> toronto-pm mailing list >> toronto-pm at pm.org >> http://mail.pm.org/mailman/listinfo/toronto-pm >> From jkeenan at pobox.com Sat Jun 3 07:16:01 2017 From: jkeenan at pobox.com (James E Keenan) Date: Sat, 3 Jun 2017 10:16:01 -0400 Subject: [tpm] Looking for Cpan Module for Parsing Metadata of Ebooks, Audiobooks and Comics In-Reply-To: <6EC4C96B-6929-4E22-A3D7-222C9CE97EDA@gmail.com> References: <6EC4C96B-6929-4E22-A3D7-222C9CE97EDA@gmail.com> Message-ID: <48692432-7f22-e7e5-c3fc-672dfa5ef429@pobox.com> On 06/03/2017 10:01 AM, Nicholas Krause wrote: > > > On June 3, 2017 6:17:06 AM EDT, Alex Beamish wrote: >> You could start with a module like MP3::Info for mp3 files and just >> store >> everything that the get_mp3info method returns. >> > Thanks is there anything for the other file formats. Why not start by going to search.cpan.org and typing in "PDF" in the search box? From liam at holoweb.net Sat Jun 3 18:24:44 2017 From: liam at holoweb.net (Liam R E Quin) Date: Sat, 03 Jun 2017 21:24:44 -0400 Subject: [tpm] Looking for Cpan Module for Parsing Metadata of Ebooks, Audiobooks and Comics In-Reply-To: References: Message-ID: <1496539484.21198.31.camel@holoweb.net> On Sat, 2017-06-03 at 06:17 -0400, Alex Beamish wrote: > [...] > > metadata parsing > > for mp3s,aacs,cbr formats,mobi,epub, and pdfs. I also plan to use > > it on a > > rather large set > > of directories with thousands or maybe even tens of thousands of > > files and > > therefore it needs > > to parse rather fast with efficient use of my resources. If it's too slow you could use a database - for a fast tree store I use BaseX with the Perl API, although that does mean learning the XQuery language - which is at least fairly standard across databases from IBM DB2 and MS SQL Server all the way to NoSQL MarkLogic or (as here) open source BaseX. In other words the way I'd approach this would depend on the uses I was going to make of the metadata. Liam From xerofoify at gmail.com Sun Jun 4 17:40:47 2017 From: xerofoify at gmail.com (nick) Date: Sun, 4 Jun 2017 20:40:47 -0400 Subject: [tpm] Looking for Cpan Module for Parsing Metadata of Ebooks, Audiobooks and Comics In-Reply-To: <1496604422.21198.44.camel@holoweb.net> References: <1496539484.21198.31.camel@holoweb.net> <4239791e-d9fd-1112-9052-90152704d417@gmail.com> <1496604422.21198.44.camel@holoweb.net> Message-ID: <94d76c16-a1fd-209d-d433-42e94a11b4bf@gmail.com> On 2017-06-04 03:27 PM, Liam R E Quin wrote: > On Sun, 2017-06-04 at 11:32 -0400, nick wrote: >> >> On 2017-06-03 09:24 PM, Liam R E Quin wrote: >>> If it's too slow you could use a database - for a fast tree store I >>> use >>> BaseX with the Perl API, > [...] >> >> Good point Liam. I do have a copy of Programming Perl so I will start >> looking through that again >> after I refresh my knowledge with Learning Perl. It's been a few >> years so I would like a refresher:). >> However on that note coming from a C++/C/Java background why not just >> run a profiler and see what >> is causing slowdowns and than decide based on that what optimizations >> to take. > > If you're doing (say) a text search with a regular expression over > 100MBytes of text, on a system less than (say) 5 or 10 years old, and > not running so many other things it's going as slowly as a blind slave > in a sulphur mine, you should get an answer in well under a second. > > If you have 100,000 files it will take longer because of the system > overhead of opening 10,000 files. > > Profiling alone won't show you when to choose a different strategy. > > At any rate it'll probably be fine; I just wanted to give you something > to keep in mind. > > Liam > > Liam, Profiling is a tool like any tool it's only useful to something who understands its use and limitations. Profiling only gives you an idea of where possible slowdowns are, all programming tools give false warnings. It's the programmer's job to find out which are actually valuable and which are not. My point was using a profiler is useful but not useful by itself. Perl does have a debugger too but it's up to the programmer to use it effectively not the tool itself. Even in C++/C with large programs its up to the programmer to decide what warnings are useful when the compiler complains. Regards, Nick P.S. I am ccing the list as I forget to so others can join in if they wish. From dave.s.doyle at gmail.com Mon Jun 19 08:10:01 2017 From: dave.s.doyle at gmail.com (Dave Doyle) Date: Mon, 19 Jun 2017 11:10:01 -0400 Subject: [tpm] Meeting June 28 (Wed) Message-ID: My apologies for the Wednesday but for the summer, can't do Thursdays. Please RSVP here: https://www.meetup.com/Toronto-Perl-Mongers/events/240909896/ Doug Hoyte is coming in and giving us a double header of talks: REST APIs with Chouette Chouette is a perl framework for building REST APIs . It combines many of my modules together into an integrated system and is the distillation of all the best-practices I've discovered from building numerous REST API services over the years. With Chouette you can build an asynchronous server that uses AnyEvent::Task to delegate blocking tasks such as DB access and password hashing, Callback::Frame for error handling, and Log::Defer for structured logging. In this talk we'll build an authentication service from scratch with Chouette and discuss some of its cool features along the way. Better test suites with Log Periodic Does your test suite take too long? Are you plagued by flaky, unreliable tests? Feeling guilty about those sleep commands everywhere? My company Log Periodic is rolling out a new product that helps solve these problems and more. This talk will show a demo of the technology and discuss the magic sauce that makes it work. -- dave.s.doyle at gmail.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From arocker at Vex.Net Mon Jun 19 12:17:48 2017 From: arocker at Vex.Net (arocker at Vex.Net) Date: Mon, 19 Jun 2017 15:17:48 -0400 Subject: [tpm] Does this affect us? Message-ID: <1f5be87d0993da24f6ac6be02a376234.squirrel@webmail.vybenetworks.com> http://log.perl.org/2017/06/pmorg-group-hosting-shutting-down.html From dave.s.doyle at gmail.com Mon Jun 19 12:21:48 2017 From: dave.s.doyle at gmail.com (Gmail) Date: Mon, 19 Jun 2017 15:21:48 -0400 Subject: [tpm] Does this affect us? In-Reply-To: <1f5be87d0993da24f6ac6be02a376234.squirrel@webmail.vybenetworks.com> References: <1f5be87d0993da24f6ac6be02a376234.squirrel@webmail.vybenetworks.com> Message-ID: <439D12DF-D3A9-4DDA-9C3D-DD73507C91CD@gmail.com> Shouldn't. We're not hosting on pm.org's infra. DNS is pointing to Indy's servers. -- dave.s.doyle at gmail.com Sent from my iPhone > On Jun 19, 2017, at 3:17 PM, arocker at Vex.Net wrote: > > > http://log.perl.org/2017/06/pmorg-group-hosting-shutting-down.html > _______________________________________________ > toronto-pm mailing list > toronto-pm at pm.org > http://mail.pm.org/mailman/listinfo/toronto-pm From abuzar at abuzar.com Sat Jun 24 05:42:10 2017 From: abuzar at abuzar.com (Abuzar) Date: Sat, 24 Jun 2017 08:42:10 -0400 Subject: [tpm] OT: Downloading Microsoft Leaks... anyone doing this? Able to do this? Message-ID: Hey all, I saw this on slashdot this morning: https://tech.slashdot.org/story/17/06/23/2126239/32tb-of-windows-10-internal-builds-core-source-code-leak-online Anyone have the computing resources (hard drive space) to download this? I'd be really interested to see the source code for windows. Unfortunately, I don't have an 8TB drive or the bandwidth, nor have I done something like this... is it just like downloading movies or something, but a far larger file?! If anyone knows about this stuff and/or has the ability to grab it, I'd love to hear from you! Abuzar -------------- next part -------------- An HTML attachment was scrubbed... URL: From legrady at gmail.com Sat Jun 24 07:58:32 2017 From: legrady at gmail.com (Tom Legrady) Date: Sat, 24 Jun 2017 10:58:32 -0400 Subject: [tpm] OT: Downloading Microsoft Leaks... anyone doing this? Able to do this? In-Reply-To: References: Message-ID: How to get your ISP to throttle your bandwidth, or shut you down completely :-) On 2017-06-24 08:42 AM, Abuzar wrote: > Hey all, > > I saw this on slashdot this morning: > https://tech.slashdot.org/story/17/06/23/2126239/32tb-of-windows-10-internal-builds-core-source-code-leak-online > > Anyone have the computing resources (hard drive space) to download > this? I'd be really interested to see the source code for windows. > Unfortunately, I don't have an 8TB drive or the bandwidth, nor have I > done something like this... is it just like downloading movies or > something, but a far larger file?! > > If anyone knows about this stuff and/or has the ability to grab it, > I'd love to hear from you! > > Abuzar > > > _______________________________________________ > toronto-pm mailing list > toronto-pm at pm.org > http://mail.pm.org/mailman/listinfo/toronto-pm -------------- next part -------------- An HTML attachment was scrubbed... URL: From shlomif at shlomifish.org Sat Jun 24 11:33:09 2017 From: shlomif at shlomifish.org (Shlomi Fish) Date: Sat, 24 Jun 2017 21:33:09 +0300 Subject: [tpm] OT: Downloading Microsoft Leaks... anyone doing this? Able to do this? In-Reply-To: References: Message-ID: <20170624213309.2a10d32c@telaviv1.shlomifish.org> Hi! On Sat, 24 Jun 2017 10:58:32 -0400 Tom Legrady wrote: > How to get your ISP to throttle your bandwidth, or shut you down > completely :-) > In addition, see https://www.reddit.com/r/explainlikeimfive/comments/2j7mil/eli5_if_someone_leaked_the_entirety_of_windows/ - there was also a slashdot notice about the older windows 2000 source code leak, which open source developers didn't look at for fear of becoming tainted. so I Suggest you avoid reading it. https://en.wikipedia.org/wiki/IANAL and all that jazz. -- Shlomi > > On 2017-06-24 08:42 AM, Abuzar wrote: > > Hey all, > > > > I saw this on slashdot this morning: > > https://tech.slashdot.org/story/17/06/23/2126239/32tb-of-windows-10-internal-builds-core-source-code-leak-online > > > > Anyone have the computing resources (hard drive space) to download > > this? I'd be really interested to see the source code for windows. > > Unfortunately, I don't have an 8TB drive or the bandwidth, nor have I > > done something like this... is it just like downloading movies or > > something, but a far larger file?! > > > > If anyone knows about this stuff and/or has the ability to grab it, > > I'd love to hear from you! > > > > Abuzar > > > > > > _______________________________________________ > > toronto-pm mailing list > > toronto-pm at pm.org > > http://mail.pm.org/mailman/listinfo/toronto-pm > -- ----------------------------------------------------------------- Shlomi Fish http://www.shlomifish.org/ http://www.shlomifish.org/lecture/C-and-CPP/bad-elements/ Chuck Norris gave Richard III a horse in exchange for his kingdom. Norris is now the king of England. ? http://www.shlomifish.org/humour/bits/facts/Chuck-Norris/ Please reply to list if it's a mailing list post - http://shlom.in/reply . From tpm.ats at spamgourmet.com Mon Jun 26 09:49:04 2017 From: tpm.ats at spamgourmet.com (Antonio Sun) Date: Mon, 26 Jun 2017 12:49:04 -0400 Subject: [tpm] OT: Downloading Microsoft Leaks... anyone doing this? Able to do this? In-Reply-To: References: Message-ID: > > It isn't that interesting. Save the space. They are mostly just builds > from the Insider Program, according to the folks with the actual data. > There doesn't actually appear to be any source code in the files. Multiple > parties have gone over it. It's just Insider Program builds, some tools > (that may be handy - as they are special debuggers I guess), and a whole > mess of internal nightly builds. > I haven't downloaded the files, but those who have checked it out are > people that I'm inclined to trust - based on history. It's largely nothing. On Sat, Jun 24, 2017 at 8:42 AM, Abuzar - abuzar at abuzar.com < tpm.ats.f916139d4d.abuzar#abuzar.com at ob.0sg.net> wrote: > Hey all, > > I saw this on slashdot this morning: > https://tech.slashdot.org/story/17/06/23/2126239/32tb- > of-windows-10-internal-builds-core-source-code-leak-online > > Anyone have the computing resources (hard drive space) to download this? > I'd be really interested to see the source code for windows. > Unfortunately, I don't have an 8TB drive or the bandwidth, nor have I done > something like this... is it just like downloading movies or something, but > a far larger file?! > > If anyone knows about this stuff and/or has the ability to grab it, I'd > love to hear from you! > > Abuzar > > _______________________________________________ > toronto-pm mailing list > toronto-pm at pm.org > http://mail.pm.org/mailman/listinfo/toronto-pm > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From dave.s.doyle at gmail.com Wed Jun 28 15:42:05 2017 From: dave.s.doyle at gmail.com (Dave Doyle) Date: Wed, 28 Jun 2017 18:42:05 -0400 Subject: [tpm] Meeting June 28 (Wed) In-Reply-To: References: Message-ID: Tonights hangout link: https://hangouts.google.com/hangouts/_/ytl/fvqtkkfsEBYGJNLduxBTJpYRN9Um7jlgazxKRw4ZoQU=?hl=en_US -- dave.s.doyle at gmail.com On 19 June 2017 at 11:10, Dave Doyle wrote: > My apologies for the Wednesday but for the summer, can't do Thursdays. > > Please RSVP here: > > https://www.meetup.com/Toronto-Perl-Mongers/events/240909896/ > > Doug Hoyte is coming in and giving us a double header of talks: > > REST APIs with Chouette > > Chouette is a perl framework for building REST APIs . It combines many of > my modules together into an integrated system and is the distillation of > all the best-practices I've discovered from building numerous REST API > services over the years. With Chouette you can build an asynchronous server > that uses AnyEvent::Task to delegate blocking tasks such as DB access and > password hashing, Callback::Frame for error handling, and Log::Defer for > structured logging. In this talk we'll build an authentication service from > scratch with Chouette and discuss some of its cool features along the way. > > Better test suites with Log Periodic > > Does your test suite take too long? Are you plagued by flaky, unreliable > tests? Feeling guilty about those sleep commands everywhere? My company Log > Periodic is rolling out a new product that helps solve these problems and > more. This talk will show a demo of the technology and discuss the magic > sauce that makes it work. > -- > dave.s.doyle at gmail.com > -------------- next part -------------- An HTML attachment was scrubbed... URL: From jkeenan at pobox.com Wed Jun 28 18:22:36 2017 From: jkeenan at pobox.com (James E Keenan) Date: Wed, 28 Jun 2017 21:22:36 -0400 Subject: [tpm] Meeting June 28 (Wed) In-Reply-To: References: Message-ID: On 06/28/2017 06:42 PM, Dave Doyle wrote: > Tonights hangout link: > > https://hangouts.google.com/hangouts/_/ytl/fvqtkkfsEBYGJNLduxBTJpYRN9Um7jlgazxKRw4ZoQU=?hl=en_US > Damn! I had it on my calendar but (was so busy debugging that I) completely forgot about this. Any slides or youtube? Thanks. jimk From dave.s.doyle at gmail.com Wed Jun 28 19:01:38 2017 From: dave.s.doyle at gmail.com (Gmail) Date: Wed, 28 Jun 2017 22:01:38 -0400 Subject: [tpm] Meeting June 28 (Wed) In-Reply-To: References: Message-ID: It will be posted to YouTube! As usual, Doug delivered some great talks. -- dave.s.doyle at gmail.com Sent from my iPhone > On Jun 28, 2017, at 9:22 PM, James E Keenan wrote: > >> On 06/28/2017 06:42 PM, Dave Doyle wrote: >> Tonights hangout link: >> https://hangouts.google.com/hangouts/_/ytl/fvqtkkfsEBYGJNLduxBTJpYRN9Um7jlgazxKRw4ZoQU=?hl=en_US > > Damn! I had it on my calendar but (was so busy debugging that I) completely forgot about this. > > Any slides or youtube? > > Thanks. > jimk > _______________________________________________ > toronto-pm mailing list > toronto-pm at pm.org > http://mail.pm.org/mailman/listinfo/toronto-pm From legrady at gmail.com Wed Jun 28 19:41:45 2017 From: legrady at gmail.com (Tom Legrady) Date: Wed, 28 Jun 2017 22:41:45 -0400 Subject: [tpm] Reading config data from shell files In-Reply-To: References: Message-ID: Several places I've worked have had really old code that was a mix of Perl & shell scripts, and they used a shell file to do configuration. You can set files to be run prior to each autosys command, but in cron it doesn't. While some people use '. /path/to/configfile;' as the prefix to every command, that's rather cumbersome. Some people always invoke a shell wrapper script which loads the config, and then runs the program. I was thinking of parsing the config file with Perl, but that seemed too much work. Then it occured to me I could run the file in a sub-shell, finishing with the command 'env', and it would report everything it had found. I'm considering packaging this up as Config::Shell, since it's a Config reader. Does anyone know any existing solutions that would save my typing? Tom From dave.s.doyle at gmail.com Thu Jun 29 06:37:25 2017 From: dave.s.doyle at gmail.com (Dave Doyle) Date: Thu, 29 Jun 2017 09:37:25 -0400 Subject: [tpm] Meeting June 28 (Wed) In-Reply-To: References: Message-ID: All the talks from last night (With a bonus talk!) Chouette https://www.youtube.com/watch?v=ZFjszODsKJw Log Periodic https://www.youtube.com/watch?v=ZFjszODsKJw&t25m50s The Evolution of the Telegraph key https://www.youtube.com/watch?v=ZFjszODsKJw&t=47m18s -- dave.s.doyle at gmail.com On 28 June 2017 at 22:01, Gmail wrote: > It will be posted to YouTube! As usual, Doug delivered some great talks. > > -- > dave.s.doyle at gmail.com > > Sent from my iPhone > > > On Jun 28, 2017, at 9:22 PM, James E Keenan wrote: > > > >> On 06/28/2017 06:42 PM, Dave Doyle wrote: > >> Tonights hangout link: > >> https://hangouts.google.com/hangouts/_/ytl/ > fvqtkkfsEBYGJNLduxBTJpYRN9Um7jlgazxKRw4ZoQU=?hl=en_US > > > > Damn! I had it on my calendar but (was so busy debugging that I) > completely forgot about this. > > > > Any slides or youtube? > > > > Thanks. > > jimk > > _______________________________________________ > > toronto-pm mailing list > > toronto-pm at pm.org > > http://mail.pm.org/mailman/listinfo/toronto-pm > -------------- next part -------------- An HTML attachment was scrubbed... URL: