[Madrid-pm] Duda con un parsing

PEGG preiddy en gmail.com
Sab Mar 14 02:19:37 PDT 2009


Hola, en el trabajo me han pedido que haga un parsing para un fichero de
texto:
<RECORD 1>
Accession number:2862793
Title:Qualification of switchable metal-oxide arresters for a protective
level of 1.6 p.u. on Hydro-Quebec's 735 kV system
Authors:Latour, Y. (1); St-Jean, G. (1); Petit, A.; Huynh, N.
Author affiliation:(1) Inst. de Res. d'Hydro-Quebec, Varennes, Que., Canada
Serial title:IEEE Transactions on Power Delivery
Abbreviated serial title:IEEE Trans. Power Deliv. (USA)
Volume:PWRD-1
Issue:4
Publication date:Oct. 1986
Pages:184-93
Language:English
ISSN:0885-8977
CODEN:ITPDE5
Document type:Journal article (JA)
Country of publication:USA
Material Identity Number:J605-1986-004
Abstract:Results of a series of tests on 39 sections of metal-oxide surge
arresters from six manufacturers are reported. Ranging from 85 kV to 160 kV
in rated voltage, these sections were similar to the devices to be installed
on Hydro-Quebec's 735 kV power transmission system. The purpose of the
qualification tests was to assess the merits of this type of arrester as a
solution to the overvoltage problem encountered on the James Bay part of the
power grid following load-shedding operations. The test program was
therefore oriented in terms of demonstrating that the surge arresters
produce the following three features: they may fail by short circuit in a
totally safe manner, i.e. without explosion or fracture from thermal shock
to their housing; they will produce a switching surface protective level of
1.6 p.u. (per unit of rated voltage of a 735 kV system, e.g., 1 p.u.=425
kV<sub>rms</sub>, or 1 p.u.=600 kV<sub>crest</sub>) of crest system voltage
(960 kV at 5 kA crest); they will withstand a 60 Hz voltage of 1.2 p.u. (510
kV) for the maximum period of 30 s during which they may be switched onto
the power system
Number of references:2
Inspec controlled terms:power system protection† - †surge protection
Uncontrolled terms:switchable metal-oxide arresters† - †surge arresters† -
†power transmission system† - †qualification tests† - †overvoltage problem†
- †power grid† - †load-shedding† - †short circuit† - †switching surface
protective level† - †735 kV † - †85 to 160 kV
Inspec classification codes:B8140C Protection apparatus
Numerical data indexing:voltage 7.35E+05 V;voltage 8.5E+04 1.6E+05 V
Treatment:Practical (PRA)
Discipline:Electrical/Electronic engineering (B)

Me piden que el mismo se pueda tener en csv o ingresarlo directamente a una
BD relacional, he buscado documentación para no reinventar la rueda y he
visto que estan:

   - Text::CSV=> Pasa de csv a txt, imagino que se pueda hacer a la inversa.
   - Tie::CSV_File
   - Tie::Handle::CSV
   - DBD::CSV

Mi duda / problema está en que son más de 6 millones de registros en
ficheros de cientos de megas, no se exactamente el número de campos, cuando
un autor tenga un numero, ese se corresponde con su afiliación por lo que
deberian ir juntos en una tabla o fichero nuevo, pero siempre manteniendo
todo en una tabla principal.

¿Cual de estos creen que sea el más optimo para trabajar?
------------ próxima parte ------------
Se ha borrado un adjunto en formato HTML...
URL: <http://mail.pm.org/pipermail/madrid-pm/attachments/20090314/182211f5/attachment.html>


Más información sobre la lista de distribución Madrid-pm