[Chicago-talk] Building a hierarchy

Brian Katzung briank at kappacs.com
Sat Mar 19 07:42:11 PDT 2011


Yes, I would probably parse UL#foldinglist (all of them! I guess the 
author doesn't quite understand the id attribute) from 
http://codelists.wpc-edi.com/wpc_taxonomy.asp.

   - Brian

On 2011-03-19 09:00, Jay Strauss wrote:
> Hi sorry for the delay.  I know everyone is hanging on this thread 
> with baited breath.
>
> I was trying to parse the data on: http://www.wpc-edi.com/codes/taxonomy
>
> I initially cut/paste into word to change the hierarchy, so I can 
> probably reformat into something without the potential of dupes.  I 
> looked at the source of the webpage.  Maybe there is a way I can parse 
> the page directly for the hierarchy.
>
> Thanks
> Jay
>
> On Wed, Mar 16, 2011 at 8:31 PM, Brian Katzung <briank at kappacs.com 
> <mailto:briank at kappacs.com>> wrote:
>
>     I would try very hard to find an alternative source data format
>     (perhaps with level-dependent indentation?) if at all possible
>     before spending a lot of time trying to parse this one because
>     it's ambiguous.
>
>     Consider an item on line "i." after an item on line "h.".
>     Depending on the type of the subsequent line, you may ("ii.") or
>     may not ("V.") be able to determine what type of line the "i."
>     item is.
>
>       - Brian
>
>
>     On 2011-03-16 11:31, Jay Strauss wrote:
>>     Hi,
>>
>>     I need to build a hierarchy out of some data to load into a
>>     RDBMS.  The data looks like below.  I need to convert it to more
>>     like:
>>
>>     code, desc, parent_code
>>
>>     (where code is like "193200000X")
>>
>>     I'm struggling.
>>
>>     I think I could do this in a rigid manner by saying I have 4
>>     indexes or levels:
>>     upper case roman
>>     lower case alpha
>>     lower case roman
>>     numeric
>>
>>     and keeping track where I am, and I the parent one level above.
>>
>>     I'd like to do it flexibly, without having to know how many
>>     levels in advance (I get similarly structured data with # of
>>     levels and info from time to time).
>>
>>     But I don't know:
>>
>>     1) whats the best structure for this (I'm thinking an array of
>>     arrays)
>>     2) how to traverse the array without knowing my indexes, i.e. go
>>     one level up, go one level down
>>
>>     Can anyone suggest ways to skin this cat?
>>
>>     Thanks
>>     Jay
>>
>>
>>     I.Individual or Groups (of Individuals)
>>     a.Group
>>     i.Multi-Specialty  - 193200000X
>>     ii.
>>     Single Specialty  - 193400000X
>>     b.Allopathic & Osteopathic Physicians
>>     i.Allergy & Immunology - 207K00000X
>>     1.Allergy - 207KA0200X
>>     2.Clinical & Laboratory Immunology - 207KI0005X
>>     ii.Anesthesiology - 207L00000X
>>     1.Addiction Medicine - 207LA0401X
>>     2.Critical Care Medicine - 207LC0200X
>>     3.Hospice and Palliative Medicine - 207LH0002X
>>     4.Pain Medicine - 207LP2900X
>>     5.Pediatric Anesthesiology - 207LP3000X
>>     iii.
>>     Clinical Pharmacology - 208U00000X
>>     iv.Colon & Rectal Surgery - 208C00000X
>>     v.Dermatology - 207N00000X
>>     1.Clinical & Laboratory Dermatological Immunology - 207NI0002X
>>     2.Dermatopathology - 207ND0900X
>>     3.MOHS-Micrographic Surgery - 207ND0101X
>>     4.Pediatric Dermatology - 207NP0225X
>>     5.Procedural Dermatology - 207NS0135X
>>     vi.
>>     Electrodiagnostic Medicine - 204R00000X
>>     vii.
>>     Emergency Medicine - 207P00000X
>>
>>
>>     _______________________________________________
>>     Chicago-talk mailing list
>>     Chicago-talk at pm.org <mailto:Chicago-talk at pm.org>
>>     http://mail.pm.org/mailman/listinfo/chicago-talk
>
>     -- 
>     Brian Katzung, Kappa Computer Solutions, LLC
>     Leveraging UNIX, GNU/Linux, open source, and custom
>     software solutions for business and beyond
>     Phone: 877.367.8837 x1http://www.kappacs.com
>
>
>     _______________________________________________
>     Chicago-talk mailing list
>     Chicago-talk at pm.org <mailto:Chicago-talk at pm.org>
>     http://mail.pm.org/mailman/listinfo/chicago-talk
>
>

-- 
Brian Katzung, Kappa Computer Solutions, LLC
Leveraging UNIX, GNU/Linux, open source, and custom
software solutions for business and beyond
Phone: 877.367.8837 x1  http://www.kappacs.com

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.pm.org/pipermail/chicago-talk/attachments/20110319/0b328f67/attachment.html>


More information about the Chicago-talk mailing list