Hi all - I'm attempting to work with a fairly larg...
# getting-started
c
Hi all - I'm attempting to work with a fairly large XML file (here, fairly large means approximately 48GB). I would appreciate some help with terminology and approaches. What I would like to do: since the XML is so large I was hoping to use a Stax processor to manage it in chunks, and serialize/write out sub-trees; e.g.
Copy code
<root>
  <item>
    <thing>abc</thing>
    <part>def</part>
  </item>
  <item>
    <thing>fgh</thing>
    <part>ijk</part>
  </item>
  <!-- ... repeat -->
</root>
Edit: Apologies, I hit Enter too quickly. when processing the stream, serialize each
<item>
to a separate file. My questions revolve around marshalling/unmarshalling XML and whether or not this is something I need to work into my attempt, or if the Jackson/Woodstox would be sufficient for my attempt. Thanks in advance for your time! Edit, part II: 🤦 the context is Kotlin on the JVM and desktop.