Deserializing a single element in a large XML document: xmlSerializer.Deserialize(xmlReader.ReadSubtree()) fails due to namespace issues

Tags:

I am attempting to process a large XML document (using a XmlReader) in a single pass, and deserialize only certain elements in it using an XmlSerializer.

Below is some code and a tiny mock XML document showing how I have attempted to do this.

^{Rationale for using XmlReader: 1. I am dealing with very large XML documents (10–250 MB), which for this reason I do not want to load into memory. So XmlDocument is out of the question. 2. I want to extract only certain elements. Typically I will be able to ignore most other content. XmlReader appears to give me an efficient means of skipping irrelevant content. 3. I do not know in advance whether any and all elements that I can deal with will be present; therefore I am not using a bunch of Xpath/XQuery or LINQ to XML-based queries, because I want to make only a single pass over the XML files (due to their size).}

public class ElementOfInterest { }
…

var xml = @"<?xml version='1.0' encoding='utf-8' ?>
            <Root xmlns:ex='urn:stakx:example'
                  xmlns:xsi='http://www.w3.org/2001/XMLSchema-instance'>
              <ElementOfInterest xsi:type='ex:ElementOfInterest' />
            </Root>";

var reader = System.Xml.XmlReader.Create(new System.IO.StringReader(xml));
reader.ReadToFollowing("ElementOfInterest");

var serializer = new System.Xml.Serialization.XmlSerializer(typeof(ElementOfInterest));
serializer.Deserialize(reader.ReadSubtree());

The last line of code throws the following inner exception:

InvalidOperationException: "Namespace prefix ex is not defined."

Obviously, the XmlSerializer doesn't recognise the ex namespace prefix inside the xsi:type attribute's value.

This is just one error I am having, but frankly, the larger problem is that I have no idea how to go about the whole namespace issue. I am simply looking for a convenient way to de-serialize just a single node out of the XML document, but that seems to entail having to manually register/manage namespaces, and to somehow forward them from the XmlReader to the XmlSerializer.

Can someone demonstrate how to deserialize a single node from a XML document read with an XmlReader, either by pointing out the error in my code, or by showing an alternative approach?

748

asked Jan 27 '15 23:01

stakx - no longer contributing

1 Answers

The following works:

using System.IO;
using System.Xml;
using System.Xml.Serialization;

static void Main()
{
    var xml = @"<?xml version='1.0' encoding='utf-8' ?>
                <Root
                  xmlns:xsi='http://www.w3.org/2001/XMLSchema-instance'
                  xmlns:ex='urn:stakx:example'
                >
                  <ex:ElementOfInterest xsi:type='ex:ElementOfInterest' />
                </Root>";

    var nt = new NameTable();
    var mgr = new XmlNamespaceManager(nt);
    mgr.AddNamespace("ex", "urn:stakx:example");
    var ctxt = new XmlParserContext(nt, mgr, "", XmlSpace.Default);
    var reader = XmlReader.Create(new StringReader(xml), null, ctxt);
    var serializer = new XmlSerializer(typeof(ElementOfInterest));

    reader.ReadToFollowing("ElementOfInterest", "urn:stakx:example");
    var eoi = (ElementOfInterest)serializer.Deserialize(reader.ReadSubtree());
}

[XmlRoot(Namespace = "urn:stakx:example")]
public class ElementOfInterest { }

Note the namespace in the input: <ex:ElementOfInterest>.

159

answered Oct 19 '22 21:10

Tomalak

Related questions
                            
                                Alternate to Dataflow BroadcastBlock with guaranteed delivery
                            
                                Seeking guidance reading .yaml files with C#
                            
                                access sub XML values in sms web service that hasnt value in standard way
                            
                                How to do server side state management in vNext Web Applications
                            
                                AutoMapper: string to nullable int
                            
                                Detect when running inside a catch block
                            
                                IList<mutable_struct> vs mutable_struct[]
                            
                                Remove Byte Order Mark from signed PDF file?
                            
                                Using RX queries, how to get which records have same status for a window of 3 seconds every second?
                            
                                Returning from a task without blocking UI thread
                            
                                Json.Parse escape newline characters
                            
                                WebClient DownloadFile - Access denied or could not find part of the path
                            
                                Wcf get raw request from operation
                            
                                OWIN Self host - hook into begin request, end request events
                            
                                EF - Default value for new column with automatic migration
                            
                                How to do a registration in Simple Injector after a GetInstance call / Alternate solution?
                            
                                How can I download only part of a page?
                            
                                How to change textblock background?
                            
                                Get accessors from PropertyInfo as Func<object> and Action<object> delegates
                            
                                Control.Invoke unwraps the outer exception and propagates the inner exception instead

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Deserializing a single element in a large XML document: xmlSerializer.Deserialize(xmlReader.ReadSubtree()) fails due to namespace issues

Tags:

c#

xml

xml-namespaces

xmlserializer

xmlreader

stakx - no longer contributing

People also ask

1 Answers

Tomalak

Recent Activity

Donate For Us