Consider a user that needs a text of docx document without the headers and footers for processing in R.
If a file.docx is renamed as file.zip and the document document.xml is analyzed - it is a well formed XML document with the text.
Did Microsfot (or other developer) publish a schema for this document.xml subfile in the ZIP package of docx file?
The file looks like this:
<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
- <w:document xmlns:wpc="http://schemas.microsoft.com/office/word/2010/wordprocessingCanvas" xmlns:mc="http://schemas.openxmlformats.org/markup-compatibility/2006" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:r="http://schemas.openxmlformats.org/officeDocument/2006/relationships" xmlns:m="http://schemas.openxmlformats.org/officeDocument/2006/math" xmlns:v="urn:schemas-microsoft-com:vml" xmlns:wp14="http://schemas.microsoft.com/office/word/2010/wordprocessingDrawing" xmlns:wp="http://schemas.openxmlformats.org/drawingml/2006/wordprocessingDrawing" xmlns:w10="urn:schemas-microsoft-com:office:word" xmlns:w="http://schemas.openxmlformats.org/wordprocessingml/2006/main" xmlns:w14="http://schemas.microsoft.com/office/word/2010/wordml" xmlns:wpg="http://schemas.microsoft.com/office/word/2010/wordprocessingGroup" xmlns:wpi="http://schemas.microsoft.com/office/word/2010/wordprocessingInk" xmlns:wne="http://schemas.microsoft.com/office/word/2006/wordml" xmlns:wps="http://schemas.microsoft.com/office/word/2010/wordprocessingShape" mc:Ignorable="w14 wp14">
- <w:body>
- <w:p w:rsidR="00F447D7" w:rsidRPr="00C63308" w:rsidRDefault="00F447D7">
- <w:pPr>
SchemaViewer is a free program that will display XSD files in the proper tree format, which makes them easier to read than with a simple text editor like Notepad. The file can also open with Microsoft Visual Studio, XML Notepad, EditiX XML Editor, Progress Stylus Studio, and XMLSpy.
How to open an SCHEMATIC file. You can open and edit SCHEMATIC files using a variety of Minecraft mod tools. Most notably, you can open SCHEMATIC files in MCEdit (cross-platform), which is a Minecraft saved game editor. To open a SCHEMATIC file in MCEdit, select the Import button while editing a level.
What is XML Schema Definition (XSD)? XML Schema Definition or XSD is a recommendation by the World Wide Web Consortium (W3C) to describe and validate the structure and content of an XML document. It is primarily used to define the elements, attributes and data types the document can contain.
From wikipedia:
The format was initially standardised by Ecma (as ECMA-376) and, in later versions, by ISO and IEC (as ISO/IEC 29500).
You can find various versions of the XSD in the ECMA-376 downloads
document.xml conforms to the WordprocessingML part of the schemas (look for wml.xsd).
I think this might be the location: http://msdn.microsoft.com/en-us/library/hh643329(v=office.12).aspx
This is version 5.2. On this page you can find a link to version 5.1.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With