I have a xml like below <pre class="prettyprint"><code><root> <FIToFICstmrDrctDbt> <GrpHdr> <MsgId>A</MsgId> <CreDtTm>2001-12-17T09:30:47</CreDtTm> <NbOfTxs>0</NbOfTxs> <TtlIntrBkSttlmAmt Ccy="EUR">0.0</TtlIntrBkSttlmAmt> <IntrBkSttlmDt>1967-08-13</IntrBkSttlmDt> <SttlmInf> <SttlmMtd>CLRG</SttlmMtd> <ClrSys> <Prtry>xx</Prtry> </ClrSys> </SttlmInf> <InstgAgt> <FinInstnId> <BIC>AAAAAAAAAAA</BIC> </FinInstnId> </InstgAgt> </GrpHdr> </FIToFICstmrDrctDbt> </root> </code></pre> I need to extract the value of each tag value in separate variables using awk command. how to do it?

You can use <code>awk</code> as shown below, however, this is NOT a robust solution and will fail if the xml is not formatted correctly e.g. if there are multiple elements on the same line. <pre class="prettyprint"><code>$ dt=$(awk -F '[<>]' '/IntrBkSttlmDt/{print $3}' file) $ echo $dt 1967-08-13 </code></pre> I suggest you use a proper xml processing tool, like <code>xmllint</code>. <pre class="prettyprint"><code>$ dt=$(xmllint --shell file <<< "cat //IntrBkSttlmDt/text()" | grep -v "^/ >") $ echo $dt 1967-08-13 </code></pre>

Extract xml tag value using awk command

Tags:

shell

unix

aix

xml

awk

I have a xml like below

<root>    
<FIToFICstmrDrctDbt>
            <GrpHdr>
                <MsgId>A</MsgId>
                <CreDtTm>2001-12-17T09:30:47</CreDtTm>
                <NbOfTxs>0</NbOfTxs>
                <TtlIntrBkSttlmAmt Ccy="EUR">0.0</TtlIntrBkSttlmAmt>
                <IntrBkSttlmDt>1967-08-13</IntrBkSttlmDt>
                <SttlmInf>
                    <SttlmMtd>CLRG</SttlmMtd>
                    <ClrSys>
                        <Prtry>xx</Prtry>
                    </ClrSys>
                </SttlmInf>
                <InstgAgt>
                    <FinInstnId>
                        <BIC>AAAAAAAAAAA</BIC>
                    </FinInstnId>
                </InstgAgt>
            </GrpHdr>
    </FIToFICstmrDrctDbt>
</root>

I need to extract the value of each tag value in separate variables using awk command. how to do it?

465

asked Dec 27 '12 11:12

user1929905

2 Answers

You can use awk as shown below, however, this is NOT a robust solution and will fail if the xml is not formatted correctly e.g. if there are multiple elements on the same line.

$ dt=$(awk -F '[<>]' '/IntrBkSttlmDt/{print $3}' file)
$ echo $dt
1967-08-13

I suggest you use a proper xml processing tool, like xmllint.

$ dt=$(xmllint --shell file <<< "cat //IntrBkSttlmDt/text()" | grep -v "^/ >")
$ echo $dt
1967-08-13

114

answered Sep 27 '22 20:09

dogbane

The following gawk command uses a record separator regex pattern to match the XML tags. Anything starting with a < followed by at least one non-> and terminated by a > is considered to be a tag. Gawk assigns each RS match into the RT variable. Anything between the tags will be parsed as the record text which gawk assigns to $0.

gawk 'BEGIN { RS="<[^>]+>" } { print RT, $0 }' myfile

answered Sep 27 '22 21:09

Michael Hamilton

Related questions
                            
                                Challenge: Can you make this simple function more elegant using C# 4.0
                            
                                getChildNodes giving unexpected result
                            
                                Find position of parent node using xpath
                            
                                using registerShutdownHook() in the Spring Framework
                            
                                How to inflate Android View in LinearLayout class?
                            
                                Python XML Parsing without root
                            
                                Find and Increment a Number in an XML File
                            
                                Way to parse XML (org.w3c.Document) on Android
                            
                                Any reason not to use XmlSerializer?
                            
                                Python: Unicode and ElementTree.parse
                            
                                Xml node reading for each loop
                            
                                Magento. Insert block into another without change template code
                            
                                Android Studio not identifying xml file as layout file
                            
                                Class not found when unmarshalling: android.support.v7.widget.Toolbar$SavedState
                            
                                How can I strip invalid XML characters from strings in Perl?
                            
                                Validating jdoconfig with incorrect url
                            
                                Spring xml problem
                            
                                How to check for string equality case insensitive in xsl
                            
                                XPath to locate a cell with specific text parsing HTML tables
                            
                                converting from xml name-values into simple hash

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With