I'm trying to use VIM to remove a duplicate line in an XML file I created. (I can't recreate the file because the ID numbers will change.) The file looks something like this: <pre class="prettyprint"><code> <tag k="natural" v="water"/> <tag k="nhd:fcode" v="39004"/> <tag k="natural" v="water"/></code></pre> I'm trying to remove one of the duplicate k="natural" v="water" lines. When I try to use the <code>\_</code> modifier to include newlines in my regex replaces, VIM doesn't seem to find anything. Any tips on what regex or tool to use?

First of all, you can use <code>awk</code> to remove all duplicate lines, keeping their order. <pre class="prettyprint"><code>:%!awk '\!_[$0]++' </code></pre> If you not sure if there are some other duplicate lines you don't want remove, then just add conditions. <pre class="prettyprint"><code>:%!awk '\!(_[$0]++ && /tag/ && /natural/ && /water/)' </code></pre> But, parsing a nested structure like xml with regex is a bad idea, IMHO. You are going to care them not to be screwed up all the time. <code>xmllint</code> gives you a list of specific elements in the file: <pre class="prettyprint"><code>:!echo "cat //tag[@k='natural' and @v='water']" | xmllint --shell % </code></pre> You can slash duplicate lines step by step.

Remove Duplicate Line in Vim?

Tags:

regex

vim

xml

I'm trying to use VIM to remove a duplicate line in an XML file I created. (I can't recreate the file because the ID numbers will change.)

The file looks something like this:

Click to copy

    <tag k="natural" v="water"/>
    <tag k="nhd:fcode" v="39004"/>
    <tag k="natural" v="water"/>

I'm trying to remove one of the duplicate k="natural" v="water" lines. When I try to use the \_ modifier to include newlines in my regex replaces, VIM doesn't seem to find anything.

Any tips on what regex or tool to use?

862

asked Dec 13 '09 15:12

magneticMonster

1 Answers

First of all, you can use awk to remove all duplicate lines, keeping their order.

Click to copy

:%!awk '\!_[$0]++'

If you not sure if there are some other duplicate lines you don't want remove, then just add conditions.

Click to copy

:%!awk '\!(_[$0]++ && /tag/ && /natural/ && /water/)'

But, parsing a nested structure like xml with regex is a bad idea, IMHO. You are going to care them not to be screwed up all the time. xmllint gives you a list of specific elements in the file:

Click to copy

:!echo "cat //tag[@k='natural' and @v='water']" | xmllint --shell %

You can slash duplicate lines step by step.

answered Sep 19 '22 03:09

ernix

Related questions
                            
                                xml to json with attributes for php or python
                            
                                How to change the xml class name using fasterxml jackson?
                            
                                How to give user permissions programmatically?
                            
                                How to let Java.xml.Transformer output a xml without any useless space or line break?
                            
                                Yii2 render response a xml file in the view
                            
                                How to export data from database to xml according the XSD
                            
                                Error:(218) Apostrophe not preceded by \
                            
                                Navigation drawer menu item with titles and sub titles
                            
                                Parse a soap XML to a C# class
                            
                                android error on tutorial cannot find symbol variable activity_display_message
                            
                                extract text between xml tags in python
                            
                                How to make the icon background of an Android app transparent?
                            
                                how can I shape Circular the selected image from gallery
                            
                                How to manage concurrent Input/Output access to a XML file from multiple instances of an EXE, using Delphi.
                            
                                Groovy parsing JSON vs XML
                            
                                Better way to cleanly handle nested XML with LINQ
                            
                                What's the difference between the W3 and xmlsoap.org schemas?
                            
                                Special characters in XML files - processing with the DOM API
                            
                                VBScript, MSXML and Namespaces
                            
                                XML Query within SQL Server

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Remove Duplicate Line in Vim?

Tags:

regex

vim

xml

magneticMonster

People also ask

1 Answers

ernix

Recent Activity

Donate For Us