Use sed or awk to fix date format

Q: Should I use sed or awk?

AWK, like sed, is a programming language that deals with large bodies of text. But while people use sed to process and modify text, people mostly use AWK as a tool for analysis and reporting. Like sed, AWK was first developed at Bell Labs in the 1970s.

Q: Which is faster sed or awk?

Generally I would say grep is the fastest one, sed is the slowest. Of course this depends on what are you doing exactly. I find awk much faster than sed . You can speed up grep if you don't need real regular expressions but only simple fixed strings (option -F).

Q: What does sed and awk do?

The sed is a command line utility that parses and transforms text, using a simple, compact programming language. The awk is a command line utility designed for text processing that allows writing effective programs in the form of statements.

Tags:

regex

bash

sed

awk

I'm trying to convert a HTML containing a table to a .csv file using a bash script.

So far I've acomplished the following steps:

Convert to Unix format (with dos2unix)
Remove all spaces and tabs (with sed 's/[ \t]//g')
Remove all the blank lines (with sed ':a;N;$!ba;s/\n//g') (this is necesary, because the HTML file has a blank line for each cell of the table... that's not my fault)
Remove the unnecesary <td> and <tr> tags (with sed 's/<t.>//g')
Replace </td> with ',' (with sed 's/<\/td/,/g')
Replace </tr> with end-of-line (\n) characters (with sed 's/<\/tr/\n/g')

Of course, I'm putting all this in a pipeline. So far, it's working great. There's one final step I'm stuck with: The table has a column with dates, which has the format dd/mm/yyyy, and I'd like to convert them to yyyy-mm-dd.

Is there a (simple) way to do it (with sed or awk)?

Data sample (after the whole sed pipe):

500,2,13/09/2007,30000.00,12,B-1
501,2,15/09/2007,14000.00,8,B-2

Expected result:

500,2,2007-09-13,30000.00,12,B-1
501,2,2007-09-15,14000.00,8,B-2

The reason I need to do this is because I need to import this data to MySQL. I could open the file in Excel and change the format by hand, but I would like to skip that.

837

asked Aug 26 '13 21:08

Barranka

1 Answers

sed -E 's,([0-9]{2})/([0-9]{2})/([0-9]{4}),\3-\2-\1,g'

answered Sep 30 '22 16:09

ash

Related questions
                            
                                How can check a minimum 3 characters in a given value ,using regular expression
                            
                                regex - check for decimal(javascript)
                            
                                Function To Create Regex Matching a Number Range
                            
                                What does [^.]* mean in regular expression?
                            
                                Replace all letters in a word to * in js [closed]
                            
                                sed and Mac OS X differences with to upper, to lower and whole capture control sequences
                            
                                Removing First and Last Double Quotes
                            
                                Validate only alphanumeric characters in Laravel
                            
                                Improving/Fixing a Regex for C style block comments
                            
                                How Do I Use A Decimal Number In A Django URL Pattern?
                            
                                How do I match text within parentheses using regex?
                            
                                Reg exp wanted for replacing all non-alphanumeric chars with underscores
                            
                                preg_replace in PHP - regular expression for NOT condition
                            
                                What does the regex string "\\p{Cntrl}" match in Java?
                            
                                RegEx to get text from inside the square brackets [duplicate]
                            
                                scanf regex - C
                            
                                How do I build Perl regular expressions dynamically?
                            
                                Why does "Year 2010" =~ /([0-4]*)/ results in empty string in $1?
                            
                                How to capitalize first letter of first word in a sentence?
                            
                                How to make regex case-insensitive?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With