I want to sort the file using first six characters of a line. It should ignore the default sort order after the sixth character. I have tried using the below command, but the system takes the default sort order after the sixth character. <pre class="prettyprint"><code>sort -k 1,6 filename.txt </code></pre> Input File : "filename.txt" <pre class="prettyprint"><code>09289720150531N201505220820D20150514 09289720150531N201505220820A20150516 08806020150531N201505290810D20150526 08806020150531N201505290810A20150528 </code></pre> Output should be: <pre class="prettyprint"><code>08806020150531N201505290810D20150526 08806020150531N201505290810A20150528 09289720150531N201505220820D20150514 09289720150531N201505220820A20150516 </code></pre> But my command output is: <pre class="prettyprint"><code>08806020150531N201505290810A20150528 08806020150531N201505290810D20150526 09289720150531N201505220820A20150516 09289720150531N201505220820D20150514 </code></pre>

The option as shown uses the field position. If you change that to something like <code>-k1.1,1.6</code> it will use the character position in the first field. This is an extended POSIX feature, likely to be provided on most platforms. However, in your example there are only two distinct values in character positions 1-6: <code>088060</code> and <code>092897</code>. The standard sort command does not have a feature for ignoring columns, but only for using columns. While GNU sort provides an extension (<code>-s</code> for "disabling last-resort comparison"), Solaris sort does not have such an extension. After the sort-keys have been taken into account, it sorts by the remainder of the lines. There is some vague wording in its manual which hints that <code>-u</code> will do what you want: <blockquote> When there are multiple key fields, later keys are compared only after all earlier keys compare equal. Except when the <code>-u</code> option is specified, lines that otherwise compare equal are ordered as if none of the options <code>-d</code>, <code>-f</code>, <code>-i</code>, <code>-n</code> or <code>-k</code> were present (but with <code>-r</code> still in effect, if it was specified) and with all bytes in the lines significant to the comparison. </blockquote> However — revisiting this — the wording from this is misleading since <code>-u</code> is used to filter duplicates. A comment suggests that <code>-k1.1,1.6</code> could be shortened to <code>-k1.6</code>, and testing with Solaris 10 confirmed that would work. That is with <code>/usr/bin/sort</code>, of course. On my copy of Solaris 10, there is an additional copy of sort, in <code>/opt/sfw/bin/sort</code>: <pre class="prettyprint"><code>$ /opt/sfw/bin/sort --version sort (GNU coreutils) 5.97 </code></pre> and that program supports the <code>-s</code> option noted above. With that option, the program produces the output which was requested.

Sort the file in unix using first six characters of a line

Tags:

unix

sorting

I want to sort the file using first six characters of a line. It should ignore the default sort order after the sixth character. I have tried using the below command, but the system takes the default sort order after the sixth character.

sort -k 1,6 filename.txt

Input File : "filename.txt"

09289720150531N201505220820D20150514
09289720150531N201505220820A20150516
08806020150531N201505290810D20150526
08806020150531N201505290810A20150528

Output should be:

08806020150531N201505290810D20150526
08806020150531N201505290810A20150528
09289720150531N201505220820D20150514
09289720150531N201505220820A20150516

But my command output is:

08806020150531N201505290810A20150528
08806020150531N201505290810D20150526
09289720150531N201505220820A20150516
09289720150531N201505220820D20150514

531

asked Jun 01 '15 09:06

srisriv

1 Answers

The option as shown uses the field position. If you change that to something like -k1.1,1.6 it will use the character position in the first field. This is an extended POSIX feature, likely to be provided on most platforms.

However, in your example there are only two distinct values in character positions 1-6: 088060 and 092897. The standard sort command does not have a feature for ignoring columns, but only for using columns. While GNU sort provides an extension (-s for "disabling last-resort comparison"), Solaris sort does not have such an extension. After the sort-keys have been taken into account, it sorts by the remainder of the lines.

There is some vague wording in its manual which hints that -u will do what you want:

When there are multiple key fields, later keys are compared only after all earlier keys compare equal. Except when the -u option is specified, lines that otherwise compare equal are ordered as if none of the options -d, -f, -i, -n or -k were present (but with -r still in effect, if it was specified) and with all bytes in the lines significant to the comparison.

However — revisiting this — the wording from this is misleading since -u is used to filter duplicates.

A comment suggests that -k1.1,1.6 could be shortened to -k1.6, and testing with Solaris 10 confirmed that would work. That is with /usr/bin/sort, of course. On my copy of Solaris 10, there is an additional copy of sort, in /opt/sfw/bin/sort:

$ /opt/sfw/bin/sort --version
sort (GNU coreutils) 5.97

and that program supports the -s option noted above. With that option, the program produces the output which was requested.

121

answered Nov 15 '22 07:11

Thomas Dickey

Related questions
                            
                                JCR SQL2 - result query order as in JCR browser
                            
                                MVC sort list before showing it in view by name
                            
                                When sorting strings should é come before e
                            
                                Get indices for sorted permutation of an array in Ruby?
                            
                                In Emacs 24.3.1 on Windows 7, how can I group directories first in dired?
                            
                                Number of comparisons in Straight Selection sort
                            
                                OrderBy().ThenBy() wrong output
                            
                                PHP Sort an array with uasort
                            
                                active admin sort a count of a has_many column
                            
                                How to sort DirectoryInfo.GetFiles() [duplicate]
                            
                                Sort and keep a unique duplicate which has the highest value
                            
                                Python: Memory efficient sort of a list of tuples by two elements
                            
                                How to sort input values
                            
                                C: Sorting Big Data; Not in Memory
                            
                                How to partially sort in a stable way
                            
                                Sorting a deque using limited operations?
                            
                                How To Split an Array in half in java
                            
                                Using quicksort on a string array
                            
                                Doing absolute descending sort of data.table through function?
                            
                                Sell rotting apples in time

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With