The string of example is: <pre class="prettyprint"><code>abcdefghijklmno </code></pre> If I give in input: <pre class="prettyprint"><code>abc FALSE #at least 4 characters. abcd TRUE cdefg FALSE #because the match must start from the first character. abcde TRUE abcdeghi FALSE #because the characters must be contained consecutively. abcdefgh TRUE abcdefghi TRUE abcdefghijklmno TRUE abcdefghijklmnop FALSE #because it exceeds the example string. </code></pre> i have tried: <pre class="prettyprint"><code>set -- abc i=1 [[ abcdefghijklmno == ${!i}* ]] echo $? </code></pre> but <code>echo "$?"</code> returns <code>0</code> also with 3, 2, 1 or 0 characters. This other code is obviously wrong but it is to communicate what I would like to do: <pre class="prettyprint"><code>set -- abc i=1 [[ abcdefghijklmno == ${!i}{4}* ]] echo $? </code></pre> EDIT: The solution that suits me is the following: <pre class="prettyprint"><code>set -- abc i=1 [[ abcdefghijklmno == ${!i}* && $(expr length "${!i}") -ge 4 ]] echo $? </code></pre>

You may us this <code>awk</code>: <pre class="prettyprint lang-sh prettyprint-override"><code>awk -v s='abcdefghijklmno' '{ print $0, (length($1) > 3 && index(s, $1) == 1 ? "TRUE" : "FALSE")}' file | column -t abc FALSE abcd TRUE cdefg FALSE abcde TRUE abcdeghi FALSE abcdefgh TRUE abcdefghi TRUE abcdefghijklmno TRUE abcdefghijklmnop FALSE </code></pre> Explained: <ul> <li> <code>column</code> command has been used for tabular output only.</li> <li> <code>length($1) > 3 && index(s, $1) == 1</code>: Check condition that length of first field is greater than <code>3</code> and <code>$1</code> is found from first position in the given string <code>s</code>.</li> </ul> Alternatively, we can also use a regex to check presence of <code>$1</code> from start: <pre class="prettyprint"><code>awk -v s='abcdefghijklmno' '{ print $0, (length($1) > 3 && s ~ "^" $1 ? "TRUE" : "FALSE") }' file </code></pre>

The <code>index</code> function of Perl seems adapted: given two strings, it returns the index at which the second one occurs in the first one, or -1 if it does not occur. What you want to do is thus to check if the second string appears in the first one, at the index 0. Then, you can use the <code>length</code> function to make sure that the second string is more than 4 characters long For instance, <pre class="prettyprint"><code>length("abc") >= 4 && index("abcdefghijklmno", "abc") == 0 # true length("cdefg") >= 4 && index("abcdefghijklmno", "cdefg") == 0 # false length("abcdefghijklmno") >= 4 && index("abcdefghijklmno", "abcdefghijklmno") == 0 # true </code></pre> To use it in a one-liner, one way is to provide both strings on the command line. For instance: <pre class="prettyprint"><code>perl -e 'print length($ARGV[1]) >= 4 && index($ARGV[0], $ARGV[1]) == 0 ? "TRUE" : "FALSE"' abcdefghijklmno abc </code></pre> <hr> Alternatively, you can sacrifice readability for conciseness by using a regular expression: <pre class="prettyprint"><code>perl -e 'print $ARGV[0] =~ /^\Q$ARGV[1]\E(?<=.{4})/ ? "TRUE" : "FALSE"' abcdefghijklmno abcde </code></pre> Where the regex checks if the first string starts with the second one (<code>/^\Q$ARGV[1]\E</code>), and that the second one is 4 characters long or more (<code>(?<=.{4})</code>; see <code>perlre#lookaround-assertions</code>).

Check a substring is contained in a string and has at least the first 4 characters

Tags:

bash

awk

The string of example is:

abcdefghijklmno

If I give in input:

abc                 FALSE    #at least 4 characters.
abcd                TRUE
cdefg               FALSE    #because the match must start from the first character.
abcde               TRUE
abcdeghi            FALSE    #because the characters must be contained consecutively.
abcdefgh            TRUE
abcdefghi           TRUE
abcdefghijklmno     TRUE
abcdefghijklmnop    FALSE    #because it exceeds the example string.

i have tried:

set -- abc
i=1
[[ abcdefghijklmno == ${!i}* ]]
echo $?

but echo "$?" returns 0 also with 3, 2, 1 or 0 characters.

This other code is obviously wrong but it is to communicate what I would like to do:

set -- abc
i=1
[[ abcdefghijklmno == ${!i}{4}* ]]
echo $?

EDIT:

The solution that suits me is the following:

set -- abc
i=1
[[ abcdefghijklmno == ${!i}* && $(expr length "${!i}") -ge 4 ]]
echo $?

442

asked Mar 25 '21 09:03

Mario Palumbo

2 Answers

You may us this awk:

awk -v s='abcdefghijklmno' '{
print $0, (length($1) > 3 && index(s, $1) == 1 ? "TRUE" : "FALSE")}' file | column -t

abc               FALSE
abcd              TRUE
cdefg             FALSE
abcde             TRUE
abcdeghi          FALSE
abcdefgh          TRUE
abcdefghi         TRUE
abcdefghijklmno   TRUE
abcdefghijklmnop  FALSE

Explained:

column command has been used for tabular output only.
length($1) > 3 && index(s, $1) == 1: Check condition that length of first field is greater than 3 and $1 is found from first position in the given string s.

Alternatively, we can also use a regex to check presence of $1 from start:

awk -v s='abcdefghijklmno' '{
   print $0, (length($1) > 3 && s ~ "^" $1 ? "TRUE" : "FALSE")
}' file

172

answered Oct 21 '22 16:10

anubhava

The index function of Perl seems adapted: given two strings, it returns the index at which the second one occurs in the first one, or -1 if it does not occur. What you want to do is thus to check if the second string appears in the first one, at the index 0. Then, you can use the length function to make sure that the second string is more than 4 characters long

For instance,

length("abc") >= 4 && index("abcdefghijklmno", "abc") == 0                # true
length("cdefg") >= 4 && index("abcdefghijklmno", "cdefg") == 0            # false
length("abcdefghijklmno") >= 4 && index("abcdefghijklmno", "abcdefghijklmno") == 0    # true

To use it in a one-liner, one way is to provide both strings on the command line. For instance:

perl -e 'print length($ARGV[1]) >= 4 && index($ARGV[0], $ARGV[1]) == 0 ? "TRUE" : "FALSE"' abcdefghijklmno abc

Alternatively, you can sacrifice readability for conciseness by using a regular expression:

perl -e 'print $ARGV[0] =~ /^\Q$ARGV[1]\E(?<=.{4})/ ? "TRUE" : "FALSE"' abcdefghijklmno abcde

Where the regex checks if the first string starts with the second one (/^\Q$ARGV[1]\E), and that the second one is 4 characters long or more ((?<=.{4}); see perlre#lookaround-assertions).

answered Oct 21 '22 18:10

Dada

Related questions
                            
                                Text File data parsing lines and output as columns
                            
                                sed/awk - print text between patterns spanned across multiple lines
                            
                                Use Variable in Command Substitution
                            
                                Color escape codes in pretty printed columns
                            
                                How to export env variable in node.js
                            
                                Return variable from node.js to sh script
                            
                                check IPs if they exist in /etc/hosts
                            
                                Redirected output hangs when using tee
                            
                                Bash double square brackets regex match issue
                            
                                Why is this Bash function within a git alias executing twice, and why does adding `exit` fix it?
                            
                                Linux CLI - How to get substring from JSON jq + grep?
                            
                                Why does ADB commands break a bash script loop?
                            
                                How to set [Bash on Ubuntu on Windows] [environment variables] from [windows path]?
                            
                                Bash Script to Conda Install requirements.txt with PIP follow-up
                            
                                makefile run targets in parallel
                            
                                How to safely exit early from a bash script?
                            
                                Docker exec linux terminal create alias
                            
                                How to launch a new WSL bash window from an existing WSL bash window
                            
                                How to define bash as default Windows Terminal command processor?
                            
                                Nohup take no effect when to close the terminal end the process running in background?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With