All along, we have been looking at simple expressions when checking whether a condition has been meet or not. What if you want to use more then one expression to check for a particular condition in?
In this article, we shall take a look at the how you can combine multiple expressions referred to as compound expressions to check for a condition when filtering text or strings.
In Awk, compound expressions are built using the &&
referred to as (and)
and the ||
referred to as (or)
compound operators.
The general syntax for compound expressions is:
( first_expression ) && ( second_expression )
Here, first_expression
and second_expression
must be true to make the whole expression true.
( first_expression ) || ( second_expression)
Here, one of the expressions either first_expression
or second_expression
must be true for the whole expression to be true.
Caution: Remember to always include the parenthesis.
The expressions can be built using the comparison operators that we looked at in Part 4 of the awk series.
Let us now get a clear understanding using an example below:
In this example, a have a text file named tecmint_deals.txt
, which contains a list of some amazing random Tecmint deals, it includes the name of the deal, the price and type.
No Name Price Type 1 Mac_OS_X_Cleanup_Suite $9.99 Software 2 Basics_Notebook $14.99 Lifestyle 3 Tactical_Pen $25.99 Lifestyle 4 Scapple $19.00 Unknown 5 Nano_Tool_Pack $11.99 Unknown 6 Ditto_Bluetooth_Altering_Device $33.00 Tech 7 Nano_Prowler_Mini_Drone $36.99 Tech
Say that we want only print and flag deals that are above $20 and of type “Tech” using the (**)
sign at the end of each line.
We shall need to run the command below.
# awk '($3 ~ /^\$[2-9][0-9]*\.[0-9][0-9]$/) && ($4=="Tech") { printf "%s\t%s\n",$0,"*"; } ' tecmint_deals.txt 6 Ditto_Bluetooth_Altering_Device $33.00 Tech * 7 Nano_Prowler_Mini_Drone $36.99 Tech *
In this example, we have used two expressions in a compound expression:
- First expression, ($3 ~ /^\$[2-9][0-9]*\.[0-9][0-9]$/) ; checks the for lines with deals with price above $20, and it is only true if the value of $3 which is the price matches the pattern /^\$[2-9][0-9]*\.[0-9][0-9]$/
- And the second expression, ($4 == “Tech”) ; checks whether the deal is of type “Tech” and it is only true if the value of $4 equals to “Tech”.
Remember, a line will only be flagged with the (**)
, if first expression and second expression are true as states the principle of the &&
operator.
Summary
Some conditions always require building compound expressions for you to match exactly what you want. When you understand the use of comparison and compound expression operators then, filtering text or strings based on some difficult conditions will become easy.
Hope you find this guide useful and for any questions or additions, always remember to leave a comment and your concern will be solved accordingly.
Thanks, Ren!!
Correct answer. You hit the mail.
Do not use parenthesis when explaining you are “flagging the text with…” (**), when you are actually flagging it with just **. The special code formatting (the pink text box) already depicts that meaning. Putting extra parenthesis just makes it more cumbersome.
The same is for “(and)” and “(or)” in the third paragraph.
Point taken, will do as you have mentioned.
@Ren
The line is:
7 Nano_Prowler_Mini_Drone $36.99 Tech
Not:
7 Nano_Prowler_Mini_Drone $3.99 Tech
So it is supposed to be printed with a “*” since it is above $20
I think
$ awk '($3 ~ /^\$[2-9][0-9]*\.[0-9][0-9]$/) && ($4=="Tech") { printf "%s\t%s\n",$0,"*"; } ' tecmint_deals.txt
should change to
$ awk '($3 ~ /^\$[2-9][0-9]+\.[0-9][0-9]$/) && ($4=="Tech") { printf "%s\t%s\n",$0,"*"; } ' tecmint_deals.txt
otherwise, the following line will also printed with “*”:
7 Nano_Prowler_Mini_Drone $3.99 Tech
P.S. Sorry for my English.