The case and select constructs are technically not loops, since they do not iterate the execution of a code block. Like loops, however, they direct program flow according to conditions at the top or bottom of the block.

Controlling program flow in a code block

case (in) / esac

The case construct is the shell equivalent of switch in C/C++. It permits branching to one of a number of code blocks, depending on condition tests. It serves as a kind of shorthand for multiple if/then/else statements and is an appropriate tool for creating menus.

case "$variable" in

"$condition1" )
command...
;;

"$condition2" )
command...
;;

esac

Quoting the variables is not mandatory, since word splitting does not take place.
Each test line ends with a right paren ).
Each condition block ends with a double semicolon ;;.
The entire case block terminates with an esac (case spelled backwards).

Example 10-24. Using case

#!/bin/bash
 # Testing ranges of characters.
 
 echo; echo "Hit a key, then hit return."
 read Keypress
 
 case "$Keypress" in
   [[:lower:]]   ) echo "Lowercase letter";;
   [[:upper:]]   ) echo "Uppercase letter";;
   [0-9]         ) echo "Digit";;
   *             ) echo "Punctuation, whitespace, or other";;
 esac      #  Allows ranges of characters in [square brackets],
           #+ or POSIX ranges in [[double square brackets.
 
 #  In the first version of this example,
 #+ the tests for lowercase and uppercase characters were
 #+ [a-z] and [A-Z].
 #  This no longer works in certain locales and/or Linux distros.
 #  POSIX is more portable.
 #  Thanks to Frank Wang for pointing this out.
 
 #  Exercise:
 #  --------
 #  As the script stands, it accepts a single keystroke, then terminates.
 #  Change the script so it accepts repeated input,
 #+ reports on each keystroke, and terminates only when "X" is hit.
 #  Hint: enclose everything in a "while" loop.
 
 exit 0

Example 10-25. Creating menus using case

#!/bin/bash
 
 # Crude address database
 
 clear # Clear the screen.
 
 echo "          Contact List"
 echo "          ------- ----"
 echo "Choose one of the following persons:" 
 echo
 echo "[E]vans, Roland"
 echo "[J]ones, Mildred"
 echo "[S]mith, Julie"
 echo "[Z]ane, Morris"
 echo
 
 read person
 
 case "$person" in
 # Note variable is quoted.
 
   "E" | "e" )
   # Accept upper or lowercase input.
   echo
   echo "Roland Evans"
   echo "4321 Floppy Dr."
   echo "Hardscrabble, CO 80753"
   echo "(303) 734-9874"
   echo "(303) 734-9892 fax"
   echo "revans@zzy.net"
   echo "Business partner & old friend"
   ;;
 # Note double semicolon to terminate each option.
 
   "J" | "j" )
   echo
   echo "Mildred Jones"
   echo "249 E. 7th St., Apt. 19"
   echo "New York, NY 10009"
   echo "(212) 533-2814"
   echo "(212) 533-9972 fax"
   echo "milliej@loisaida.com"
   echo "Ex-girlfriend"
   echo "Birthday: Feb. 11"
   ;;
 
 # Add info for Smith & Zane later.
 
           * )
    # Default option.	  
    # Empty input (hitting RETURN) fits here, too.
    echo
    echo "Not yet in database."
   ;;
 
 esac
 
 echo
 
 #  Exercise:
 #  --------
 #  Change the script so it accepts multiple inputs,
 #+ instead of terminating after displaying just one address.
 
 exit 0

An exceptionally clever use of case involves testing for command-line parameters.
#! /bin/bash case "$1" in "") echo "Usage: ${0##*/} <filename>"; exit $E_PARAM;; # No command-line parameters, # or first parameter empty. # Note that ${0##*/} is ${var##pattern} param substitution. Net result is $0. -*) FILENAME=./$1;; # If filename passed as argument ($1) starts with a dash, #+ replace it with ./$1 #+ so further commands don't interpret it as an option. * ) FILENAME=$1;; # Otherwise, $1. esac

Here is an more straightforward example of command-line parameter handling:
#! /bin/bash while [ $# -gt 0 ]; do # Until you run out of parameters . . . case "$1" in -d|--debug) # "-d" or "--debug" parameter? DEBUG=1 ;; -c|--conf) CONFFILE="$2" shift if [ ! -f $CONFFILE ]; then echo "Error: Supplied file doesn't exist!" exit $E_CONFFILE # File not found error. fi ;; esac shift # Check next set of parameters. done # From Stefano Falsetto's "Log2Rot" script, #+ part of his "rottlog" package. # Used with permission.

Example 10-26. Using command substitution to generate the case variable

#!/bin/bash
 # case-cmd.sh: Using command substitution to generate a "case" variable.
 
 case $( arch ) in   # "arch" returns machine architecture.
                     # Equivalent to 'uname -m' ...
 i386 ) echo "80386-based machine";;
 i486 ) echo "80486-based machine";;
 i586 ) echo "Pentium-based machine";;
 i686 ) echo "Pentium2+-based machine";;
 *    ) echo "Other type of machine";;
 esac
 
 exit 0

A case construct can filter strings for globbing patterns.

Example 10-27. Simple string matching

#!/bin/bash
 # match-string.sh: simple string matching
 
 match_string ()
 {
   MATCH=0
   NOMATCH=90
   PARAMS=2     # Function requires 2 arguments.
   BAD_PARAMS=91
 
   [ $# -eq $PARAMS ] || return $BAD_PARAMS
 
   case "$1" in
   "$2") return $MATCH;;
   *   ) return $NOMATCH;;
   esac
 
 }  
 
 
 a=one
 b=two
 c=three
 d=two
 
 
 match_string $a     # wrong number of parameters
 echo $?             # 91
 
 match_string $a $b  # no match
 echo $?             # 90
 
 match_string $b $d  # match
 echo $?             # 0
 
 
 exit 0

Example 10-28. Checking for alphabetic input

#!/bin/bash
 # isalpha.sh: Using a "case" structure to filter a string.
 
 SUCCESS=0
 FAILURE=-1
 
 isalpha ()  # Tests whether *first character* of input string is alphabetic.
 {
 if [ -z "$1" ]                # No argument passed?
 then
   return $FAILURE
 fi
 
 case "$1" in
 [a-zA-Z]*) return $SUCCESS;;  # Begins with a letter?
 *        ) return $FAILURE;;
 esac
 }             # Compare this with "isalpha ()" function in C.
 
 
 isalpha2 ()   # Tests whether *entire string* is alphabetic.
 {
   [ $# -eq 1 ] || return $FAILURE
 
   case $1 in
   *[!a-zA-Z]*|"") return $FAILURE;;
                *) return $SUCCESS;;
   esac
 }
 
 isdigit ()    # Tests whether *entire string* is numerical.
 {             # In other words, tests for integer variable.
   [ $# -eq 1 ] || return $FAILURE
 
   case $1 in
   *[!0-9]*|"") return $FAILURE;;
             *) return $SUCCESS;;
   esac
 }
 
 
 
 check_var ()  # Front-end to isalpha ().
 {
 if isalpha "$@"
 then
   echo "\"$*\" begins with an alpha character."
   if isalpha2 "$@"
   then        # No point in testing if first char is non-alpha.
     echo "\"$*\" contains only alpha characters."
   else
     echo "\"$*\" contains at least one non-alpha character."
   fi  
 else
   echo "\"$*\" begins with a non-alpha character."
               # Also "non-alpha" if no argument passed.
 fi
 
 echo
 
 }
 
 digit_check ()  # Front-end to isdigit ().
 {
 if isdigit "$@"
 then
   echo "\"$*\" contains only digits [0 - 9]."
 else
   echo "\"$*\" has at least one non-digit character."
 fi
 
 echo
 
 }
 
 a=23skidoo
 b=H3llo
 c=-What?
 d=What?
 e=`echo $b`   # Command substitution.
 f=AbcDef
 g=27234
 h=27a34
 i=27.34
 
 check_var $a
 check_var $b
 check_var $c
 check_var $d
 check_var $e
 check_var $f
 check_var     # No argument passed, so what happens?
 #
 digit_check $g
 digit_check $h
 digit_check $i
 
 
 exit 0        # Script improved by S.C.
 
 # Exercise:
 # --------
 #  Write an 'isfloat ()' function that tests for floating point numbers.
 #  Hint: The function duplicates 'isdigit ()',
 #+ but adds a test for a mandatory decimal point.

select

The select construct, adopted from the Korn Shell, is yet another tool for building menus.

select variable [in list]
do
command...
break
done

This prompts the user to enter one of the choices presented in the variable list. Note that select uses the PS3 prompt (#? ) by default, but that this may be changed.

Example 10-29. Creating menus using select

#!/bin/bash
 
 PS3='Choose your favorite vegetable: ' # Sets the prompt string.
 
 echo
 
 select vegetable in "beans" "carrots" "potatoes" "onions" "rutabagas"
 do
   echo
   echo "Your favorite veggie is $vegetable."
   echo "Yuck!"
   echo
   break  # What happens if there is no 'break' here?
 done
 
 exit 0

If in list is omitted, then select uses the list of command line arguments ($@) passed to the script or to the function in which the select construct is embedded.

Compare this to the behavior of a

for variable [in list]

construct with the in list omitted.

Example 10-30. Creating menus using select in a function

#!/bin/bash
 
 PS3='Choose your favorite vegetable: '
 
 echo
 
 choice_of()
 {
 select vegetable
 # [in list] omitted, so 'select' uses arguments passed to function.
 do
   echo
   echo "Your favorite veggie is $vegetable."
   echo "Yuck!"
   echo
   break
 done
 }
 
 choice_of beans rice carrots radishes tomatoes spinach
 #         $1    $2   $3      $4       $5       $6
 #         passed to choice_of() function
 
 exit 0

Internal Commands and Builtins

A builtin is a command contained within the Bash tool set, literally built in. This is either for performance reasons -- builtins execute faster than external commands, which usually require forking off a separate process -- or because a particular builtin needs direct access to the shell internals.

When a command or the shell itself initiates (or spawns) a new subprocess to carry out a task, this is called forking. This new process is the child, and the process that forked it off is the parent. While the child process is doing its work, the parent process is still executing.

Note that while a parent process gets the process ID of the child process, and can thus pass arguments to it, the reverse is not true. This can create problems that are subtle and hard to track down.

Example 11-1. A script that forks off multiple instances of itself

#!/bin/bash
 # spawn.sh
 
 
 PIDS=$(pidof sh $0)  # Process IDs of the various instances of this script.
 P_array=( $PIDS )    # Put them in an array (why?).
 echo $PIDS           # Show process IDs of parent and child processes.
 let "instances = ${#P_array[*]} - 1"  # Count elements, less 1.
                                       # Why subtract 1?
 echo "$instances instance(s) of this script running."
 echo "[Hit Ctl-C to exit.]"; echo
 
 
 sleep 1              # Wait.
 sh $0                # Play it again, Sam.
 
 exit 0               # Not necessary; script will never get to here.
                      # Why not?
 
 #  After exiting with a Ctl-C,
 #+ do all the spawned instances of the script die?
 #  If so, why?
 
 # Note:
 # ----
 # Be careful not to run this script too long.
 # It will eventually eat up too many system resources.
 
 #  Is having a script spawn multiple instances of itself
 #+ an advisable scripting technique.
 #  Why or why not?

Generally, a Bash builtin does not fork a subprocess when it executes within a script. An external system command or filter in a script usually will fork a subprocess.

A builtin may be a synonym to a system command of the same name, but Bash reimplements it internally. For example, the Bash echo command is not the same as /bin/echo, although their behavior is almost identical.
#!/bin/bash echo "This line uses the \"echo\" builtin." /bin/echo "This line uses the /bin/echo system command."

A keyword is a reserved word, token or operator. Keywords have a special meaning to the shell, and indeed are the building blocks of the shell's syntax. As examples, "for", "while", "do", and "!" are keywords. Similar to a builtin, a keyword is hard-coded into Bash, but unlike a builtin, a keyword is not by itself a command, but part of a larger command structure. [1]

I/O

echo

prints (to stdout) an expression or variable (see Example 4-1).
echo Hello echo $a

An echo requires the -e option to print escaped characters. See Example 5-2.

Normally, each echo command prints a terminal newline, but the -n option suppresses this.

An echo can be used to feed a sequence of commands down a pipe.

if echo "$VAR" | grep -q txt # if [[ $VAR = *txt* ]] then echo "$VAR contains the substring sequence \"txt\"" fi

An echo, in combination with command substitution can set a variable.

a=`echo "HELLO" | tr A-Z a-z`

Be aware that echo `command` deletes any linefeeds that the output of command generates.

The $IFS (internal field separator) variable normally contains \n (linefeed) as one of its set of whitespace characters. Bash therefore splits the output of command at linefeeds into arguments to echo. Then echo outputs these arguments, separated by spaces.

bash$ ls -l /usr/share/apps/kjezz/sounds -rw-r--r-- 1 root root 1407 Nov 7 2000 reflect.au -rw-r--r-- 1 root root 362 Nov 7 2000 seconds.au bash$ echo `ls -l /usr/share/apps/kjezz/sounds` total 40 -rw-r--r-- 1 root root 716 Nov 7 2000 reflect.au -rw-r--r-- 1 root root 362 Nov 7 2000 seconds.au

So, how can we embed a linefeed within an echoed character string?
# Embedding a linefeed? echo "Why doesn't this string \n split on two lines?" # Doesn't split. # Let's try something else. echo echo $"A line of text containing a linefeed." # Prints as two distinct lines (embedded linefeed). # But, is the "$" variable prefix really necessary? echo echo "This string splits on two lines." # No, the "$" is not needed. echo echo "---------------" echo echo -n $"Another line of text containing a linefeed." # Prints as two distinct lines (embedded linefeed). # Even the -n option fails to suppress the linefeed here. echo echo echo "---------------" echo echo # However, the following doesn't work as expected. # Why not? Hint: Assignment to a variable. string1=$"Yet another line of text containing a linefeed (maybe)." echo $string1 # Yet another line of text containing a linefeed (maybe). # ^ # Linefeed becomes a space. # Thanks, Steve Parker, for pointing this out.

This command is a shell builtin, and not the same as /bin/echo, although its behavior is similar.

bash$ type -a echo echo is a shell builtin echo is /bin/echo

printf

The printf, formatted print, command is an enhanced echo. It is a limited variant of the C language printf() library function, and its syntax is somewhat different.

printf format-string... parameter...

This is the Bash builtin version of the /bin/printf or /usr/bin/printf command. See the printf manpage (of the system command) for in-depth coverage.

Older versions of Bash may not support printf.

Example 11-2. printf in action

#!/bin/bash
 # printf demo
 
 PI=3.14159265358979
 DecimalConstant=31373
 Message1="Greetings,"
 Message2="Earthling."
 
 echo
 
 printf "Pi to 2 decimal places = %1.2f" $PI
 echo
 printf "Pi to 9 decimal places = %1.9f" $PI  # It even rounds off correctly.
 
 printf "\n"                                  # Prints a line feed,
                                              # Equivalent to 'echo' . . .
 
 printf "Constant = \t%d\n" $DecimalConstant  # Inserts tab (\t).
 
 printf "%s %s \n" $Message1 $Message2
 
 echo
 
 # ==========================================#
 # Simulation of C function, sprintf().
 # Loading a variable with a formatted string.
 
 echo 
 
 Pi12=$(printf "%1.12f" $PI)
 echo "Pi to 12 decimal places = $Pi12"
 
 Msg=`printf "%s %s \n" $Message1 $Message2`
 echo $Msg; echo $Msg
 
 #  As it happens, the 'sprintf' function can now be accessed
 #+ as a loadable module to Bash,
 #+ but this is not portable.
 
 exit 0

Formatting error messages is a useful application of printf

E_BADDIR=65 var=nonexistent_directory error() { printf "$@" >&2 # Formats positional params passed, and sents them to stderr. echo exit $E_BADDIR } cd $var || error $"Can't cd to %s." "$var" # Thanks, S.C.

read

"Reads" the value of a variable from stdin, that is, interactively fetches input from the keyboard. The -a option lets read get array variables (see Example 26-6).

Example 11-3. Variable assignment, using read

#!/bin/bash
 # "Reading" variables.
 
 echo -n "Enter the value of variable 'var1': "
 # The -n option to echo suppresses newline.
 
 read var1
 # Note no '$' in front of var1, since it is being set.
 
 echo "var1 = $var1"
 
 
 echo
 
 # A single 'read' statement can set multiple variables.
 echo -n "Enter the values of variables 'var2' and 'var3' (separated by a space or tab): "
 read var2 var3
 echo "var2 = $var2      var3 = $var3"
 # If you input only one value, the other variable(s) will remain unset (null).
 
 exit 0

A read without an associated variable assigns its input to the dedicated variable $REPLY.

Example 11-4. What happens when read has no variable

#!/bin/bash
 # read-novar.sh
 
 echo
 
 # -------------------------- #
 echo -n "Enter a value: "
 read var
 echo "\"var\" = "$var""
 # Everything as expected here.
 # -------------------------- #
 
 echo
 
 # ------------------------------------------------------------------- #
 echo -n "Enter another value: "
 read           #  No variable supplied for 'read', therefore...
                #+ Input to 'read' assigned to default variable, $REPLY.
 var="$REPLY"
 echo "\"var\" = "$var""
 # This is equivalent to the first code block.
 # ------------------------------------------------------------------- #
 
 echo
 
 exit 0

Normally, inputting a \ suppresses a newline during input to a read. The -r option causes an inputted \ to be interpreted literally.

Example 11-5. Multi-line input to read

#!/bin/bash
 
 echo
 
 echo "Enter a string terminated by a \\, then press <ENTER>."
 echo "Then, enter a second string, and again press <ENTER>."
 read var1     # The "\" suppresses the newline, when reading $var1.
               #     first line \
               #     second line
 
 echo "var1 = $var1"
 #     var1 = first line second line
 
 #  For each line terminated by a "\"
 #+ you get a prompt on the next line to continue feeding characters into var1.
 
 echo; echo
 
 echo "Enter another string terminated by a \\ , then press <ENTER>."
 read -r var2  # The -r option causes the "\" to be read literally.
               #     first line \
 
 echo "var2 = $var2"
 #     var2 = first line \
 
 # Data entry terminates with the first <ENTER>.
 
 echo 
 
 exit 0

The read command has some interesting options that permit echoing a prompt and even reading keystrokes without hitting ENTER.

# Read a keypress without hitting ENTER. read -s -n1 -p "Hit a key " keypress echo; echo "Keypress was "\"$keypress\""." # -s option means do not echo input. # -n N option means accept only N characters of input. # -p option means echo the following prompt before reading input. # Using these options is tricky, since they need to be in the correct order.

The -n option to read also allows detection of the arrow keys and certain of the other unusual keys.

Example 11-6. Detecting the arrow keys

#!/bin/bash
 # arrow-detect.sh: Detects the arrow keys, and a few more.
 # Thank you, Sandro Magi, for showing me how.
 
 # --------------------------------------------
 # Character codes generated by the keypresses.
 arrowup='\[A'
 arrowdown='\[B'
 arrowrt='\[C'
 arrowleft='\[D'
 insert='\[2'
 delete='\[3'
 # --------------------------------------------
 
 SUCCESS=0
 OTHER=65
 
 echo -n "Press a key...  "
 # May need to also press ENTER if a key not listed above pressed.
 read -n3 key                      # Read 3 characters.
 
 echo -n "$key" | grep "$arrowup"  #Check if character code detected.
 if [ "$?" -eq $SUCCESS ]
 then
   echo "Up-arrow key pressed."
   exit $SUCCESS
 fi
 
 echo -n "$key" | grep "$arrowdown"
 if [ "$?" -eq $SUCCESS ]
 then
   echo "Down-arrow key pressed."
   exit $SUCCESS
 fi
 
 echo -n "$key" | grep "$arrowrt"
 if [ "$?" -eq $SUCCESS ]
 then
   echo "Right-arrow key pressed."
   exit $SUCCESS
 fi
 
 echo -n "$key" | grep "$arrowleft"
 if [ "$?" -eq $SUCCESS ]
 then
   echo "Left-arrow key pressed."
   exit $SUCCESS
 fi
 
 echo -n "$key" | grep "$insert"
 if [ "$?" -eq $SUCCESS ]
 then
   echo "\"Insert\" key pressed."
   exit $SUCCESS
 fi
 
 echo -n "$key" | grep "$delete"
 if [ "$?" -eq $SUCCESS ]
 then
   echo "\"Delete\" key pressed."
   exit $SUCCESS
 fi
 
 
 echo " Some other key pressed."
 
 exit $OTHER
 
 #  Exercises:
 #  ---------
 #  1) Simplify this script by rewriting the multiple "if" tests
 #+    as a 'case' construct.
 #  2) Add detection of the "Home," "End," "PgUp," and "PgDn" keys.

The -n option to read will not detect the ENTER (newline) key.

The -t option to read permits timed input (see Example 9-4).

The read command may also "read" its variable value from a file redirected to stdin. If the file contains more than one line, only the first line is assigned to the variable. If read has more than one parameter, then each of these variables gets assigned a successive whitespace-delineated string. Caution!

Example 11-7. Using read with file redirection

#!/bin/bash
 
 read var1 <data-file
 echo "var1 = $var1"
 # var1 set to the entire first line of the input file "data-file"
 
 read var2 var3 <data-file
 echo "var2 = $var2   var3 = $var3"
 # Note non-intuitive behavior of "read" here.
 # 1) Rewinds back to the beginning of input file.
 # 2) Each variable is now set to a corresponding string,
 #    separated by whitespace, rather than to an entire line of text.
 # 3) The final variable gets the remainder of the line.
 # 4) If there are more variables to be set than whitespace-terminated strings
 #    on the first line of the file, then the excess variables remain empty.
 
 echo "------------------------------------------------"
 
 # How to resolve the above problem with a loop:
 while read line
 do
   echo "$line"
 done <data-file
 # Thanks, Heiner Steven for pointing this out.
 
 echo "------------------------------------------------"
 
 # Use $IFS (Internal Field Separator variable) to split a line of input to
 # "read", if you do not want the default to be whitespace.
 
 echo "List of all users:"
 OIFS=$IFS; IFS=:       # /etc/passwd uses ":" for field separator.
 while read name passwd uid gid fullname ignore
 do
   echo "$name ($fullname)"
 done </etc/passwd   # I/O redirection.
 IFS=$OIFS              # Restore originial $IFS.
 # This code snippet also by Heiner Steven.
 
 
 
 #  Setting the $IFS variable within the loop itself
 #+ eliminates the need for storing the original $IFS
 #+ in a temporary variable.
 #  Thanks, Dim Segebart, for pointing this out.
 echo "------------------------------------------------"
 echo "List of all users:"
 
 while IFS=: read name passwd uid gid fullname ignore
 do
   echo "$name ($fullname)"
 done </etc/passwd   # I/O redirection.
 
 echo
 echo "\$IFS still $IFS"
 
 exit 0

Piping output to a read, using echo to set variables will fail.

Yet, piping the output of cat seems to work.

cat file1 file2 | while read line do echo $line done

However, as Bj� Eriksson shows:

Example 11-8. Problems reading from a pipe

#!/bin/sh
 # readpipe.sh
 # This example contributed by Bjon Eriksson.
 
 last="(null)"
 cat $0 |
 while read line
 do
     echo "{$line}"
     last=$line
 done
 printf "\nAll done, last:$last\n"
 
 exit 0  # End of code.
         # (Partial) output of script follows.
         # The 'echo' supplies extra brackets.
 
 #############################################
 
 ./readpipe.sh 
 
 {#!/bin/sh}
 {last="(null)"}
 {cat $0 |}
 {while read line}
 {do}
 {echo "{$line}"}
 {last=$line}
 {done}
 {printf "nAll done, last:$lastn"}
 
 
 All done, last:(null)
 
 The variable (last) is set within the subshell but unset outside.

The gendiff script, usually found in /usr/bin on many Linux distros, pipes the output of find to a while read construct.
find $1 $ -name "*$2" -o -name ".*$2" $ -print | while read f; do . . .

Filesystem

cd

The familiar cd change directory command finds use in scripts where execution of a command requires being in a specified directory.

(cd /source/directory && tar cf - . ) | (cd /dest/directory && tar xpvf -)
[from the previously cited example by Alan Cox]

The -P (physical) option to cd causes it to ignore symbolic links.

cd - changes to $OLDPWD, the previous working directory.

The cd command does not function as expected when presented with two forward slashes.
bash$ cd // bash$ pwd //
The output should, of course, be /. This is a problem both from the command line and in a script.

pwd

Print Working Directory. This gives the user's (or script's) current directory (see Example 11-9). The effect is identical to reading the value of the builtin variable $PWD.

pushd, popd, dirs

This command set is a mechanism for bookmarking working directories, a means of moving back and forth through directories in an orderly manner. A pushdown stack is used to keep track of directory names. Options allow various manipulations of the directory stack.

pushd dir-name pushes the path dir-name onto the directory stack and simultaneously changes the current working directory to dir-name

popd removes (pops) the top directory path name off the directory stack and simultaneously changes the current working directory to that directory popped from the stack.

dirs lists the contents of the directory stack (compare this with the $DIRSTACK variable). A successful pushd or popd will automatically invoke dirs.

Scripts that require various changes to the current working directory without hard-coding the directory name changes can make good use of these commands. Note that the implicit $DIRSTACK array variable, accessible from within a script, holds the contents of the directory stack.

Example 11-9. Changing the current working directory

#!/bin/bash
 
 dir1=/usr/local
 dir2=/var/spool
 
 pushd $dir1
 # Will do an automatic 'dirs' (list directory stack to stdout).
 echo "Now in directory `pwd`." # Uses back-quoted 'pwd'.
 
 # Now, do some stuff in directory 'dir1'.
 pushd $dir2
 echo "Now in directory `pwd`."
 
 # Now, do some stuff in directory 'dir2'.
 echo "The top entry in the DIRSTACK array is $DIRSTACK."
 popd
 echo "Now back in directory `pwd`."
 
 # Now, do some more stuff in directory 'dir1'.
 popd
 echo "Now back in original working directory `pwd`."
 
 exit 0
 
 # What happens if you don't 'popd' -- then exit the script?
 # Which directory do you end up in? Why?

Variables

let

The let command carries out arithmetic operations on variables. In many cases, it functions as a less complex version of expr.

Example 11-10. Letting "let" do arithmetic.

#!/bin/bash
 
 echo
 
 let a=11            # Same as 'a=11'
 let a=a+5           # Equivalent to  let "a = a + 5"
                     # (Double quotes and spaces make it more readable.)
 echo "11 + 5 = $a"  # 16
 
 let "a <<= 3"       # Equivalent to  let "a = a << 3"
 echo "\"\$a\" (=16) left-shifted 3 places = $a"
                     # 128
 
 let "a /= 4"        # Equivalent to  let "a = a / 4"
 echo "128 / 4 = $a" # 32
 
 let "a -= 5"        # Equivalent to  let "a = a - 5"
 echo "32 - 5 = $a"  # 27
 
 let "a *=  10"      # Equivalent to  let "a = a * 10"
 echo "27 * 10 = $a" # 270
 
 let "a %= 8"        # Equivalent to  let "a = a % 8"
 echo "270 modulo 8 = $a  (270 / 8 = 33, remainder $a)"
                     # 6
 
 echo
 
 exit 0

eval

eval arg1 [arg2] ... [argN]

Combines the arguments in an expression or list of expressions and evaluates them. Any variables contained within the expression are expanded. The result translates into a command. This can be useful for code generation from the command line or within a script.

bash$ process=xterm bash$ show_process="eval ps ax | grep $process" bash$ $show_process 1867 tty1 S 0:02 xterm 2779 tty1 S 0:00 xterm 2886 pts/1 S 0:00 grep xterm

Example 11-11. Showing the effect of eval

#!/bin/bash
 
 y=`eval ls -l`  #  Similar to y=`ls -l`
 echo $y         #+ but linefeeds removed because "echoed" variable is unquoted.
 echo
 echo "$y"       #  Linefeeds preserved when variable is quoted.
 
 echo; echo
 
 y=`eval df`     #  Similar to y=`df`
 echo $y         #+ but linefeeds removed.
 
 #  When LF's not preserved, it may make it easier to parse output,
 #+ using utilities such as "awk".
 
 echo
 echo "==========================================================="
 echo
 
 # Now, showing how to "expand" a variable using "eval" . . .
 
 for i in 1 2 3 4 5; do
   eval value=$i
   #  value=$i has same effect. The "eval" is not necessary here.
   #  A variable lacking a meta-meaning evaluates to itself --
   #+ it can't expand to anything other than its literal self.
   echo $value
 done
 
 echo
 echo "---"
 echo
 
 for i in ls df; do
   value=eval $i
   #  value=$i has an entirely different effect here.
   #  The "eval" evaluates the commands "ls" and "df" . . .
   #  The terms "ls" and "df" have a meta-meaning,
   #+ since they are interpreted as commands,
   #+ rather than just character strings.
   echo $value
 done
 
 
 exit 0

Example 11-12. Forcing a log-off

#!/bin/bash
 # Killing ppp to force a log-off.
 
 # Script should be run as root user.
 
 killppp="eval kill -9 `ps ax | awk '/ppp/ { print $1 }'`"
 #                     -------- process ID of ppp -------  
 
 $killppp                  # This variable is now a command.
 
 
 # The following operations must be done as root user.
 
 chmod 666 /dev/ttyS3      # Restore read+write permissions, or else what?
 #  Since doing a SIGKILL on ppp changed the permissions on the serial port,
 #+ we restore permissions to previous state.
 
 rm /var/lock/LCK..ttyS3   # Remove the serial port lock file. Why?
 
 exit 0
 
 # Exercises:
 # ---------
 # 1) Have script check whether root user is invoking it.
 # 2) Do a check on whether the process to be killed
 #+   is actually running before attempting to kill it.   
 # 3) Write an alternate version of this script based on 'fuser':
 #+      if [ fuser -s /dev/modem ]; then . . .

Example 11-13. A version of "rot13"

#!/bin/bash
 # A version of "rot13" using 'eval'.
 # Compare to "rot13.sh" example.
 
 setvar_rot_13()              # "rot13" scrambling
 {
   local varname=$1 varvalue=$2
   eval $varname='$(echo "$varvalue" | tr a-z n-za-m)'
 }
 
 
 setvar_rot_13 var "foobar"   # Run "foobar" through rot13.
 echo $var                    # sbbone
 
 setvar_rot_13 var "$var"     # Run "sbbone" through rot13.
                              # Back to original variable.
 echo $var                    # foobar
 
 # This example by Stephane Chazelas.
 # Modified by document author.
 
 exit 0

Rory Winston contributed the following instance of how useful eval can be.

Example 11-14. Using eval to force variable substitution in a Perl script

In the Perl script "test.pl":
         ...		
         my $WEBROOT = <WEBROOT_PATH>;
         ...
 
 To force variable substitution try:
         $export WEBROOT_PATH=/usr/local/webroot
         $sed 's/<WEBROOT_PATH>/$WEBROOT_PATH/' < test.pl > out
 
 But this just gives:
         my $WEBROOT = $WEBROOT_PATH;
 
 However:
         $export WEBROOT_PATH=/usr/local/webroot
         $eval sed 's%\<WEBROOT_PATH\>%$WEBROOT_PATH%' < test.pl > out
 #        ====
 
 That works fine, and gives the expected substitution:
         my $WEBROOT = /usr/local/webroot;
 
 
 ### Correction applied to original example by Paulo Marcel Coelho Aragao.

The eval command can be risky, and normally should be avoided when there exists a reasonable alternative. An eval $COMMANDS executes the contents of COMMANDS, which may contain such unpleasant surprises as rm -rf *. Running an eval on unfamiliar code written by persons unknown is living dangerously.

set

The set command changes the value of internal script variables. One use for this is to toggle option flags which help determine the behavior of the script. Another application for it is to reset the positional parameters that a script sees as the result of a command (set `command`). The script can then parse the fields of the command output.

Example 11-15. Using set with positional parameters

#!/bin/bash
 
 # script "set-test"
 
 # Invoke this script with three command line parameters,
 # for example, "./set-test one two three".
 
 echo
 echo "Positional parameters before  set \`uname -a\` :"
 echo "Command-line argument #1 = $1"
 echo "Command-line argument #2 = $2"
 echo "Command-line argument #3 = $3"
 
 
 set `uname -a` # Sets the positional parameters to the output
                # of the command `uname -a`
 
 echo $_        # unknown
 # Flags set in script.
 
 echo "Positional parameters after  set \`uname -a\` :"
 # $1, $2, $3, etc. reinitialized to result of `uname -a`
 echo "Field #1 of 'uname -a' = $1"
 echo "Field #2 of 'uname -a' = $2"
 echo "Field #3 of 'uname -a' = $3"
 echo ---
 echo $_        # ---
 echo
 
 exit 0

Invoking set without any options or arguments simply lists all the environmental and other variables that have been initialized.
bash$ set AUTHORCOPY=/home/bozo/posts BASH=/bin/bash BASH_VERSION=$'2.05.8(1)-release' ... XAUTHORITY=/home/bozo/.Xauthority _=/etc/bashrc variable22=abc variable23=xzy

Using set with the -- option explicitly assigns the contents of a variable to the positional parameters. When no variable follows the --, it unsets the positional parameters.

Example 11-16. Reassigning the positional parameters

#!/bin/bash
 
 variable="one two three four five"
 
 set -- $variable
 # Sets positional parameters to the contents of "$variable".
 
 first_param=$1
 second_param=$2
 shift; shift        # Shift past first two positional params.
 remaining_params="$*"
 
 echo
 echo "first parameter = $first_param"             # one
 echo "second parameter = $second_param"           # two
 echo "remaining parameters = $remaining_params"   # three four five
 
 echo; echo
 
 # Again.
 set -- $variable
 first_param=$1
 second_param=$2
 echo "first parameter = $first_param"             # one
 echo "second parameter = $second_param"           # two
 
 # ======================================================
 
 set --
 # Unsets positional parameters if no variable specified.
 
 first_param=$1
 second_param=$2
 echo "first parameter = $first_param"             # (null value)
 echo "second parameter = $second_param"           # (null value)
 
 exit 0

See also Example 10-2 and Example 12-50.

unset

The unset command deletes a shell variable, effectively setting it to null. Note that this command does not affect positional parameters.

bash$ unset PATH bash$ echo $PATH bash$

Example 11-17. "Unsetting" a variable

#!/bin/bash
 # unset.sh: Unsetting a variable.
 
 variable=hello                       # Initialized.
 echo "variable = $variable"
 
 unset variable                       # Unset.
                                      # Same effect as:  variable=
 echo "(unset) variable = $variable"  # $variable is null.
 
 exit 0

export

The export command makes available variables to all child processes of the running script or shell. Unfortunately, there is no way to export variables back to the parent process, to the process that called or invoked the script or shell. One important use of the export command is in startup files, to initialize and make accessible environmental variables to subsequent user processes.

Example 11-18. Using export to pass a variable to an embedded awk script

#!/bin/bash
 
 #  Yet another version of the "column totaler" script (col-totaler.sh)
 #+ that adds up a specified column (of numbers) in the target file.
 #  This uses the environment to pass a script variable to 'awk' . . .
 #+ and places the awk script in a variable.
 
 
 ARGS=2
 E_WRONGARGS=65
 
 if [ $# -ne "$ARGS" ] # Check for proper no. of command line args.
 then
    echo "Usage: `basename $0` filename column-number"
    exit $E_WRONGARGS
 fi
 
 filename=$1
 column_number=$2
 
 #===== Same as original script, up to this point =====#
 
 export column_number
 # Export column number to environment, so it's available for retrieval.
 
 
 # -----------------------------------------------
 awkscript='{ total += $ENVIRON["column_number"] }
 END { print total }'
 # Yes, a variable can hold an awk script.
 # -----------------------------------------------
 
 # Now, run the awk script.
 awk "$awkscript" "$filename"
 
 # Thanks, Stephane Chazelas.
 
 exit 0

It is possible to initialize and export variables in the same operation, as in export var1=xxx.

However, as Greg Keraunen points out, in certain situations this may have a different effect than setting a variable, then exporting it.

bash$ export var=(a b); echo ${var[0]} (a b) bash$ var=(a b); export var; echo ${var[0]} a

declare, typeset

The declare and typeset commands specify and/or restrict properties of variables.

readonly

Same as declare -r, sets a variable as read-only, or, in effect, as a constant. Attempts to change the variable fail with an error message. This is the shell analog of the C language const type qualifier.

getopts

This powerful tool parses command-line arguments passed to the script. This is the Bash analog of the getopt external command and the getopt library function familiar to C programmers. It permits passing and concatenating multiple options [2] and associated arguments to a script (for example scriptname -abc -e /usr/local).

The getopts construct uses two implicit variables. $OPTIND is the argument pointer (OPTion INDex) and $OPTARG (OPTion ARGument) the (optional) argument attached to an option. A colon following the option name in the declaration tags that option as having an associated argument.

A getopts construct usually comes packaged in a while loop, which processes the options and arguments one at a time, then decrements the implicit $OPTIND variable to step to the next.

The arguments passed from the command line to the script must be preceded by a minus (-) or a plus (+). It is the prefixed - or + that lets getopts recognize command-line arguments as options. In fact, getopts will not process arguments without the prefixed - or +, and will terminate option processing at the first argument encountered lacking them.
The getopts template differs slightly from the standard while loop, in that it lacks condition brackets.
The getopts construct replaces the deprecated getopt external command.

while getopts ":abcde:fg" Option # Initial declaration. # a, b, c, d, e, f, and g are the options (flags) expected. # The : after option 'e' shows it will have an argument passed with it. do case $Option in a ) # Do something with variable 'a'. b ) # Do something with variable 'b'. ... e) # Do something with 'e', and also with $OPTARG, # which is the associated argument passed with option 'e'. ... g ) # Do something with variable 'g'. esac done shift $(($OPTIND - 1)) # Move argument pointer to next. # All this is not nearly as complicated as it looks <grin>.

Example 11-19. Using getopts to read the options/arguments passed to a script

#!/bin/bash
 # Exercising getopts and OPTIND
 # Script modified 10/09/03 at the suggestion of Bill Gradwohl.
 
 
 # Here we observe how 'getopts' processes command line arguments to script.
 # The arguments are parsed as "options" (flags) and associated arguments.
 
 # Try invoking this script with
 # 'scriptname -mn'
 # 'scriptname -oq qOption' (qOption can be some arbitrary string.)
 # 'scriptname -qXXX -r'
 #
 # 'scriptname -qr'    - Unexpected result, takes "r" as the argument to option "q"
 # 'scriptname -q -r'  - Unexpected result, same as above
 # 'scriptname -mnop -mnop'  - Unexpected result
 # (OPTIND is unreliable at stating where an option came from).
 #
 #  If an option expects an argument ("flag:"), then it will grab
 #+ whatever is next on the command line.
 
 NO_ARGS=0 
 E_OPTERROR=65
 
 if [ $# -eq "$NO_ARGS" ]  # Script invoked with no command-line args?
 then
   echo "Usage: `basename $0` options (-mnopqrs)"
   exit $E_OPTERROR        # Exit and explain usage, if no argument(s) given.
 fi  
 # Usage: scriptname -options
 # Note: dash (-) necessary
 
 
 while getopts ":mnopq:rs" Option
 do
   case $Option in
     m     ) echo "Scenario #1: option -m-   [OPTIND=${OPTIND}]";;
     n | o ) echo "Scenario #2: option -$Option-   [OPTIND=${OPTIND}]";;
     p     ) echo "Scenario #3: option -p-   [OPTIND=${OPTIND}]";;
     q     ) echo "Scenario #4: option -q-\
  with argument \"$OPTARG\"   [OPTIND=${OPTIND}]";;
     #  Note that option 'q' must have an associated argument,
     #+ otherwise it falls through to the default.
     r | s ) echo "Scenario #5: option -$Option-";;
     *     ) echo "Unimplemented option chosen.";;   # DEFAULT
   esac
 done
 
 shift $(($OPTIND - 1))
 #  Decrements the argument pointer so it points to next argument.
 #  $1 now references the first non option item supplied on the command line
 #+ if one exists.
 
 exit 0
 
 #   As Bill Gradwohl states,
 #  "The getopts mechanism allows one to specify:  scriptname -mnop -mnop
 #+  but there is no reliable way to differentiate what came from where
 #+  by using OPTIND."

Script Behavior

source, . (dot command)

This command, when invoked from the command line, executes a script. Within a script, a source file-name loads the file file-name. Sourcing a file (dot-command) imports code into the script, appending to the script (same effect as the #include directive in a C program). The net result is the same as if the "sourced" lines of code were physically present in the body of the script. This is useful in situations when multiple scripts use a common data file or function library.

Example 11-20. "Including" a data file

#!/bin/bash
 
 . data-file    # Load a data file.
 # Same effect as "source data-file", but more portable.
 
 #  The file "data-file" must be present in current working directory,
 #+ since it is referred to by its 'basename'.
 
 # Now, reference some data from that file.
 
 echo "variable1 (from data-file) = $variable1"
 echo "variable3 (from data-file) = $variable3"
 
 let "sum = $variable2 + $variable4"
 echo "Sum of variable2 + variable4 (from data-file) = $sum"
 echo "message1 (from data-file) is \"$message1\""
 # Note:                            escaped quotes
 
 print_message This is the message-print function in the data-file.
 
 
 exit 0

File data-file for Example 11-20, above. Must be present in same directory.

# This is a data file loaded by a script.
 # Files of this type may contain variables, functions, etc.
 # It may be loaded with a 'source' or '.' command by a shell script.
 
 # Let's initialize some variables.
 
 variable1=22
 variable2=474
 variable3=5
 variable4=97
 
 message1="Hello, how are you?"
 message2="Enough for now. Goodbye."
 
 print_message ()
 {
 # Echoes any message passed to it.
 
   if [ -z "$1" ]
   then
     return 1
     # Error, if argument missing.
   fi
 
   echo
 
   until [ -z "$1" ]
   do
     # Step through arguments passed to function.
     echo -n "$1"
     # Echo args one at a time, suppressing line feeds.
     echo -n " "
     # Insert spaces between words.
     shift
     # Next one.
   done  
 
   echo
 
   return 0
 }

It is even possible for a script to source itself, though this does not seem to have any practical applications.

Example 11-21. A (useless) script that sources itself

#!/bin/bash
 # self-source.sh: a script sourcing itself "recursively."
 # From "Stupid Script Tricks," Volume II.
 
 MAXPASSCNT=100    # Maximum number of execution passes.
 
 echo -n  "$pass_count  "
 #  At first execution pass, this just echoes two blank spaces,
 #+ since $pass_count still uninitialized.
 
 let "pass_count += 1"
 #  Assumes the uninitialized variable $pass_count
 #+ can be incremented the first time around.
 #  This works with Bash and pdksh, but
 #+ it relies on non-portable (and possibly dangerous) behavior.
 #  Better would be to initialize $pass_count to 0 before incrementing.
 
 while [ "$pass_count" -le $MAXPASSCNT ]
 do
   . $0   # Script "sources" itself, rather than calling itself.
          # ./$0 (which would be true recursion) doesn't work here. Why?
 done  
 
 #  What occurs here is not actually recursion,
 #+ since the script effectively "expands" itself, i.e.,
 #+ generates a new section of code
 #+ with each pass through the 'while' loop',
 #  with each 'source' in line 20.
 #
 #  Of course, the script interprets each newly 'sourced' "#!" line
 #+ as a comment, and not as the start of a new script.
 
 echo
 
 exit 0   # The net effect is counting from 1 to 100.
          # Very impressive.
 
 # Exercise:
 # --------
 # Write a script that uses this trick to actually do something useful.

exit

Unconditionally terminates a script. The exit command may optionally take an integer argument, which is returned to the shell as the exit status of the script. It is good practice to end all but the simplest scripts with an exit 0, indicating a successful run.

If a script terminates with an exit lacking an argument, the exit status of the script is the exit status of the last command executed in the script, not counting the exit. This is equivalent to an exit $?.

exec

This shell builtin replaces the current process with a specified command. Normally, when the shell encounters a command, it forks off a child process to actually execute the command. Using the exec builtin, the shell does not fork, and the command exec'ed replaces the shell. When used in a script, therefore, it forces an exit from the script when the exec'ed command terminates. [3]

Example 11-22. Effects of exec

#!/bin/bash
 
 exec echo "Exiting \"$0\"."   # Exit from script here.
 
 # ----------------------------------
 # The following lines never execute.
 
 echo "This echo will never echo."
 
 exit 99                       #  This script will not exit here.
                               #  Check exit value after script terminates
                               #+ with an 'echo $?'.
                               #  It will *not* be 99.

Example 11-23. A script that exec's itself

#!/bin/bash
 # self-exec.sh
 
 echo
 
 echo "This line appears ONCE in the script, yet it keeps echoing."
 echo "The PID of this instance of the script is still $$."
 #     Demonstrates that a subshell is not forked off.
 
 echo "==================== Hit Ctl-C to exit ===================="
 
 sleep 1
 
 exec $0   #  Spawns another instance of this same script
           #+ that replaces the previous one.
 
 echo "This line will never echo!"  # Why not?
 
 exit 0

An exec also serves to reassign file descriptors. For example, exec <zzz-file replaces stdin with the file zzz-file.

The -exec option to find is not the same as the exec shell builtin.

shopt

This command permits changing shell options on the fly (see Example 24-1 and Example 24-2). It often appears in the Bash startup files, but also has its uses in scripts. Needs version 2 or later of Bash.
shopt -s cdspell # Allows minor misspelling of directory names with 'cd' cd /hpme # Oops! Mistyped '/home'. pwd # /home # The shell corrected the misspelling.

Commands

true

A command that returns a successful (zero) exit status, but does nothing else.

# Endless loop while true # alias for ":" do operation-1 operation-2 ... operation-n # Need a way to break out of loop or script will hang. done

false

A command that returns an unsuccessful exit status, but does nothing else.

# Null loop while false do # The following code will not execute. operation-1 operation-2 ... operation-n # Nothing happens! done

type [cmd]

Similar to the which external command, type cmd gives the full path name to "cmd". Unlike which, type is a Bash builtin. The useful -a option to type identifies keywords and builtins, and also locates system commands with identical names.

bash$ type '[' [ is a shell builtin bash$ type -a '[' [ is a shell builtin [ is /usr/bin/[

hash [cmds]

Record the path name of specified commands -- in the shell hash table [4] -- so the shell or script will not need to search the $PATH on subsequent calls to those commands. When hash is called with no arguments, it simply lists the commands that have been hashed. The -r option resets the hash table.

bind

The bind builtin displays or modifies readline [5] key bindings.

help

Gets a short usage summary of a shell builtin. This is the counterpart to whatis, but for builtins.

bash$ help exit exit: exit [n] Exit the shell with a status of N. If N is omitted, the exit status is that of the last command executed.

Notes

[1]	An exception to this is the time command, listed in the official Bash documentation as a keyword.
[2]	A option is an argument that acts as a flag, switching script behaviors on or off. The argument associated with a particular option indicates the behavior that the option (flag) switches on or off.
[3]	Unless the exec is used to reassign file descriptors.
[4]	Hashing is a method of creating lookup keys for data stored in a table. The data items themselves are "scrambled" to create keys, using one of a number of simple mathematical algorithms. An advantage of hashing is that it is fast. A disadvantage is that "collisions" -- where a single key maps to more than one data item -- are possible. For examples of hashing see Example A-21 and Example A-22.
[5]	The readline library is what Bash uses for reading input in an interactive shell.

Basic Commands

The first commands a novice learns

ls

The basic file "list" command. It is all too easy to underestimate the power of this humble command. For example, using the -R, recursive option, ls provides a tree-like listing of a directory structure. Other useful options are -S, sort listing by file size, -t, sort by file modification time, and -i, show file inodes (see Example 12-4).

Example 12-1. Using ls to create a table of contents for burning a CDR disk

#!/bin/bash
 # ex40.sh (burn-cd.sh)
 # Script to automate burning a CDR.
 
 
 SPEED=2          # May use higher speed if your hardware supports it.
 IMAGEFILE=cdimage.iso
 CONTENTSFILE=contents
 DEVICE=cdrom
 # DEVICE="0,0"     For older versions of cdrecord
 DEFAULTDIR=/opt  # This is the directory containing the data to be burned.
                  # Make sure it exists.
                  # Exercise: Add a test for this.
 
 # Uses Joerg Schilling's "cdrecord" package:
 # http://www.fokus.fhg.de/usr/schilling/cdrecord.html
 
 #  If this script invoked as an ordinary user, may need to suid cdrecord
 #+ chmod u+s /usr/bin/cdrecord, as root.
 #  Of course, this creates a security hole, though a relatively minor one.
 
 if [ -z "$1" ]
 then
   IMAGE_DIRECTORY=$DEFAULTDIR
   # Default directory, if not specified on command line.
 else
     IMAGE_DIRECTORY=$1
 fi
 
 # Create a "table of contents" file.
 ls -lRF $IMAGE_DIRECTORY > $IMAGE_DIRECTORY/$CONTENTSFILE
 # The "l" option gives a "long" file listing.
 # The "R" option makes the listing recursive.
 # The "F" option marks the file types (directories get a trailing /).
 echo "Creating table of contents."
 
 # Create an image file preparatory to burning it onto the CDR.
 mkisofs -r -o $IMAGEFILE $IMAGE_DIRECTORY
 echo "Creating ISO9660 file system image ($IMAGEFILE)."
 
 # Burn the CDR.
 echo "Burning the disk."
 echo "Please be patient, this will take a while."
 cdrecord -v -isosize speed=$SPEED dev=$DEVICE $IMAGEFILE
 
 exit $?

cat, tac

cat, an acronym for concatenate, lists a file to stdout. When combined with redirection (> or >>), it is commonly used to concatenate files.
# Uses of 'cat' cat filename # Lists the file. cat file.1 file.2 file.3 > file.123 # Combines three files into one.
The -n option to cat inserts consecutive numbers before all lines of the target file(s). The -b option numbers only the non-blank lines. The -v option echoes nonprintable characters, using ^ notation. The -s option squeezes multiple consecutive blank lines into a single blank line.

See also Example 12-25 and Example 12-21.

In a pipe, it may be more efficient to redirect the stdin to a file, rather than to cat the file.

cat filename | tr a-z A-Z tr a-z A-Z < filename # Same effect, but starts one less process, #+ and also dispenses with the pipe.

tac, is the inverse of cat, listing a file backwards from its end.

rev

reverses each line of a file, and outputs to stdout. This does not have the same effect as tac, as it preserves the order of the lines, but flips each one around.

bash$ cat file1.txt This is line 1. This is line 2. bash$ tac file1.txt This is line 2. This is line 1. bash$ rev file1.txt .1 enil si sihT .2 enil si sihT

cp

This is the file copy command. cp file1 file2 copies file1 to file2, overwriting file2 if it already exists (see Example 12-6).

Particularly useful are the -a archive flag (for copying an entire directory tree) and the -r and -R recursive flags.

mv

This is the file move command. It is equivalent to a combination of cp and rm. It may be used to move multiple files to a directory, or even to rename a directory. For some examples of using mv in a script, see Example 9-18 and Example A-2.

When used in a non-interactive script, mv takes the -f (force) option to bypass user input.

When a directory is moved to a preexisting directory, it becomes a subdirectory of the destination directory.

bash$ mv source_directory target_directory bash$ ls -lF target_directory total 1 drwxrwxr-x 2 bozo bozo 1024 May 28 19:20 source_directory/

rm

Delete (remove) a file or files. The -f option forces removal of even readonly files, and is useful for bypassing user input in a script.

The rm command will, by itself, fail to remove filenames beginning with a dash.

bash$ rm -badname rm: invalid option -- b Try `rm --help' for more information.

One way to accomplish this is to preface the filename to be removed with a dot-slash .
bash$ rm ./-badname
Another method is to precede the filename with a " -- ".
bash$ rm -- -badname

When used with the recursive flag -r, this command removes files all the way down the directory tree from the current directory. A careless rm -rf * can wipe out a big chunk of a directory structure.

rmdir

Remove directory. The directory must be empty of all files -- including "invisible" dotfiles [1] -- for this command to succeed.

mkdir

Make directory, creates a new directory. For example, mkdir -p project/programs/December creates the named directory. The -p option automatically creates any necessary parent directories.

chmod

Changes the attributes of an existing file (see Example 11-12).

chmod +x filename # Makes "filename" executable for all users. chmod u+s filename # Sets "suid" bit on "filename" permissions. # An ordinary user may execute "filename" with same privileges as the file's owner. # (This does not apply to shell scripts.)

chmod 644 filename # Makes "filename" readable/writable to owner, readable to # others # (octal mode).

chmod 1777 directory-name # Gives everyone read, write, and execute permission in directory, # however also sets the "sticky bit". # This means that only the owner of the directory, # owner of the file, and, of course, root # can delete any particular file in that directory.

chattr

Change file attributes. This is analogous to chmod above, but with different options and a different invocation syntax, and it works only on an ext2 filesystem.

One particularly interesting chattr option is i. A chattr +i filename marks the file as immutable. The file cannot be modified, linked to, or deleted , not even by root. This file attribute can be set or removed only by root. In a similar fashion, the a option marks the file as append only.

root# chattr +i file1.txt root# rm file1.txt rm: remove write-protected regular file `file1.txt'? y rm: cannot remove `file1.txt': Operation not permitted

If a file has the s (secure) attribute set, then when it is deleted its block is zeroed out on the disk.

If a file has the u (undelete) attribute set, then when it is deleted, its contents can still be retrieved (undeleted).

If a file has the c (compress) attribute set, then it will automatically be compressed on writes to disk, and uncompressed on reads.

The file attributes set with chattr do not show in a file listing (ls -l).

ln

Creates links to pre-existings files. A "link" is a reference to a file, an alternate name for it. The ln command permits referencing the linked file by more than one name and is a superior alternative to aliasing (see Example 4-6).

The ln creates only a reference, a pointer to the file only a few bytes in size.

The ln command is most often used with the -s, symbolic or "soft" link flag. An advantage of using the -s flag is that it permits linking across file systems.

The syntax of the command is a bit tricky. For example: ln -s oldfile newfile links the previously existing oldfile to the newly created link, newfile.

If a file named newfile has previously existed, it will be deleted when the filename newfile is preempted as the name for a link.

Which type of link to use?

As John Macdonald explains it:

Both of these provide a certain measure of dual reference -- if you edit the contents of the file using any name, your changes will affect both the original name and either a hard or soft new name. The differences between them occurs when you work at a higher level. The advantage of a hard link is that the new name is totally independent of the old name -- if you remove or rename the old name, that does not affect the hard link, which continues to point to the data while it would leave a soft link hanging pointing to the old name which is no longer there. The advantage of a soft link is that it can refer to a different file system (since it is just a reference to a file name, not to actual data).

Links give the ability to invoke a script (or any other type of executable) with multiple names, and having that script behave according to how it was invoked.

Example 12-2. Hello or Good-bye

#!/bin/bash
 # hello.sh: Saying "hello" or "goodbye"
 #+          depending on how script is invoked.
 
 # Make a link in current working directory ($PWD) to this script:
 #    ln -s hello.sh goodbye
 # Now, try invoking this script both ways:
 # ./hello.sh
 # ./goodbye
 
 
 HELLO_CALL=65
 GOODBYE_CALL=66
 
 if [ $0 = "./goodbye" ]
 then
   echo "Good-bye!"
   # Some other goodbye-type commands, as appropriate.
   exit $GOODBYE_CALL
 fi
 
 echo "Hello!"
 # Some other hello-type commands, as appropriate.
 exit $HELLO_CALL

man, info

These commands access the manual and information pages on system commands and installed utilities. When available, the info pages usually contain a more detailed description than do the man pages.

Notes

Complex Commands

Commands for more advanced users

find

-exec COMMAND \;

Carries out COMMAND on each file that find matches. The command sequence terminates with ; (the ";" is escaped to make certain the shell passes it to find literally, without interpreting it as a special character).

bash$ find ~/ -name '*.txt' /home/bozo/.kde/share/apps/karm/karmdata.txt /home/bozo/misc/irmeyc.txt /home/bozo/test-scripts/1.txt

If COMMAND contains {}, then find substitutes the full path name of the selected file for "{}".

find ~/ -name 'core*' -exec rm {} \; # Removes all core dump files from user's home directory.

find /home/bozo/projects -mtime 1 # Lists all files in /home/bozo/projects directory tree #+ that were modified within the last day. # # mtime = last modification time of the target file # ctime = last status change time (via 'chmod' or otherwise) # atime = last access time DIR=/home/bozo/junk_files find "$DIR" -type f -atime +5 -exec rm {} \; # ^^ # Curly brackets are placeholder for the path name output by "find." # # Deletes all files in "/home/bozo/junk_files" #+ that have not been accessed in at least 5 days. # # "-type filetype", where # f = regular file # d = directory, etc. # (The 'find' manpage has a complete listing.)

find /etc -exec grep '[0-9][0-9]*[.][0-9][0-9]*[.][0-9][0-9]*[.][0-9][0-9]*' {} \; # Finds all IP addresses (xxx.xxx.xxx.xxx) in /etc directory files. # There a few extraneous hits. How can they be filtered out? # Perhaps by: find /etc -type f -exec cat '{}' \; | tr -c '.[:digit:]' '\n' \ | grep '^[^.][^.]*\.[^.][^.]*\.[^.][^.]*\.[^.][^.]*$' # # [:digit:] is one of the character classes #+ introduced with the POSIX 1003.2 standard. # Thanks, Stephan�Chazelas.

The -exec option to find should not be confused with the exec shell builtin.

Example 12-3. Badname, eliminate file names in current directory containing bad characters and whitespace.

#!/bin/bash
 # badname.sh
 # Delete filenames in current directory containing bad characters.
 
 for filename in *
 do
   badname=`echo "$filename" | sed -n /[\+\{\;\"\\\=\?~\(\)\<\>\&\*\|\$]/p`
 # badname=`echo "$filename" | sed -n '/[+{;"\=?~()<>&*|$]/p'`  also works.
 # Deletes files containing these nasties:     + { ; " \ = ? ~ ( ) < > & * | $
 #
   rm $badname 2>/dev/null
 #             ^^^^^^^^^^^ Error messages deep-sixed.
 done
 
 # Now, take care of files containing all manner of whitespace.
 find . -name "* *" -exec rm -f {} \;
 # The path name of the file that "find" finds replaces the "{}".
 # The '\' ensures that the ';' is interpreted literally, as end of command.
 
 exit 0
 
 #---------------------------------------------------------------------
 # Commands below this line will not execute because of "exit" command.
 
 # An alternative to the above script:
 find . -name '*[+{;"\\=?~()<>&*|$ ]*' -exec rm -f '{}' \;
 # (Thanks, S.C.)

Example 12-4. Deleting a file by its inode number

#!/bin/bash
 # idelete.sh: Deleting a file by its inode number.
 
 #  This is useful when a filename starts with an illegal character,
 #+ such as ? or -.
 
 ARGCOUNT=1                      # Filename arg must be passed to script.
 E_WRONGARGS=70
 E_FILE_NOT_EXIST=71
 E_CHANGED_MIND=72
 
 if [ $# -ne "$ARGCOUNT" ]
 then
   echo "Usage: `basename $0` filename"
   exit $E_WRONGARGS
 fi  
 
 if [ ! -e "$1" ]
 then
   echo "File \""$1"\" does not exist."
   exit $E_FILE_NOT_EXIST
 fi  
 
 inum=`ls -i | grep "$1" | awk '{print $1}'`
 # inum = inode (index node) number of file
 # ----------------------------------------------------------------------
 # Every file has an inode, a record that hold its physical address info.
 # ----------------------------------------------------------------------
 
 echo; echo -n "Are you absolutely sure you want to delete \"$1\" (y/n)? "
 # The '-v' option to 'rm' also asks this.
 read answer
 case "$answer" in
 [nN]) echo "Changed your mind, huh?"
       exit $E_CHANGED_MIND
       ;;
 *)    echo "Deleting file \"$1\".";;
 esac
 
 find . -inum $inum -exec rm {} \;
 #                           ^^
 #        Curly brackets are placeholder
 #+       for text output by "find."
 echo "File "\"$1"\" deleted!"
 
 exit 0

See Example 12-27, Example 3-4, and Example 10-9 for scripts using find. Its manpage provides more detail on this complex and powerful command.

xargs

A filter for feeding arguments to a command, and also a tool for assembling the commands themselves. It breaks a data stream into small enough chunks for filters and commands to process. Consider it as a powerful replacement for backquotes. In situations where command substitution fails with a too many arguments error, substituting xargs often works. Normally, xargs reads from stdin or from a pipe, but it can also be given the output of a file.

The default command for xargs is echo. This means that input piped to xargs may have linefeeds and other whitespace characters stripped out.
bash$ ls -l total 0 -rw-rw-r-- 1 bozo bozo 0 Jan 29 23:58 file1 -rw-rw-r-- 1 bozo bozo 0 Jan 29 23:58 file2 bash$ ls -l | xargs total 0 -rw-rw-r-- 1 bozo bozo 0 Jan 29 23:58 file1 -rw-rw-r-- 1 bozo bozo 0 Jan 29 23:58 file2

ls | xargs -p -l gzip gzips every file in current directory, one at a time, prompting before each operation.

An interesting xargs option is -n NN, which limits to NN the number of arguments passed.

ls | xargs -n 8 echo lists the files in the current directory in 8 columns.

Another useful option is -0, in combination with find -print0 or grep -lZ. This allows handling arguments containing whitespace or quotes.

find / -type f -print0 | xargs -0 grep -liwZ GUI | xargs -0 rm -f

grep -rliwZ GUI / | xargs -0 rm -f

Either of the above will remove any file containing "GUI". (Thanks, S.C.)

Example 12-5. Logfile: Using xargs to monitor system log

#!/bin/bash
 
 # Generates a log file in current directory
 # from the tail end of /var/log/messages.
 
 # Note: /var/log/messages must be world readable
 # if this script invoked by an ordinary user.
 #         #root chmod 644 /var/log/messages
 
 LINES=5
 
 ( date; uname -a ) >>logfile
 # Time and machine name
 echo --------------------------------------------------------------------- >>logfile
 tail -$LINES /var/log/messages | xargs |  fmt -s >>logfile
 echo >>logfile
 echo >>logfile
 
 exit 0
 
 #  Note:
 #  ----
 #  As Frank Wang points out,
 #+ unmatched quotes (either single or double quotes) in the source file
 #+ may give xargs indigestion.
 #
 #  He suggests the following substitution for line 15:
 #     tail -$LINES /var/log/messages | tr -d "\"'" | xargs | fmt -s >>logfile
 
 
 
 #  Exercise:
 #  --------
 #  Modify this script to track changes in /var/log/messages at intervals
 #+ of 20 minutes.
 #  Hint: Use the "watch" command.

As in find, a curly bracket pair serves as a placeholder for replacement text.

Example 12-6. Copying files in current directory to another

#!/bin/bash
 # copydir.sh
 
 #  Copy (verbose) all files in current directory ($PWD)
 #+ to directory specified on command line.
 
 E_NOARGS=65
 
 if [ -z "$1" ]   # Exit if no argument given.
 then
   echo "Usage: `basename $0` directory-to-copy-to"
   exit $E_NOARGS
 fi  
 
 ls . | xargs -i -t cp ./{} $1
 #            ^^ ^^      ^^
 #  -t is "verbose" (output command line to stderr) option.
 #  -i is "replace strings" option.
 #  {} is a placeholder for output text.
 #  This is similar to the use of a curly bracket pair in "find."
 #
 #  List the files in current directory (ls .),
 #+ pass the output of "ls" as arguments to "xargs" (-i -t options),
 #+ then copy (cp) these arguments ({}) to new directory ($1).  
 #
 #  The net result is the exact equivalent of
 #+   cp * $1
 #+ unless any of the filenames has embedded "whitespace" characters.
 
 exit 0

Example 12-7. Killing processes by name

#!/bin/bash
 # kill-byname.sh: Killing processes by name.
 # Compare this script with kill-process.sh.
 
 #  For instance,
 #+ try "./kill-byname.sh xterm" --
 #+ and watch all the xterms on your desktop disappear.
 
 #  Warning:
 #  -------
 #  This is a fairly dangerous script.
 #  Running it carelessly (especially as root)
 #+ can cause data loss and other undesirable effects.
 
 E_BADARGS=66
 
 if test -z "$1"  # No command line arg supplied?
 then
   echo "Usage: `basename $0` Process(es)_to_kill"
   exit $E_BADARGS
 fi
 
 
 PROCESS_NAME="$1"
 ps ax | grep "$PROCESS_NAME" | awk '{print $1}' | xargs -i kill {} 2&>/dev/null
 #                                                       ^^      ^^
 
 # -----------------------------------------------------------
 # Notes:
 # -i is the "replace strings" option to xargs.
 # The curly brackets are the placeholder for the replacement.
 # 2&>/dev/null suppresses unwanted error messages.
 # -----------------------------------------------------------
 
 exit $?

Example 12-8. Word frequency analysis using xargs

#!/bin/bash
 # wf2.sh: Crude word frequency analysis on a text file.
 
 # Uses 'xargs' to decompose lines of text into single words.
 # Compare this example to the "wf.sh" script later on.
 
 
 # Check for input file on command line.
 ARGS=1
 E_BADARGS=65
 E_NOFILE=66
 
 if [ $# -ne "$ARGS" ]
 # Correct number of arguments passed to script?
 then
   echo "Usage: `basename $0` filename"
   exit $E_BADARGS
 fi
 
 if [ ! -f "$1" ]       # Check if file exists.
 then
   echo "File \"$1\" does not exist."
   exit $E_NOFILE
 fi
 
 
 
 ########################################################
 cat "$1" | xargs -n1 | \
 #  List the file, one word per line. 
 tr A-Z a-z | \
 #  Shift characters to lowercase.
 sed -e 's/\.//g'  -e 's/\,//g' -e 's/ /\
 /g' | \
 #  Filter out periods and commas, and
 #+ change space between words to linefeed,
 sort | uniq -c | sort -nr
 #  Finally prefix occurrence count and sort numerically.
 ########################################################
 
 #  This does the same job as the "wf.sh" example,
 #+ but a bit more ponderously, and it runs more slowly (why?).
 
 exit 0

expr

All-purpose expression evaluator: Concatenates and evaluates the arguments according to the operation given (arguments must be separated by spaces). Operations may be arithmetic, comparison, string, or logical.

expr 3 + 5

returns 8

expr 5 % 3

returns 2

expr 5 \* 3

returns 15

The multiplication operator must be escaped when used in an arithmetic expression with expr.

y=`expr $y + 1`

Increment a variable, with the same effect as let y=y+1 and y=$(($y+1)). This is an example of arithmetic expansion.

z=`expr substr $string $position $length`

Extract substring of $length characters, starting at $position.

Example 12-9. Using expr

#!/bin/bash
 
 # Demonstrating some of the uses of 'expr'
 # =======================================
 
 echo
 
 # Arithmetic Operators
 # ---------- ---------
 
 echo "Arithmetic Operators"
 echo
 a=`expr 5 + 3`
 echo "5 + 3 = $a"
 
 a=`expr $a + 1`
 echo
 echo "a + 1 = $a"
 echo "(incrementing a variable)"
 
 a=`expr 5 % 3`
 # modulo
 echo
 echo "5 mod 3 = $a"
 
 echo
 echo
 
 # Logical Operators
 # ------- ---------
 
 #  Returns 1 if true, 0 if false,
 #+ opposite of normal Bash convention.
 
 echo "Logical Operators"
 echo
 
 x=24
 y=25
 b=`expr $x = $y`         # Test equality.
 echo "b = $b"            # 0  ( $x -ne $y )
 echo
 
 a=3
 b=`expr $a \> 10`
 echo 'b=`expr $a \> 10`, therefore...'
 echo "If a > 10, b = 0 (false)"
 echo "b = $b"            # 0  ( 3 ! -gt 10 )
 echo
 
 b=`expr $a \< 10`
 echo "If a < 10, b = 1 (true)"
 echo "b = $b"            # 1  ( 3 -lt 10 )
 echo
 # Note escaping of operators.
 
 b=`expr $a \<= 3`
 echo "If a <= 3, b = 1 (true)"
 echo "b = $b"            # 1  ( 3 -le 3 )
 # There is also a "\>=" operator (greater than or equal to).
 
 
 echo
 echo
 
 
 
 # String Operators
 # ------ ---------
 
 echo "String Operators"
 echo
 
 a=1234zipper43231
 echo "The string being operated upon is \"$a\"."
 
 # length: length of string
 b=`expr length $a`
 echo "Length of \"$a\" is $b."
 
 # index: position of first character in substring
 #        that matches a character in string
 b=`expr index $a 23`
 echo "Numerical position of first \"2\" in \"$a\" is \"$b\"."
 
 # substr: extract substring, starting position & length specified
 b=`expr substr $a 2 6`
 echo "Substring of \"$a\", starting at position 2,\
 and 6 chars long is \"$b\"."
 
 
 #  The default behavior of the 'match' operations is to
 #+ search for the specified match at the ***beginning*** of the string.
 #
 #        uses Regular Expressions
 b=`expr match "$a" '[0-9]*'`               #  Numerical count.
 echo Number of digits at the beginning of \"$a\" is $b.
 b=`expr match "$a" '\([0-9]*\)'`           #  Note that escaped parentheses
 #                   ==      ==              + trigger substring match.
 echo "The digits at the beginning of \"$a\" are \"$b\"."
 
 echo
 
 exit 0

The : operator can substitute for match. For example, b=`expr $a : [0-9]*` is the exact equivalent of b=`expr match $a [0-9]*` in the above listing.

#!/bin/bash echo echo "String operations using \"expr \$string : \" construct" echo "===================================================" echo a=1234zipper5FLIPPER43231 echo "The string being operated upon is \"`expr "$a" : '$.*$'`\"." # Escaped parentheses grouping operator. == == # *************************** #+ Escaped parentheses #+ match a substring # *************************** # If no escaped parentheses... #+ then 'expr' converts the string operand to an integer. echo "Length of \"$a\" is `expr "$a" : '.*'`." # Length of string echo "Number of digits at the beginning of \"$a\" is `expr "$a" : '[0-9]*'`." # ------------------------------------------------------------------------- # echo echo "The digits at the beginning of \"$a\" are `expr "$a" : '$[0-9]*$'`." # == == echo "The first 7 characters of \"$a\" are `expr "$a" : '$.......$'`." # ===== == == # Again, escaped parentheses force a substring match. # echo "The last 7 characters of \"$a\" are `expr "$a" : '.*$.......$'`." # ==== end of string operator ^^ # (actually means skip over one or more of any characters until specified #+ substring) echo exit 0

Time / Date Commands

Time/date and timing

date

Simply invoked, date prints the date and time to stdout. Where this command gets interesting is in its formatting and parsing options.

Example 12-10. Using date

#!/bin/bash
 # Exercising the 'date' command
 
 echo "The number of days since the year's beginning is `date +%j`."
 # Needs a leading '+' to invoke formatting.
 # %j gives day of year.
 
 echo "The number of seconds elapsed since 01/01/1970 is `date +%s`."
 #  %s yields number of seconds since "UNIX epoch" began,
 #+ but how is this useful?
 
 prefix=temp
 suffix=$(date +%s)  # The "+%s" option to 'date' is GNU-specific.
 filename=$prefix.$suffix
 echo $filename
 #  It's great for creating "unique" temp filenames,
 #+ even better than using $$.
 
 # Read the 'date' man page for more formatting options.
 
 exit 0

The -u option gives the UTC (Universal Coordinated Time).

bash$ date Fri Mar 29 21:07:39 MST 2002 bash$ date -u Sat Mar 30 04:07:42 UTC 2002

The date command has quite a number of output options. For example %N gives the nanosecond portion of the current time. One interesting use for this is to generate six-digit random integers.
date +%N | sed -e 's/000$//' -e 's/^0//' ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ # Strip off leading and trailing zeroes, if present.

There are many more options (try man date).
date +%j # Echoes day of the year (days elapsed since January 1). date +%k%M # Echoes hour and minute in 24-hour format, as a single digit string. # The 'TZ' parameter permits overriding the default time zone. date # Mon Mar 28 21:42:16 MST 2005 TZ=EST date # Mon Mar 28 23:42:16 EST 2005 # Thanks, Frank Kannemann and Pete Sjoberg, for the tip. SixDaysAgo=$(date --date='6 days ago') OneMonthAgo=$(date --date='1 month ago') # Four weeks back (not a month). OneYearAgo=$(date --date='1 year ago')

Text Processing Commands

Commands affecting text and text files

sort

File sorter, often used as a filter in a pipe. This command sorts a text stream or file forwards or backwards, or according to various keys or character positions. Using the -m option, it merges presorted input files. The info page lists its many capabilities and options. See Example 10-9, Example 10-10, and Example A-8.

tsort

Topological sort, reading in pairs of whitespace-separated strings and sorting according to input patterns.

uniq

This filter removes duplicate lines from a sorted file. It is often seen in a pipe coupled with sort.
cat list-1 list-2 list-3 | sort | uniq > final.list # Concatenates the list files, # sorts them, # removes duplicate lines, # and finally writes the result to an output file.

The useful -c option prefixes each line of the input file with its number of occurrences.

bash$ cat testfile This line occurs only once. This line occurs twice. This line occurs twice. This line occurs three times. This line occurs three times. This line occurs three times. bash$ uniq -c testfile 1 This line occurs only once. 2 This line occurs twice. 3 This line occurs three times. bash$ sort testfile | uniq -c | sort -nr 3 This line occurs three times. 2 This line occurs twice. 1 This line occurs only once.

The sort INPUTFILE | uniq -c | sort -nr command string produces a frequency of occurrence listing on the INPUTFILE file (the -nr options to sort cause a reverse numerical sort). This template finds use in analysis of log files and dictionary lists, and wherever the lexical structure of a document needs to be examined.

Example 12-11. Word Frequency Analysis

#!/bin/bash
 # wf.sh: Crude word frequency analysis on a text file.
 # This is a more efficient version of the "wf2.sh" script.
 
 
 # Check for input file on command line.
 ARGS=1
 E_BADARGS=65
 E_NOFILE=66
 
 if [ $# -ne "$ARGS" ]  # Correct number of arguments passed to script?
 then
   echo "Usage: `basename $0` filename"
   exit $E_BADARGS
 fi
 
 if [ ! -f "$1" ]       # Check if file exists.
 then
   echo "File \"$1\" does not exist."
   exit $E_NOFILE
 fi
 
 
 
 ########################################################
 # main ()
 sed -e 's/\.//g'  -e 's/\,//g' -e 's/ /\
 /g' "$1" | tr 'A-Z' 'a-z' | sort | uniq -c | sort -nr
 #                           =========================
 #                            Frequency of occurrence
 
 #  Filter out periods and commas, and
 #+ change space between words to linefeed,
 #+ then shift characters to lowercase, and
 #+ finally prefix occurrence count and sort numerically.
 
 #  Arun Giridhar suggests modifying the above to:
 #  . . . | sort | uniq -c | sort +1 [-f] | sort +0 -nr
 #  This adds a secondary sort key, so instances of
 #+ equal occurrence are sorted alphabetically.
 #  As he explains it:
 #  "This is effectively a radix sort, first on the
 #+ least significant column
 #+ (word or string, optionally case-insensitive)
 #+ and last on the most significant column (frequency)."
 ########################################################
 
 exit 0
 
 # Exercises:
 # ---------
 # 1) Add 'sed' commands to filter out other punctuation,
 #+   such as semicolons.
 # 2) Modify the script to also filter out multiple spaces and
 #    other whitespace.

expand, unexpand

The expand filter converts tabs to spaces. It is often used in a pipe.

The unexpand filter converts spaces to tabs. This reverses the effect of expand.

cut

A tool for extracting fields from files. It is similar to the print $N command set in awk, but more limited. It may be simpler to use cut in a script than awk. Particularly important are the -d (delimiter) and -f (field specifier) options.

Using cut to obtain a listing of the mounted filesystems:
cat /etc/mtab | cut -d ' ' -f1,2

Using cut to list the OS and kernel version:
uname -a | cut -d" " -f1,3,11,12

Using cut to extract message headers from an e-mail folder:
bash$ grep '^Subject:' read-messages | cut -c10-80 Re: Linux suitable for mission-critical apps? MAKE MILLIONS WORKING AT HOME!!! Spam complaint Re: Spam complaint

Using cut to parse a file:
# List all the users in /etc/passwd. FILENAME=/etc/passwd for user in $(cut -d: -f1 $FILENAME) do echo $user done # Thanks, Oleg Philon for suggesting this.

cut -d ' ' -f2,3 filename is equivalent to awk -F'[ ]' '{ print $2, $3 }' filename