Aesthetics

As mentioned in the Preface, this guide aims to be as objective as possible by providing well-founded reasons for each recommended practice. However, some practices are purely stylistic and may vary based on personal preference or specific project requirements.

This section addresses these aesthetic choices and offers guidelines for maintaining a consistent and visually appealing script. Where applicable, the guidelines will include explanations of their advantages over alternative practices.

For stylistic guidelines that impact a script's functionality, please refer to this guide's Style section.

Indentations

Indentations help structure code and improve readability by visually grouping related statements. The indentation method can greatly affect how well a script can be understood and maintained.

Guidelines

Type: ALWAYS use spaces for indentations.
- Reason: Spaces are universally supported and render consistently across different editors and environments. This ensures that the script's structure remains intact regardless of where it is viewed or edited.
Size: ALWAYS use four spaces per indentation level.
- Reason: Using four spaces is considered the standard and is widely accepted within the Bash community.

Characters Per Line

In the past, the maximum number of characters per line (CPL) was limited to 80 because of the constraints of older terminals. Even though modern terminals and editors can display more characters per line, it's still beneficial to stick to a reasonable limit to maintain readability and ease of maintenance across different environments and tools.

General Guidelines

CPL Limit: Set a maximum CPL of 88 characters.
- Reason: An 88-character limit balances the historical 80-character standard and the need for more descriptive code. It accommodates longer variable names, comments, and strings without sacrificing readability.
Exceptions: The CPL limit may be exceeded if line splitting significantly reduces readability or negatively impacts the script's structure.

Importance of CPL Limits

Readability: Shorter lines reduce the need for horizontal scrolling and minimize word wrapping issues, which can make all the difference when working with multiple windows simultaneously.
Tool Compatibility: A shorter CPL ensures that code is displayed uniformly across different development environments, such as code review platforms, IDEs, and terminals. This prevents layout issues and preserves the intended structure.

Formatting Multi-lined Commands

With a CPL limit in place, commands will inevitably span multiple lines. As such, it's essential to format these multi-lined commands in a way that maintains an easy-to-understand and follow structure.

Guidelines

Indentation and AlignmentLogical OperatorsPiping Operators

Indentation: Use the standard four-space indentation for each continuation line.

Example

rsync -avz /source/directory/with/a/very/long/path/ \
    /destination/directory/with/another/long/path/

Two or More Operators: If two or more logical operators are used in a command sequence, place each operator at the start of a new line, even if the CPL limit hasn't been reached.
- Reason: Multiple commands on a single line can obscure operations. Placing them onto their own lines improves readability by making the sequence more transparent. (1)
  1. Logical Operator Placement
    I have no objective reasoning for placing logical operators at the beginning of a new line. It's just a personal preference. I find it easier to read and understand the command sequence when each operator is at the beginning of a new line rather than at the end of the previous line.
Single Operator: If a single logical operator connects two commands and the length is within the CPL limit, both commands may remain on the same line.
- Reason: Keeping a single operator on the same line can be more concise and easier to read when the command sequence is short. Splitting them unnecessarily may introduce complexity without added clarity.
Special Case: Sometimes, a single || operator follows a command, and the subsequent operations span multiple lines. In such cases, placing || { at the end of the initial command is acceptable. The closing bracket (}) should start a new line, with operations enclosed between the braces.
- Reason: This format keeps the structure concise and easy to follow when handling a single failure condition with ||. Other formats can add complexity without improving clarity, making this approach more readable.
- Example of acceptable formatting:
  1 2 3 4 5
  cp /some/config/file /some/config/file.bak || { echo "Failed to back up 'file'" >&2 echo "Please create a backup of the original 'file' before continuing" exit 1 }
- Example of formatting that is not acceptable:
  1 2 3 4
  rm /some/config/file.bak && cp /some/config/file /some/config/file.bak || { echo "Failed to overwrite backup of 'file'" >&2 exit 1 }
  Explanation: Here, && and || are on the same line, violating the "Two or More Operators" guideline and making the sequence potentially harder to understand. The preferred format is:
  1 2 3 4 5 6
  rm -rf /some/config/backup_dir \ && mv -fT /some/other/config/backup_dir /some/config/backup_dir \ || { echo "Failed to overwrite backup directory" >&2 exit 1 }

General Examples

Example 1:

[[ -f /some/location/of/file.txt ]] && rm -f /some/location/of/file.txt

Example 2:

mkdir -p /path/to/backup \
    && rsync -av --delete /path/to/source/ /path/to/backup/ \
    || echo "Backup failed" >> error.log

Placement: Similar to logical operators, when a piped command sequence exceeds the CPL limit, place each pipe (|) at the start of a new line with the preceding command (| command).
Single-Line Placement: If piped commands fit within the 88-character limit, they may remain on the same line. Use this sparingly to avoid creating complex or hard-to-read sequences.

Examples

Piped command sequence on a single line:

grep -r "TODO" /path/to/project | grep -v "DONE" | sort | uniq > todo_list.txt

Multi-line command sequence:

grep -r "TODO" /path/to/project \
    | grep -v "DONE" \
    | awk '{print $1, ":", $2}' \
    | sort \
    | uniq > todo_list.txt

Formatting Control Structures

Control structures in Bash, such as if statements and for or while loops can be formatted in several ways. Depending on the context, you can choose between a standard or single-line control structure. In either case, maintaining consistency and readability is key.

Guidelines

Standard Control StructureSingle-Line Control Structure

Keyword Placement:
- Opening Keyword: Place then on the same line as the if statement and do on the same line as the for or while loop.
- Closing Keyword: End if statements with fi and loops with done on their own lines.
Continuation Line: If the condition or loop statement exceeds the CPL limit, you have two options for formatting continuation lines:
- Option 1 (Recommended): Place the opening keyword on the following blank line rather than at the end of the condition or loop statement.
- Option 2: Use eight spaces for each continuation line rather than the standard four spaces.
- Reason: Both options help differentiate the condition or loop statement from the actions or commands within the control structure. This separation enhances readability and maintains a clear structure. While Option 1 is the recommended approach, Option 2 is acceptable if you prefer a different style. However, it's essential to maintain consistency within the script.

Examples

Standard control structure formatting:

if [[ $1 -eq 1 ]]; then
    echo "You entered one."
else
    echo "You entered something else."
fi

for file in *.txt; do
    echo "Processing $file"
done

Continuation lines — Option 1:

if (! hash dotnet \
    || ! hash redis-server \
    || ! hash python3 \
    || ! "$ccze_installed" \
    || ! "$yt_dlp_installed" \
    || [[ ${dotnet_version:-false} != "$C_REQ_DOTNET_VERSION" ]]) &>/dev/null
then
    opt_one_dis=true
    opt_one_text="${E_GREY}${opt_one_text}${dis_option}${E_NC}"
fi

Continuation lines — Option 2:

if (! hash dotnet \
        || ! hash redis-server \
        || ! hash python3 \
        || ! "$ccze_installed" \
        || ! "$yt_dlp_installed" \
        || [[ ${dotnet_version:-false} != "$C_REQ_DOTNET_VERSION" ]]) &>/dev/null; then
    opt_one_dis=true
    opt_one_text="${E_GREY}${opt_one_text}${dis_option}${E_NC}"
fi

Clarity and Maintainability: Single-line control structures must be clear and concise.
- Reason: This format is best suited for simple conditions or loops that can be expressed succinctly.
Avoid Complex Logic: Avoid adding multiple commands or complex logic to single-line structures.
- Reason: Including complex logic or multiple commands on a single line can reduce readability and make the code harder to maintain.
Character Limit: DO NOT use a single-line control structure if it exceeds the 88-character limit.

Example

Two single-line control structures:

[[ $1 -eq 1 ]] && echo "You entered one."

for file in *.txt; do echo "Processing $file"; done

Vertical Spacing

Vertical spacing is often overlooked when formatting any programming language. However, it is crucial for enhancing readability and maintaining a clean and organized script structure.

Guidelines

Single Blank Line: Use a single blank line to separate logical blocks of code or functions.
Double Blank Lines: Use double-blank lines sparingly to highlight new sections or distinct logical groups.

Why Vertical Spacing Matters?

Readability: Proper vertical spacing balances readability and organization. Too many blank lines can make the script appear disjointed, while too few can make it look cluttered. Striking the right balance ensures the script is easily read and logically organized.

Example

add() {
    local sum=$(( $1 + $2 ))
    echo "Sum: $sum"
}

subtract() {
    local difference=$(( $1 - $2 ))
    echo "Difference: $difference"
}


echo "Starting the script..."
add 5 3
subtract 10 4
echo "Script finished."

Explanation: In this example, a single blank line separates the add and subtract functions. Two blank lines are used before the echo "Starting the script..." line, signifying a transition between the function definitions and the main script logic. This spacing enhances readability and visually separates the script's distinct sections.

Comments

Comments are essential for explaining script functionality and enhancing long-term usability. Effective commenting practices ensure scripts are easily understood by new developers or future maintainers. While similar to previous guidelines, comments have a few additional considerations depending on their context.

General Guidelines

Capitalization: Begin each comment with a capital letter, except for code elements like variables or functions.
Punctuation: Conclude comments with a period to indicate a complete thought.
Spacing: For inline comments, maintain two spaces between the code and the comment.
- Reason: While a single space is often sufficient, two spaces provide a clearer visual separation between the code and the comment.

Example

# This is a comment explaining the next line of code.
variable="value"  # Inline comment with two spaces before it.

Function Comments

Functions in Bash differ from those in other languages, especially regarding argument handling. Due to these differences, functions should be thoroughly documented to clarify their purpose, parameters, and expected behavior.

Guidelines

Purpose: Provide a clear and comprehensive description of the function's role within the script. This description should go beyond merely repeating the function's name; it should offer context and explain its intended use.
- Reason: Clearly documenting a function's purpose allows developers, including yourself, to quickly grasp what the function is meant to achieve. Even if the function name is descriptive, a brief explanation adds valuable context, facilitating a better understanding of its role.
- Exception: If the function is short AND its name is descriptive, the purpose may be left blank, or a short and simple description may be used instead. This exception should be used sparingly and only when the function's purpose is immediately evident from its name.
Global Variables: If the function introduces new global variables or modifies existing ones, document their purpose or how they are modified.
- Reason: Identifying how the function interacts with global variables helps developers, including yourself, understand the function's impact on the script's state. This is especially important in larger scripts where tracking variable scope can be challenging.
Parameters: Detail each parameter the function accepts, noting whether they are required or optional, and specify any default values.
- Reason: Documenting the parameters clarifies how the function should be called and what inputs it requires to operate correctly. This reduces the likelihood of errors or misuse and ensures that the function is used consistently with its intended design.
- Value Assignment: Assign parameter values to local variables within the function.
  - Reason: Assigning parameters to local variables enhances understanding by giving the parameters meaningful names within the function's scope. This practice also prevents accidental modification of the original parameters.
Exit and Return: Specify the function's exit and return values separately. Describe the reason and when the exit occurs, as well as what the function outputs and the values it returns for use elsewhere in the script.
- Reason: Describing the function's exit and return values helps developers understand what to expect when calling the function and how to handle the results.
General Reasoning: Besides the reasons previously mentioned, documenting functions allows for quick reference and understanding of the script's structure and logic. This is especially beneficial when revisiting the script or collaborating with other developers after an extended period.

Format/Structure Example: Below is the recommended format and structure for documenting functions in Bash scripts. This format provides a clear outline for documenting functions effectively. Please pay attention to the annotations within the examples. They provide additional context and explanations for the documentation.

Important Note

While all of the fields are highly recommended, some may be omitted if they are not applicable to the function, or if they do not provide additional value. However, it is recommended to include as much information as possible to ensure the function is well-documented. Additionally, if information in one of the fields is already provided elsewhere or is self-evident from the function's name or logic, it may be omitted.

####
# Function description...
#
# NOTES:
#   Additional notes or considerations. This section may be omitted if there is no
#   additional information to provide.
#
# NEW GLOBALS:
#   - global_variable_one : Description of global_variable_one.
#
# MODIFIED GLOBALS:
#   - global_variable_two : Description of how global_variable_two is modified.
#
# PARAMETERS:
#   - $1: parameter_name (Required) (1)
#       - Description of the parameter.
#   - $2: parameter_name (Optional, Default: default_value)
#       - Description of the parameter.
#       - Acceptable values: (2)
#           - value_one: Description of value_one.
#           - value_two: Description of value_two.
#
#
# RETURNS:
#   return value: Description of when the return value is provided and its significance.
#
# EXITS:
#   exit value: Description of when the exit occurs and the reason for the exit.
function_name() {
    # Function logic here...
}

Parameter Requirement: Specify whether the parameter is "Required" or "Optional." If the parameter is optional, include the default value in the format of (Optional, Default: default_value).
Acceptable Values: If a parameter has specific values it accepts/expects, list them. This ensures that the function is used correctly and helps prevent errors. If the value's purpose is not immediately clear, provide a brief description.

Example

Example 1Example 2Example 3Example 4

####
# Convert a given IP address into an integer, using bitwise operations.
#
# NOTE:
#   This allows for easier IP address comparison and calculation. Specifically, the
#   integer is used to calculate the range of IP addresses to scan, among other things.
#
# PARAMETERS:
#   - $1: ip (Required)
#       - The IP address to convert to an integer.
ip_to_int() {
    local ip="$1"
    local IFS='.'

    read -r octet1 octet2 octet3 octet4 <<< "$ip"
    echo "$(( (octet1 << 24) + (octet2 << 16) + (octet3 << 8) + octet4 ))"
}

####
# Verify that the provided IP address is valid, based on a regular expression pattern.
#
# PARAMETERS:
#   - $1: ip (Required)
#       - The IP address to verify.
verify_valid_ip() {
    local ip="$1"
    local valid_ip_regex="^((25[0-5]|2[0-4][0-9]|1?[0-9][0-9]?)\.){3}(25[0-5]|2[0-4][0-9]|1?[0-9][0-9]?)$"

    if [[ ! $ip =~ $valid_ip_regex ]]; then
        echo -e "${C_RED}ERROR:${C_NC} Invalid IP address: $ip" >&2
        clean_exit "1" "" "false"
    fi
}

####
# Perform cleanup operations when the script exits. This includes killing any background
# jobs and removing temporary files.
#
# PARAMETERS:
#   - $1: exit_code (Required)
#       - The type of exit that occurred.
#       - Acceptable values: (1)
#           - 0: Normal exit. The script completed its task successfully.
#           - 1: Exiting due to an error. An error occurred during the script execution.
#           - 130: User interruption. The user interrupted the script using Ctrl+C.
#           - 143: SIGTERM signal received. The script was terminated by a SIGTERM signal.
#   - $2: clean_up (Optional, Default: true)
#       - Whether to perform cleanup operations.
#       - Acceptable values: (2)
#           - true
#           - false
#   - $3: display_message (Optional, Default: "true") (3)
#       - Acceptable values: true, false (4)
#
# EXITS:
#   exit_code: The exit code passed by the caller. Always executes once cleanup
#       operations are complete.
clean_exit() {
    local exit_code="$1"
    local clean_up="${2:-true}"
    local display_message="${3:-true}"

    if [[ $exit_code == "1" && $display_message == "true" ]]; then
        echo "${C_RED}==>${C_NC} A fatal error occurred." >&2
    elif [[ ($exit_code == "130" ||  $exit_code == "143")
            && $display_message == "true" ]]; then
        echo ""
        echo "${C_YELLOW}==>${C_NC} User interruption detected."
    fi

    if [[ $clean_up == "true" ]]; then
        echo "${C_CYAN}==>${C_NC} Cleaning up..."

        for job in "${background_jobs[@]}"; do
            kill -9 "$job" > /dev/null 2>&1
        done

        [[ -f $C_TMP_FILE ]] && rm "$C_TMP_FILE"
    fi

    exit "$exit_code"
}

Acceptable Values: As mentioned in the guidelines, if a parameter has a specific set of acceptable values, list them. This is an example of how to document such values.
Value Descriptions: Since the values' purpose is self-explanatory, no additional descriptions are necessary.
No Description: If the parameter's purpose is immediately evident from its name or the function's logic, a description may be omitted.
Alternative Formatting: When the "Acceptable values" section is short and without additional descriptions, they may be placed on the same line rather than as a separate bullet point.

####
# Given two IP addresses, determine the lower and upper bounds, and store them in the
# global in two new global variables.
#
# NEW GLOBALS: (1)
#   - C_LOWER_BOUND: Indicates the *start* of the IP range to be scanned.
#   - C_UPPER_BOUND: Indicates the *end* of the IP range to be scanned.
#
# PARAMETERS:
#   - $1: bound_one (Required)
#       - The first IP address to compare.
#   - $2: bound_two (Required)
#       - The second IP address to compare.
check_lower_upper_bounds() {
    local bound_one="$1"
    local bound_two="$2"

    if (( $(ip_to_int "$bound_one") > $(ip_to_int "$bound_two") )); then
        C_LOWER_BOUND="$bound_two"
        C_UPPER_BOUND="$bound_one"
    elif (( $(ip_to_int "$bound_one") < $(ip_to_int "$bound_two") )); then
        C_LOWER_BOUND="$bound_one"
        C_UPPER_BOUND="$bound_two"
    else
        echo "${C_RED}ERROR:${C_NC} The lower and upper bounds are the same." >&2
        clean_exit "1" "" "false"
    fi
}

New Globals: As mentioned in the guidelines, if the function introduces new global variables, document them here. This section provides a clear overview of the new variables and their purpose.

Pound Signs in Comments

Traditionally, a single pound sign (#) is used to denote a comment in Bash scripts. However, using multiple pound signs can help differentiate the comments' purpose and scope.

Guidelines

Single Pound Sign (#)Double Pound Signs (##)Quadruple Pound Signs (####)Triple Pound Signs (###)

Usage: Use a single pound sign for general comments that explain a single line of code or provide context for a specific command.

Example

1 2	`# This command lists all files in the directory. ls -la`

Usage: Use two pound signs for comments that describe the functionality of a block of code, such as a loop, conditional sequence, control structure, or a group of related variable declarations. Place these comments directly above the relevant code block.
Blank Lines: These comments directly apply to the block of code immediately following them. If a blank line precedes the commented code block, it indicates that the comment does not apply to the later code.
- Exception: If a double pound sign comment is placed at the top of a control structure or loop, and a blank line is introduced within the structure, the comment still applies to the entire structure.

Examples

## Set variables for the script.
var_one="value_one"
var_two="value_two"
var_three="value_three"

## Loop through all ".txt" files in the directory. (1)
for file in *.txt; do
    echo "Processing $file"

    echo "Done"
done

Blank Line in Structure: While there is a blank line between the two echo commands, this comment still applies to the entire loop.

Usage: Use four pound signs to separate distinct parts of the script, such as functions, variable declarations, or main script logic.
Sparingly: Use four pound signs sparingly, primarily when it’s necessary to visually distinguish major sections of code. Excessive use can clutter the script and make it harder to read.
Omitting ####: Quadruple-pound signs can be omitted if the script is relatively short, well-structured, and easy to navigate without additional sectioning.
Subsections: Quadruple-pound signs can also indicate subsections within a larger script section. However, this should be done VERY sparingly. Consider using triple-pound signs (###) before resorting to quadruple-pound signs for subsections.
Formatting: Quadruple-pound sign comments should be formatted to ensure the transition between sections is clear and visually distinct. Below are the recommended formatting guidelines:
- Section Naming: Append [ section_name ] to the end of ####, replacing section_name with a descriptive title for that section. The section name should clearly indicate the content or purpose of that section.
- Filler Characters: After the section name, append a series of # characters to fill the remaining space up to the 88-character limit.
- Section Comments: If necessary, add comments directly below the initial #### line, prefixing them with four # characters.
- Spacing: As mentioned in the vertical spacing guidelines, provide two blank lines before and after the quadruple-pound signs to enhance the visual separation between sections.
- Subsection Format: Subsections should follow the same format, with the number of brackets ([]) indicating the depth of the subsection within the script. The deeper the subsection, the more brackets should be used. Use filler characters to maintain a consistent 88-character width.

Example

####[ Global Variables ]################################################################
####[[ Modifiable Variables ]]########################################################## (1)

C_MODIFY_ME="value"

####[[ General Variables ]]#############################################################

general_var="value"

####[ Functions ]#######################################################################

####
# Function description...
process_files() {
    # Function logic here...
}

####[ Main Code ]#######################################################################
#### Main code description here, if necessary...

# Main execution starts here.
echo "Starting script execution..."

# Call a function to process files.
process_files "input.txt" "log.txt"

Triple Pound Signs: As mentioned in the guidelines, consider using triple-pound signs (###) before resorting to quadruple-pound signs for subsections.

Description: Triple-pound signs serve as a middle ground between double and quadruple-pound signs. They are used when blocks or lines of code require some distinction but does not necessitate a completely new section.
Usage: Use three pound signs where the code is different enough to warrant distinction but not significant enough to be placed in an entirely new section.
Formatting: Triple-pound sign comments should be formatted to ensure the transition between differing blocks of code is clear and visually distinct. Below are the recommended formatting guidelines:
- Section Naming: Append [ section_name ] to the end of ###, replacing section_name with a descriptive title for that section. The section name should clearly indicate the content or purpose of the code below it.
- Filler Characters: After the section name, append a series of # characters to fill the remaining space up to the 88-character limit. Additionally, place three # characters above and below the section name line.
- Spacing: Include a single blank line above and below the filler characters to separate the previous command(s), the triple pound sign comment, and the next command(s).
- Section Comments: If necessary, add comments to describe the section's content or purpose, prefixed with three # characters.

Example

####[ Global Variables ]################################################################


background_jobs=()

###
### [ Configurable Variables ]
### The following variables can be modified to suit your needs.
###

# The maximum number of concurrent pings to run.
readonly C_MAX_CONCURRENT_PINGS=255

###
### [ Constants ]
###

## Variables to colorize the output.
C_YELLOW="$(printf '\033[1;33m')"
C_GREEN="$(printf '\033[0;32m')"
C_BLUE="$(printf '\033[0;34m')"
C_CYAN="$(printf '\033[0;36m')"
C_RED="$(printf '\033[1;31m')"
C_NC="$(printf '\033[0m')"
C_CLRLN="$(printf '\r\033[K')"
readonly C_YELLOW C_GREEN C_BLUE C_CYAN C_RED C_NC C_CLRLN


####[ Functions ]#######################################################################


## Functions to perform various tasks...