Working with Regular Expressions

A regular expression defines a pattern. That pattern is then compared to a target string, and based on the rules, it either:

Why Use Regular Expressions?

Character	Meaning
`.`	Matches any character except newline
`\d`	Matches any digit (0-9)
`\w`	Matches any word character (alphanumeric + underscore)
`\s`	Matches any whitespace character
`\D`, `\W`, `\S`	Negated versions (non-digit, non-word, non-whitespace)

Square brackets [ ] define a character class.
Match any single character from the specified set.
Examples:
- [aeiou] - matches any vowel
- [0-9] - matches any digit (same as \d)
- [a-zA-Z] - matches any letter (upper or lowercase)
- [^0-9] - matches any character that’s NOT a digit

/yes|no/

Matches either "yes" or "no"

Flag	Description
`g`	Global search (find all matches)
`i`	Case-insensitive search
`m`	Multi-line mode (`^` and `$` match line start/end)
`s`	Allows `.` to match newline characters
`u`	Unicode mode
`y`	Sticky search (matches from lastIndex)

Example:

const regex = /hello/gi;

Parentheses ( ) create capture groups
Used to:
- Apply quantifiers to entire sequences
- Extract specific parts of the match
- Reference matched text with backreferences

# Example: Capturing name parts
pattern = r"(\w+)\s(\w+)"
text = "John Smith"
# Captures: Group 1 = "John", Group 2 = "Smith"

plaintext

| pipe symbol for alternation (OR operator)
(?:...) for non-capturing groups
Examples:
- cat|dog matches “cat” or “dog”
- I love (cats|dogs) matches “I love cats” or “I love dogs”
- (?:https?|ftp):// matches “http://”, “https://”, or “ftp://“

Lookahead: (?=...) positive, (?!...) negative
Lookbehind: (?<=...) positive, (?<!...) negative
Zero-width assertions (don’t consume characters)
Examples:
- \w+(?=\s) - word followed by whitespace
- (?<=\$)\d+ - digits preceded by dollar sign
- \b\w+\b(?!\s+and\b) - word NOT followed by “and”