Smarter Logging Middleware

smarter.lib.logging.middleware

Middleware for Per-Request Logging Context

This module provides Django middleware that injects a per-request logging context into Python’s contextvars system. This enables:

  • Request-scoped logging correlation: Each HTTP request receives a unique logging context, allowing logs to be traced per request.

  • Async-safe logging context propagation: Works seamlessly with both synchronous and asynchronous Django views, ensuring context is preserved across async boundaries.

  • Real-time log filtering: Supports real-time log filtering in the Smarter Terminal Emulator for improved debugging and monitoring.

  • User-scoped or anonymous request tracing: Authenticated users are identified in logs by their model and username; anonymous users receive a UUID-based identifier.

Features

  • Integrates with Django’s middleware stack.

  • Uses context variables for safe, per-request logging context.

  • Supports both sync and async Django request handling.

  • Controlled by a Waffle switch (SmarterWaffleSwitches.ENABLE_MIDDLEWARE_REQUEST_LOG_CONTEXT).

Usage

Add SmarterRequestLogContextMiddleware to your Django MIDDLEWARE settings to enable per-request logging context. Ensure the required Waffle switch is configured to activate the middleware.

MIDDLEWARE = [
    # ...
    'smarter.lib.logging.middleware.SmarterRequestLogContextMiddleware',
    # ...
]

Dependencies

  • Django

  • asgiref

  • smarter.common.helpers.logger_helpers

  • smarter.common.mixins

  • smarter.lib.logging.redis_log_handler

  • smarter.lib.django.waffle

class smarter.lib.logging.middleware.SmarterRequestLogContextMiddleware(get_response, *args, **kwargs)[source]

Bases: SmarterMiddlewareMixin

Middleware that injects request identity into logging contextvars.

Authenticated users receive:

“<ModelClass>.<username>”

Anonymous users receive:

UUID-based request identifiers.

__init__(get_response, *args, **kwargs)
_is_private_ip(ip)

Check if IP is in private/internal ranges.

property amnesty_urls: list[str]

Returns a list of URLs that are exempt from certain checks.

Returns:

List of URL path strings that are exempt.

Return type:

list[str]

async_capable = True
bool_environment_variable(var_name, default=False)

Retrieves a boolean value from an environment variable.

This method checks the specified environment variable and returns its value as a boolean. It recognizes common truthy values such as “true”, “1”, “yes”, and “on”. If the variable is not set or cannot be interpreted as a boolean, it returns the provided default value.

Parameters:
  • var_name (str) – The name of the environment variable to check.

  • default (bool) – The default boolean value to return if the environment variable is not set or invalid.

Returns:

The boolean value of the environment variable or the default.

Return type:

bool

data_to_dict(data)

Converts data to a dictionary, handling different types of input.

This method accepts either a dictionary or a string. If a string is provided, it will attempt to parse it as JSON first, and if that fails, as YAML. If parsing fails or the data type is unsupported, a SmarterValueError is raised.

Parameters:

data (Union[dict, str]) – The data to convert, either a dict or a JSON/YAML string.

Returns:

The data as a dictionary.

Return type:

dict

Raises:

SmarterValueError – If the data cannot be converted to a dictionary.

deserves_amnesty(slug)

Determines if a given URL deserves amnesty based on the amnesty URLs list. This excuses certain endpoints (like health checks) from select middleware checks.

Parameters:

slug (str) – The URL path to check.

Returns:

True if the URL deserves amnesty, False otherwise.

Return type:

bool

dict_is_contained_in(dict1, dict2)

Checks if one dictionary is contained within another. This method determines if all key-value pairs in dict1 are present in dict2.

Parameters:
  • dict1 (dict) – The dictionary to check for containment.

  • dict2 (dict) – The dictionary to check against for containment.

Returns:

True if dict1 is contained in dict2, False otherwise.

Return type:

bool

dict_is_subset(small, big)

Checks if one dictionary is a subset of another.

This method determines if all key-value pairs in the small dictionary are present in the big dictionary. It returns True if the small dictionary is a subset of the big dictionary, and False otherwise.

Parameters:
  • small (dict) – The dictionary to check as a subset.

  • big (dict) – The dictionary to check against as a superset.

Returns:

True if the small dictionary is a subset of the big dictionary, False otherwise.

Return type:

bool

property formatted_class_name: str

Returns the class name formatted for logging.

Returns:

The formatted class name as a string.

Return type:

str

property formatted_state_not_ready: str

Returns the not-ready state formatted for logging.

Returns:

The formatted not-ready state as a string.

Return type:

str

property formatted_state_ready: str

Returns the readiness state formatted for logging.

Returns:

The formatted readiness state as a string.

Return type:

str

generate_fernet_encryption_key()

Generates a Fernet encryption key.

This method creates a new Fernet encryption key, which can be used for secure encryption and decryption of data. The generated key is returned as a URL-safe base64-encoded string.

Returns:

A new Fernet encryption key.

Return type:

str

async get_async_context(request)[source]

Resolve logging context for async requests.

Return type:

str

get_client_ip(request)

Get client IP address from request.

This method attempts to determine the original client IP address, accounting for proxies, load balancers, and CDNs. It checks common headers set by proxies and falls back to REMOTE_ADDR.

Notes

  • In AWS CLB → Kubernetes Nginx setups, the client IP flow is: Client → CLB → Nginx Ingress → Django. - CLB adds X-Forwarded-For with the original client IP. - Nginx may add X-Real-IP or modify X-Forwarded-For. - Django sees REMOTE_ADDR as the Nginx IP (not the client IP).

  • If using Cloudflare, it adds the CF-Connecting-IP header.

  • Always validate IPs to avoid trusting spoofed headers.

Parameters:

request (HttpRequest) – The Django request object.

Returns:

The detected client IP address, or None if not found.

Return type:

Optional[str]

get_readonly_csv_file(file_path)

Retrieves a read-only file object for a CSV file.

This method opens the specified CSV file in read-only mode and returns a file object that can be used to read its contents. It ensures that the file is not modified during the reading process.

Parameters:

file_path (str) – The path to the CSV file to open.

Returns:

A read-only file object for the specified CSV file.

Return type:

file

get_readonly_yaml_file(file_path)

Retrieves a read-only file object for a YAML file.

This method opens the specified YAML file in read-only mode and returns a file object that can be used to read its contents. It ensures that the file is not modified during the reading process.

Parameters:

file_path (str) – The path to the YAML file to open.

Returns:

A read-only file object for the specified YAML file.

Return type:

file

get_sync_context(request)[source]

Resolve logging context for sync requests.

Return type:

str

has_auth_indicators(request)

Check if request has authentication indicators (cookies, headers, etc.).

This method inspects the request for common authentication signals, such as session cookies, CSRF tokens, authorization headers, API keys, or Django’s built-in authentication.

Parameters:

request (HttpRequest) – The Django request object.

Returns:

True if authentication indicators are present, False otherwise.

Return type:

bool

property health_check_urls: list[str]

Returns a list of URL paths that are considered health check endpoints.

Returns:

List of health check URL path strings.

Return type:

list[str]

static is_authenticated(user)[source]

Safely determine whether a user is authenticated.

Return type:

bool

mask_string(string='', mask_char='*', mask_length=4, string_length=8)

Masks a string for secure logging.

This utility function masks all but the last unmasked_chars characters of the input string, replacing them with asterisks. It is useful for logging sensitive information like API keys or passwords.

Parameters:
  • string (Optional[str]) – The string to be masked.

  • mask_char (str) – The character used for masking.

  • mask_length (int) – The number of characters to mask.

  • string_length (int) – The length of the string to consider for masking.

Returns:

The masked string.

Return type:

str

property ready: bool

Indicates whether the object is ready for use. This is a placeholder that should be overridden in subclasses.

Returns:

True if ready, False otherwise.

Return type:

bool

recursive_sort_dict(data)

Recursively sorts a dictionary by its keys.

This method takes a dictionary and returns a new dictionary with all keys sorted in ascending order. If any values are also dictionaries, they will be sorted recursively as well.

Parameters:

data (dict) – The dictionary to sort.

Returns:

A new dictionary with all keys sorted.

Return type:

dict

static reset_logging_context(token)[source]

Restore previous logging context.

Return type:

None

rfc1034_compliant_str(name)

Converts a string to an RFC 1034 compliant format.

This method takes a string and converts it to a format that complies with RFC 1034, which is commonly used for domain names. It replaces invalid characters with hyphens and ensures the resulting string is lowercase.

Parameters:

name (str) – The string to convert to RFC 1034 compliant format.

Returns:

The converted string in RFC 1034 compliant format.

Return type:

str

rfc1034_compliant_to_snake(name)

Converts an RFC 1034 compliant string to snake_case.

This method takes a string in RFC 1034 compliant format and converts it to snake_case. It replaces hyphens with underscores and ensures the resulting string is lowercase.

Parameters:

name (str) – The RFC 1034 compliant string to convert.

Returns:

The converted string in snake_case.

Return type:

str

static set_logging_context(context)[source]

Set request-scoped logging context.

Return type:

Token

set_smarter_process_id(request)

Set smarter_process_id on request if not already set. This has to consider the pipeline of middlewares. Each middleware class sets its own unique smarter_process_id, but if a previous middleware has already set the request object’s smarter_process_id then we should not overwrite it. This allows all middlewares in the pipeline to share the same process ID for logging and caching purposes.

Return type:

None

smarter_build_absolute_uri(request)

Attempts to get the absolute URI from a request object.

This utility function tries to retrieve the request URL from any valid child class of django.http.HttpRequest. It is especially useful in unit tests or scenarios where the request object may not implement build_absolute_uri().

Parameters:

request (HttpRequest) – The request object.

Returns:

The absolute request URL.

Return type:

Optional[str]

Raises:

SmarterValueError – If the URI cannot be built from the request.

smarter_process_id: int
sorted_dict(data)

Returns a new dictionary with keys sorted.

Parameters:

data (dict) – The dictionary to sort.

Returns:

A new dictionary with sorted keys.

Return type:

dict

sync_capable = True
to_camel_case(data, convert_values=False)

Converts a snake_case string to camelCase.

This method takes a string in snake_case format and converts it to camelCase. It is useful for standardizing naming conventions across different formats.

Parameters:
  • data (Union[str, dict[str, object], list[object], object]) – The snake_case string to convert.

  • convert_values (bool) – Whether to convert the values of dictionaries and lists recursively.

Returns:

The converted string in camelCase.

Return type:

Any

to_snake_case(data, convert_values=False)

Converts a camelCase or PascalCase string to snake_case.

This method takes a string in camelCase or PascalCase format and converts it to snake_case. It is useful for standardizing naming conventions across different formats.

Parameters:
  • data (Union[str, dict[str, object], list[object], object]) – The camelCase or PascalCase string to convert.

  • convert_values (bool) – Whether to convert the values of dictionaries and lists recursively.

Returns:

The converted string in snake_case.

Return type:

Any

property unformatted_class_name: str

Returns the raw class name without formatting.

Returns:

The unformatted class name as a string.

Return type:

str

This is useful for logging or serialization where the plain class name is needed.