[WITH CODE] RiskOps: Improving MAE/MFE

A methodical approach to trading with peaks, valleys, and robust models

Feb 15, 2025

Table of contents:

Introduction.
The problem with MAE/MFE.
An alternative to finding the optimal target value.
1. Step 1 - Identify peaks and bottoms.
2. Step 2: Calculate differences.
3. Step 3 - RANSAC
The exit strategy based on partial exists.

Introduction

Once upon a time, in the enchanted land of Data Science, there existed two popular wizards: Maximum Adverse Excursion and Maximum Favorable Excursion. These magical tools helped traders and engineers measure the extreme ups and downs of their trades. Think of them as the superhero gadgets of financial risk management—capable of measuring how far a price could fall—or rise—during a trade. But alas, like picky eaters who only relish a specific flavor, MAE and MFE insist that all data must be normal, resembling a perfect bell curve.

Imagine you’re measuring the heights of children in a class, and suddenly one kid shows up on stilts! The bell curve gets all confused, and our MAE/MFE tools cry, this isn’t normal! Abort mission! And so, our adventure begins: How do we deal with data that is more like a squashed cupcake or a rollercoaster ride than a neat, bell-shaped curve?

The problem with MAE/MFE

MAE and MFE have long been the darlings of traders who assume that price movements follow a bell-shaped normal distribution. In mathematical terms, when we say a variable X is normally distributed, we mean its probability density function is given by

\(f(x) = \frac{1}{\sqrt{2\pi\sigma^2}} \exp \left( -\frac{(x - \mu)^2}{2\sigma^2} \right) \)

where μ is the mean and σ is the standard deviation. This formula looks elegant and symmetric, much like a well-baked cake.

However, market data is messy. Prices can be influenced by countless unpredictable factors, leading to distributions that are skewed and heavy-tailed. Under these conditions, MAE and MFE—tools that rely on the assumption of normality— give misleading signals.

Let’s examine the MAE. For a trade, we define:

\(\text{MAE} = \max \{ P_{\text{peak}} - P_{\text{trough}} \}, \)

where P_peak is a local maximum—a peak—and P_trought is the subsequent local minimum—a valley.

Under the assumption of normality, we expect the differences P_peak−P_trought to be symmetrically distributed about a central value. But when the data is non-normal, extreme values—outliers—may distort this picture, making MAE and MFE unreliable.

Thus, we require an approach that does not rely on the data being normal—a method that can handle irregularities with the grace of a superhero dodging meteors.

An alternative to finding the optimal target value

To tackle the problem of non-normal data, we propose an approach that is both intuitive and mathematically robust. Our method consists of three key steps:

Step 1 - Identify peaks and bottoms:

Imagine you’re hiking in the mountains: the peaks represent the highest points, and the valleys are the lowest dips. In our mathematical frame, peaks correspond to local maxima and valleys to local minima.

A peak is identified when the derivative—the slope—of the function changes sign from positive to negative. Mathematically, if f(x) is our function, then a point x=x₀ is a local maximum or peak if:

\(f'(x_0) = 0 \quad \text{and} \quad f''(x_0) < 0. \)

Similarly, a valley is found when:

\(f'(x_0) = 0 \quad \text{and} \quad f''(x_0) > 0. \)

This is like saying: If the hill stops rising and starts falling, you’re at a peak! And if it stops falling and starts rising, you’re at a valley! Pretty neat, right?

Let’s see how we can implement this idea. Consider the following code fragment, keeping in mind that it is only used to label or to use as a basis as an approximation for this type of methods, never to give input or output signals:

import numpy as np
import matplotlib.pyplot as plt
from scipy.signal import find_peaks

# Simulate stock price data
np.random.seed(42)
time = np.arange(50)  # Time steps
stock_price = np.cumsum(np.random.randn(50)) + 100  # Random walk around 100

# Find peaks (local maxima)
peaks, _ = find_peaks(stock_price)

# Find bottoms (local minima)
bottoms, _ = find_peaks(-stock_price)

# Plot the stock price with peaks and bottoms
plt.figure(figsize=(10, 5))
plt.plot(time, stock_price, label="Stock Price", linestyle="-", marker="o", markersize=4)
plt.scatter(time[peaks], stock_price[peaks], color="red", label="Peaks", zorder=3)
plt.scatter(time[bottoms], stock_price[bottoms], color="blue", label="Bottoms", zorder=3)
plt.xlabel("Time")
plt.ylabel("Stock Price")
plt.title("Simulated Stock Price with Peaks and Bottoms")
plt.legend()
plt.grid()

# Show the chart
plt.show()

The find_peaks function locates positions in the data array where a peak occurs. To find valleys, we simply invert the data—multiply by -1—and then find the peaks of this inverted series.

Step 2: Calculate differences:

After identifying our peaks and valleys, the next logical step is to measure the height differences between them. These differences are crucial—they represent the excursion lengths, akin to the difference between the highest mountain and the deepest valley.

In simple math, suppose we have a sequence of peaks P₁,P₂,…,P_n and valleys B₁,B₂,…,B_n. The difference between a peak and the following valley is given by:

\(D_i = P_i - B_i, \)

and for valleys followed by the next peak, we can write:

\(D_{i+1} = B_i - P_{i+1}. \)

These differences help us quantify the magnitude of fluctuations in our data. We can now calculate the differences as follows:

# Assuming 'stock_price' is our array and 'peaks' and 'bottoms' are identified indices
differences = []

# We take the difference between each peak and the subsequent valley.
# (This is a simplified illustration; in practice, one must align peaks and valleys correctly)
for p, b in zip(peaks, bottoms):
    differences.append(stock_price[p] - stock_price[b])

For each pair, we subtract the valley’s value from the peak’s value and if you plot this:

Step 3 - RANSAC:

Now comes the part where a robust algorithm called RANSAC or Random Sample Consensus combined with quantile analysis to deal with data chaos.

RANSAC is a powerful method that fits a model to data while ignoring outliers. Imagine a robot that randomly picks points, draws a line through them, and then checks which points are in line—inliers. It repeats this process until it finds the best line that fits the majority of the data.

Mathematically, RANSAC minimizes the loss function:

\(L = \sum_{(x,y) \in \text{inliers}} (y - mx - b)^2, \)

where m and b are the slope and intercept of the line, respectively. The goal is to find the line that best represents the normal behavior of the data, ignoring the extreme outliers.

After applying RANSAC, we further analyze the distribution of our differences by computing quantiles.

Suppose X is a random variable with cumulative distribution function F(x)=Pr(X≤x). Then, for any probability level p∈[0,1], the p-th quantile Q_p is defined as:

\(Q_p = \inf \{ x \in \mathbb{R} : F(x) \geq p \}. \)

Where:

F(x) tells us the probability that X takes on a value less than or equal to x.
The set {x:F(x)≥p} consists of all values x for which the probability of X≤x is at least p.
The infimum of that set is the smallest such value, making it the p-th quantile.

For example:

Q_0.90 = inf⁡{x:F(x)≥0.9} is the 90th quantile.
Q_0.10 is the 10th quantile, etc.

Because quantiles are non-parametric, they don't assume any specific underlying distribution shape—like the normal distribution. This makes them especially valuable for analyzing data that has been pre-filtered by RANSAC, ensuring that our analysis remains robust even in the presence of outliers.

Here, everyone should choose their stoploss and take profit based on their strategy, but I was talking about it the other day with a fellow:

if your strategy doesn't work well from the start, something is wrong and it is better for the system to close with a small loss and to look for alpha elsewhere.

Alternatively, you can choose multiple quantiles to have multiple targets. Let's see the whole example:

import numpy as np
import matplotlib.pyplot as plt
from scipy.signal import find_peaks
from sklearn.linear_model import RANSACRegressor

# ---------------------------------------------------------
# 1. Simulate stock price data
# ---------------------------------------------------------
np.random.seed(42)
time = np.arange(50)  # Time steps
stock_price = np.cumsum(np.random.randn(50)) + 100  # Random walk around 100

# ---------------------------------------------------------
# 2. Identify peaks and bottoms
# ---------------------------------------------------------
peaks, _ = find_peaks(stock_price)
bottoms, _ = find_peaks(-stock_price)

# ---------------------------------------------------------
# 3. Calculate differences between matching peaks and bottoms
# ---------------------------------------------------------
differences = []
for p, b in zip(peaks, bottoms):
    diff = stock_price[p] - stock_price[b]
    differences.append(diff)

differences = np.array(differences)  # Convert to NumPy array
X = np.arange(len(differences)).reshape(-1, 1)  # Indices for RANSAC

# ---------------------------------------------------------
# 4. Apply RANSAC to the differences
# ---------------------------------------------------------
ransac = RANSACRegressor().fit(X, differences)
inlier_mask = ransac.inlier_mask_  # Boolean mask of inliers
outlier_mask = ~inlier_mask

# Extract RANSAC parameters (slope & intercept for the best-fit line)
slope = ransac.estimator_.coef_[0]
intercept = ransac.estimator_.intercept_

print("RANSAC slope:", slope)
print("RANSAC intercept:", intercept)

# ---------------------------------------------------------
# 5. Interpret RANSAC output as 'pptimum target'
# ---------------------------------------------------------
# Option A: Predicted value at the last inlier index
pred_optimum_last = ransac.predict([[X[-1][0]]])
print("Predicted optimum target at last index:", pred_optimum_last)

# Option B: Average of inlier predictions as a single optimum target
pred_all = ransac.predict(X)
pred_optimum_avg = np.mean(pred_all[inlier_mask])
print("Predicted optimum target with average of inlier predictions:", pred_optimum_avg)

# ---------------------------------------------------------
# 6. Compute additional targets via quantiles (60th, 70th) on inliers
# ---------------------------------------------------------
inlier_differences = differences[inlier_mask]

q60 = np.quantile(inlier_differences, 0.60)
q70 = np.quantile(inlier_differences, 0.70)

print("60th quantile (inliers only):", q60)
print("70th quantile (inliers only):", q70)

# ---------------------------------------------------------
# 7. Visualize the results
# ---------------------------------------------------------
fig, ax = plt.subplots(1, 1, figsize=(10, 5))
ax.scatter(X[inlier_mask], differences[inlier_mask], c='blue', label='Inliers')
ax.scatter(X[outlier_mask], differences[outlier_mask], c='red', label='Outliers')

# RANSAC best-fit line
line_x = np.linspace(X.min(), X.max(), 100).reshape(-1, 1)
line_y = ransac.predict(line_x)
ax.plot(line_x, line_y, color='orange', label='RANSAC Fit')

ax.set_title("Differences (peak - bottom) with RANSAC regression")
ax.set_xlabel("Index")
ax.set_ylabel("Difference value")
ax.legend()

plt.tight_layout()
plt.show()

What is the output of this script?

And the numerical data:

RANSAC slope: -0.022971148820770435
RANSAC intercept: 0.40051416530672224
Predicted optimum target at last index: [0.10188923]
Predicted optimum target with average of inlier predictions: 0.2659688650707811
60th quantile (inliers only): 0.28533330818357283
70th quantile (inliers only): 0.32653746974535236

We got two different outputs, both of them valid:

Option A: Predict at the final index—or any specific index—to get a direct target value—I prefer this one.
Option B: Average all inlier predictions for a stable single-number target.

Besides we filter the data to inliers only—inlier_mask in order to take the 60th and 70th quantiles of these inlier differences because they can serve as additional target thresholds.

For instance, in trading logic, you might treat these as partial take-profit or stoploss levels—due to the differences the same quantile can be used for both types of exit targets.

The exit strategy based on partial exists

The exit strategy based on these new quantiles works as follows:

Establishing the exit baseline—optimum profit target:
Using RANSAC, we derive a robust baseline which represents the central tendency of our peak-to-valley differences. This baseline tells us the typical or expected magnitude of a price move without being skewed by outliers.
Defining secondary profit targets with quantiles:
From the inlier data we compute higher quantiles—like the 60th and 70th quantiles. You can use higher quantiles but for the sake of this example we will continue with those:
- 60th quantile: This level marks a moderately favorable price move. When the price reaches this level, you might consider taking partial profits.
- 70th quantile: This higher threshold indicates an even more significant move. If the price hits this level, it suggests the market has moved very favorably, and you might take more profit or close your position.
Stop-loss consideration:
Similarly, higher quantiles would serve as stop-loss thresholds. If the price moves unfavorably and falls below these levels,—entry+/-RANSAC values—it signals a potential reversal or significant downturn, prompting you to exit the position to minimize losses. We use higher quantiles because we have calculated the differences between peaks and bottoms.

Pretty cool right!? Not only does this method provide a more reliable estimate of risk and reward in non-normal data, but it also opens up new avenues for research and practical applications.

Until next time—may your trades stay profitable, your backtests tell the truth, and your edge never fade! 📈

P.S. What is your level in quantitative finance? Let me know which ones you prefer!