Indian-trading-skills backtest-expert

install

source · Clone the upstream repo

git clone https://github.com/ajeeshworkspace/indian-trading-skills

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/ajeeshworkspace/indian-trading-skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/backtest-expert" ~/.claude/skills/ajeeshworkspace-indian-trading-skills-backtest-expert && rm -rf "$T"

manifest: skills/backtest-expert/SKILL.md

source content

Backtest Expert — Indian Market Strategy Validation

Core Philosophy

"Find strategies that break the least, not profit the most."

A strategy that survives stress testing across multiple market regimes, transaction cost assumptions, and parameter perturbations is far more valuable than one that shows spectacular returns on a single optimized parameter set. Overfitting is the silent killer of trading accounts.

6-Step Backtesting Workflow

Step 1: State the Hypothesis (1 Sentence Edge)

Before writing a single line of code, articulate why the strategy should work in one clear sentence.

Good hypotheses:

"Stocks that gap up >3% on above-average volume after consolidation tend to continue higher for 2-5 days on NSE."
"Nifty 50 stocks that revert to their 20-day mean after RSI drops below 30 produce positive expectancy within 5 trading sessions."
"Selling strangles on Bank Nifty on Wednesday expiry with delta <0.15 captures time decay faster than gamma risk materializes."

Bad hypotheses:

"This indicator combination looks good on the chart." (no edge articulated)
"I saw someone on Twitter making money with this." (no reasoning)

Ask yourself:

What behavioral or structural edge am I exploiting?
Why would this edge persist? (Structural > Behavioral > Statistical)
Who is on the other side of this trade, and why are they losing?

Step 2: Codify Rules (No Ambiguity)

Every rule must be binary — a computer must be able to execute it without interpretation.

Rule Categories

Category	What to Define	Example
Universe	Which stocks/instruments	Nifty 200 constituents, F&O stocks only, market cap >5000 Cr
Entry	Exact trigger conditions	Close > 20 EMA AND RSI(14) crosses above 40 AND volume > 1.5x 20-day avg
Exit — Target	Profit-taking rule	Close 3% above entry OR trailing stop of 1.5 ATR
Exit — Stop	Loss-cutting rule	Close below entry-day low OR 2% fixed stop
Exit — Time	Maximum holding period	Exit after 10 trading sessions if neither target nor stop hit
Position Sizing	How much capital per trade	5% of equity per position, max 10 concurrent positions
Filters	When NOT to trade	Skip if stock is in F&O ban period, skip 2 days around results

India-Specific Rules to Consider

Circuit limits: Stocks hitting upper/lower circuit cannot be exited. Define handling.
F&O ban period: Stocks crossing 95% MWPL cannot add fresh F&O positions.
T+1 settlement: Cash equity settles next trading day (changed from T+2 in 2023).
Pre-open session: 9:00-9:08 AM orders, 9:08-9:15 AM matching. Define if you use pre-open.
Muhurat trading: Special Diwali session — include or exclude?
Corporate actions: Adjust for splits, bonuses, dividends, rights issues.

Step 3: Run Initial Backtest

Minimum Requirements

Parameter	Minimum	Recommended
Time period	5 years	8-10+ years
Number of trades	100	200+
Market regimes covered	2 (bull + bear)	4+ (bull, bear, sideways, high-vol)
Data quality	Adjusted for corporate actions	Survivorship-bias-free universe

Indian Market Regimes to Cover

Regime	Period Examples	Characteristics
Bull market	2014-2017, 2020-2021	Nifty trending up, broad participation
Bear market	2008, 2020 (Mar), 2022 (Jun)	Sharp drawdowns, high correlation
Sideways/Range	2018-2019, 2023 H1	Nifty in 10% range, stock-specific moves
High volatility	2008, 2020, Budget days	India VIX > 25
Low volatility	2017, 2021 H2	India VIX < 15
Pre/Post Budget	Every Feb 1	Gap moves, policy-driven sectors
Election cycle	2014, 2019, 2024	Uncertainty then rally pattern
Monsoon impact	Jun-Sep annually	Agri, FMCG, rural economy impact
RBI policy shifts	Rate hike/cut cycles	Banking, NBFC, rate-sensitive sectors
Global crude shock	2018, 2022	INR weakness, OMC impact, inflation

Key Metrics to Record

Returns: CAGR, total return, monthly returns distribution
Risk: Max drawdown, average drawdown, drawdown duration, Calmar ratio
Efficiency: Sharpe ratio (use 6% risk-free for India), Sortino ratio
Trade quality: Win rate, avg win/loss, profit factor, expectancy per trade
Consistency: % profitable months, worst month, longest losing streak

Step 4: Stress Test (Spend 80% of Your Time Here)

This is where most backtests fail — and where the real value lies.

4a. Parameter Sensitivity

Perturb every parameter by +/-20% and check if performance degrades gracefully or collapses.

Parameter	Base	-20%	-10%	+10%	+20%	Verdict
EMA period	20	16	18	22	24	Stable if all profitable
RSI threshold	40	32	36	44	48	Fragile if only 40 works
Stop loss %	2%	1.6%	1.8%	2.2%	2.4%	Check drawdown impact

Rule of thumb: If the strategy only works with exact parameter values, it is overfit. You want a "plateau" of profitability, not a "peak."

4b. Execution Friction (India-Specific Costs)

Apply realistic transaction costs:

Cost Component	Delivery (CNC)	Intraday (MIS)	F&O
Brokerage	~₹20/order or 0.03%	~₹20/order or 0.03%	~₹20/order
STT	0.1% (buy+sell)	0.025% (sell only)	0.0125% (sell, options)
Exchange charges	0.00345% (NSE)	0.00345% (NSE)	0.05% (options)
GST	18% on brokerage+exchange	18% on brokerage+exchange	18% on brokerage+exchange
Stamp duty	0.015% (buy)	0.003% (buy)	0.003% (buy)
SEBI charges	0.0001%	0.0001%	0.0001%
Slippage	0.05-0.1% large-cap	0.1-0.2% mid-cap	0.1-0.3% options

Total round-trip cost estimates:

Delivery large-cap: ~0.3-0.5%
Intraday large-cap: ~0.1-0.2%
F&O (options): ~0.15-0.4%
Small-cap delivery: ~0.5-1.0% (wider spreads)

4c. Time Robustness

Split data into 3-year rolling windows. Is the strategy profitable in each?
Check year-by-year returns. Is any single year driving total performance?
Remove the best month. Is the strategy still positive?

4d. Sample Size Validation

Minimum 30 trades for any statistical claim (even this is weak)
100+ trades: Moderate confidence
200+ trades: Good confidence
Use the t-test: Is average trade return significantly different from zero?

Step 5: Out-of-Sample Validation (Walk-Forward Analysis)

Never skip this step.

Walk-Forward Method for Indian Markets

In-sample period: Train on 5 years of data (e.g., 2015-2019)
Out-of-sample period: Test on next 1-2 years (e.g., 2020-2021)
Roll forward: Move window, retrain on 2016-2020, test on 2021-2022
Combine: Aggregate all out-of-sample periods for true performance estimate

Walk-Forward Efficiency (WFE):

WFE = Out-of-Sample Return / In-Sample Return

WFE > 50%: Good — strategy generalizes
WFE 30-50%: Acceptable — some overfitting present
WFE < 30%: Poor — likely overfit

Paper Trading Validation

Before deploying capital, paper trade for at least:

30 trades minimum
2 months minimum
Cover at least one volatile period (expiry week, results season, RBI policy)

Step 6: Evaluate Results (Deploy / Refine / Abandon)

Use the evaluation script to get an objective score:

python3 evaluate_backtest.py \
  --total-trades 150 \
  --win-rate 62 \
  --avg-win-pct 1.8 \
  --avg-loss-pct 1.2 \
  --max-drawdown-pct 15 \
  --years-tested 8 \
  --num-parameters 3 \
  --slippage-tested

Decision Framework

Score	Verdict	Action
80-100	Deploy	Size small initially (25% of intended), scale up over 50+ live trades
60-79	Refine	Identify weakest dimension, address it, re-test
40-59	Refine with caution	Multiple issues — may not be salvageable
0-39	Abandon	Fundamental edge likely does not exist. Document lessons and move on.

Before Deploying

Using Broker MCP Tools for Backtesting Support

While the MCP tools are not backtesting engines, they support the process. Use whichever broker is connected:

Groww MCP (if connected)

fetch_historical_candle_data
: Fetch OHLCV data for strategy development and spot-checking
get_historical_technical_indicators
: Calculate indicators (SMA, EMA, RSI, MACD, Bollinger, SuperTrend, etc.) on historical data
get_historical_candlestick_patterns
: Identify candle patterns in historical data
fetch_stocks_fundamental_data
: Screen for universe construction (PE, ROE, market cap filters)
fetch_fundamentals_screener
: Natural language screening for universe building
fetch_technical_screener
: Technical screening for strategy ideas
get_ltp
: Current price for live validation

fetch_market_movers_and_trending_stocks_funds
: Discover momentum and volume patterns

Zerodha Kite MCP (if connected)

get_historical_data
: Fetch OHLCV candle data for strategy development
get_ltp
/ get_quotes
: Current prices for live validation
search_instruments
: Find instruments for universe construction
get_holdings
/ get_positions
: Verify live portfolio against strategy signals

Quick Reference: Red Flags

Red Flag	Why It Matters
CAGR > 50% with no drawdowns	Too good to be true — check for look-ahead bias
Win rate > 80%	Likely not accounting for slippage or adverse fills
Only works on specific parameters	Overfitting — no edge, just noise
< 50 trades in backtest	Statistically meaningless
No losing months in 5+ years	Data error or survivorship bias
Strategy stops working after 2020	Market structure may have changed (T+1, algo proliferation)
Uses > 5 parameters	Degrees of freedom too high — curve-fitted
No transaction costs modeled	Real returns could be negative
Tested on Nifty 50 only	Survivorship bias in universe selection

Files in This Skill

```
scripts/evaluate_backtest.py
```
— CLI scoring tool for backtest evaluation
```
references/methodology.md
```
— Comprehensive backtesting methodology for Indian markets
```
references/failed_tests.md
```
— Common failure patterns and documentation framework