Purged cross-validation

4 days ago 7

Italic

← Previous revision		Revision as of 14:50, 5 July 2025
Line 2:		Line 2:
	{{Orphan\|date=June 2025}}		{{Orphan\|date=June 2025}}

	'''Purged cross-validation''' is a variant of k-fold [[Cross-validation (statistics)\|cross-validation]] designed to prevent look-ahead bias in time series and other structured data, developed in 2017 by [[Marcos López de Prado]] at [[Guggenheim Partners]] and [[Cornell University]].<ref name="Lopez2018" /> It is primarily used in financial [[machine learning]] to ensure the independence of training and testing samples when labels depend on future events. It provides an alternative to conventional cross-validation and walk-forward [[backtesting]] methods, which often yield overly optimistic performance estimates due to information leakage and overfitting.<ref name="JPM" /><ref name="JCF" />		'''Purged cross-validation''' is a variant of ''k''-fold [[Cross-validation (statistics)\|cross-validation]] designed to prevent look-ahead bias in time series and other structured data, developed in 2017 by [[Marcos López de Prado]] at [[Guggenheim Partners]] and [[Cornell University]].<ref name="Lopez2018" /> It is primarily used in financial [[machine learning]] to ensure the independence of training and testing samples when labels depend on future events. It provides an alternative to conventional cross-validation and walk-forward [[backtesting]] methods, which often yield overly optimistic performance estimates due to information leakage and overfitting.<ref name="JPM" /><ref name="JCF" />

	== Motivation ==		== Motivation ==