Timestamp continuity
Every symbol and timeframe is checked for missing bars, duplicate timestamps, non-monotonic order, and weekend/session artifacts.
HistoricalFX turns raw historical price archives into validation-ready datasets for backtesting, Python research, CSV workflows, and MetaTrader imports. The commercial value is not just access to prices; it is reducing the cleaning, conversion, and data-quality work traders usually have to do themselves.
The business is being built around auditable data releases. The current manifest ties the paid R2 bundle to the local release-quality report, so validation claims can be proven release by release instead of hand-waved.
This manifest describes the current uploaded paid R2 bundle and the local release-quality audit. Coverage varies by symbol; use pair-level reports before making strategy assumptions.
Last manifest generated: 2026-06-06T00:57:17.159Z
The goal is to make historical FX data boring: predictable columns, consistent timestamps, reproducible exports, and fewer hidden assumptions in every backtest.
Every symbol and timeframe is checked for missing bars, duplicate timestamps, non-monotonic order, and weekend/session artifacts.
Bars are rejected or flagged when high/low/open/close relationships are impossible, malformed, or outside expected market structure.
Outlier moves are detected against neighboring bars and higher-timeframe ranges so obvious feed spikes do not poison a backtest.
M5, M15, H1, H4, daily, and weekly files are generated from the same normalized minute base to keep timeframes internally consistent.
Parquet exports are loaded after generation to verify schema, date parsing, numeric columns, and row counts. CSV and MetaTrader conversion files are verified separately when rebuilt.
Dataset builds are scripted so future refreshes can be regenerated, compared, and audited instead of hand-edited file by file.
This is the core product engine. As the business matures, the same pipeline can power subscriptions, commercial licenses, API access, and custom validation reports.
Ingest raw minute archives by symbol and month.
Normalize timestamps and column names into one canonical OHLCV schema.
Remove duplicates and impossible bars.
Detect gaps, quiet-session edge cases, weekend bars, and suspicious spikes.
Generate derived timeframes from the normalized minute base.
Export buyer-ready Parquet packages; rebuild CSV and MetaTrader conversion files only when matching delivery artifacts are ready.
Run post-export validation and sample-load checks before release.
Raw data is easy to compare on price. Cleaned, documented, reproducible data can become infrastructure. The same validation layer can support one-time downloads, commercial licenses, API subscriptions, and custom data-quality work for teams.
Download sample →The current product focuses on audited OHLCV bar releases. The next defensible layers are expanded historical rebuilds, commercial licensing, refreshed data builds, and API access for repeat users.
support@...