Duon Labs Logo
TRANSPARENT BENCHMARKS

VOYONS MODELS ARENA

Live rankings tracking Voyons model performance. Every forecast is measured. Every comparison is transparent.

6
Active Models
6
Ranked Models
1517
Top Rating
5,235
Total Games
QUALIFIED MODELS

OFFICIAL RANKINGS

RANK MODEL
1
voyons-nano-25.6-release
Rating: 1517
Win Rate: 54%
Released: Jun 01, 2025
2
voyons-tiny-25.7-early
Rating: 1505
Win Rate: 51%
Released: Jul 01, 2025
3
voyons-nano-25.7-release
Rating: 1502
Win Rate: 50%
Released: Jul 01, 2025
4
voyons-tiny-25.6-release
Rating: 1493
Win Rate: 48%
Released: Jun 01, 2025
5
voyons-tiny-25.7-release
Rating: 1489
Win Rate: 47%
Released: Jul 01, 2025
6
voyons-tiny-25.5-release
Rating: 1441
Win Rate: 35%
Released: May 01, 2025
Mathematical Formulas
EVALUATION METHODOLOGY

HOW WE MEASURE PERFORMANCE

Duon Labs maintains transparent benchmarks for all Voyons models in production. Models are continuously evaluated by forecasting top cryptocurrency pairs across multiple timeframes (15m, 1h, 1d) in real-time market conditions. Each variant competes head-to-head on identical forecasting tasks to determine relative performance.

CONTINUOUS MARKET EVALUATION

Real-time forecasting across crypto markets.

Models continuously forecast major cryptocurrency pairs (BTC/USDT, ETH/USDT, and more) across multiple timeframes: 15-minute for rapid movements, 1-hour for intraday patterns, and daily for longer trends. This diverse evaluation ensures models perform well across different market conditions and trading horizons.

timeframes: [15m, 1h, 1d] × top_crypto_pairs

EXTROPY-BASED SCORING

The metric that matters: forecast precision.

Each model's forecast quality is measured using extropy, an information-theoretic metric. We evaluate precision by using the input candle data to forecast the next 3 candles, then calculate extropy between the actual samples and forecasted samples. This measures how well the model captures the underlying patterns in the data.

response_last_candle_extropy = AVG(last_3_candles_extropies)

GLICKO-2 RATING SYSTEM

Advanced skill-based rankings.

We use the Glicko-2 rating system to rank models based on head-to-head performance. Each model starts at 1500 rating points. Winning against stronger opponents provides more rating gain, while losing to weaker opponents results in larger rating drops. The system converges quickly to each model's true skill level.

starting_rating: 1500 | higher_is_better

QUALIFICATION CRITERIA

Statistical significance before ranking.

To ensure meaningful rankings, models must prove themselves through volume and diversity: minimum 100 games played against at least 2 different opponents. This ensures that rankings reflect consistent performance across varied competition rather than isolated results against a single opponent.

qualified: games ≥ 100 AND opponents ≥ 2

EXPLORE VOYONS

Experience the power of Voyons' probabilistic forecasting. Access these models through our API and build your own applications.