trajectory collection + analysis pipeline.
Goal:
collect 10k-50k finished token trajectories
↓
find historical edges
↓
build live LONG/SHORT signal engine
There are only 3 important stages.
Collect live tokens + finished trajectories.
Files:
- official_api_collector.py
- fast_finished_collector.py
Result:
- token_snapshots
- official_api_token_state
- finished_tokens
Convert trajectories into features + historical strategies.
Files:
- analyze.py
- paper_sim.py
- sweep.py
- clusters.py
- auto_run.py
Result:
- trajectory_features
- paper_trades
- strategy_sweep
- trajectory_clusters
Use historical edges on new live tokens.
Files:
- live_features.py
- live_signals.py
- telegram_bot.py
Result:
LONG / SHORT / SKIP
risk
suggested leverage
historical setup match
Right now the ONLY important thing is:
collect as many finished trajectories as possible
Target:
10k minimum
50k ideal
Do NOT focus on real trading yet.
Uses official public GraphQL API.
Collects:
- live token state
- price
- volume
- buys/sells
- start/end dates
- speed mode
Writes:
- token_snapshots
- official_api_token_state
Runs 24/7.
Browser collector using logged-in browser profile.
Purpose:
collect finished trajectories
Only scans:
expired + ready tokens
Writes:
- finished_tokens
Runs 24/7.
Runs:
analyze.py
paper_sim.py
sweep.py
Use periodically.
Groups token trajectories into similar behavior clusters.
Used later for:
pattern matching
Monitoring only for now.
Later:
live trading signals
Live snapshots over time.
One token can create many rows.
90k snapshots != 90k tokens
Current latest state of each token.
One row per token.
Final completed trajectories.
This is the MOST IMPORTANT dataset.
Calculated statistics/features for trajectories.
Historical strategy backtests.
These require browser session / Cloudflare bypass.
fast_finished_collector.py
Reason:
needs persistent logged-in browser profile
Safe/stateless services.
official_api_collector.py
telegram_bot.py
Possible later:
auto_run.py scheduler
Copy:
.env.example
→
.env
Important variables:
DATABASE_URL=
CATAPULT_API_KEY=LOCAL_PROFILE_DIR=browser_profile_fast2
FAST_FINISHED_READY_DELAY_SECONDS=90
FAST_FINISHED_USE_OFFICIAL_READY_QUEUE=true
FAST_FINISHED_ALLOW_SNAPSHOT_FALLBACK=falsepython official_api_collector.pypython fast_finished_collector.pypython auto_run.py
python clusters.pysaved >> not_ready
Example:
saved: 500
not_ready: 20
not_ready >> saved
Usually means:
ready delay too low
Increase:
FAST_FINISHED_READY_DELAY_SECONDS=12010k finished trajectories
25k+
50k+
Later the pipeline becomes:
token appears
↓
live features
↓
historical pattern matching
↓
LONG / SHORT / SKIP
↓
risk + leverage suggestion
↓
telegram alert