RADE V2 prototyping by drowe67 · Pull Request #42 · drowe67/radae

drowe67 · 2025-01-17T05:41:09Z

Bandwidth/PAPR

Exploring ideas to improve 99% power bandwidth (spectral mask) from RADE V1. Just prototyping with "mixed rate" training and inference, i.e. no pilots or CP, genie phase.

Worked out how to put a BPF in training loop (conv1d with training disabled)
Take away that phase only (PAPR 0dB) works quite well
clip-BPF x 3 produces reasonable 99% power BW, 0dB PAPR, good loss
Doc ML EQ training and inference in README.md when we get to final V2 version. Just collect notes here in comments until then

Training:

python3 train.py --cuda-visible-devices 0 --sequence-length 400 --batch-size 512 --epochs 200 --lr 0.003 --lr-decay-factor 0.0001 ~/Downloads/tts_speech_16k_speexdsp.f32 250117_test --bottleneck 3 --h_file h_nc20_train_mpp.f32 --range_EbNo --plot_loss --auxdata --txbpf
Epoch 200 Loss 0.116

Testing:

./inference.sh 250117_test/checkpoints/checkpoint_epoch_200.pth wav/brian_g8sez.wav - --bottleneck 3 --auxdata --write_tx tx_bpf.f32 --write_latent z.f32 --txbpf
          Eb/No   C/No     SNR3k  Rb'    Eq     PAPR
Target..: 100.00  133.01   98.24  2000
Measured: 102.89          101.12       1243.47  0.00
loss: 0.121 BER: 0.000

octave:154> radae_plots; do_plots('z.f32','tx_bpf.f32')
bandwidth (Hz): 1255.813953 power/total_power: 0.990037

Red lines mark 99% power bandwidth:

ML EQ

Classical DSP:

python3 ml_eq.py --eq dsp --notrain --EbNodB 4 --phase_offset

MSE loss function:

python3 ml_eq.py --EbNodB 4 --phase_offset --lr 0.001 --epochs 100

Phase loss function:

python3 ml_eq.py --EbNodB 4 --phase_offset --lr 0.001 --epochs 100 --loss_phase

…looks good, PAPR 2dB with 2nd clipper

…iff training runs

…mparison

drowe67 · 2025-02-03T00:54:20Z

Frame 2 EQ examples

Ideal (perfect EQ)

python3 ml_eq.py --frame 2 --notrain --eq bypass --EbNodB 4
<snip>
EbNodB:  4.00 n_bits: 240000 n_errors: 3027 BER: 0.013

Classical DSP lin:

python3 ml_eq.py --eq dsp --notrain --EbNodB 4 --phase_offset --frame 2
<snip>
EbNodB:  4.00 n_bits: 240000 n_errors: 3921 BER: 0.016

ML EQ (using MSE loss function):

python3 ml_eq.py --frame 2 --lr 0.1 --epochs 100 --EbNodB 4 --phase_offset --n_syms 1000000 --batch_size 128
<snip>
EbNodB:  4.00 n_bits: 24000000 n_errors: 437933 BER: 0.018

drowe67 · 2025-02-04T00:56:57Z

ML waveform training

Generate 10 hour complex h file:

Fs=8000; Rs=50; Nc=20; multipath_samples('mpp', Fs, Rs, Nc, 10*60*60, 'h_nc20_train_mpp.c64',"",1);

Training:

python3 train.py --cuda-visible-devices 0 --sequence-length 400 --batch-size 512 --epochs 200 --lr 0.003 --lr-decay-factor 0.0001 ~/Downloads/tts_speech_16k_speexdsp.f32 250204_test --bottleneck 3 --h_file h_nc20_train_mpp.c64 --h_complex --range_EbNo --plot_loss --auxdata

…eta=0.999

…V2 rx2.py development

…t ctests

drowe67 · 2026-01-13T21:09:01Z

Jan 2026 Timing corner case

While extending ota_test.sh to cover V2, it was discovered the timing estimator sometimes returns values that upset V2 decoder, leading to high loss. To reproduce, we can use the ota_test.sh test framework:

./test/ota_test_cal.sh ~/codec2-dev/build_linux/ -30 0.4 --mpp --freq -25
./ota_test.sh -d -r rx.wav -l wav/brian_g8sez.wav
<snip>
+ python3 loss.py brian_g8sez_features_in.f32 brian_g8sez_features_out_tx1.f32 --features_hat2 features_out_rx1.f32 --compare
loss1: 0.121 loss2: 0.299 delta: 0.178
+ python3 loss.py brian_g8sez_features_in.f32 brian_g8sez_features_out_tx2.f32 --features_hat2 features_out_rx2.f32 --compare
loss1: 0.091 loss2: 3.416 delta: 3.326

Note very high loss2.

The interaction between the timing estimator and the ML decoder hasn't been well understood to date. A timing model has been developed and written up in Section 11 of 2025_rade_hf3. In V1 the pilot phase phase correction took care of timing offsets. In V2 the network must be able to handle them.

Running just the V2 receiver:

python3 ./rx2.py 250725/checkpoints/checkpoint_epoch_200.pth 250725_ml_sync /tmp/tmp.ehSKyo9nPa.f32 features_out_rx2.f32 --latent-dim 56 --w1_dec 128 --agc --quiet
python3 loss.py brian_g8sez_features_in.f32 brian_g8sez_features_out_tx2.f32 --features_hat2 features_out_rx2.f32 --compare

The rx2.sh cmd line was copied from the ./ota_test.sh run, not the tmp filename will vary. The problem can be illustrated by dumping the timing variables:

python3 ./rx2.py 250725/checkpoints/checkpoint_epoch_200.pth 250725_ml_sync /tmp/tmp.ehSKyo9nPa.f32 features_out_rx2.f32 --latent-dim 56 --w1_dec 128 --agc --quiet --write_delta_hat delta_hat.int16 --write_delta_hat_pp delta_hat_pp.int16 --write_sig_det sig_det.int16
octave:1> delta_hat=load_raw('delta_hat.int16'); figure(1); clf; plot(delta_hat); delta_hat_pp = load_raw('delta_hat_pp.int16'); hold on; plot(delta_hat_pp); sig_det=load_raw('sig_det.int16'); plot(sig_det*175); hold off;

The post processor output delta_hat_pp locks onto a timing estimate at the high end of the MPP channel delay range (around 60), this value causes a high loss in the decoded signal. When we fix the timing est at 50 using --fix_delta_hat 50, we get acceptable loss.

Script that supports testing timing offsets and verifying models in the paper:

./test/v2_timing.sh --correct_time_offset -22

Testing proposed solution:

octave:> multipath_samples("mpp_low", 8000, 50, 14, 10, "", "g_mpp_low.f32")
octave:> multipath_samples("mpp_high", 8000, 50, 14, 10, "", "g_mpp_high.f32")

./test/v2_timing.sh --g_file g_mpp_low.f32 --correct_time_offset -8 --fix_delta_hat 32
./test/v2_timing.sh --g_file g_mpp_low.f32 --correct_time_offset -8 --fix_delta_hat 48
./test/v2_timing.sh --g_file g_mpp_high.f32 --correct_time_offset -8 --fix_delta_hat 32
./test/v2_timing.sh --g_file g_mpp_low.f32 --correct_time_offset -8 --fix_delta_hat 48

…fset -8 as suggested by analysis of timing issues

…v2_timing.sh for Jan 26 timing tests

…ain sync (Jan 26 timing corner case), ctests to exercise issue, make_g.sh creates g_mpp_low & g_mpp_high

…s using Ncorr=-8 as recc by Jan 26 timing issue work

…n v2_rx2_mpp_low

drowe67 added 25 commits January 13, 2025 16:57

wip BPF filter in training loop

4c19d43

BPF filter in training loop, trains up OK, loss a bit high, spectrum …

4adfc92

…looks good, PAPR 2dB with 2nd clipper

wip multiple clip/filter stages

1acf8b6

3 stage clip-filter gives us a reasonable 99% power BW of < 1300 Hz

9a8b175

clean up of 3 stage BPF-clip, 99% varies between 1250 and 1450Hz on d…

10bde8d

…iff training runs

51 tap filter works OK

0ffa3ff

supporting plots for RADE V2 PAPR/bandwidth paper

dce6360

AWGN/MPP plots for RADE V2 PAPR/bandwidth paper

c1899e6

250117_test model used for paper curves

d5f1383

wip first pass ML EQ

a1167fe

wip - correcting a random phase offset

52ecb9e

moved channel model into forward(), inference section plots scatter

39bb89d

way to bypass training/EQ to test vanilla channel simulation

6e65d7a

train with phase based loss function; classical lin DSP option for co…

772c237

…mparison

load/save models, Eb/No curves

db09c52

top level script to generate curves

4bd38b9

refactored to do de-framing, similar results

86bd4ff

wip framer class

3ebdfe3

generic complex to (real,img) float pairs

1f2f29a

dsp equaliser in framer

d04142b

reshaped frames to (batch,Nc,Ns)

629b249

wip frame 2, working for bypass and dsp

c2d1ffb

wip frame 2, converging well with ML and low BER

b4482a8

mleq_cvurves.sh set up for different frames, loss_phase working again

a77e1c9

getting good results from frame2 with constant phase offset

3ec7a49

added frequency offset to training, working quite well

9c09401

drowe67 added 2 commits February 5, 2025 12:40

first pass at training with complex h

ef67028

complex H option for inference

0b3a297

drowe67 added 12 commits December 12, 2025 08:07

wip: streaming output of z_hat working

9a6a0b7

more verbose error message when f_hat longer than f

f11d686

streaming frame sync, output of z_hat working OK

0b1bdb4

state machine tweaks, good perf at high and low SNR for AWGN, MPP a WIP

6144acf

wip MPP, various debug options, better results on MPP high SNR with b…

5a98d7d

…eta=0.999

binary framesync indicator

f7c7bcc

4 point tests passing OK when run mannually

021d1fb

first pass at '4 point' ctests to use as a baseline for further RADE …

dc3a9d5

…V2 rx2.py development

WIP fix ctests, just 12,14,15 failing now

217eb93

adjusted SNR an better time alignemnt using --clip_end to fix ota_tes…

441f325

…t ctests

loss.py tool now time aligns +/- 1 sec

0429614

correct acq time for padding

45cb6d2

drowe67 added 17 commits January 16, 2026 16:18

--plot working again with new time alignment

916b1c1

ota_test_cal.sh now measures V1 and V2 losses, using -correct_freq_of…

62a0047

…fset -8 as suggested by analysis of timing issues

timing est convenience script

035e718

multipath_samples.m can generate contrived test channels, support in …

a6a5896

…v2_timing.sh for Jan 26 timing tests

curves for 260128b mppa (a Hz Doppler, 3ms delay)

69b528a

option for mppa 1Hz Doppler 3ms delay

39109bf

training argument for timing jitter amplitude

3393f2a

single line log when sync status changes

d878ab9

change to 250725a_ml_sync network trained with +/-2ms jitter to maint…

0681aef

…ain sync (Jan 26 timing corner case), ctests to exercise issue, make_g.sh creates g_mpp_low & g_mpp_high

260203_inf curves of rx2.sh against genie 250725 and V1, V2 spot test…

0edf309

…s using Ncorr=-8 as recc by Jan 26 timing issue work

ctest v2_rx_oppfs to test opposite frame sync using 20ms time shift o…

1060304

…n v2_rx2_mpp_low

WIP coding AGC

d2e18e4

first pass at propper AGC, with some spot checks

6da9874

added --gain to ota_test ctests to support AGC implementation

34765d1

260206_inf curves show AGC doesn't affect loss

1e50398

building up test report

4171658

Merge branch 'main' into dr-radev2

bda6f20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RADE V2 prototyping#42

RADE V2 prototyping#42
drowe67 wants to merge 168 commits intomainfrom
dr-radev2

drowe67 commented Jan 17, 2025 •

edited

Loading

Uh oh!

drowe67 commented Feb 3, 2025 •

edited

Loading

Uh oh!

drowe67 commented Feb 4, 2025

Uh oh!

drowe67 commented Jan 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

drowe67 commented Jan 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bandwidth/PAPR

ML EQ

Uh oh!

drowe67 commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Frame 2 EQ examples

Uh oh!

drowe67 commented Feb 4, 2025

ML waveform training

Uh oh!

drowe67 commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Jan 2026 Timing corner case

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

drowe67 commented Jan 17, 2025 •

edited

Loading

drowe67 commented Feb 3, 2025 •

edited

Loading

drowe67 commented Jan 13, 2026 •

edited

Loading