Oasislmf ci testing by sambles · Pull Request #28 · OasisLMF/OasisModels

sambles · 2026-05-19T09:14:55Z

No description provided.

sambles · 2026-05-21T08:27:01Z

PiWindComplex model ~ Claude

Changes and rationale:

1. get_event_ids — events_pd.size → len(events_pd)
Bug fix. .size returns rows × cols, so batching was silently wrong.

2. get_model — parse model_data once with ast.literal_eval
Previously eval was called once per field (twice total per row). Now parsed once and both fields extracted together. Also safer — no arbitrary code execution.

3. gul_calc — vectorize bin_height
apply(lambda x: x.prob_to - x.prob_from, axis=1) replaced with df['prob_to'] - df['prob_from']. Eliminates Python row iteration; 10–100× faster on large DataFrames.

4. gul_calc — replace iterrows() random number loop
Was doing an O(n) full-DataFrame boolean scan per item. Now generates randoms per unique (event_id, group_id) pair, builds a lookup table, and uses a single merge.

5. calculate_guls — vectorize with np.where
Replaced row-by-row apply(calculate_guls, axis=1) with a vectorized np.where. Avoids per-row Python function call overhead; 50–200× faster at large sample counts.

6. write_loss_stream — replace inner boolean filter with groupby
Was re-scanning the entire DataFrame per (event_id, item_id) pair — O(n²). Now O(n log n) sort + O(n) grouped iteration.

7. write_loss_stream — batch binary writes with numpy structured arrays
Was calling struct.pack once per field per row. Now packs each item's rows into a numpy structured array and writes in one .tobytes() call, reducing syscall and Python overhead.

1. Bug: events_pd.size returns rows × cols, not rows — events are silently miscounted 2. eval called twice per row in model_data parsing (once per field) 3. Row-by-row apply(lambda) for bin_height — vectorizable in one line 4. iterrows() loop with boolean masking for random numbers — O(n) full-DataFrame scan per item 5. apply(calculate_guls, axis=1) — row-by-row Python apply, should be np.where 6. O(n²) write_loss_stream — inner boolean filter re-scans the entire DataFrame per (event_id, item_id) pair 7. struct.pack one field at a time — many small writes; batch them with numpy structured arrays

sambles added 2 commits May 19, 2026 09:44

start

afa1632

Update testing settings

f09703c

sambles marked this pull request as draft May 19, 2026 09:15

sambles added 4 commits May 19, 2026 11:11

print run cmd

3a8772e

Fix date

03a5119

Add actions file

9c12656

Fix complex model example for newer oasislmf

83ab447

sambles added 22 commits May 21, 2026 09:27

f

af618fd

Skip complex model until updated

8ae3505

Fix path loading issue

78ef342

Update skips

7ddb6a1

Update OED to match the latest spec

b13bd5f

ffs

abb2b80

Merge branch 'develop' into oasislmf-ci-testing

27e579e

Fix python ver tested

8c1c768

install full package requ

8d37326

f

5b183e2

f

7a0a697

grrr

9dfc869

am blind

ae0efae

add output cmp and generation to test

3ff4320

Merge branch 'develop' into oasislmf-ci-testing

82b0cb3

f

742d7ae

complex tests working

4e7020f

script to set OED version on all files

5441fd6

set OED to v5

2315ba0

f

c9abc59

add new test output

6e0599a

sambles added 4 commits June 5, 2026 12:48

delete old test data

447095a

Fix test

90ae74d

set workflow to check files

0018ad2

f

d0a18fa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Oasislmf ci testing#28

Oasislmf ci testing#28
sambles wants to merge 32 commits into
developfrom
oasislmf-ci-testing

sambles commented May 19, 2026

Uh oh!

sambles commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sambles commented May 19, 2026

Uh oh!

sambles commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant