Skip to content

v.0.2.0#12

Draft
anderstorstensson wants to merge 13 commits intomainfrom
next
Draft

v.0.2.0#12
anderstorstensson wants to merge 13 commits intomainfrom
next

Conversation

@anderstorstensson
Copy link
Member

No description provided.

anderstorstensson and others added 13 commits February 28, 2026 09:42
…rmats

Add load_from_h5() to read iRfcb 0.8.0+ HDF5 classifier output files
(.h5) with threshold support. Update load_from_csv() to support
use_threshold parameter via the class_name_auto column. File index
scanning now discovers *_class*.h5 files alongside CSV and MAT.

Loading priority is CSV > H5 > MAT. The classification threshold
checkbox now applies to all three formats and is placed below the
Classification Folder path in Settings for better discoverability.

Includes hdf5r as optional dependency (Suggests), test fixtures,
and comprehensive tests for all new functionality.
Add Import PNG → SQLite feature that reads annotations from PNG images
organized in class-name subfolders (iRfcb convention). Includes a
multi-step UI flow with class mapping dialog for unmatched classes and
overwrite warnings for existing samples.

New exported functions: scan_png_class_folder() and
import_png_folder_to_db(). 32 new tests, vignette and FAQ updates.
Adds filter dropdowns above the class selector in Class Review mode,
allowing users to scope reviews by year, month, and IFCB instrument.
All filters default to "All". Changing year cascades to update
available months and instruments. Database functions list_classes_db
and load_class_annotations_db now accept optional filter parameters.
New list_annotation_metadata_db function extracts available filter
options from annotation sample names.
Connect directly to remote IFCB Dashboard instances to browse samples,
download images on demand, and cache them locally. Includes settings UI
toggle between local folders and dashboard mode, bulk zip downloads with
throttled parallelism, ADC/autoclass on-demand fetching, and graceful
SQLite-only fallback when ADC is unavailable for MAT export.

New exported functions: parse_dashboard_url(), list_dashboard_bins(),
download_dashboard_images(), download_dashboard_images_bulk(),
download_dashboard_adc(), download_dashboard_autoclass(), and
get_dashboard_cache_dir().

Also fixes reticulate venv binding in test setup.R so scipy tests no
longer skip incorrectly.
Add a Predict button in Sample Mode that classifies images using a
remote CNN model via iRfcb::ifcb_classify_images(). Gradio API URL
and model are configured in Settings > Live Prediction with debounced
URL input and dynamic model fetching. Predictions show per-image
progress, respect the classification threshold setting, skip manually
reclassified images, and auto-add new classes to the class list.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…ettings

Download individual PNGs instead of full zip archives in class review mode,
making dashboard class review much faster. Add configurable download
parameters (parallel downloads, sleep time, timeout, retries) in Settings.
Allow local classification files (CSV/H5/MAT) to be used in dashboard mode
as an alternative to dashboard auto-classifications.
Set a 15-second curl timeout (10s connect) on individual image requests
so non-existent images fail fast instead of hanging. Stop retrying on
404/410 responses since the image clearly doesn't exist. Track consecutive
failures per sample and skip all remaining images from a sample after 2
failures, avoiding long waits when annotations reference samples not
available on the dashboard.
Prevent SSRF by rejecting non-HTTP(S) URLs in parse_dashboard_url() and
the Gradio predict observer. Fix download_dashboard_autoclass() to parse
actual ROI numbers from the CSV pid column instead of assuming sequential
1..N numbering.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

1 participant