Skip to content

Daskify features#179

Draft
ckmah wants to merge 25 commits intomasterfrom
daskify-features
Draft

Daskify features#179
ckmah wants to merge 25 commits intomasterfrom
daskify-features

Conversation

@ckmah
Copy link
Collaborator

@ckmah ckmah commented Feb 25, 2025

ckmah added 13 commits February 23, 2025 19:40
- Introduced `pytest.ini` for test discovery and logging configuration.
- Updated `_utils.py` to persist points in Dask.
- Enhanced image measurement module with new functions: `mean_intensity`, `regionprops`, and `moments_optimized`.
- Added clustering functionality in `_cluster.py` for unsupervised learning on point features.
- Implemented density and distance calculations in `_density.py` and `_distance.py`.
- Introduced optimized measurement engine in `_measure_optimized.py` using Polars and Numba.
- Added tests for optimized measure functions in `test_measure_optimized.py` to ensure functionality and performance benchmarks.
- Updated shape measures in `_measure.py` to support parallel processing and improved logging.
@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

ckmah and others added 12 commits July 25, 2025 21:02
… ripley metrics

- Created SVG files for distance, polarity, moments, and ripley benchmarks to visualize performance metrics.
- Added corresponding JSON files to store benchmark results and machine information.
- Introduced new plotting scripts to generate benchmark analysis and summary plots.
- Updated dependencies in `pyproject.toml` to include `polars` for enhanced data processing capabilities.
… data

- Removed `xgboost` from dependencies and updated `scikit-learn` to a more flexible version.
- Consolidated development dependencies under a new `dev` section in `pyproject.toml`.
- Enhanced the synthetic dataset creation function to unify shapes, points, and optional images/labels for testing.
- Updated tests to utilize the new synthetic data generation approach, ensuring consistency across shape and point feature tests.
- Improved logging in test fixtures for better traceability during test execution.
…lity

Co-authored-by: ckmah <3103744+ckmah@users.noreply.github.com>
- Keep data in sparse format throughout computation where possible
- Add chunk_size parameter for controlling memory usage during writes
- Update set_points_metadata to preserve sparse DataFrame columns
- Add automatic zarr writing for efficient storage via ome-zarr
- Improve memory efficiency with sparse-aware operations

Co-authored-by: ckmah <3103744+ckmah@users.noreply.github.com>
Co-authored-by: ckmah <3103744+ckmah@users.noreply.github.com>
- Create raster grid programmatically from shape bounds
- Store flux results as multi-channel 2D image with chunked dask arrays
- Channels include: gene values, embeddings, color RGB, and counts
- Leverage ome-zarr native chunked writing via Image2DModel
- Update tests to validate image-based storage
- Remove dependency on pre-existing raster points

Co-authored-by: ckmah <3103744+ckmah@users.noreply.github.com>
- Add adaptive scale factors based on image dimensions
- Only create multiscale pyramid if image is large enough
- Prevents errors with small images in tests
- Ensures proper ome-zarr metadata for all image sizes

Co-authored-by: ckmah <3103744+ckmah@users.noreply.github.com>
…ning

- Introduced functions for creating raster grids and computing cell flux values.
- Implemented SVD model training for dimensionality reduction of flux data.
- Added support for streaming computation with zarr-backed SpatialData.
- Updated flux function to handle batch processing and efficient writing to zarr.
- Adjusted parameters for training size and improved memory management during processing.

Co-authored-by: ckmah <3103744+ckmah@users.noreply.github.com>
…ithm

Refactor flux algorithm to use Image2DModel with chunked ome-zarr storage
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants