MorphoCLIP

Cross-modal contrastive learning aligning molecular graphs with Cell Painting morphological profiles for zero-shot mechanism-of-action retrieval.

Overview

MorphoCLIP trains a CLIP-style dual encoder that maps molecular structures and Cell Painting CellProfiler feature vectors into a shared embedding space. Once trained, you can query the model with any SMILES string to retrieve phenotypically similar compounds — without any MoA label at inference time.

Key novelty

First CLIP-style alignment between molecular graph encodings and Cell Painting profiles
Hard negative mining strategy using known off-target MoA pairs
Zero-shot MoA retrieval benchmark on JUMP-CP × ChEMBL

Installation

cd morphoclip
pip install -r requirements.txt

Data

python scripts/download_data.py
python scripts/preprocess.py

Training

python scripts/train.py --config configs/default.yaml

Evaluation

python scripts/evaluate.py --checkpoint checkpoints/best_model.pt

Tests

pytest tests/ -v

Architecture

SMILES ──► GATv2 (4 layers) ──► mean pool ──► projection ──► L2-norm ──► 256-d embedding
                                                                                │
                                                                    cosine similarity + InfoNCE
                                                                                │
812-d CellProfiler ──► Residual MLP ──► projection ──► L2-norm ──► 256-d embedding

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
conftest.py		conftest.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MorphoCLIP

Overview

Key novelty

Installation

Data

Training

Evaluation

Tests

Architecture

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MorphoCLIP

Overview

Key novelty

Installation

Data

Training

Evaluation

Tests

Architecture

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages