Skip to yearly menu bar Skip to main content


Poster
in
Workshop: AI for New Drug Modalities

Active learning for efficient discovery of optimal gene combinations in the combinatorial perturbation space

Jason Qin · Hans-Hermann Wessels · Carlos Fernandez-Granda · Yuhan Hao


Abstract:

The advancement of novel combinatorial CRISPR screening technologies enables the identification of synergistic gene combinations on a large scale. This is crucial for developing novel and effective combination therapies, but the combinatorial space makes exhaustive experimentation infeasible. We introduce NAIAD, an active learning framework that efficiently discovers optimal gene pairs that induce desired cellular phenotypes. NAIAD leverages single-gene perturbation effects and adaptive gene embeddings that scale with the training data size, mitigating overfitting in small-sample learning while capturing complex gene interactions as more data is collected. Evaluated on four CRISPR combinatorial perturbation datasets totaling over 350,000 genetic interactions, NAIAD, trained on a small dataset, outperforms existing models by up to 40% relative to the second-best. NAIAD's acquisition function prioritizes gene pairs with the maximum predicted effects, resulting in the highest marginal gain in each round and accelerating discovery with fewer CRISPR experimental iterations. This framework improves the identification of novel, effective gene combinations, enabling more efficient CRISPR library design and offering promising applications in genomics research and therapeutic development.

Chat is not available.