SubQuad: Near-Quadratic-Free Structure Inference with Distribution-Balanced Objectives in Adaptive Receptor framework
AI 摘要
SubQuad通过优化流程和目标函数,实现了免疫组库分析的加速、减负和公平性提升。
主要贡献
- 提出了 antigen-aware 的近亚二次检索方法
- 设计了可微分门控模块自适应融合对齐和嵌入通道
- 实现了自动化校准流程以确保罕见亚群的比例代表性
方法论
结合 MinHash 预过滤、GPU加速的affinity kernels、多模态融合和公平约束聚类,构建端到端的免疫组库分析流程。
原文摘要
Comparative analysis of adaptive immune repertoires at population scale is hampered by two practical bottlenecks: the near-quadratic cost of pairwise affinity evaluations and dataset imbalances that obscure clinically important minority clonotypes. We introduce SubQuad, an end-to-end pipeline that addresses these challenges by combining antigen-aware, near-subquadratic retrieval with GPU-accelerated affinity kernels, learned multimodal fusion, and fairness-constrained clustering. The system employs compact MinHash prefiltering to sharply reduce candidate comparisons, a differentiable gating module that adaptively weights complementary alignment and embedding channels on a per-pair basis, and an automated calibration routine that enforces proportional representation of rare antigen-specific subgroups. On large viral and tumor repertoires SubQuad achieves measured gains in throughput and peak memory usage while preserving or improving recall@k, cluster purity, and subgroup equity. By co-designing indexing, similarity fusion, and equity-aware objectives, SubQuad offers a scalable, bias-aware platform for repertoire mining and downstream translational tasks such as vaccine target prioritization and biomarker discovery.