Multimodal Learning 相关度: 5/10

SubQuad: Near-Quadratic-Free Structure Inference with Distribution-Balanced Objectives in Adaptive Receptor framework

Rong Fu, Zijian Zhang, Wenxin Zhang, Kun Liu, Jiekai Wu, Xianda Li, Simon Fong
arXiv: 2602.17330v1 发布: 2026-02-19 更新: 2026-02-19

AI 摘要

SubQuad通过优化流程和目标函数,实现了免疫组库分析的加速、减负和公平性提升。

主要贡献

  • 提出了 antigen-aware 的近亚二次检索方法
  • 设计了可微分门控模块自适应融合对齐和嵌入通道
  • 实现了自动化校准流程以确保罕见亚群的比例代表性

方法论

结合 MinHash 预过滤、GPU加速的affinity kernels、多模态融合和公平约束聚类,构建端到端的免疫组库分析流程。

原文摘要

Comparative analysis of adaptive immune repertoires at population scale is hampered by two practical bottlenecks: the near-quadratic cost of pairwise affinity evaluations and dataset imbalances that obscure clinically important minority clonotypes. We introduce SubQuad, an end-to-end pipeline that addresses these challenges by combining antigen-aware, near-subquadratic retrieval with GPU-accelerated affinity kernels, learned multimodal fusion, and fairness-constrained clustering. The system employs compact MinHash prefiltering to sharply reduce candidate comparisons, a differentiable gating module that adaptively weights complementary alignment and embedding channels on a per-pair basis, and an automated calibration routine that enforces proportional representation of rare antigen-specific subgroups. On large viral and tumor repertoires SubQuad achieves measured gains in throughput and peak memory usage while preserving or improving recall@k, cluster purity, and subgroup equity. By co-designing indexing, similarity fusion, and equity-aware objectives, SubQuad offers a scalable, bias-aware platform for repertoire mining and downstream translational tasks such as vaccine target prioritization and biomarker discovery.

标签

免疫组库 亲和力评估 公平性 聚类

arXiv 分类

cs.LG cs.AI