LLM Reasoning 相关度: 7/10

Event Embedding of Protein Networks : Compositional Learning of Biological Function

Antonin Sulc
arXiv: 2604.00911v1 发布: 2026-04-01 更新: 2026-04-01

AI 摘要

论文研究了在蛋白质互作网络中使用组合结构嵌入进行生物功能预测的有效性。

主要贡献

  • 提出基于组合结构的蛋白质网络嵌入方法
  • 证明组合结构能提高通路一致性和功能类比准确性
  • 揭示组合结构有利于生物网络的关系和组合推理

方法论

使用 Event2Vec (加性序列嵌入模型) 在 STRING 数据库的人类蛋白质互作网络上训练 64 维嵌入,与 DeepWalk 进行比较。

原文摘要

In this work, we study whether enforcing strict compositional structure in sequence embeddings yields meaningful geometric organization when applied to protein-protein interaction networks. Using Event2Vec, an additive sequence embedding model, we train 64-dimensional representations on random walks from the human STRING interactome, and compare against a DeepWalk baseline based on Word2Vec, trained on the same walks. We find that compositional structure substantially improves pathway coherence (30.2$\times$ vs 2.9$\times$ above random), functional analogy accuracy (mean similarity 0.966 vs 0.650), and hierarchical pathway organization, while geometric properties such as norm--degree anticorrelation are shared with or exceeded by the non-compositional baseline. These results indicate that enforced compositionality specifically benefits relational and compositional reasoning tasks in biological networks.

标签

蛋白质互作网络 网络嵌入 生物信息学 组合学习

arXiv 分类

cs.LG