We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.BM

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Biomolecules

Title: Limits on Inferring T-cell Specificity from Partial Information

Abstract: A key challenge in molecular biology is to decipher the mapping of protein sequence to function. To perform this mapping requires the identification of sequence features most informative about function. Here, we quantify the amount of information (in bits) that T-cell receptor (TCR) sequence features provide about antigen specificity. We identify informative features by their degree of conservation among antigen-specific receptors relative to null expectations. We find that TCR specificity synergistically depends on the hypervariable regions of both receptor chains, with a degree of synergy that strongly depends on the ligand. Using a coincidence-based approach to measuring information enables us to directly bound the accuracy with which TCR specificity can be predicted from partial matches to reference sequences. We anticipate that our statistical framework will be of use for developing machine learning models for TCR specificity prediction and for optimizing TCRs for cell therapies. The proposed coincidence-based information measures might find further applications in bounding the performance of pairwise classifiers in other fields.
Comments: 24 pages, 15 figures
Subjects: Biomolecules (q-bio.BM); Statistical Mechanics (cond-mat.stat-mech); Information Theory (cs.IT)
Cite as: arXiv:2404.12565 [q-bio.BM]
  (or arXiv:2404.12565v1 [q-bio.BM] for this version)

Submission history

From: Andreas Tiffeau-Mayer [view email]
[v1] Fri, 19 Apr 2024 01:02:08 GMT (2721kb,D)

Link back to: arXiv, form interface, contact.