Question: My data has a clonotype (say clonotype1) where the TRB chain is paired with a TRA chain and a second clonotype (say clonotype2) that has only TRB chain whose sequence is the same as the TRB chain of clonotype1. Could you please explain the reason why
cellranger vdj assigned the same TCR chain to different clonotypes?
Answer: In Cell Ranger v4 and lower, two cells were merged into a clonotype only when the CDR3 nucleotide sequences of all productive chains in the cells were an exact match. For example, if cell1 had productive TRB and TRA chains while cell2 had a productive TRB chain (no TRA chain), they would be assigned to separate clonotypes even if the CDR3 nucleotide sequences of the TRB chains match between cell1 and cell2.
TRA is generally expressed at lower levels than TRB. Therefore, sometimes the TRA chain is not detected or not annotated as productive. This could be the cause of the missing or nonproductive TRA in cell2 in your example.
The algorithm in Cell Ranger v5 and later has been upgraded to merge cells with just the TRA or TRB chains into clonotypes that contain both chain types, thus outputting fewer, biologically relevant clonotypes.