Question: How can we add genes to a reference package for Cell Ranger?
Answer: To add genes to an existing Cell Ranger reference package, such as the ones available on our website, there are three steps:
First, add the additional FASTA sequence records to the
Second, update the GTF file (
genes/genes.gtf). The GTF file format is essentially a list of records, one per line, each comprising nine tab-delimited non-empty fields.
|1||Chromosome||Must refer to a chromosome/contig in the genome fasta.|
|4||Start||Start position on the reference (1-based inclusive).|
|5||End||End position on the reference (1-based inclusive).|
|6||Score||Unused. Suggested value ".".|
|7||Strand||Strandedness of this feature on the reference:
|8||Frame||Unused. Suggested value ".".|
|9||Attributes||A semicolon-delimited list of key-value pairs of the form
mylocus annotation exon 100 200 . + . gene_id "mygene"; transcript_id "mygene";
Third, after adding the necessary records to your FASTA file and the additional lines to your GTF file, run
cellranger mkref as normal.
For more information please see the Adding one or more genes to your reference section on the Using Custom References page. Please find a tutorial here on Building custom reference that illustrates addition of new genes.