When using additional final distribution (PCR) plates from the Extended Throughput Kit v1.1, each plate has the same set of 96 i5-index sequences, but receives a different set of i7 index sequences from the P7 primer pool used (RNA-A-AP1 to RNA-A-AP4). This data is analyzed by generating separate "fastq libraries" for each plate (i7 pool), analyzing each separately through the core workflow (barcodeParser, STARSolo, etc.) and then combining the outputs at the single-cell level (gene expression matrix and metrics).
Cells from different plates are distinguished by appending the library-name (P7 primer pool) to the cell barcode in the gene expression matrix outputs.
If all plates are sequenced together in the same run, this can be done in one nextflow analysis run:
- Create a
samples.csvfor all plates and samples- List each sample on each plate (repeating the
samplename for each plate used) - Set the
libIndexcolumn to the name of the barcode pool (e.g.RNA-A-AP1) - See samples.ext.csv for an example with two samples on two plates
- List each sample on each plate (repeating the
- Launch the workflow
- This will produce outputs for each individual plate as well as merged outputs (with default
mergeset), combining all cells for each sample across all plates
- This will produce outputs for each individual plate as well as merged outputs (with default
A small test-case can be run with:
nextflow run /PATH/TO/ScaleRna -profile PROFILE -params-file /PATH/TO/ScaleRna/docs/examples/extended-throughput/runParams.yml --outDir output.ext
If plates were sequenced on separate sequencing runs, each plate can first be analyzed individually and the results later merged in a separate nextflow run
- Create a
samples.csvfor each separate plate- Including the
libIndexcolumn with the P7 pool used for that plate
- Including the
- Run the nextflow analysis workflow separately for each plate
- Create another
samples.csvthat lists all samples and plates to be merged- Add a new
resultDircolumn that points to the workflow output directories for the individual plate runs - See samples.ext-merge.csv
- Add a new
- Run the workflow with the
--reportingoption--reportinguses the previous alignment results instead of re-processing all reads- Do not specify any input reads at this step (no
--runFolderor--fastqDir)
To run this workflow starting from fastq files, rather than a runFolder / BCLs, see Fastq Generation