This is a preview of subscription content, access via your institution
Relevant articles
Open Access articles citing this article.
-
Endosymbiont diversity across native and invasive brown widow spider populations
Scientific Reports Open Access 12 April 2024
-
happi: a hierarchical approach to pangenomics inference
Genome Biology Open Access 29 September 2023
-
Using strain-resolved analysis to identify contamination in metagenomics data
Microbiome Open Access 02 March 2023
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
References
Segerstolpe, Å. et al. Cell Metab. 24, 593–607 (2016).
Cao, J. et al. Science 357, 661–667 (2017).
Illumina Inc. Effects of Index Misassignment on Multiplexing and Downstream Analysis. Publication No. 770-2017-004-C QB # 5420 (Illumina Inc., 2017).
Sinha, R. et al. bioRxiv preprint at https://www.biorxiv.org/content/early/2017/04/09/125724 (2017).
Griffiths, J.A., Richard, A.C., Bach, K., Lun, A.T.L. & Marioni, J.C. bioRxiv preprint at https://www.biorxiv.org/content/early/2018/01/30/177048 (2018).
Costello, M. et al. bioRxiv preprint at https://www.biorxiv.org/content/early/2017/10/10/200790 (2017).
Acknowledgements
This work was supported by the Swedish Research Council (grant 2017-01062 to R. Sandberg), the Bert L. and N. Kuggie Vallee Foundation (to R. Sandberg), a National Science Foundation Graduate Research Fellowship (grant DGE 1147470 to G.S.), the NIH (grant R01CA86065 to I.L.W.), the California Institute for Regenerative Medicine (grant GC1R-06673-A), to M. Snyder; Collaborative Reserch Project to I.L.W. and the Virginia and D.K. Ludwig Fund for Cancer Research (to I.L.W.)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Integrated supplementary information
Supplementary Figure 1 Read count distributions at different thresholds of filtering.
(A) Histogram of gene read count distributions (post-correction HiSeq4000) for all 384 cells of the HSC plate, after applying read count filtering at the indicated threshold (1 to 10 reads). (B) The read count distribution of the same plate (sequenced on NextSeq) without correction or read count filtering. We noted that corrected data still had an excess of low read counts which we eliminated by using a read-count threshold. Note that using a read count threshold is not sufficient to remove cross-contamination signal (Fig. S2).
Supplementary Figure 2 Evaluation of read count threshold for index switching correction.
(A) Histogram of false positive gene expression, defined as gene expression detected on the HiSeq 4000 before (blue) and after (yellow) correction where no expression was detected in the same cell on the NextSeq 500. (B) Histogram with fraction false positive expression signals removed per gene. (C) Histogram of false negative gene expression defined as no detectable gene expression on the HiSeq 4000 before (blue) and after (yellow) correction where expression was detected in the same cell on the NextSeq 500.
Supplementary Figure 3 Corrected data rescued index-driven HSC sub-clusters.
(A) Robust PCA (rPCA) analyses of HSCs based on HiSeq 4000 sequence counts, coloured by the Nextera i5 index. The rPCA algorithm assigns the most prominent outlier observations (i.e. cells) into the last principal components, which corresponded to wells that had been amplified using the same i7 and i5 indices (n = 384 cells). (B) Loadings of cells on PC24, stratified by i5 Nextera primers (n = 384 cells, 16 stratifications) (C) Loading of cells on P25, stratified by i7 Nextera primers (n = 384 cells, 24 stratifications). Center: Median, Hinges: 1st and 3rd quartiles, Whiskers: 1.5 interquartile range (IQR) (D) rPCA clustering of HSCs (as in A) for corrected HiSeq 4000 sequence counts, colored by Nextera i5 index primer. (E-F) As in (C-D) for corrected HiSeq 4000 sequence counts.
Supplementary Figure 4 Corrected data rescued index-driven plate-associated clustering.
(A) PCA analysis of HSCs from two plates of libraries sequenced on the HiSeq 4000, colored by library plate. PC2 scores separate cells by plates (n = 384 cells per plate). (B) As in (A) for corrected transcriptome data.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–4 and Supplementary Methods (PDF 938 kb)
Supplementary Software
unspread software for index switching correction (ZIP 6 kb)
Rights and permissions
About this article
Cite this article
Larsson, A., Stanley, G., Sinha, R. et al. Computational correction of index switching in multiplexed sequencing libraries. Nat Methods 15, 305–307 (2018). https://doi.org/10.1038/nmeth.4666
Published:
Issue Date:
DOI: https://doi.org/10.1038/nmeth.4666
This article is cited by
-
Endosymbiont diversity across native and invasive brown widow spider populations
Scientific Reports (2024)
-
Using strain-resolved analysis to identify contamination in metagenomics data
Microbiome (2023)
-
happi: a hierarchical approach to pangenomics inference
Genome Biology (2023)
-
De novo identification of microbial contaminants in low microbial biomass microbiomes with Squeegee
Nature Communications (2022)
-
SpotClean adjusts for spot swapping in spatial transcriptomics data
Nature Communications (2022)