Research Progress of Next-Generation Sequencing Reference Materials
LIU Yang1,2,SU Zhaozhong1,2,ZENG Fanjun3,YUAN Huijun1,ZHANG Yongzhuo2
1. College of Life Science and Engineering, Lanzhou University of Technology, Lanzhou, Gansu 730050, China
2. National Institute of Metrology, Beijing 100029, China
3. International Travel Health Care Center, General Administration of Customs (Beijing) , Beijing 100013, China
Abstract:With the advent of next-generation sequencing technology, gene sequence has been widely studied.Due to the complexity of genetic material, technical errors introduced during sample preparation, sequencing, and analysis, and systematic biases between different sequencing platforms, high-throughput sequencing results suffer in terms of accuracy and consistency across platforms. However, the use of standard materials in the sequencing process can solve these problems. Next-generation sequencing reference materials are usually genetic materials with good characteristics or synthetic reference materials added and simulated electronic data sets. The application of next-generation sequencing reference materials will help calibrate the measurement results of next-generation sequencing and evaluate the instrument performance, which is crucial to ensure the accuracy and consistency of sequencing results.
SANGER F, NICKLEN S, COULSON A R. DNA sequencing with chain-terminating inhibitors [J]. Biotechnology, 1992, 24: 104-108.
[41]
OLSON N D, LUND S P, COLMAN R E, et al. Best practices for evaluating single nucleotide variant calling methods for microbial genomics [J]. Front Genet, 2015, 6: 235.
[2]
CHURCH G M, GAO Y, KOSURI S. Next-Generation Digital Information Storage in DNA [J]. Science, 2012, 337 (6102): 1628.
[4]
MATTHIJS G, SOUCHE E, ALDERS M, et al. Guidelines for diagnostic next-generation sequencing [J]. Eur J Hum Genet, 2016, 24 (1): 2-5.
[6]
HUNKAPILLER T, KAISER R J, KOOP B F, et al. Large-scale and automated DNA sequence determination [J]. Science, 1991, 254 (5028): 59-67.
[11]
FULLER C W, MIDDENDORF L R, BENNER S A, et al. The challenges of sequencing by synthesis [J]. Nature biotechnology, 2009, 27 (11): 1013-1023.
[12]
YEGNASUBRAMANIAN S. Preparation of fragment libraries for next-generation sequencing on the applied biosystems SOLiD platform [J]. Methods in Enzymology, 2013, 529: 185-200.
[14]
ZHU F Y, CHEN M X, YE N H, et al. Comparative performance of the BGISEQ-500 and Illumina HiSeq4000 sequencing platforms for transcriptome analysis in plants [J]. Plant Methods, 2018, 14: 1-14.
[16]
LIU D, ZHOU H, SHI D, et al. Quality Control of Next-generation Sequencing-based In vitro Diagnostic Test for Onco-relevant Mutations Using Multiplex Reference Materials in Plasma [J]. Journal of Cancer, 2018, 9 (9): 1680-1688.
[21]
ZIOGAS D E, KYROCHRISTOS I D, ROUKOS D H. Next-generation sequencing: from conventional applications to breakthrough genomic analyses and precision oncology [J]. Expert review of medical devices, 2018, 15 (1): 1-3.
[8]
SCHATZ M C, DELCHER A L, SALZBERG S L. Assembly of large genomes using second-generation sequencing [J]. Genome Research, 2010, 20 (9): 1165-1173.
[18]
BROCKMAN W, ALVAREZ P, YOUNG S, et al. Quality scores and SNP detection in sequencing-by-synthesis systems [J]. Genome Research, 2008, 18 (5): 763-770.
[22]
BANSAL V. A statistical method for the detection of variants from next-generation resequencing of DNA pools [J]. Bioinformatics, 2010, 26 (12): 318-324.
[24]
WHITE G H, FARRANCE I. Uncertainty of measurement in quantitative medical testing: a laboratory implementation guide [J]. The Clinical Biochemist Reviews, 2004, 25 (4): 1-24.
[26]
REUMERS J, DE RIJK P, ZHAO H, et al. Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing [J]. Nature Biotechnology, 2011, 30 (1): 61-68.
[28]
KALMAN L V, TARLETON J C, PERCY A K, et al. Development of a genomic DNA reference material panel for Rett syndrome (MECP2-related disorders) genetic testing [J]. J Mol Diagn, 2014, 16 (2): 273-279.
[31]
FANG L T, ZHU B, ZHAO Y, et al. Establishing community reference samples, data and call sets for benchmarking cancer mutation detection using whole-genome sequencing [J]. Nat Biotechnol, 2021, 39 (9): 1151-1160.
[32]
CONESA A, MADRIGAL P, TARAZONA S, et al. A survey of best practices for RNA-seq data analysis [J]. Genome Biol, 2016, 17 (1): 1-19.
[34]
SEQC/MAQC-III Consortium. A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium [J]. Nat Biotechnol, 2014, 32 (9): 903-914.
[38]
Human Microbiome Project Consortium. A framework for human microbiome research [J]. Nature, 2012, 486 (7402): 215-221.
[7]
SEO T S, BAI X, KIM D H, et al. Four-color DNA sequencing by synthesis on a chip using photocleavable fluorescent nucleotides [J]. Proceedings of the National Academy of Sciences of the United States of America, 2005, 102 (17): 5926-5931.
[10]
TAWFIK D S, GRIFFITHS A D. Man-made cell-like compartments for molecular evolution [J]. Nature biotechnology, 1998, 16 (7): 652-656.
[17]
ZHANG C, WANG Y, HU X, et al. An Improved NGS Library Construction Approach Using DNA Isolated from Human Cancer Formalin-Fixed Paraffin-Embedded Samples [J]. Anatomical Record, 2019, 302 (6): 941-946.
[20]
VOELKERDING K V, DAMES S, DURTSCHI J D. Next Generation Sequencing for Clinical Diagnostics-Principles and Application to Targeted Resequencing for Hypertrophic Cardiomyopathy: A Paper from the 2009 William Beaumont Hospital Symposium on Molecular Pathology [J]. The Journal of molecular diagnostics: JMD, 2010, 12 (5): 539-551.
[5]
GARGIS A S, KALMAN L, LUBIN I M. Assuring the Quality of Next-Generation Sequencing in Clinical Microbiology and Public Health Laboratories [J]. Journal of clinical microbiology, 2016, 54 (12): 2857-2865.
[15]
CHEN S, LI S, XIE W, et al. Performance comparison between rapid sequencing platforms for ultra-low coverage sequencing strategy [J]. PloS One, 2014, 9 (3): e92192.
[25]
ROSS M G, RUSS C, COSTELLO M, et al. Characterizing and measuring bias in sequence data [J]. Genome Biology, 2013, 14 : 1-20.
[27]
TORSVIK A, STIEBER D, ENGER P, et al. U-251 revisited: genetic drift and phenotypic consequences of long-term cultures of glioblastoma cells [J]. Cancer Med, 2014, 3 (4): 812-824.
[30]
KALMAN L, LEONARD J, GERRY N, et al. Quality assurance for Duchenne and Becker muscular dystrophy genetic testing: development of a genomic DNA reference material panel [J]. J Mol Diagn, 2011, 13 (2): 167-174.
[35]
WHITE H E, MATEJTSCHUK P, RIGSBY P, et al. Establishment of the first World Health Organization International Genetic Reference Panel for quantitation of BCR-ABL mRNA [J]. Blood, The Journal of the American Society of Hematology, 2010, 116(22): 111-117.
[37]
BROWN C T, HUG L A, THOMAS B C, et al. Unusual biology across a group comprising more than 15% of domain Bacteria [J]. Nature, 2015, 523 (7559): 208-211.
[40]
SINGER E, BUSHNELL B, COLEMAN-DERR D, et al. High-resolution phylogenetic microbial community profiling [J]. ISME J, 2016, 10 (8): 2020-2032.
[42]
QUAIL M A, SMITH M, JACKSON D, et al. SASI-Seq: sample assurance Spike-Ins, and highly differentiating 384 barcoding for Illumina sequencing [J]. BMC Genomics, 2014, 15 : 1-13.
[44]
DEVESON I W, CHEN W Y, WONG T, et al. Representing genetic variation with synthetic DNA standards [J]. Nat Methods, 2016, 13 (9): 784-791.
[45]
External RNA Controls Consortium. Proposed methods for testing and selecting the ERCC external RNA controls [J]. BMC Genomics, 2005, 6(1): 150.
[9]
CARNEIRO M O, RUSS C, ROSS M G, et al. Pacific biosciences sequencing technology for genotyping and variation discovery in human data [J]. BMC Genomics, 2012, 13 (1): 1-7.
[19]
SBONER A, MU X J, GREENBAUM D, et al. The real cost of sequencing: higher than you think! [J]. Genome biology, 2011, 12 : 1-10.
[29]
KALMAN L, TARLETON J, HITCH M, et al. Development of a genomic DNA reference material panel for myotonic dystrophy type 1 (DM1) genetic testing [J]. J Mol Diagn, 2013, 15 (4): 518-525.
[39]
SINHA R, ABNET C C, WHITE O, et al. The microbiome quality control project: baseline study design and future directions [J]. Genome Biol, 2015, 16: 1-16.
[47]
LESHKOWITZ D, FELDMESSER E, Friedlander G, et al. Using Synthetic Mouse Spike-In Transcripts to Evaluate RNA-Seq Analysis Tools [J]. PLoS One, 2016, 11 (4): e0153782.
[48]
HARDWICK S A, CHEN W Y, WONG T, et al. Spliced synthetic genes as internal controls in RNA sequencing experiments [J]. Nat Methods, 2016, 13 (9): 792-798.
[49]
JIANG L, SCHLESINGER F, DAVIS C A, et al. Synthetic spike-in standards for RNA-seq experiments [J]. Genome Res, 2011, 21 (9): 1543-1551.
[50]
DABER R, SUKHADIA S, MORRISSETTE J J. Understanding the limitations of next generation sequencing informatics, an approach to clinical pipeline validation using artificial data sets [J]. Cancer Genet, 2013, 206 (12): 441-448.
[52]
EWING A D, HOULAHAN K E, HU Y, et al. Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection [J]. Nat Methods, 2015, 12 (7): 623-630.
[3]
VAN DIJK E L, JASZCZYSZYN Y, THERMES C. Library preparation methods for next-generation sequencing: tone down the bias [J]. Experimental cell research, 2014, 322 (1): 12-20.
[13]
ROEH S, WEBER P, REX-HAFFNER M, et al. Sequencing on the SOLiD 5500xl System-in-depth characterization of the GC bias [J]. Nucleus, 2017, 8 (4): 370-380.
[23]
ZOOK J M, CATOE D, MCDANIEL J, et al. Extensive sequencing of seven human genomes to characterize benchmark reference materials [J]. Scientific data, 2016, 3(1): 1-26.
[33]
NOVORADOVSKAYA N, WHITFIELD M L, BASEHORE L S, et al. Universal Reference RNA as a standard for microarray experiments [J]. BMC genomics, 2004, 5 (1): 1-13.
[43]
STROM C M, JANECZKO R A, ANDERSON B, et al. Technical validation of a multiplex platform to detect thirty mutations in eight genetic diseases prevalent in individuals of Ashkenazi Jewish descent [J]. Genet Med, 2005, 7 (9): 633-639.
[53]
DUNCAVAGE E J, ABEL H J, MERKER J D, et al. A Model Study of In Silico Proficiency Testing for Clinical Next-Generation Sequencing [J]. Arch Pathol Lab Med, 2016, 140 (10): 1085-1091.
[36]
ESCOBAR-ZEPEDA A,VERA-PONCE DE LEóN A, Sanchez-Flores A. The Road to Metagenomics: From Microbiology to DNA Sequencing Technologies and Bioinformatics [J]. Front Genet, 2015, 6: 348.
[46]
CRONIN M, GHOSH K, SISTARE F, et al. Universal RNA reference materials for gene expression [J]. Clin Chem, 2004, 50 (8): 1464-1471.
[51]
ZOOK J M, CHAPMAN B, WANG J, et al. Integrating human sequence data sets provides a resource of benchmark SNP and indel genotype calls [J]. Nat Biotechnol, 2014, 32 (3): 246-251.