首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
3.
Next-generation sequencing (NGS) is a cost-effective technology capable of screening several genes simultaneously; however, its application in a clinical context requires an established workflow to acquire reliable sequencing results. Here, we report an optimized NGS workflow analyzing 22 lung cancer-related genes to sequence critical samples such as DNA from formalin-fixed paraffin-embedded (FFPE) blocks and circulating free DNA (cfDNA). Snap frozen and matched FFPE gDNA from 12 non-small cell lung cancer (NSCLC) patients, whose gDNA fragmentation status was previously evaluated using a multiplex PCR-based quality control, were successfully sequenced with Ion Torrent PGM™. The robust bioinformatic pipeline allowed us to correctly call both Single Nucleotide Variants (SNVs) and indels with a detection limit of 5%, achieving 100% specificity and 96% sensitivity. This workflow was also validated in 13 FFPE NSCLC biopsies. Furthermore, a specific protocol for low input gDNA capable of producing good sequencing data with high coverage, high uniformity, and a low error rate was also optimized. In conclusion, we demonstrate the feasibility of obtaining gDNA from FFPE samples suitable for NGS by performing appropriate quality controls. The optimized workflow, capable of screening low input gDNA, highlights NGS as a potential tool in the detection, disease monitoring, and treatment of NSCLC.  相似文献   

4.
Influenza viruses still pose a serious threat to humans, and we have not yet been able to effectively predict future pandemic strains and prepare vaccines in advance. One of the main reasons is the high genetic diversity of influenza viruses. We do not know the individual clonotypes of a virus population because some are the majority and others make up only a small fraction of the population. First-generation (FGS) and next-generation sequencing (NGS) technologies have inherent limitations that are unable to resolve a minority clonotype’s information in the virus population. Third-generation sequencing (TGS) technologies with ultra-long reads have the potential to solve this problem but have a high error rate. Here, we evaluated emerging direct RNA sequencing and cDNA sequencing with the MinION platform and established a novel approach that combines the high accuracy of Illumina sequencing technology and long reads of nanopore sequencing technology to resolve both variants and clonotypes of influenza virus. Furthermore, a new program was written to eliminate the effect of nanopore sequencing errors for the analysis of the results. By using this pipeline, we identified 47 clonotypes in our experiment. We conclude that this approach can quickly discriminate the clonotypes of virus genes, allowing researchers to understand virus adaptation and evolution at the population level.  相似文献   

5.
Massive parallel sequencing technologies are promising a highly sensitive detection of low-level mutations, especially in mitochondrial DNA (mtDNA) studies. However, processes from DNA extraction and library construction to bioinformatic analysis include several varying tasks. Further, there is no validated recommendation for the comprehensive procedure. In this study, we examined potential pitfalls on the sequencing results based on two-person mtDNA mixtures. Therefore, we compared three DNA polymerases, six different variant callers in five mixtures between 50% and 0.5% variant allele frequencies generated with two different amplification protocols. In total, 48 samples were sequenced on Illumina MiSeq. Low-level variant calling at the 1% variant level and below was performed by comparing trimming and PCR duplicate removal as well as six different variant callers. The results indicate that sensitivity, specificity, and precision highly depend on the investigated polymerase but also vary based on the analysis tools. Our data highlight the advantage of prior standardization and validation of the individual laboratory setup with a DNA mixture model. Finally, we provide an artificial heteroplasmy benchmark dataset that can help improve somatic variant callers or pipelines, which may be of great interest for research related to cancer and aging.  相似文献   

6.
The highly challenging hexaploid wheat (Triticum aestivum) genome is becoming ever more accessible due to the continued development of multiple reference genomes, a factor which aids in the plight to better understand variation in important traits. Although the process of variant calling is relatively straightforward, selection of the best combination of the computational tools for read alignment and variant calling stages of the analysis and efficient filtering of the false variant calls are not always easy tasks. Previous studies have analyzed the impact of methods on the quality metrics in diploid organisms. Given that variant identification in wheat largely relies on accurate mining of exome data, there is a critical need to better understand how different methods affect the analysis of whole exome sequencing (WES) data in polyploid species. This study aims to address this by performing whole exome sequencing of 48 wheat cultivars and assessing the performance of various variant calling pipelines at their suggested settings. The results show that all the pipelines require filtering to eliminate false-positive calls. The high consensus among the reference SNPs called by the best-performing pipelines suggests that filtering provides accurate and reproducible results. This study also provides detailed comparisons for high sensitivity and precision at individual and population levels for the raw and filtered SNP calls.  相似文献   

7.
The purpose of this study was to develop a flexible, cost-efficient, next-generation sequencing (NGS) protocol for genetic testing. Long-range polymerase chain reaction (PCR) amplicons of up to 20 kb in size were designed to amplify entire genomic regions for a panel (n = 35) of inherited retinal disease (IRD)-associated loci. Amplicons were pooled and sequenced by NGS. The analysis was applied to 227 probands diagnosed with IRD: (A) 108 previously molecularly diagnosed, (B) 94 without previous genetic testing, and (C) 25 undiagnosed after whole-exome sequencing (WES). The method was validated with 100% sensitivity on cohort A. Long-range PCR-based sequencing revealed likely causative variant(s) in 51% and 24% of proband from cohorts B and C, respectively. Breakpoints of 3 copy number variants (CNVs) could be characterized. Long-range PCR libraries spike-in extended coverage of WES. Read phasing confirmed compound heterozygosity in 5 probands. The proposed sequencing protocol provided deep coverage of the entire gene, including intronic and promoter regions. Our method can be used (i) as a first-tier assay to reduce genetic testing costs, (ii) to elucidate missing heritability cases, (iii) to characterize breakpoints of CNVs at nucleotide resolution, (iv) to extend WES data to non-coding regions by spiking-in long-range PCR libraries, and (v) to help with phasing of candidate variants.  相似文献   

8.
Oxford Nanopore sequencing can be used to achieve complete bacterial genomes. However, the error rates of Oxford Nanopore long reads are greater compared to Illumina short reads. Long-read assemblers using a variety of assembly algorithms have been developed to overcome this deficiency, which have not been benchmarked for genomic analyses of bacterial pathogens using Oxford Nanopore long reads. In this study, long-read assemblers, namely Canu, Flye, Miniasm/Racon, Raven, Redbean, and Shasta, were thus benchmarked using Oxford Nanopore long reads of bacterial pathogens. Ten species were tested for mediocre- and low-quality simulated reads, and 10 species were tested for real reads. Raven was the most robust assembler, obtaining complete and accurate genomes. All Miniasm/Racon and Raven assemblies of mediocre-quality reads provided accurate antimicrobial resistance (AMR) profiles, while the Raven assembly of Klebsiella variicola with low-quality reads was the only assembly with an accurate AMR profile among all assemblers and species. All assemblers functioned well for predicting virulence genes using mediocre-quality and real reads, whereas only the Raven assemblies of low-quality reads had accurate numbers of virulence genes. Regarding multilocus sequence typing (MLST), Miniasm/Racon was the most effective assembler for mediocre-quality reads, while only the Raven assemblies of Escherichia coli O157:H7 and K. variicola with low-quality reads showed positive MLST results. Miniasm/Racon and Raven were the best performers for MLST using real reads. The Miniasm/Racon and Raven assemblies showed accurate phylogenetic inference. For the pan-genome analyses, Raven was the strongest assembler for simulated reads, whereas Miniasm/Racon and Raven performed the best for real reads. Overall, the most robust and accurate assembler was Raven, closely followed by Miniasm/Racon.  相似文献   

9.
10.
Mytilus coruscus (family Mytilidae) is one of the most important marine shellfish species in Korea. During the past few decades, this species has become endangered due to the loss of habitats and overfishing. Despite this species' importance, information on its genetic background is scarce. In this study, we developed microsatellite markers for M. coruscus using next-generation sequencing. A total of 263,900 raw reads were obtained from a quarter-plate run on the 454 GS-FLX titanium platform, and 176,327 unique sequences were generated with an average length of 381 bp; 2569 (1.45%) sequences contained a minimum of five di- to tetra-nucleotide repeat motifs. Of the 51 loci screened, 46 were amplified successfully, and 22 were polymorphic among 30 individuals, with seven of trinucleotide repeats and three of tetranucleotide repeats. All loci exhibited high genetic variability, with an average of 17.32 alleles per locus, and the mean observed and expected heterozygosities were 0.67 and 0.90, respectively. In addition, cross-amplification was tested for all 22 loci in another congener species, M. galloprovincialis. None of the primer pairs resulted in effective amplification, which might be due to their high mutation rates. Our work demonstrated the utility of next-generation 454 sequencing as a method for the rapid and cost-effective identification of microsatellites. The high degree of polymorphism exhibited by the 22 newly developed microsatellites will be useful in future conservation genetic studies of this species.  相似文献   

11.
The taxonomic composition of microbial communities can be assessed using universal marker amplicon sequencing. The most common taxonomic markers are the 16S rDNA for bacterial communities and the internal transcribed spacer (ITS) region for fungal communities, but various other markers are used for barcoding eukaryotes. A crucial step in the bioinformatic analysis of amplicon sequences is the identification of representative sequences. This can be achieved using a clustering approach or by denoising raw sequencing reads. DADA2 is a widely adopted algorithm, released as an R library, that denoises marker-specific amplicons from next-generation sequencing and produces a set of representative sequences referred to as ‘Amplicon Sequence Variants’ (ASV). Here, we present Dadaist2, a modular pipeline, providing a complete suite for the analysis that ranges from raw sequencing reads to the statistics of numerical ecology. Dadaist2 implements a new approach that is specifically optimised for amplicons with variable lengths, such as the fungal ITS. The pipeline focuses on streamlining the data flow from the command line to R, with multiple options for statistical analysis and plotting, both interactive and automatic.  相似文献   

12.
随着油气田勘探开发的深入以及储量的增长变化,油气储量分布面积也在不断增加和变更,传统的手工储量面积图形计算与管理已无法满足生产需要。而随着计算机对海量数据、图形计算等处理技术的提高,可以通过采用统一的MAPGIS K9平台建立空间图形数据标准,借助于空间数据运算分析和空间技术重构等技术,构建储量数量、空间位置与上报时间三统一的储量分析流程,完成储量面积图形变更、对比分析、汇总和查询等储量变化分析,为油田图形空间管理和油田勘探开发动态实时跟踪提供了先进的技术手段。  相似文献   

13.
14.
15.
Mutations in POC1B are a rare cause of inherited retinal degeneration. In this study, we present a thorough phenotypic and genotypic characterization of three individuals harboring putatively pathogenic variants in the POC1B gene. All patients displayed a similar, slowly progressive retinopathy (cone dystrophy or cone-rod dystrophy) with normal funduscopy but disrupted outer retinal layers on optical coherence tomography and variable age of onset. Other symptoms were decreased visual acuity and photophobia. Whole genome sequencing revealed a novel homozygous frameshift variant in one patient. Another patient was shown to harbor a novel deep intronic variant in compound heterozygous state with a previously reported canonical splice site variant. The third patient showed a novel nonsense variant and a novel non-canonical splice site variant. We aimed to validate the effect of the deep intronic variant and the non-canonical splice site variant by means of in vitro splice assays. In addition, direct RNA analysis was performed in one patient. Splicing analysis revealed that the non-canonical splice site variant c.561-3T>C leads to exon skipping while the novel deep intronic variant c.1033-327T>A causes pseudoexon activation. Our data expand the genetic landscape of POC1B mutations and confirm the benefit of genome sequencing in combination with downstream functional validation using minigene assays for the analysis of putative splice variants. In addition, we provide clinical multimodal phenotyping of the affected individuals.  相似文献   

16.
The combination of phage display technology with high-throughput sequencing enables in-depth analysis of library diversity and selection-driven dynamics. We applied short-read sequencing of the mutagenized region on focused display libraries of two homologous nucleic acid modification eraser proteins—AlkB and FTO—biopanned against methylated DNA. This revealed enriched genotypes with small indels and concomitant doubtful amino acid motifs within the FTO library. Nanopore sequencing of the entire display vector showed additional enrichment of large deletions overlooked by region-specific sequencing, and further impacted the interpretation of the obtained amino acid motifs. We could attribute enrichment of these corrupted clones to amplification bias due to arduous FTO display slowing down host cell growth as well as phage production. This amplification bias appeared to be stronger than affinity-based target selection. Recommendations are provided for proper sequence analysis of phage display data, which can improve motive discovery in libraries of proteins that are difficult to display.  相似文献   

17.
One of the major goals in DNA‐based personalized medicine is the development of sequence‐specific small molecules to target the genome. SAHA‐PIPs belong to such class of small molecule. In the context of the complex eukaryotic genome, the differential biological effects of SAHA‐PIPs are unclear. This question can be addressed by identifying the binding regions across the genome; however, it is a challenge to enrich small‐molecule‐bound DNA without chemical crosslinking. Here, we developed a method that employs high‐throughput sequencing to map the binding area of small molecules throughout the chromatinized human genome. Analysis of the sequenced data confirmed the presence of specific binding sites for SAHA‐PIPs from the enriched sequence reads. Mapping the binding sites and enriched regions on the human genome clarifies the reason for the distinct biological effects of SAHA‐PIP. This approach will be useful for identifying the function of other small molecules on a large scale.  相似文献   

18.
Genome-wide association studies (GWAS) found locus 3p21.31 associated with severe COVID-19. CCR5 resides at the same locus and, given its known biological role in other infection diseases, we investigated if common noncoding and rare coding variants, affecting CCR5, can predispose to severe COVID-19. We combined single nucleotide polymorphisms (SNPs) that met the suggestive significance level (P ≤ 1 × 10−5) at the 3p21.31 locus in public GWAS datasets (6406 COVID-19 hospitalized patients and 902,088 controls) with gene expression data from 208 lung tissues, Hi-C, and Chip-seq data. Through whole exome sequencing (WES), we explored rare coding variants in 147 severe COVID-19 patients. We identified three SNPs (rs9845542, rs12639314, and rs35951367) associated with severe COVID-19 whose risk alleles correlated with low CCR5 expression in lung tissues. The rs35951367 resided in a CTFC binding site that interacts with CCR5 gene in lung tissues and was confirmed to be associated with severe COVID-19 in two independent datasets. We also identified a rare coding variant (rs34418657) associated with the risk of developing severe COVID-19. Our results suggest a biological role of CCR5 in the progression of COVID-19 as common and rare genetic variants can increase the risk of developing severe COVID-19 by affecting the functions of CCR5.  相似文献   

19.
Megalobrama pellegrini is an endemic fish species found in the upper Yangtze River basin in China. This species has become endangered due to the construction of the Three Gorges Dam and overfishing. However, the available genetic data for this species is limited. Here, we developed 26 polymorphic microsatellite markers from the M. pellegrini genome using next-generation sequencing techniques. A total of 257,497 raw reads were obtained from a quarter-plate run on 454 GS-FLX titanium platforms and 49,811 unique sequences were generated with an average length of 404 bp; 24,522 (49.2%) sequences contained microsatellite repeats. Of the 53 loci screened, 33 were amplified successfully and 26 were polymorphic. The genetic diversity in M. pellegrini was moderate, with an average of 3.08 alleles per locus, and the mean observed and expected heterozygosity were 0.47 and 0.51, respectively. In addition, we tested cross-species amplification for all 33 loci in four additional breams: M. amblycephala, M. skolkovii, M. terminalis, and Sinibrama wui. The cross-species amplification showed a significant high level of transferability (79%-97%), which might be due to their dramatically close genetic relationships. The polymorphic microsatellites developed in the current study will not only contribute to further conservation genetic studies and parentage analyses of this endangered species, but also facilitate future work on the other closely related species.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号