首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
2.
Lepidopteran species are mostly pests, causing serious annual economic losses. High-quality genome sequencing and assembly uncover the genetic foundation of pest occurrence and provide guidance for pest control measures. Long-read sequencing technology and assembly algorithm advances have improved the ability to timeously produce high-quality genomes. Lepidoptera includes a wide variety of insects with high genetic diversity and heterozygosity. Therefore, the selection of an appropriate sequencing and assembly strategy to obtain high-quality genomic information is urgently needed. This research used silkworm as a model to test genome sequencing and assembly through high-coverage datasets by de novo assemblies. We report the first nearly complete telomere-to-telomere reference genome of silkworm Bombyx mori (P50T strain) produced by Pacific Biosciences (PacBio) HiFi sequencing, and highly contiguous and complete genome assemblies of two other silkworm strains by Oxford Nanopore Technologies (ONT) or PacBio continuous long-reads (CLR) that were unrepresented in the public database. Assembly quality was evaluated by use of BUSCO, Inspector, and EagleC. It is necessary to choose an appropriate assembler for draft genome construction, especially for low-depth datasets. For PacBio CLR and ONT sequencing, NextDenovo is superior. For PacBio HiFi sequencing, hifiasm is better. Quality assessment is essential for genome assembly and can provide better and more accurate results. For chromosome-level high-quality genome construction, we recommend using 3D-DNA with EagleC evaluation. Our study references how to obtain and evaluate high-quality genome assemblies, and is a resource for biological control, comparative genomics, and evolutionary studies of Lepidopteran pests and related species.  相似文献   

3.
High-quality genome sequences help to elucidate the genetic basis of numerous biological processes and track species evolution. For flax (Linum usitatissimum L.)—a multifunctional crop, high-quality assemblies from Oxford Nanopore Technologies (ONT) data were unavailable, largely due to the difficulty of isolating pure high-molecular-weight DNA. This article proposes a scheme for gaining a contiguous L. usitatissimum assembly using Nanopore data. We developed a protocol for flax nuclei isolation with subsequent DNA extraction, which allows obtaining about 5 μg of pure high-molecular-weight DNA from 0.5 g of leaves. Such an amount of material can be collected even from a single plant and yields more than 30 Gb of ONT data in two MinION runs. We performed a comparative analysis of different genome assemblers and polishers on the gained data and obtained the final 447.1-Mb assembly of L. usitatissimum line 3896 genome using the Canu—Racon (two iterations)—Medaka combination. The genome comprised 1695 contigs and had an N50 of 6.2 Mb and a completeness of 93.8% of BUSCOs from eudicots_odb10. Our study highlights the impact of the chosen genome construction strategy on the resulting assembly parameters and its eligibility for future genomic studies.  相似文献   

4.
Nanopore sequencing (ONT) is a new and rapidly developing method for determining nucleotide sequences in DNA and RNA. It serves the ability to obtain long reads of thousands of nucleotides without assembly and amplification during sequencing compared to next-generation sequencing. Nanopore sequencing can help for determination of genetic changes leading to antibiotics resistance. This study presents the application of ONT technology in the assembly of an E. coli genome characterized by a deletion of the tolC gene and known single-nucleotide variations leading to antibiotic resistance, in the absence of a reference genome. We performed benchmark studies to determine minimum coverage depth to obtain a complete genome, depending on the quality of the ONT data. A comparison of existing programs was carried out. It was shown that the Flye program demonstrates plausible assembly results relative to others (Shasta, Canu, and Necat). The required coverage depth for successful assembly strongly depends on the size of reads. When using high-quality samples with an average read length of 8 Kbp or more, the coverage depth of 30× is sufficient to assemble the complete genome de novo and reliably determine single-nucleotide variations in it. For samples with shorter reads with mean lengths of 2 Kbp, a higher coverage depth of 50× is required. Avoiding of mechanical mixing is obligatory for samples preparation. Nanopore sequencing can be used alone to determine antibiotics-resistant genetic features of bacterial strains.  相似文献   

5.
To date, different strategies of whole-genome sequencing (WGS) have been developed in order to understand the genome structure and functions. However, the analysis of genomic sequences obtained from natural populations is challenging and the biological interpretation of sequencing data remains the main issue. The MinION device developed by Oxford Nanopore Technologies (ONT) is able to generate long reads with minimal costs and time requirements. These valuable assets qualify it as a suitable method for performing WGS, especially in small laboratories. The long reads resulted using this sequencing approach can cover large structural variants and repetitive sequences commonly present in the genomes of eukaryotes. Using MinION, we performed two WGS assessments of a Romanian local strain of Drosophila melanogaster, referred to as Horezu_LaPeri (Horezu). In total, 1,317,857 reads with a size of 8.9 gigabytes (Gb) were generated. Canu and Flye de novo assembly tools were employed to obtain four distinct assemblies with both unfiltered and filtered reads, achieving maximum reference genome coverages of 94.8% (Canu) and 91.4% (Flye). In order to test the quality of these assemblies, we performed a two-step evaluation. Firstly, we considered the BUSCO scores and inquired for a supplemental set of genes using BLAST. Subsequently, we appraised the total content of natural transposons (NTs) relative to the reference genome (ISO1 strain) and mapped the mdg1 retroelement as a resolution assayer. Our results reveal that filtered data provide only slightly enhanced results when considering genes identification, but the use of unfiltered data had a consistent positive impact on the global evaluation of the NTs content. Our comparative studies also revealed differences between Flye and Canu assemblies regarding the annotation of unique versus repetitive genomic features. In our hands, Flye proved to be moderately better for gene identification, while Canu clearly outperformed Flye for NTs analysis. Data concerning the NTs content were compared to those obtained with ONT for the D. melanogaster ISO1 strain, revealing that our strategy conducted to better results. Additionally, the parameters of our ONT reads and assemblies are similar to those reported for ONT experiments performed on various model organisms, revealing that our assembly data are appropriate for a proficient annotation of the Horezu genome.  相似文献   

6.
7.
The reconstruction of individual haplotypes can facilitate the interpretation of disease risks; however, high costs and technical challenges still hinder their assessment in clinical settings. Second-generation sequencing is the gold standard for variant discovery but, due to the production of short reads covering small genomic regions, allows only indirect haplotyping based on statistical methods. In contrast, third-generation methods such as the nanopore sequencing platform developed by Oxford Nanopore Technologies (ONT) generate long reads that can be used for direct haplotyping, with fewer drawbacks. However, robust standards for variant phasing in ONT-based target resequencing efforts are not yet available. In this study, we presented a streamlined proof-of-concept workflow for variant calling and phasing based on ONT data in a clinically relevant 12-kb region of the APOE locus, a hotspot for variants and haplotypes associated with aging-related diseases and longevity. Starting with sequencing data from simple amplicons of the target locus, we demonstrated that ONT data allow for reliable single-nucleotide variant (SNV) calling and phasing from as little as 60 reads, although the recognition of indels is less efficient. Even so, we identified the best combination of ONT read sets (600) and software (BWA/Minimap2 and HapCUT2) that enables full haplotype reconstruction when both SNVs and indels have been identified previously using a highly-accurate sequencing platform. In conclusion, we established a rapid and inexpensive workflow for variant phasing based on ONT long reads. This allowed for the analysis of multiple samples in parallel and can easily be implemented in routine clinical practice, including diagnostic testing.  相似文献   

8.
9.
For tiling of the SARS-CoV-2 genome, the ARTIC Network provided a V4 protocol using 99 pairs of primers for amplicon production and is currently the widely used amplicon-based approach. However, this technique has regions of low sequence coverage and is labour-, time-, and cost-intensive. Moreover, it requires 14 pairs of primers in two separate PCRs to obtain spike gene sequences. To overcome these disadvantages, we proposed a single PCR to efficiently detect spike gene mutations. We proposed a bioinformatic protocol that can process FASTQ reads into spike gene consensus sequences to accurately call spike protein variants from sequenced samples or to fairly express the cases of missing amplicons. We evaluated the in silico detection rate of primer sets that yield amplicon sizes of 400, 1200, and 2500 bp for spike gene sequencing of SARS-CoV-2 to be 59.49, 76.19, and 92.20%, respectively. The in silico detection rate of our proposed single PCR primers was 97.07%. We demonstrated the robustness of our analytical protocol against 3000 Oxford Nanopore sequencing runs of distinct datasets, thus ensuring high-integrity sequencing of spike genes for variant SARS-CoV-2 determination. Our protocol works well with the data yielded from versatile primer designs, making it easy to determine spike protein variants.  相似文献   

10.
11.
The high-throughput molecular analysis of gene targeting (GT) events is made technically challenging by the residual presetabce of donor molecules. Large donor molecules restrict primer placement, resulting in long amplicons that cannot be readily analyzed using standard NGS pipelines or qPCR-based approaches such as ddPCR. In plants, removal of excess donor is time and resource intensive, often requiring plant regeneration and weeks to months of effort. Here, we utilized Oxford Nanopore Amplicon Sequencing (ONAS) to bypass the limitations imposed by donor molecules with 1 kb of homology to the target and dissected GT outcomes at three loci in Nicotiana benthamia leaves. We developed a novel bioinformatic pipeline, Phased ANalysis of Genome Editing Amplicons (PANGEA), to reduce the effect of ONAS error on amplicon analysis and captured tens of thousands of somatic plant GT events. Additionally, PANGEA allowed us to collect thousands of GT conversion tracts 5 days after reagent delivery with no selection, revealing that most events utilized tracts less than 100 bp in length when incorporating an 18 bp or 3 bp insertion. These data demonstrate the usefulness of ONAS and PANGEA for plant GT analysis and provide a mechanistic basis for future plant GT optimization.  相似文献   

12.
13.
14.
The early vascular plants in the genus Selaginella, which is the sole genus of the Selaginellaceae family, have an important place in evolutionary history, along with ferns, as such plants are valuable resources for deciphering plant evolution. In this study, we sequenced and assembled the plastid genome (plastome) sequences of two Selaginella tamariscina individuals, as well as Selaginella stauntoniana and Selaginella involvens. Unlike the inverted repeat (IR) structures typically found in plant plastomes, Selaginella species had direct repeat (DR) structures, which were confirmed by Oxford Nanopore long-read sequence assembly. Comparative analyses of 19 lycophytes, including two Huperzia and one Isoetes species, revealed unique phylogenetic relationships between Selaginella species and related lycophytes, reflected by structural rearrangements involving two rounds of large inversions that resulted in dynamic changes between IR and DR blocks in the plastome sequence. Furthermore, we present other uncommon characteristics, including a small genome size, drastic reductions in gene and intron numbers, a high GC content, and extensive RNA editing. Although the 16 Selaginella species examined may not fully represent the genus, our findings suggest that Selaginella plastomes have undergone unique evolutionary events yielding genomic features unparalleled in other lycophytes, ferns, or seed plants.  相似文献   

15.
Background: Long noncoding RNAs (lncRNAs) have been implicated in the pathogenesis of cardiovascular diseases. We aimed to identify novel lncRNAs associated with the early response to ischemia in the heart. Methods and Results: RNA sequencing data gathered from 81 paired left ventricle samples from patients undergoing cardiopulmonary bypass was collected before and after a period of ischemia. Novel lncRNAs were validated with Oxford Nanopore Technologies long-read sequencing. Gene modules associated with an early ischemic response were identified and the subcellular location of selected lncRNAs was determined with RNAscope. A total of 2446 mRNAs, 270 annotated lncRNAs and one novel lncRNA differed in response to ischemia (adjusted p < 0.001, absolute fold change >1.2). The novel lncRNA belonged to a gene module of highly correlated genes that also included 39 annotated lncRNAs. This module associated with ischemia (Pearson correlation coefficient = −0.69, p = 1 × 10−23) and activation of cell death pathways (p < 6 × 10−9). A further nine novel cardiac lncRNAs were identified, of which, one overlapped five cis-eQTL eSNPs for the gene RWD Domain-Containing Sumoylation Enhancer (RWDD3) and was itself correlated with RWDD3 expression (Pearson correlation coefficient −0.2, p = 0.002). Conclusion: We have identified 10 novel lncRNAs, one of which was associated with myocardial ischemia and may have potential as a novel therapeutic target or early marker for myocardial dysfunction.  相似文献   

16.
17.
Dilated cardiomyopathy (DCM) is a common cause of heart failure (HF) and is of familial origin in 20–40% of cases. Genetic testing by next-generation sequencing (NGS) has yielded a definite diagnosis in many cases; however, some remain elusive. In this study, we used a combination of NGS, human-induced pluripotent-stem-cell-derived cardiomyocytes (iPSC-CMs) and nanopore long-read sequencing to identify the causal variant in a multi-generational pedigree of DCM. A four-generation family with familial DCM was investigated. Next-generation sequencing (NGS) was performed on 22 family members. Skin biopsies from two affected family members were used to generate iPSCs, which were then differentiated into iPSC-CMs. Short-read RNA sequencing was used for the evaluation of the target gene expression, and long-read RNA nanopore sequencing was used to evaluate the relevance of the splice variants. The pedigree suggested a highly penetrant, autosomal dominant mode of inheritance. The phenotype of the family was suggestive of laminopathy, but previous genetic testing using both Sanger and panel sequencing only yielded conflicting evidence for LMNA p.R644C (rs142000963), which was not fully segregated. By re-sequencing four additional affected family members, further non-coding LMNA variants could be detected: rs149339264, rs199686967, rs201379016, and rs794728589. To explore the roles of these variants, iPSC-CMs were generated. RNA sequencing showed the LMNA expression levels to be significantly lower in the iPSC-CMs of the LMNA variant carriers. We demonstrated a dysregulated sarcomeric structure and altered calcium homeostasis in the iPSC-CMs of the LMNA variant carriers. Using targeted nanopore long-read sequencing, we revealed the biological significance of the variant c.356+1G>A, which generates a novel 5′ splice site in exon 1 of the cardiac isomer of LMNA, causing a nonsense mRNA product with almost complete RNA decay and haploinsufficiency. Using novel molecular analysis and nanopore technology, we demonstrated the pathogenesis of the rs794728589 (c.356+1G>A) splice variant in LMNA. This study highlights the importance of precise diagnostics in the clinical management and workup of cardiomyopathies.  相似文献   

18.
19.
20.
The performance of tile roofing assemblies as well as untreated cedar shake roofing assemblies exposed to continuous firebrand showers were compared. Specifically, experiments were conducted for two types of concrete tile roofing assemblies (flat and profiled), one type of terracotta tile roofing assembly (flat) and an untreated (without any fire retardant) cedar shake roofing assembly. The design of the roofing assemblies was based on construction guidelines in the USA. The duration of the firebrand flux was fixed at 20 min, and the wind speed was varied from 6 m/s to 9 m/s. These wind speeds were chosen to be able to compare roofing assembly performance to similar assemblies exposed to a batch‐feed firebrand generator which had limited duration of firebrand exposure (6 min). The average firebrand mass flux that arrived at the surface of the roofing assemblies was 0.3 g/m2s Results indicated that for the untreated cedar shake assemblies, ignition occurred easily from the firebrand assault, and this type of roofing assembly generated their own firebrands after ignition. To attempt to quantify the degree of penetration, the number of firebrands that penetrated the tile roofing assemblies, and deposited onto the underlayment/counter‐batten system was counted as function of wind speed for each assembly. Firebrand penetration was observed, even for the flat tile assemblies. It is believed that these are the first‐ever experiments described in the peer‐reviewed literature to expose wood and tile roofing experiments to continuous wind‐driven firebrand showers. Copyright © 2016 John Wiley & Sons, Ltd.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号