首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The presence of any one of the five unlinked MAL loci (MAL1, MAL2, MAL3, MAL4 and MAL6) confers the ability to ferment maltose on the yeast Saccharomyces cerevisiae. Each locus is composed of three genes encoding maltose permease, alpha-glucosidase and MAL activator. Chromosomal DNA of seven representative baking strains has been separated by pulse-field gel electrophoresis and probed with three genes in MAL6 locus. The DNA bands to which all of the three MAL-derived probes simultaneously hybridized were chromosome VII carrying MAL1 in all of the strains tested, chromosome XI carrying MAL4 in six strains, chromosome III carrying MAL2 in three strains and chromosomes II and VIII carrying MAL3 and MAL6, respectively, in the one strain. The number of MAL loci in baking strains was comparable to those of brewing strains.  相似文献   

2.
The contiguous sequence of 1,003,450 bp spanning map positions 64% to 92% of the genome of Synechocystis sp. strain PCC6803 has been deduced. Computer analysis of the sequence predicts that this region contains at least 818 potential ORFs, in which 255 (31%) were either genes that had already been identified or their homologues, 84 (10%) were homologues to registered hypothetical genes, and 149 (18%) showed weak similarities to reported genes. The remaining 330 ORFs showed no apparent similarity to any reported genes or carried no significant protein motifs. The potential ORFs as a whole occupied 86% of the sequenced region, implying compact arrangement of genes in the genome. As to the structural RNA genes, one rRNA operon consisting of 5,028 bp and at least 11 species of tRNA genes were identified. It is noteworthy that 10 out of the 11 tRNA species showed significant sequence similarities to tRNAs reported in plant chloroplasts. As other notable unique sequences, three classes of IS-like elements each with characteristics typical of IS elements were identified, and a typical unit of WD(Trp-Asp)-repeats which have only been detected in the regulatory proteins of eukaryotes was identified within the large 5,079-bp ORF located at map position 69%.  相似文献   

3.
Maltose and maltotriose are the two most abundant fermentable sugars in brewer's wort, and the rate of uptake of these sugars by brewer's yeast can have a major impact on fermentation performance. In spite of this, no information is currently available on the genetics of maltose and maltotriose uptake in brewing strains of yeast. In this work, we studied 30 brewing strains of yeast (5 ale strains and 25 lager strains) with the aim of examining the alleles of maltose and maltotriose transporter genes contained by them. To do this, we hybridized gene probes to chromosome blots. Studies performed with laboratory strains have shown that maltose utilization is conferred by any one of five unlinked but highly homologous MAL loci (MAL1 to MAL4 and MAL6). Gene 1 at each locus encodes a maltose transporter. All of the strains of brewer's yeast examined except two were found to contain MAL11 and MAL31 sequences, and only one of these strains lacked MAL41. MAL21 was not present in the five ale strains and 12 of the lager strains. MAL61 was not found in any of the yeast strains. In three of the lager strains, there was evidence that MAL transporter gene sequences occurred on chromosomes other than those known to carry MAL loci. Sequences corresponding to the AGT1 gene, which encodes a transporter of several alpha-glucosides, including maltose and maltotriose, were detected in all but one of the yeast strains. Homologues of AGT1 were identified in three of the lager strains, and two of these homologues were mapped, one to chromosome II and the other to chromosome XI. AGT1 appears to be a member of a family of closely related genes, which may have arisen in brewer's yeast in response to selective pressure.  相似文献   

4.
The nucleotide sequence of 37,639 bp of the right arm of chromosome XII has been determined. Twenty-five open reading frames (ORFs) longer than 300 bp were detected, two of which extend into the flanking cosmids. Only two (L2931 and L2961) of the 25 ORFs correspond to previously sequenced genes (HOG1 and YAP3, respectively). Another ORF is distinct from YAP3 but shows pronounced similarity to it. About half of the remaining ORFs show similarity to other genes or display characteristic protein signatures. In particular, ORF L2952 has striking homology with the probable cell cycle control protein crn of Drosophila melanogaster. L2949 has significant similarity to the human ZFM1 (related to a potential suppressor oncogene) and mouse CW17R genes, though it lacks the carboxy-terminal oligoproline and oligoglutamine stretches encoded by these mammalian genes. The small ORF L2922 is similar to part of the much larger yeast flocculation gene FLO1. Other sequences found in the 37639 bp fragment are one delta and one solo-sigma element, the tRNA-Arg3 gene, the small nuclear RNA gene SNR6 and three ARS consensus sequences.  相似文献   

5.
The contiguous 874.423 base pair sequence corresponding to the 50.0-68.8 min region on the genetic map of the Escherichia coli K-12 (W3110) was constructed by the determination of DNA sequences in the 50.0-57.9 min region (360 kb) and two large (100 kb in all) and five short gaps in the 57.9-68.8 min region whose sequences had been registered in the DNA databases. We analyzed its sequence features and found that this region contained at least 894 potential open reading frames (ORFs), of which 346 (38.7%) were previously reported, 158 (17.7%) were homologous to other known genes, 232 (26.0%) were identical or similar to hypothetical genes registered in databases, and the remaining 158 (17.7%) showed no significant similarity to any other genes. A homology search of the ORFs also identified several new gene clusters. Those include two clusters of fimbrial genes, a gene cluster of three genes encoding homologues of the human long chain fatty acid degradation enzyme complex in the mitochondrial membrane, a cluster of at least nine genes involved in the utilization of ethanolamine, a cluster of the secondary set of 11 hyc genes participating in the formate hydrogenlyase reaction and a cluster of five genes coding for the homologues of degradation enzymes for aromatic hydrocarbons in Pseudomonas putida. We also noted a variety of novel genes, including two ORFs, which were homologous to the putative genes encoding xanthine dehydrogenase in the fly and a protein responsible for axonal guidance and outgrowth of the rat, mouse and nematode. An isoleucine tRNA gene, designated ileY, was also newly identified at 60.0 min.  相似文献   

6.
The complete DNA sequence of cosmid clone p59 comprising 37,549 bp derived from chromosome X was determined from an ordered set of subclones. The sequence contains 14 open reading frames (ORFs) containing at least 100 consecutive sense codons. Four of the ORFs represent already known and sequenced yeast genes: B645 is identical to the SME1 gene encoding a protein kinase, required for induction of meiosis in yeast, D819 represents the MEF2 gene probably encoding a second mitochondrial elongation factor-like protein, D678 is identical to the yeast GSH1 gene encoding gamma-glutamylcysteine synthetase and B746 is identical to the CSD3 gene, which plays an as yet unidentified role in chitin biosynthesis and/or its regulation. The deduced amino acid sequence of A550 is 63% identical to the Cc eta subunit of a murine TCP-1-containing chaperonin and more than 35% identical to thermophilic factor 55 from Sulfolobus shibatae, as well as to a number of proteins belonging to the chaperonin TCP-1 family. Open reading frame F551 exhibits homology to two regions of the DAL80 gene located on yeast chromosome XI encoding a pleiotropic negative regulatory protein. In addition, extensive homology was detected in three regions including parts of ORFs A560, B746/CSD3 and the incomplete ORF C852 to three consecutive ORFs of unknown function in the middle of the right arm of chromosome XI. Finally, the sequence contained a tRNA(Arg3) (AGC) gene.  相似文献   

7.
The low-Ca2+-response (LCR) plasmid pCD1 of the plague agent Yersinia pestis KIM5 was sequenced and analyzed for its genetic structure. pCD1 (70,509 bp) has an IncFIIA-like replicon and a SopABC-like partition region. We have assigned 60 apparently intact open reading frames (ORFs) that are not contained within transposable elements. Of these, 47 are proven or possible members of the LCR, a major virulence property of human-pathogenic Yersinia spp., that had been identified previously in one or more of Y. pestis or the enteropathogenic yersiniae Yersinia enterocolitica and Yersinia pseudotuberculosis. Of these 47 LCR-related ORFs, 35 constitute a continuous LCR cluster. The other LCR-related ORFs are interspersed among three intact insertion sequence (IS) elements (IS100 and two new IS elements, IS1616 and IS1617) and numerous defective or partial transposable elements. Regional variations in percent GC content and among ORFs encoding effector proteins of the LCR are additional evidence of a complex history for this plasmid. Our analysis suggested the possible addition of a new Syc- and Yop-encoding operon to the LCR-related pCD1 genes and gave no support for the existence of YopL. YadA likely is not expressed, as was the case for Y. pestis EV76, and the gene for the lipoprotein YlpA found in Y. enterocolitica likely is a pseudogene in Y. pestis. The yopM gene is longer than previously thought (by a sequence encoding two leucine-rich repeats), the ORF upstream of ypkA-yopJ is discussed as a potential Syc gene, and a previously undescribed ORF downstream of yopE was identified as being potentially significant. Eight other ORFs not associated with IS elements were identified and deserve future investigation into their functions.  相似文献   

8.
Murine gammaherpesvirus 68 (gammaHV68) infects mice, thus providing a tractable small-animal model for analysis of the acute and chronic pathogenesis of gammaherpesviruses. To facilitate molecular analysis of gammaHV68 pathogenesis, we have sequenced the gammaHV68 genome. The genome contains 118,237 bp of unique sequence flanked by multiple copies of a 1,213-bp terminal repeat. The GC content of the unique portion of the genome is 46%, while the GC content of the terminal repeat is 78%. The unique portion of the genome is estimated to encode at least 80 genes and is largely colinear with the genomes of Kaposi's sarcoma herpesvirus (KSHV; also known as human herpesvirus 8), herpesvirus saimiri (HVS), and Epstein-Barr virus (EBV). We detected 63 open reading frames (ORFs) homologous to HVS and KSHV ORFs and used the HVS/KSHV numbering system to designate these ORFs. gammaHV68 shares with HVS and KSHV ORFs homologous to a complement regulatory protein (ORF 4), a D-type cyclin (ORF 72), and a G-protein-coupled receptor with close homology to the interleukin-8 receptor (ORF 74). One ORF (K3) was identified in gammaHV68 as homologous to both ORFs K3 and K5 of KSHV and contains a domain found in a bovine herpesvirus 4 major immediate-early protein. We also detected 16 methionine-initiated ORFs predicted to encode proteins at least 100 amino acids in length that are unique to gammaHV68 (ORFs M1 to 14). ORF M1 has striking homology to poxvirus serpins, while ORF M11 encodes a potential homolog of Bcl-2-like molecules encoded by other gammaherpesviruses (gene 16 of HVS and KSHV and the BHRF1 gene of EBV). In addition, clustered at the left end of the unique region are eight sequences with significant homology to bacterial tRNAs. The unique region of the genome contains two internal repeats: a 40-bp repeat located between bp 26778 and 28191 in the genome and a 100-bp repeat located between bp 98981 and 101170. Analysis of the gammaHV68, HVS, EBV, and KSHV genomes demonstrated that each of these viruses have large colinear gene blocks interspersed by regions containing virus-specific ORFs. Interestingly, genes associated with EBV cell tropism, latency, and transformation are all contained within these regions encoding virus-specific genes. This finding suggests that pathogenesis-associated genes of gammaherpesviruses, including gammaHV68, may be contained in similarly positioned genome regions. The availability of the gammaHV68 genomic sequence will facilitate analysis of critical issues in gammaherpesvirus biology via integration of molecular and pathogenetic studies in a small-animal model.  相似文献   

9.
The nucleotide sequence of 22,803 bp on the left arm of chromosome VII was determined by polymerase chain reaction-based approaches to compensate for the unstable character of cosmid clones from this region of the chromosome. The coding density of the sequence is particularly high (more than 83%). Twelve open reading frames (ORFs) longer than 300 bp were found, two of which (at the left side) have been described previously (James et al., 1995) after sequencing of an overlapping cosmid. Four other ORFs correspond to published sequences of the known genes ARO2, RPL9A, TIP1 and MRF1. ARO2 codes for chorismate synthetase. RPL9A for protein L9 of the large ribosomal subunit and MRF1 for a mitochondrial translation release factor. The TIP1 product interacts with Sec20p and is thus involved in transport from endoplasmic reticulum to Golgi. Five of the remaining ORFs have not been identified previously, while the sixth (YGL142c) has been partially sequenced as it lies 5' upstream of MRF1. These six ORFs are relatively large (between 933 and 3657 nucleotides). YGL146c, YGL142c, YGL140c and YGL139w have no significant homology to any protein sequence presently available in the public databases, but show two, nine, nine and eight putative transmembrane spans, respectively. YGL144c has a serine active site signature of lipases. YGL141w has limited homology to several human proteins, one of which mediates complex formation between papillomavirus E6 oncoprotein and tumor suppressor protein p53.  相似文献   

10.
We have initiated large-scale sequencing of the third smallest chromosome of the CL Brener strain of Trypanosoma cruzi and we report here the complete sequence of a contig consisting of three cosmids. This contig covers 93.4 kb and has been found to contain 20-30 novel genes and several repeat elements, including a novel chromosome 3-specific 400-bp repeat sequence. The intergenic sequences were found to be rich in di- and trinucleotide repeats of varying lengths and also contained several known T. cruzi repeat elements. The sequence contains 29 open reading frames (ORFs) longer than 700 bp, the longest being 5157 bp, and a large number of shorter ORFs. Of the long ORFs, seven show homology to known genes in parasites and other organisms, whereas four ORFs were confirmed by sequencing of cDNA clones. Two shorter ORFs were confirmed by a database homology and a cDNA clone, respectively, and one RNA gene was identified. The identified genes include two copies of the gene for alanine-aminotransferase as well as genes for glucose-6-phosphate isomerase, protein kinases and phosphatases, and an ATP synthase subunit. An interesting feature of the sequence was that the genes appear to be organized in two long clusters containing multiple genes on the same strand. The two clusters are transcribed in opposite directions and they are separated by an approximately 20-kb long, relatively GC-rich sequence, that contains two large repetitive elements as well as a pseudogene for cruzipain and a gene for U2snRNA. It is likely that this strand switch region contains one or more regulatory and promoter regions. The reported sequence provides the first insight into the genome organization of T. cruzi and shows the potential of this approach for rapid identification of novel genes. [The sequence data described in this paper have been submitted to the GenBank data library under accession nos. AF052831-AF052833.]  相似文献   

11.
We have sequenced a 61.989 bp stretch located between genes RAD7 and FIP1 of Saccharomyces cerevisiae chromosome X. This stretch contains 36 open reading frames (ORFs) of at least 100 codons. Fourteen of these correspond to sequences previously published as HIT1, CDC8, YAP17, CBF1, NAT1, RPA12, CCT5, TOR1, RFC2, PEM2, CDC11, MIR1, STE18 and GRR1. The proteins deduced from four ORFs (YJR059w, YJR065c, YJR075w, YJR078w) have significant similarity to proteins of known function from yeast or other organisms, including S. cerevisiae serine/threonine-specific protein kinase. Schizosaccharomyces pombe Act2 protein, S. cerevisiae mannosyltransferase OCH1 protein and mouse indoleamine 2,3-dioxygenase, respectively. Four of the remaining 18 ORFs have similarity to proteins with unknown function, six are weakly similar to other known sequences, while another eight exhibit no similarity to any known sequence. In addition, three tRNA genes have been recognized. Three genes clustered within 22 kb (YJR059w, YJR061w and TOR1) have counterparts arranged within 15 kb on the left arm of chromosome XI.  相似文献   

12.
13.
14.
15.
A 6.2-kb DNA fragment containing the genes for the nitrilotriacetate (NTA) monooxygenase of Chelatobacter heintzii ATCC 29600 was cloned and characterized by DNA sequencing and expression studies. The nucleotide sequence contained three major open reading frames (ORFs). Two of the ORFs, which were oriented divergently with an intergenic region of 307 bp, could be assigned to the NTA monooxygenase components A and B. The predicted N-terminal amino acid sequences of these ORFs were identical with those determined for the purified components. We therefore named these genes ntaA (for component A of NTA monooxygenase) and ntaB (for component B). The ntaA and ntaB genes could be expressed in Escherichia coli DH5alpha, and the gene products were visualized after Western blotting (immunoblotting) and incubation with polyclonal antibodies against component A or B. By mixing overproduced NtaB from E. coli and purified component A from C. heintzii ATCC 29600, reconstitution of a functional NTA monooxygenase complex was possible. The deduced gene product of ntaA showed only significant homology to SoxA (involved in dibenzothiophene degradation) and to SnaA (involved in pristamycin synthesis); that of ntaB shared weak homologies in one domain with other NADH:flavine mononucleotide oxidoreductases. These homologies provide no conclusive answer as to the possible evolutionary origin of the NTA monooxygenase. The deduced gene product of the third ORF (ORF1) had homology in the N-terminal region with the GntR class of bacterial regulator proteins and therefore may encode a regulator protein, possibly involved in regulation of ntaA and ntaB expression.  相似文献   

16.
The complete nucleotide sequence of the chloroplast genome (150,613 bp) from the unicellular green alga Chlorella vulgaris C-27 has been determined. The genome contains no large inverted repeat and has one copy of rRNA gene cluster consisting of 16S, 23S, and 5S rRNA genes. It contains 31 tRNA genes, of which the tRNALeu(GAG) gene has not been found in land plant chloroplast DNAs analyzed so far. Sixty-nine protein genes and eight ORFs conserved with those found in land plant chloroplasts have also been found. The most striking is the existence of two adjacent genes homologous to bacterial genes involved in cell division, minD and minE, which are arranged in the same order in Escherichia coli. This finding suggests that the mechanism of chloroplast division is similar to bacterial division. Other than minD and minE homologues, genes encoding ribosomal proteins L5, L12, L19, and S9 (rpl5, rpl12, rpl19, and rps9); a chlorophyll biosynthesis Mg chelating subunit (chlI); and elongation factor EF-Tu (tufA), which have not been reported from land plant chloroplast DNAs, are present in this genome. However, many of the new chloroplast genes recently found in red and brown algae have not been found in C. vulgaris. Furthermore, this algal species possesses two long ORFs related to ycf1 and ycf2 that are exclusively found in land plants. These observations suggest that C. vulgaris is closer to land plants than to red and brown algae.  相似文献   

17.
Two regions from the genome of the virulent Lactobacillus delbrueckii subsp. lactic bacteriophage LL-H were sequenced (2330 and 12939 bp; 44% of the 34.6-kb genome). Together with the previously sequenced region containing the major capsid protein-encoding gene (2498 bp), the sequence had 21 open reading frames (ORFs) on the main coding strand. Only two putative ORFs were detected on the complementary strand. The ORFs covered 93.2% of the sequence. All but four of the ORFs were preceded by a ribosome-binding site. Only four longer non-coding stretches of sequences (175-278 nucleotides (nt) in size) were present. The longest of the non-coding regions contained an A + T-rich sequence that is surrounded by eight perfect copies of an 8-nt sequence that is present both as direct and inverted repeats. This region could represent the origin of replication. All the previously mapped structural protein-encoding genes of phage LL-H were included in the sequence. Genes were identified for the following five proteins: gp19 (encoded by gene g17), gp58 (g71), gp61 (g57), gp75 (g70) and gp89 (g88). N-terminal amino-acid sequencing was performed on gp19 and gp75, and it was found that the N-terminal Met had been post-translationally removed from both proteins.  相似文献   

18.
The nucleotide sequence of a 26.7 kb DNA segment from the left arm of Saccharomyces cerevisiae chromosome IV is presented. An analysis of this segment revealed 11 open reading frames (ORFs) longer than 300 bp and one split gene. These ORFs include the genes encoding the large subunit of RNA polymerase II, the biotin apo-protein ligase, an ADP-ribosylation factor (ARF 2), the 'L35'-ribosomal protein, a rho GDP dissociation factor, and the sequence encoding the protein phosphatase 2A. Further sequence analysis revealed a short ORF encoding the ribosomal protein YL41B, an intron in a 5' untranslated region and an extended homology with another cosmid (X83276) located on the same chromosome. The potential biological relevance of these findings is discussed.  相似文献   

19.
The polypeptide and structural gene for a high-molecular-mass c-type cytochrome, cytochrome c553O, was isolated from the methanotroph Methylococcus capsulatus Bath. Cytochrome c553O is a homodimer with a subunit molecular mass of 124,350 Da and an isoelectric point of 6. 0. The heme c concentration was estimated to be 8.2 +/- 0.4 mol of heme c per subunit. The electron paramagnetic resonance spectrum showed the presence of multiple low spin, S = 1/2, hemes. A degenerate oligonucleotide probe synthesized based on the N-terminal amino acid sequence of cytochrome c553O was used to identify a DNA fragment from M. capsulatus Bath that contains occ, the gene encoding cytochrome c553O. occ is part of a gene cluster which contains three other open reading frames (ORFs). ORF1 encodes a putative periplasmic c-type cytochrome with a molecular mass of 118, 620 Da that shows approximately 40% amino acid sequence identity with occ and contains nine c-heme-binding motifs. ORF3 encodes a putative periplasmic c-type cytochrome with a molecular mass of 94, 000 Da and contains seven c-heme-binding motifs but shows no sequence homology to occ or ORF1. ORF4 encodes a putative 11,100-Da protein. The four ORFs have no apparent similarity to any proteins in the GenBank database. The subunit molecular masses, arrangement and number of hemes, and amino acid sequences demonstrate that cytochrome c553O and the gene products of ORF1 and ORF3 constitute a new class of c-type cytochrome.  相似文献   

20.
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号