scispace - formally typeset
Search or ask a question

Showing papers on "Sequence analysis published in 1981"


Journal ArticleDOI
TL;DR: The length of the terminal repeat sequence for O. nova was confirmed in cloned macronuclear DNA molecules and the sequences is similar but differs in the lengths of the duplex region (28 base pairs) and of the putative 3' extension (14 base pairs).
Abstract: In hypotrichous ciliates, all of the macronuclear DNA is in the form of low molecular weight molecules with an average size of approximately 2200 base pairs. Total macronuclear DNA from four hypotrichs has been shown to have inverted terminal repeats by direct sequence analysis. In Oxytricha nova, Oxytricha sp., and Stylonychia pustulata, this terminal sequence may be written as 5'-C4A4C4A4C4 ... 3'-G4T4G4T4G4T4G4T4G4 ... In Euplotes aediculatus, the sequences is similar but differs in the lengths of the duplex region (28 base pairs) and of the putative 3' extension (14 base pairs). Also in Euplotes, a second common sequence of 5 base pairs (A-A-C-T-T-T-T-G-A-A) occurs internal to the terminal repeat and a 17-base-pair heterogeneous region: 5'-C4A4C4A4C4A4C4(X)17T-T-G-A-A ... 3'-G2T4G4T4G4T4G4T4G4T4G4(X)17A-A-C-T-T ... The length of the terminal repeat sequence for O. nova was confirmed in cloned macronuclear DNA molecules.

461 citations


Journal ArticleDOI
01 Jul 1981-Cell
TL;DR: V virtually the entire immune response to phosphorylcholine is derived from a single V H -coding sequence, denoted T15 after the prototype V H sequence of this group of antibodies.

424 citations


Journal ArticleDOI
TL;DR: Cl cloning, amplification in bacteria, and sequence analysis of DNA complementary to Prl mRNA isolated from human pituitary Prl-secreting adenomas establish an evolutionary clock and postulate that the chromosomal segregation of human Prl and human growth hormone occurred about 392 million years ago and that growth hormone and chorionic sommatomammotropin underwent an intrachromosomal recombination within the last 10 million years.

319 citations


Journal ArticleDOI
TL;DR: Both mRNAs contain 10-15 heterogeneous nonviral nucleotides at their 5' ends that appear to be derived from cellular RNAs used for priming the transcription of viral RNAs.
Abstract: RNA segment 7 of the influenza A virus genome codes for at least two proteins, M1 and M2, which are synthesized from separate mRNA species. Sequence analysis of the M2 mRNA has shown that it contains an interrupted sequence of 689 nucleotides. The approximately 51 virus-specific nucleotides comprising the 5'-end leader sequence of the M2 mRNA are the same as those found at the 5' end of the colinear M1 mRNA. Following the leader sequence of the M2 mRNA, where is a 271-nucleotide body region that is 3' coterminal with the M1 mRNA. Another small potential mRNA (mRNA3) related to RNA segment 7 has been found. mRNA3 has a leader sequence of approximately 11 virus-specific nucleotides that are the same as the 5' end of the M1 and M2 mRNAs, followed by an interrupted sequence of 729 nucleotides, and then a body region of approximately 271 nucleotides that is the same as that of the M2 mRNA. The nucleotide sequences found at the junctions of the interrupted sequences in M2 mRNA and mRNA3 are similar to those found at the splicing points of intervening sequences in eukaryotic mRNAs. In addition, both mRNAs contain 10-15 heterogeneous nonviral nucleotides at their 5' ends that appear to be derived from cellular RNAs used for priming the transcription of viral RNAs. Because the 5'-end sequences of the M1 mRNA and the M2 mRNA are the same and share the 5'-proximal initiation codon for protein synthesis, the first nine amino acids would be the same in the M1 and M2 protein and then the sequences would diverge. The approximately 271-nucleotide body region of the M2 mRNA can be translated in the +1 reading frame, and the sequence indicates that M1 and M2 overlap by 14 amino acids. The coding potential of the mRNA3 is for only nine amino acids, and these would be identical to the COOH-terminal region of the membrane protein (M1).

309 citations


Journal ArticleDOI
TL;DR: In this article, a method for cloning mRNAs has been used, which results in a high yield of recombinants containing complete 5'-terminal mRNA sequences, and the data strongly support the conclusion that the 5'-untranslated region of the lysozyme mRNA is heterogeneous in length.
Abstract: A method for cloning mRNAs has been used which results in a high yield of recombinants containing complete 5'-terminal mRNA sequences. It is not dependent on self-priming to generate double-stranded DNA and therefore the S1 nuclease digestion step is not required. Instead, the cDNA is dCMP-tailed at its 3'-end with terminal deoxynucleotidyl transferase (TdT). The synthesis of the second strand is primed by oligo(dG) hybridized to the 3'-tail. Double-stranded cDNA is subsequently tailed with dCTP and annealed to dGMP-tailed vector DNA. This approach overcomes the loss of the 5'-terminal mRNA sequences and the problem of artifacts which may be introduced into cloned cDNA sequences. Chicken lysozyme cDNA was cloned into pBR322 by this procedure with a transformation efficiency of 5 x 10(3) recombinant clones per ng of ds-cDNA. Sequence analysis revealed that at least nine out of nineteen randomly isolated plasmids contained the entire 5'-untranslated mRNA sequence. The data strongly support the conclusion that the 5'-untranslated region of the lysozyme mRNA is heterogeneous in length.

303 citations


Journal ArticleDOI
TL;DR: It is suggested that three proteases are required to produce these proteins from their polyprotein precursor: a viral protease, which functions in the cytosol to release the capsid protein, signalase, and a protease of the Golgi complex that cleaves after double basic residues, to process the precursor form of one of the glycoproteins.
Abstract: The nucleotide sequence of intracellular 26S mRNA of Sindbis virus has been determined by direct sequence analysis of the cDNA made to this RNA with reverse transcriptase. From this, the amino acid sequences of the encoded virus structural proteins, which include a basic capsid protein and two integral membrane glycoproteins, have been deduced. The features of these proteins as related to their functions are discussed. We suggest that three proteases are required to produce these proteins from their polyprotein precursor: a viral protease, which functions in the cytosol to release the capsid protein, signalase, which makes two cleavages to separate the glycoproteins; and a protease of the Golgi complex that cleaves after double basic residues, to process the precursor form of one of the glycoproteins.

294 citations


Book
01 Jan 1981
TL;DR: This chapter discusses the preliminary characterization of the protein, methods for the detection of peptides, and applications of partial sequence analysis and relationship to DNA sequencing in the protein chemistry laboratory.
Abstract: Preface. List of Abbreviations. Chapter headings: 1. Introduction. 2. Preliminary characterization of the protein. 3. Specific cleavage of the protein. 4. Separation and purification of peptides. 5. Methods for the detection of peptides. 6. Determination of peptide sequences. 7. Structures introduced by post-translational modification. 8. Deduction of the total primary structure of a protein from the sequences of constituent peptides. 9. Integral membrane proteins. 10. Applications of partial sequence analysis and relationship to DNA sequencing. Appendix I. Manufacturers and supplies of equipment and reagents for protein sequence analysis. II. Purification of solvents and reagents. III. Safety in the protein chemistry laboratory. IV. Notes on work with very small amounts of proteins and peptides. References. Subject index.

294 citations


Journal ArticleDOI
TL;DR: The fragmentation pathways observed are defined and the utility in sequence analysis is commented on and a simple acetylation experiment is shown to aid rapid sequence assignment.

262 citations


Journal ArticleDOI
01 Sep 1981-Cell
TL;DR: Mapping studies suggest that gene 27.1 may be a pseudogene for eigher a Qa antigen or an as yet undefined transplantation antigen as well as those encoding transplantation antigens among the 30 to 40 different genomic clones isolated from the sperm library.

244 citations


Journal ArticleDOI
TL;DR: The DNA sequence analysis showed that the RNA molecule synthesized from the copA gene is untranslatable but has the potential for a high degree of secondary structure.
Abstract: Replication of plasmid R1 is controlled by the products of two genes, copA and copB, that act as inhibitors of replication. Here it is shown that one small RNA synthesized from the copA gene acts as replication inhibitor. This RNA molecule was identified from analyses of RNAs synthesized in EScherichia coli minicells carrying R1 miniplasmids or chimeric plasmids containing the copA gene. In minicells, This RNA was found to be unstable with a half-life of less than a few minutes. Two mutant hybrid plasmids lacking the inhibitor function did not express the RNA normally made from plasmids carrying the wild-type copA allele. Nucleotide sequence analysis of one of the copA mutants showed that a base substitution had occurred within the promoter sequence in front of the copA gene. DNA sequence analysis of the other mutant showed that a putative transcription-termination sequence was affected. The DNA sequence analysis also showed that the RNA molecule synthesized from the copA gene is untranslatable but has the potential for a high degree of secondary structure.

190 citations


Journal ArticleDOI
17 Dec 1981-Nature
TL;DR: Four single base pair mutations in the region that codes for RNA I create new incompatibility groups while preserving the mechanism of control of initiation of DNA replication in a ColE1-type plasmid.
Abstract: Four single base pair mutations in the region that codes for RNA I create new incompatibility groups while preserving the mechanism of control of initiation of DNA replication in a ColE1-type plasmid. Sequence analysis of these base substitutions points to the primary importance of the central loop of the cloverleaf structure of RNA I in this control mechanism.

Journal ArticleDOI
TL;DR: Native ovine hypothalamic SS-28 was found to be highly potent in inhibit growth hormone release from cultured rat anterior pituitary cells and indicated that SS- 28 itself may possess regulatory functions.
Abstract: A large somatostatin-like polypeptide of apparent molecular weight 3000-4500 [4K somatostatin (SS)] was isolated from ovine hypothalamus. The polypeptide was obtained in the methionine sulfoxide form. Two microsequence analyses of 0.6 and 1.8 nmol of 4K SS were performed with a modified 890 C spinning cup sequencer. The sequencing data together with results of amino acid analysis and C-terminal end-group determination indicated that 4K SS was identical with somatostatin-28 (SS-28) isolated from procine upper small intestine and sequenced by Pradayrol et al. [Pradayrol, L., Jornvall, H., Mutt, V., & Ribet, A. (1980) FEBS Lett. 109, 55-58]. No free cysteine sulfhydryl group could be detected, so that it was assumed that the two cysteine residues of ovine SS-28 formed an intramolecular disulfide bond. Besides the structure of SS-28, the N-terminal first 30 residues of an unknown polypeptide from ovine hypothalamus were sequenced as follows: H-Ile-Pro-Ile-Tyr-Glu-Lys-Lys-Tyr-Gly-Gln-Val-Pro-Met-Cys-Asp-Ala-Gly-Glu-Gln- Cys-Ala-Val-Arg-Lys-Gly-Ala-Arg-Ile-Gly-Lys. Trypsin cleaved the somatostatin (SS) entity less selectively from ovine hypothalamic SS-28 than from rat hypothalamic 12 000-dalton SS-like polypeptide (12K SS). Native ovine hypothalamic SS-28 was found to be highly potent in inhibit growth hormone release from cultured rat anterior pituitary cells. The results raised doubts that ovine SS-28 would be an SS precursor and indicated that SS-28 itself may possess regulatory functions.

Journal ArticleDOI
TL;DR: Host-range mutants of adenovirus 5 that contain a defect in region E1A fail to replicate in HeLa cells and to transform rodent cells, and results strongly imply that the Mr 51,000 polypeptide is needed to regulate expression of adjacent viral genes during the early phase of adanovirus infection.
Abstract: Host-range mutants of adenovirus 5 that contain a defect in region E1A (0-4.5 units) fail to replicate in HeLa cells and to transform rodent cells. In HeLa cells, these mutants synthesize only the two RNAs from E1A that share the same 5' and 3' termini but differ in length by the amount of internal sequence removed by splicing. RNA from wild-type virus, selected by hybridization to DNA from region E1A, translates into polypeptides of Mr 51,000 and 48,000 that are highly acidic in isoelectric focusing gels. These acidic Mr 51,000 and Mr 48,000 polypeptides are encoded by the longer and shorter E1A RNAs, respectively. Two of the host-range mutants, H5hr1 and H5hr2, fail to synthesize the Mr 51,000 polypeptide but do produce the Mr 48,000 polypeptide and a novel polypeptide thought to be a truncated portion of the Mr 51,000 polypeptide. H5hr1 and H5hr2 are hypothesized to have termination codons in sequences found only in RNA encoding the Mr 51,000 polypeptide. This prediction is verified for H5hr1 by DNA sequence analysis. The other three host-range mutants (H5hr3-5) synthesize both acidic polypeptides and are predicted to be missense. These results strongly imply that the Mr 51,000 polypeptide, alone or in combination with the Mr 48,000 polypeptide, is needed to regulate expression of adjacent viral genes during the early phase of adenovirus infection.

Journal ArticleDOI
TL;DR: It is possible, therefore, that the prosequence of procathepsin D serves as an activation peptide that keeps the enzyme inactive during intracellular transport to the lysosome.

Journal ArticleDOI
TL;DR: The nucleotide sequence of uncloned mouse satellite DNA has been determined by analyzing Sau96I restriction fragments that correspond to the repeat unit of the satellite DNA and an unambiguous sequence of 234 bp has been obtained.
Abstract: The nucleotide sequence of uncloned mouse satellite DNA has been determined by analyzing Sau96I restriction fragments that correspond to the repeat unit of the satellite DNA. An unambiguous sequence of 234 bp has been obtained. The sequence of the first 250 bases from dimeric satellite fragments present in Sau96I limit digests corresponds almost exactly to two tandemly arranged monomer sequences including a complete Sau96I site in the center. This is in agreement with the hypothesis that a low level of divergence which cannot be detected in sequence analyses of uncloned DNA is responsible for the appearance of dimeric fragments. Most of the sequence of the 5% fraction of Sau96 monomers that are susceptible to TaqI has also been determined and has been found to agree completely with the prototype sequence. The monomer sequence is internally repetitious being composed of eight diverged subrepeats. The divergence pattern has interesting implications for theories on the evolution of mouse satellite DNA.

Journal ArticleDOI
TL;DR: The conclusion of these investigations is that tRNA's represent ancient molecules that existed in the form of a mutant distribution prior to their integration into genomes.
Abstract: Evolutionary history of tRNA is studied by comparative sequence analysis of two specified tRNA's at various phylogenetic levels and of tRNA families within four different species. Criteria are developed that allow 1) to distinguish between convergent and divergent evolution, 2) to determine the mechanism of divergence and 3) to estimate the degree of randomization of the variable parts of the sequences. The conclusion of these investigations is that tRNA's represent ancient molecules that existed in the form of a mutant distribution prior to their integration into genomes.

Journal ArticleDOI
TL;DR: Poly(A)-RNA enriched for type I procollagen sequences was isolated from normal human fibroblasts and used as template to synthesize double-stranded cDNA with avian myeloblastosis virus (AMV) reverse transcriptase.
Abstract: Poly(A)-RNA enriched for type I procollagen sequences was isolated from normal human fibroblasts and used as template to synthesize double-stranded cDNA with avian myeloblastosis virus (AMV) reverse transcriptase. After the ends had been blunted with nuclease S1 and dGMP tails had been added with terminal deoxynucleotidyltransferase, the double-stranded cDNA was annealed with pBR322 DNA that had previously been cleaved with EcoRI, blunted with AMV reverse transcriptase, and dCMP-tailed with terminal deoxynucleotidyltransferase. The chimeric molecule was used to transform Escherichia coli strain HB101. Ninety-five recombinant clones were obtained and screened by dot hybridization analysis using 32P-labeled cDNA synthesized from the original poly(A)-RNA collagen-enriched population. Three positive clones were isolated and further characterized by blot hybridization techniques and by EcoRII digestion. One clone with an insert of 2.2 kilobases was shown to contain sequences encoding for the pro-alpha 2 chain of human type I procollagen. DNA sequence analysis of a 172-nucleotide fragment demonstrated that the cloned cDNA extends from amino acid position 450 of the alpha 2 chain to the middle of the COOH-terminal propeptide.

Journal ArticleDOI
TL;DR: The study shows that pairing between subunits is nonrandom, which is consistent with evidence that glycinin is synthesized as a Mr = 60,000 precursor that undergoes post-translational modification to form the individual linked subunits.

Journal ArticleDOI
21 May 1981-Nature
TL;DR: Comparison with the yeast (Saccharomyces cerevisiae) 18S sequence reveals extensive regions of high homology interspersed with tracts having little or no homology.
Abstract: 18S ribosomal RNA in Xenopus laevis is 1,825 nucleotides long, as inferred from sequence analysis of an 18S gene. All the 40 rRNA methyl groups can be located in the sequence. Comparison with the yeast (Saccharomyces cerevisiae) 18S sequence reveals extensive regions of high homology interspersed with tracts having little or no homology. Regions of high homology contain almost all the RNA methyl groups. Major regions of low homology are considerably richer in C + G in Xenopus than in yeast.

Journal ArticleDOI
01 Mar 1981-Cell
TL;DR: Southern transfer hybridization experiments, on DNA digested with enzymes that do not cut the major repeat unit of the family of genes of interest, reveal histone gene orphons in the sea urchin, ribosomal gene orPhon loci in yeast, and Ribosomal and H3 hist one gene orPHons in Drosophila melanogaster.

Journal ArticleDOI
TL;DR: Comparison of the nucleotide and protein sequences reveals no homology between the endonuclease and methylase which might provide insight into the origin of the restriction-modification system or the mechanism of common substrate recognition.

Journal ArticleDOI
TL;DR: The complete primary structure of a lipase (triacylglycerol hydrolase; EC 3.1.3) is presented for the first time and few similarities which might be significant were detected between the environment and relative position of certain half cystines in lipase and colipase, as well as between two tyrosine-rich regions existing in both proteins.

Journal ArticleDOI
TL;DR: Sequence analysis of genomic DNA shows that transcription can proceed through the major polyadenylation site and that alternative polyadenyation sites are used in the Amy-1A gene.
Abstract: Two alpha-amylase mRNAs which differ in the length of their 3' non-translated region accumulate in the cytoplasm in both mouse liver and salivary gland tissues. The two species in each tissue are transcribed from the same gene (Amy-1A). The minor species is approximately 20-nucleotides preceding the poly(A) tract. Sequence analysis of genomic DNA shows that these extra 237 nucleotides are specified by sequences contiguous to those shared by the two mRNAs. These data demonstrate that transcription can proceed through the major polyadenylation site and that alternative polyadenylation sites are used in the Amy-1A gene. Sequences which trail the two polyadenylation sites exhibit extensive homology and might therefore be involved in polyadenylation or transcription termination.

Journal ArticleDOI
TL;DR: From the nucleotide sequence of the DBP gene it can be derived that DBP contains 529 amino acid residues and has an actual molecular weight of 59,049 daltons.
Abstract: We have determined the nucleotide sequence of a region of adenovirus type 5 (Ad5) DNA located between map positions 61.7 and 71.4, which covers the gene form the 72 kD DNA binding protein (DBP) and the sequence encoding the amino-terminal part of the 100 kD protein. Sequence analysis of cDNA copies of DBP mRNA revealed the existence of two abundant species of spliced mRNA molecules. One species consists of two short leader sequences from positions 75.2 (67 and 68 nucleotides long) and 68.8 (77 nucleotides long), respectively, and the main body of the RNA molecules. The other species contains only the leader sequence from position 75.2 and the main body. The amino acid sequence of DBP is encoded entirely by a long open reading frame of 1587 nucleotides in the main body of DBP mRNA. From the nucleotide sequence of the DBP gene it can be derived that DBP contains 529 amino acid residues and has an actual molecular weight of 59,049 daltons. The sites of mutation in the mutants H5hr404 and H5ts125 were determined at the nucleotide level. Single nucleotide alterations were detected in H5hr404 and H5ts125 in the sequences corresponding to the amino-terminal part and the carboxy-terminal part of DBP, respectively. The implications of these mutations are discussed.

Journal ArticleDOI
TL;DR: The characterized frameshift mutations in lacI described here are useful for constructing systems to detect other frameshifts and deletion mutations.


Journal ArticleDOI
TL;DR: Comparisons of many of the tryptic peptides of the two forms by means of high pressure liquid chromatography, as well as amino acid composition and sequence analysis, indicated that peptides from these forms, with one exception, are different.

Journal ArticleDOI
12 Nov 1981-Nature
TL;DR: DNA sequence analysis of a select 4 kbp of the gene has precisely described 14 exons which comprise one-third of the sequences encoding the triple-helical domain of the collagen protein.
Abstract: Fifty-four kilobase pairs (kbp) of cloned chicken DNA containing the entire 38-kbp pro alpha 2 (I) collagen gene have been isolated and characterized. DNA sequence analysis of a select 4 kbp of the gene has precisely described 14 exons which comprise one-third of the sequences encoding the triple-helical domain of the collagen protein. These exons range in size from 45 to 108 base pairs (bp), are all multiples of the 9 bp that code for the repeating triplet, Gly-X-Y, and have an average size of 70 bp. About 50 introns interrupt this gene. Nevertheless, introns do not separate the coding sequences for the ends of the central triple-helical structural domain and the ends of the propeptide domains.

Journal ArticleDOI
01 Jun 1981-Cell
TL;DR: The members of the foldback (FB) family of transposable elements in Drosophila are heterogeneous, with both the inverted terminal repeats and the total element sizes extremely variable; the ends of the inverted repeats represent closely conserved sequences, similar for all members.

Journal ArticleDOI
01 Nov 1981-Cell
TL;DR: It is shown that virtually all the rDNA in Dictyostelium exists in the linear palindromic form, and that the heterogeneity can be accounted for by variable tracts of an unprecedented irregular satellite sequence with the general formula [CnT]m.