Libraries of tens of thousands of transposon mutants generated from each of four human gut Bacteroides strains, two representing the same species, were introduced simultaneously into gnotobiotic mice together with 11 other wild-type strains to generate a 15-member artificial human gut microbiota. Mice received one of two distinct diets monotonously, or both in different ordered sequences. Quantifying the abundance of mutants in different diet contexts allowed gene-level characterization of fitness determinants, niche, stability, and resilience and yielded a prebiotic (arabinoxylan) that allowed targeted manipulation of the community. The approach described is generalizable and should be useful for defining mechanisms critical for sustaining and/or approaches for deliberately reconfiguring the highly adaptive and durable relationship between the human gut microbiota and host in ways that promote wellness.
Members of the genus Xenorhabdus are entomopathogenic bacteria that associate with nematodes. The nematode-bacteria pair infects and kills insects, with both partners contributing to insect pathogenesis and the bacteria providing nutrition to the nematode from available insect-derived nutrients. The nematode provides the bacteria with protection from predators, access to nutrients, and a mechanism of dispersal. Members of the bacterial genus Photorhabdus also associate with nematodes to kill insects, and both genera of bacteria provide similar services to their different nematode hosts through unique physiological and metabolic mechanisms. We posited that these differences would be reflected in their respective genomes. To test this, we sequenced to completion the genomes of Xenorhabdus nematophila ATCC 19061 and Xenorhabdus bovienii SS-2004. As expected, both Xenorhabdus genomes encode many anti-insecticidal compounds, commensurate with their entomopathogenic lifestyle. Despite the similarities in lifestyle between Xenorhabdus and Photorhabdus bacteria, a comparative analysis of the Xenorhabdus, Photorhabdus luminescens, and P. asymbiotica genomes suggests genomic divergence. These findings indicate that evolutionary changes shaped by symbiotic interactions can follow different routes to achieve similar end points.
Azotobacter vinelandii is a soil bacterium related to the Pseudomonas genus that fixes nitrogen under aerobic conditions while simultaneously protecting nitrogenase from oxygen damage. In response to carbon availability, this organism undergoes a simple differentiation process to form cysts that are resistant to drought and other physical and chemical agents. Here we report the complete genome sequence of A. vinelandii DJ, which has a single circular genome of 5,365,318 bp. In order to reconcile an obligate aerobic lifestyle with exquisitely oxygen-sensitive processes, A. vinelandii is specialized in terms of its complement of respiratory proteins. It is able to produce alginate, a polymer that further protects the organism from excess exogenous oxygen, and it has multiple duplications of alginate modification genes, which may alter alginate composition in response to oxygen availability. The genome analysis identified the chromosomal locations of the genes coding for the three known oxygen-sensitive nitrogenases, as well as genes coding for other oxygen-sensitive enzymes, such as carbon monoxide dehydrogenase and formate dehydrogenase. These findings offer new prospects for the wider application of A. vinelandii as a host for the production and characterization of oxygen-sensitive proteins.
The human gut is home to trillions of microbes, thousands of bacterial phylotypes, as well as hydrogen-consuming methanogenic archaea. Studies in gnotobiotic mice indicate that Methanobrevibacter smithii, the dominant archaeon in the human gut ecosystem, affects the specificity and efficiency of bacterial digestion of dietary polysaccharides, thereby influencing host calorie harvest and adiposity. Metagenomic studies of the gut microbial communities of genetically obese mice and their lean littermates have shown that the former contain an enhanced representation of genes involved in polysaccharide degradation, possess more archaea, and exhibit a greater capacity to promote adiposity when transplanted into germ-free recipients. These findings have led to the hypothesis that M. smithii may be a therapeutic target for reducing energy harvest in obese humans. To explore this possibility, we have sequenced its 1,853,160-bp genome and compared it to other human gut-associated M. smithii strains and other Archaea. We have also examined M. smithii's transcriptome and metabolome in gnotobiotic mice that do or do not harbor Bacteroides thetaiotaomicron, a prominent saccharolytic bacterial member of our gut microbiota. Our results indicate that M. smithii is well equipped to persist in the distal intestine through (i) production of surface glycans resembling those found in the gut mucosa, (ii) regulated expression of adhesin-like proteins, (iii) consumption of a variety of fermentation products produced by saccharolytic bacteria, and (iv) effective competition for nitrogenous nutrient pools. These findings provide a framework for designing strategies to change the representation and/or properties of M. smithii in the human gut microbiota.
The adult human intestine contains trillions of bacteria, representing hundreds of species and thousands of subspecies. Little is known about the selective pressures that have shaped and are shaping this community's component species, which are dominated by members of the Bacteroidetes and Firmicutes divisions. To examine how the intestinal environment affects microbial genome evolution, we have sequenced the genomes of two members of the normal distal human gut microbiota, Bacteroides vulgatus and Bacteroides distasonis, and by comparison with the few other sequenced gut and non-gut Bacteroidetes, analyzed their niche and habitat adaptations. The results show that lateral gene transfer, mobile elements, and gene amplification have played important roles in affecting the ability of gut-dwelling Bacteroidetes to vary their cell surface, sense their environment, and harvest nutrient resources present in the distal intestine. Our findings show that these processes have been a driving force in the adaptation of Bacteroidetes to the distal gut environment, and emphasize the importance of considering the evolution of humans from an additional perspective, namely the evolution of our microbiomes.
Human chromosome 2 is unique to the human lineage in being the product of a head-to-head fusion of two intermediate-sized ancestral chromosomes. Chromosome 4 has received attention primarily related to the search for the Huntington's disease gene, but also for genes associated with Wolf-Hirschhorn syndrome, polycystic kidney disease and a form of muscular dystrophy. Here we present approximately 237 million base pairs of sequence for chromosome 2, and 186 million base pairs for chromosome 4, representing more than 99.6% of their euchromatic sequences. Our initial analyses have identified 1,346 protein-coding genes and 1,239 pseudogenes on chromosome 2, and 796 protein-coding genes and 778 pseudogenes on chromosome 4. Extensive analyses confirm the underlying construction of the sequence, and expand our understanding of the structure and evolution of mammalian chromosomes, including gene deserts, segmental duplications and highly variant regions.
Salmonella enterica serovars often have a broad host range, and some cause both gastrointestinal and systemic disease. But the serovars Paratyphi A and Typhi are restricted to humans and cause only systemic disease. It has been estimated that Typhi arose in the last few thousand years. The sequence and microarray analysis of the Paratyphi A genome indicates that it is similar to the Typhi genome but suggests that it has a more recent evolutionary origin. Both genomes have independently accumulated many pseudogenes among their approximately 4,400 protein coding sequences: 173 in Paratyphi A and approximately 210 in Typhi. The recent convergence of these two similar genomes on a similar phenotype is subtly reflected in their genotypes: only 30 genes are degraded in both serovars. Nevertheless, these 30 genes include three known to be important in gastroenteritis, which does not occur in these serovars, and four for Salmonella-translocated effectors, which are normally secreted into host cells to subvert host functions. Loss of function also occurs by mutation in different genes in the same pathway (e.g., in chemotaxis and in the production of fimbriae).
Human chromosome 7 has historically received prominent attention in the human genetics community, primarily related to the search for the cystic fibrosis gene and the frequent cytogenetic changes associated with various forms of cancer. Here we present more than 153 million base pairs representing 99.4% of the euchromatic sequence of chromosome 7, the first metacentric chromosome completed so far. The sequence has excellent concordance with previously established physical and genetic maps, and it exhibits an unusual amount of segmentally duplicated sequence (8.2%), with marked differences between the two arms. Our initial analyses have identified 1,150 protein-coding genes, 605 of which have been confirmed by complementary DNA sequences, and an additional 941 pseudogenes. Of genes confirmed by transcript sequences, some are polymorphic for mutations that disrupt the reading frame.
Salmonella enterica subspecies I, serovar Typhimurium (S. typhimurium), is a leading cause of human gastroenteritis, and is used as a mouse model of human typhoid fever. The incidence of non-typhoid salmonellosis is increasing worldwide, causing millions of infections and many deaths in the human population each year. Here we sequenced the 4,857-kilobase (kb) chromosome and 94-kb virulence plasmid of S. typhimurium strain LT2. The distribution of close homologues of S. typhimurium LT2 genes in eight related enterobacteria was determined using previously completed genomes of three related bacteria, sample sequencing of both S. enterica serovar Paratyphi A (S. paratyphi A) and Klebsiella pneumoniae, and hybridization of three unsequenced genomes to a microarray of S. typhimurium LT2 genes. Lateral transfer of genes is frequent, with 11% of the S. typhimurium LT2 genes missing from S. enterica serovar Typhi (S. typhi), and 29% missing from Escherichia coli K12. The 352 gene homologues of S. typhimurium LT2 confined to subspecies I of S. enterica-containing most mammalian and bird pathogens-are useful for studies of epidemiology, host specificity and pathogenesis. Most of these homologues were previously unknown, and 50 may be exported to the periplasm or outer membrane, rendering them accessible as therapeutic or vaccine targets.
The genome of the model plant Arabidopsis thaliana has been sequenced by an international collaboration, The Arabidopsis Genome Initiative. Here we report the complete sequence of chromosome 5. This chromosome is 26 megabases long; it is the second largest Arabidopsis chromosome and represents 21% of the sequenced regions of the genome. The sequence of chromosomes 2 and 4 have been reported previously and that of chromosomes 1 and 3, together with an analysis of the complete genome sequence, are reported in this issue. Analysis of the sequence of chromosome 5 yields further insights into centromere structure and the sequence determinants of heterochromatin condensation. The 5,874 genes encoded on chromosome 5 reveal several new functions in plants, and the patterns of gene organization provide insights into the mechanisms and extent of genome evolution in plants.
Knowledge of the complete genomic DNA sequence of an organism allows a systematic approach to defining its genetic components. The genomic sequence provides access to the complete structures of all genes, including those without known function, their control elements, and, by inference, the proteins they encode, as well as all other biologically important sequences. Furthermore, the sequence is a rich and permanent source of information for the design of further biological studies of the organism and for the study of evolution through cross-species sequence comparison. The power of this approach has been amply demonstrated by the determination of the sequences of a number of microbial and model organisms. The next step is to obtain the complete sequence of the entire human genome. Here we report the sequence of the euchromatic part of human chromosome 22. The sequence obtained consists of 12 contiguous segments spanning 33.4 megabases, contains at least 545 genes and 134 pseudogenes, and provides the first view of the complex chromosomal landscapes that will be found in the rest of the genome.
The higher plant Arabidopsis thaliana (Arabidopsis) is an important model for identifying plant genes and determining their function. To assist biological investigations and to define chromosome structure, a coordinated effort to sequence the Arabidopsis genome was initiated in late 1996. Here we report one of the first milestones of this project, the sequence of chromosome 4. Analysis of 17.38 megabases of unique sequence, representing about 17% of the genome, reveals 3,744 protein coding genes, 81 transfer RNAs and numerous repeat elements. Heterochromatic regions surrounding the putative centromere, which has not yet been completely sequenced, are characterized by an increased frequency of a variety of repeats, new repeats, reduced recombination, lowered gene density and lowered gene expression. Roughly 60% of the predicted protein-coding genes have been functionally characterized on the basis of their homology to known genes. Many genes encode predicted proteins that are homologous to human and Caenorhabditis elegans proteins.