[Frontiers in Bioscience 14, 3214-3221, January 1, 2009] |
|
|
Microbial diversity in the human intestine and novel insights from metagenomics Marco Ventura1, Francesca Turroni1, Carlos Canchaya1, Elaine E. Vaughan2, Paul W. O'Toole3, Douwe van Sinderen3
1 TABLE OF CONTENTS
1. ABSTRACT Bacterial communities reside in very different ecological niches on and within the human host, such as those associated with the alimentary tract. The human gastrointestinal tract is populated with as many as 100 trillion bacterial cells, whose collective genome likely reflects the co-evolution between the microbial community and its host. Recent progress has highlighted the intriguing diversity of these bacterial populations and their important contributions to human physiology. Thus, a thorough understanding of the autochthonous component of the intestinal microbiota is expected to provide crucial information not only on how to develop therapies for various gastrointestinal diseases but also on how to choose the next generation of probiotic bacteria as part of novel functional foods. Recently, novel culture-independent approaches such as metagenomics-based techniques were shown to be crucially important for the exploration of the biodiversity of the human intestinal microbiota.
2. INTRODUCTION Mammals, including humans, have, throughout their evolution, been associated with complex microbial communities, which inhabit the surfaces and alimentary tract of their host and which outnumber the human somatic and germ cells by a factor of ten (1). The distal part of the gut, i.e. the intestine, represents the environment where microbial diversity is high while its abundance is at its theoretical maximum. The human intestinal microbiota is estimated to be composed of 1013 to 1014 microorganisms whose collective genome, the microbiome, contains at least 100-fold more genes than the complete human genome (2). This complex microbiota is considered to be critically important for various gut functions, such as host nutrition, regulation of epithelial development (3), regulation of host fat storage (4), stimulation of intestinal angiogenesis (5), inflammatory immune responses (6.) and pathogen resistance (7, 8). Deviations in the composition of the microbiota from the typical adult pattern as well as from the canonical infant pattern have been associated with a variety of conditions ranging from inflammatory bowel disease to allergy, obesity and atopic disease (9). Thus, our desire to understand intestinal microbial ecology is linked to our wish to manage and maintain human health. The composition of the adult intestinal microbiota has been intensively studied, by means of culture-based methodologies and, more recently, through culture-independent technologies based on the amplification and direct sequencing of small subunit ribosomal DNA sequences (10, 11, 12, 13). The intestinal microbiota have also been analysed by other, recently developed community profiling culture-independent techniques, including DNA-microarray, fluorescent in situ hybridization (FISH), and methods such as PCR coupled to denaturing gradient and temperature electrophoresis (PCR-DGGE and PCR-TGGE) (for review see 14). The microbial populations that reside in the distal human intestine are predicted to consist of more than 400 bacterial species and around 1,000 phylotypes, belonging to a limited number of broad taxonomic units (11, 13). Within these units, members of the anaerobic genera such as Bacteroides, Eubacterium, Bifidobacterium, Ruminococcus, Clostridium and Faecalibacterium have been found to constitute the majority of the adult human intestinal microbiota, representing a relatively constant bacterial community that remains relatively stable over time (11, 14). Conversely, the infant intestinal microbiota appears to be much more simple and variable in its composition, and less stable over time (15). In this review we will focus on the current knowledge regarding diversity and development of the human gut microbiota as determined by molecular 16S ribosomal RNA approaches, as well as on the various identified factors that govern its establishment. We further discuss metagenomics approaches as a powerful tool to explore the biodiversity of the intestinal microbiota. 3. MICROBIOTA DIVERSITY OF THE HUMAN GIT AS DETERMINED BY MOLECULAR METHODS BASED ON 16S rRNA The human gut ecosystem is very dynamic and consists of various bacterial populations that are either permanent gut residents, i.e. autochthonous components, or transient inhabitants, i.e. allochthonous members, introduced from the environment. The relationship between bacteria and the human host can be viewed as a continuum between symbiosis, commensalism and pathogenesis, where the former two relationships can be considered as mutualistic. The human intestine is assumed to be colonized by a vast array of symbionts and commensals that provide various metabolic capabilities, which humans consequently did not need to develop themselves (16). The genetic diversity found in the human intestine is believed to be the result of intensive selection and co-evolution, where Horizontal Gene Transfer (HGT) has been suggested to act as the principal evolutionary force in shaping the human microbiome (15). The human intestinal microbiota is characterized by a relatively small number of bacterial and archaeal divisions, compared to other environments such as soil or stromatolite (17). Culture-independent approaches based on 16S rRNA gene sequences revealed that the intestinal human microbiota typically comprises representatives of only nine major bacterial divisions, i.e., Bacteroidetes, Firmicutes, Fusobacteria, Proteobacteria, Verrucomicrobia, Cyanobacteria, Spirochaeates, VadinBE97 and Actinobacteria (13, 15). Notably, proportions and compositions of Bacteroidetes and bifidobacteria were consistently stable within individuals whereas the Clostridium group (e.g. Clostridium coccoides and Clostridium leptum) showed large inter-individual variation (18). Thus, the microbial composition of such ecosystems is relatively simple compared to other habitats such as the soil environment, which contains at least 20 bacterial divisions (17). With respect to the phylogenetic architecture of the human intestinal microbiota, it has been suggested that this is derived from the diversification of a discrete, and therefore restricted initial microbial community. Notably, the limited diversification of the intestinal microbiota may reflect the relatively short time of existence of the human intestine as a habitat, i.e. 100 million years for mammals, as opposed to over 3.85 billion years for the ocean (19). 4. FACTORS INFLUENCING THE COMPOSITION OF THE GASTRO INTESTINAL MICROBIOTA The microbiota of the adult GIT is composed of microorganisms that represent superior competitors for a fixed set of niches (15). This interpretation prompted an intensive search for factors that define a microbial intestinal niche such as the genotype of the host and the diet. However, human microbial communities are also influenced by other, less predictable factors such as interactions between individual resident members of the microbiota, as well as transient members such as pathogens and probiotics. The genotype of the host is thought to partly explain variation of the GIT microbiota between individuals through the differential action of the genes that specify and control the immune system (20). For example, the immune system of the host influences the microbiota composition by restricting microbial penetration in the host tissues (21, 22). Furthermore, the genotype of the host modulates the intestinal microbiota through the availability of specific attachment sites and host-derived resources. In the case of Bacteroides thetaiotaomicron, which is considered the model commensal enteric microorganism, it has been shown that this bacterium regulates the production of fucosylated glycans by the host epithelium in order to respond to the bacterial utilization of fucose as a growth substrate (1). Although such interactions have been studied in murine models, it is plausible that they also occur in humans (1). If this is indeed the case, they may be influenced by human polymorphisms in fucosyltransferase enzymes, which are responsible for different blood types (23). The effect of genotype on the composition of the human microbiota was demonstrated in a gut ecological study involving monozygotic twins (24). According to this report, there were greater similarities between the gut microbiota of monozygotic twins than between monozygotic twins and their unrelated marital partners. The monozygotic twins still exhibited small differences in their intestinal microbiota, suggesting that although host genotype does appear to influence the intestinal microbiota, genetically identical subjects living in very similar environments can still exhibit significant differences in their intestinal microbiota (24). The importance of the initial colonizing microbial community on the eventual microbial composition of the gut is evident from animal studies. The composition of the mouse gut microbiota is significantly affected by maternal transmission (25). Similarly, investigations in which the microbiota was transferred from one host to another, from normal mice or zebrafish to axenic mice or zebrafish, showed that the microbial community which establishes itself in the gut at the time of birth, together with the features of the gut habitat, are determining factors for the microbiota composition (26). Diet is also a pivotal variable in influencing the composition of the intestinal microbiota. Studies on chemically well defined diet components have proven a clear correlation between diet and the presence of specific bacterial groups. It has been shown that a diet rich in inulin and related fibers promote an increase in bifidobacteria (27), whereas the intake of dietary sulphate favours several genera of sulphate-reducing bacteria over methanogenic Archaea (28). Diet affects not only the microbiota composition, but more significantly the metabolic activities of the microorganisms (29,,30). These are also influenced by factors relating to the gut environment, i.e. local conditions of pH, oxygen and hydrogen, metabolite concentration and gut transit time (31). However, representative strains of several bacterial genera were shown to be generalists with comparable growth abilities on overlapping ranges of substrates (27, 28). In addition, it should be noted that mucin and other host-derived compounds are on their own already sufficient to support a diverse microbial community (32, 33). Bacterial interactions together with resource competition are considered crucial determinants of intestinal niches (34). Bacterial co-operations in the GIT happen during the breakdown of complex carbohydrates, which originate from the diet or host (35). Finally, bacterial diversity in the human GIT might be influenced by the action of antimicrobial compounds such as bacteriocins or bacteriophages, which can promote complex dynamics due to evolutionary trade-offs between resistance and sensitivity (36), and which can also provide protection against bacterial infections (8). Recently, a comparative analysis was performed between healthy individuals and patients with Crohn's disease (CD) with respect to their gut mucosa-associated bacteriophage community. Based on the results, it was postulated that bacteriophages play a key role in the dysregulated immune response of IBD patients to the mucosal-associated bacterial population (38). 5. DEVELOPMENT OF THE HUMAN INTESTINAL MICROBIOTA Following birth every human (baby) develops a complex and active intestinal ecosystem starting from an assumed sterile environment. In just a matter of a couple of days the GIT of a newborn becomes home to a large bacterial community, whose total number quickly exceeds the total number of cells of its host (15). However, there are several controversial reports regarding the composition of the infant GIT microbiota and about the factors that influence it (15, 38). PCR based techniques such as PCR-DGGE revealed that species most closely related to the genera Bifidobacterium and Ruminococcus dominate the intestinal microbiota of infants during the first year of life (39). The GIT microbiota of infants delivered by caesarean section has been described to differ significantly from that of infants delivered vaginally (15, 38), which would be consistent with the finding that the infant GIT microbiota harbours elements of the maternal vaginal microbiota (40). Furthermore, the gestational outcome, for example hospitalized preterm infants versus healthy full-term babies, dramatically influences the composition of the developing GIT microbiota (41, 42). This could be influenced by the immune system maturity and age of priming. Considering the importance of the symbiosis between intestinal microbes and the human host, it is plausible that the development of the microbiota in the infant intestine would have evolved under strong selective pressure, acting on both the intestinal niche as well as on its bacterial colonizers. So a restricted number of co-evolved commensal bacteria may be exceptionally well adapted to occupy the intestinal ecological niche of the infant intestine. It has been found that the GIT microbiota in the first month of life is highly variable between individuals. The intestinal microbiota that first develops immediately following delivery is to a significant extent influenced by specific bacteria to which the infant is exposed in its environment. Such environmental exposures are likely to include elements of the maternal vaginal, fecal, and skin microbiota (15). Over time, however, the ecological fitness advantages of specific bacterial groups that typically dominate the adult colonic microbiota lead to a displacement or shift in numbers of the initial bacterial colonizers. In addition, progressive changes in the GIT environment, due to intrinsic developmental modifications of the GIT mucosa as well as transition to a more "adult" diet, combined with effects elicited by the microbiota themselves may provide the selection for the most highly adapted bacteria (43). 6. MICROBIAL DIVERSITY ASSESSED BY COMPOSITIONAL METAGENOMICS Over the last decade, many investigations have focused on culture-independent approaches to evaluate the complexity of the intestinal microbiota. Among these novel investigative methods that are used to study the microbial ecology of complex bacterial communities, the so-called metagenomics approach is considered to represent the method that produces results of 'gold standard' quality. Metagenomics is the study of microbial communities through sequence-based, compositional and/or functional analyses of all the combined microbial genomes contained within an environmental sample (Figure 1). With respect to its application to human intestinal microbiota, metagenomic studies of mucosal as well as faecal samples revealed the presence of representatives of the phylogenetic groups Firmicutes, Bacteroidetes, Proteobacteria, Fusobacteria, Verrucomicrobia and Actinobacteria (11, 12). However, a large portion of the phylotypes detected belong to the Bacteroidetes and Firmicutes phyla, while only one of the detected species belonging to the methanogenic archaeon, Methanobrevibacterium smithii (11, 12, 44, 45). Notably, with respect to phylotypic variation among the microbiota, it was found that only the Firmicutes phylum was comprised of a large number of phylogenetic types (e.g., the clostridial class). Also, an elevated number of phylotypes are related to butyrate-producing bacteria, all of which fall within the clostridial clusters IV, XIVa and XVI of Collins et al. (46). The other most common bacterial phylum detected in human gut samples is the Bacteroidetes with a small number of representative species, frequently including members of the Bacteroides fragilis subgroup, Bacteroides uniformis, Bacteroides thetaiotaomicron and Bacteroides distasonis (12). Recently, a compositional metagenomic study involving healthy individuals as well as with Crohn's disease revealed a reduced complexity of the bacterial phylum Firmicutes as well as over-representation of species such as uncultured Porphyromonadaceae as a signature of the faecal microbiota in patients with CD (47). The bacterial phyla appear to be unevenly distributed along the length of GIT. Metagenomic analyses of jejunum samples revealed a distinctive community composition which included an abundance of bacteria belonging to the Streptococcus genus, whereas Bacteroidetes and Clostridium genera were the predominant microbial groups identified in the distinctive composition of the microbiota in the distal ileum, ascending colon and rectum (12). Notably, bacterial diversity within the human intestine and faeces is very high while also displaying a significant inter-subject variability (11). It has been suggested that the faecal microbiota represents a combination of shed mucosal bacteria (autochthonous components of the intestinal microbiota) and a separate non-adherent luminal population (allochthonous components of the intestinal microbiota) (48-49). 6.1. Advantages and disadvantages of 16S rRNA- and metagenomic clone library-based diversity approaches The scientific rigor of metagenomic technology in defining the bacterial composition of complex bacterial communities is widely accepted (14) (Table 1). This method clearly emphasizes the impact of genomics on microbial ecology (50) and combines the use of molecular biology and genetics to identify and characterise genetic material from complex environments (51). In contrast to culture-dependent approaches, metagenomic investigations can overcome many technical limitations and problems that are linked to the handling of exigent bacteria such as those from intestinal environments. Moreover the combination of PCR of 16S rRNA gene and direct sequencing has expanded our knowledge of microbial diversity; it has been shown that less than 1% of the microbial communities in soil are accessible through culture-based techniques, and it is likely that this situation is very similar for many other microbial niches including the GIT. In addition, the proportion of non-culturable bacteria appears to increase with the age of the human subject, because the number of novel phylotypes was strikingly high when the microbiota of elderly subjects was analyzed by 16S clone library sequencing and T-RFLP (52). This highlights the importance of a culture-independent method for assessing the microbial GIT community structure in these individuals. Metagenomic analysis has significant advantages. For example it allows a high throughput approach to process samples, which contrasts significantly with the time-consuming and labour-intensive classical culturing techniques. Moreover, metagenomic studies do not require immediate processing, as samples can be frozen and subjected to analysis later, while DNA can easily be transported between laboratories (51). However, the metagenomic approach is not free from any drawbacks. Indeed, there are currently few studies on diversity from metagenomic clone libraries but it is expected that as high through put sequencing becomes cheaper, metagenomics as a routine quantitative method for diversity analysis will be within the reach of most laboratories. It is rather complicated with respect to the standardization of extraction of genetic material to cover every member of a given bacterial community. In fact, the results achieved by metagenomics studies may be significantly influenced by the efficacy of the protocols used for directly extracting DNA from environmental samples (e.g., fecal or human tissues) as well as by the specificity of PCR primers and associated PCR conditions (53). Also for Proteobacteria, Fusobacteria and Verrucomicrobia phyla some contradicting results are described (11, 12). A relatively small number of sequences corresponding to the above mentioned taxa were found in one particular metagenomic study (11), in contrast to another investigation of human intestinal biopsies which revealed a large number and proportion of these phylotypes. In addition to the bacterial phyla describe above there may be other groups (e.g. Actinobacteria) for which some methodological problems may be present. For instance, very often bifidobacteria have not detected (10, 14) or have been underestimated (11) by metagenomic approaches, which may be related to low amplification efficiencies due to high G+C content. Furthermore, even if metagenomic investigations of human intestinal microbiota generate very valuable data that outline the microbial diversity of this niche, they should nevertheless be interpreted with caution not only because of technical limitations outlined above, but also because the large majority of published metagenomic data are based on faecal samples rather than intestinal biopsies. Thus, the identified phylotypes do not necessarily represent the actual diversity of the intestinal microbiota (54). In fact, the faecal microbial community (or at least the microbial DNA present in faeces) represents all bacterial species living in the GIT and so metagenomic analysis of faecal samples seems to overestimate the diversity of the human gut microbiota, a notion which was corroborated by the finding that the microbiota of the proximal regions of the human GIT is different from that present in other parts of the intestine (55). Despite this, faecal samples are often chosen to investigate the microbial composition of the intestinal microflora because from a practical, clinical and ethical perspective they are easy to collect. A further limitation of the available metagenomic data is that the composition of the predominant bacterial community is host-specific (24) and that the number of genes within a given community is so large that the number of analyzed fragments derived from a single sample must be enormous to obtain a statistically relevant representation of the complete microbiome (56). Furthermore, not all elements of a library, even if such a library were to be complete, are sequenced and bacterial species that are only minor components of the microbiota may appear to be absent. Therefore, considerable caution must be applied in the design, analysis and interpretation of the results obtained by metagenomic studies (13). Finally, there are sometimes difficulties in cloning DNA from Gram positive bacteria into the commonly used laboratory Gram negative bacterial hosts. 7. CONCLUSIONS Despite the presumed importance played by the GIT microbiota in maintaining a healthy human host, large knowledge gaps still exist with respect to the microbial composition, its development and strain diversity, as well as their associated metabolic effects. In fact, diversity (at the 16S rRNA level) is currently a biomarker for functional activity of specific microbes and ultimately for health effect. Recently, culture-independent studies, in particular metagenomic investigations, have significantly expanded our knowledge of the human intestinal microbiota. It has been shown that the composition of gut communities is variable over time with certain bacterial components acting as superior competitors for all available niches. Moreover, functional analyses of the intestinal microbiota, through sequence exploration of metagenomic libraries or by means of micro arrays that were based on key intestinal commensal bacteria, have provided in depth information as to how these bacteria function within their host and how they interact with that host and with each other. Novel sequencing technologies (e.g. pyrosequencing techniques) coupled with development of bioinformatic means will provide more in depth analysis of the biodiversity existing in the intestinal microbiota. Moreover, the expansion of high through put functional genomic tools, e.g. micro array technologies will provide the necessary postgenomic platforms to investigate the functionality of human bacterial genomes in the human gut. Moreover, other metagenomics approaches such as the total microbiota genome sequencing might be important in order to identify new phylotypes with no previous cultured relatives. The analysis of such genome sequences together with functional genomics approaches will be crucial in order to highlight sequence redundancy and thus may provide very valuable information on both, strategies for survival under GIT conditions and processes behind the mutualistic relationship established between certain microorganisms and the host. In fact, symbionts may dedicate part of their genomes to functions that are beneficial for both bacterium and host. Thus, the identification of such functions will aid in the development of novel probiotics. Novel "omics" technologies such as transcriptomics, proteomics and metabonomics, will open new avenues of research in the study of the functionality of the human gut microbiota and may explain the role played by certain microbiota's molecules (e.g., metabolites) in the developing of diseaseor in the establishment of health. Understanding the composition of the intestinal microbiota represents a crucial step towards a knowledge-based design of functional foods or for the selection of novel probiotic bacteria. For example, it is important that any newly developed probiotic bacterium must be genetically adapted to survive in the intestinal ecological niche. Such requirements will provide these bacteria with genetically determined capabilities to interact with both the human host as well as with the other indigenous components of the intestinal microflora, thus allowing them to exert their health-promoting effect(s) (e.g., probiotic role). Moreover, probiotic bacteria genetically adapted to live in the human intestinal niche will likely encode various "biological defences" to counteract the establishment of potentially dangerous microorganisms. Greater depth of knowledge on the autochthonous component of the intestinal microbiota will provide the possibility to screen or develop important characteristics within novel probiotic microbes, including the capacity to be highly stable in the human intestine and especially have reliable or enhanced probiotic effects. Furthermore as our understanding of the ecology and rich diversity of the human gut microbiota develops, members isolated from intestinal biopsies (autochthonous component of the intestinal microbiota), beside faecal isolates, will contribute to the next generation of probiotic microbes. Furthermore, the exploration of human gut microbiota will possibly lead in the long term to the identification of novel probiotics, that are different from the commonly used lactobacilli and bifidobacteria and that possess new health benefits. In addition, other criteria that should be taken into account when selecting novel probiotics are the differentiation of strains targeted for different host groups such as adult versus children or elderly people, and those that are compatible with the genetic background of the targeted host (57). Greater knowledge on evolution of species and strain diversity in the intestine can thus open up new possibilities that are expected to lead to the development of a next generation of designer probiotics with clearly described and scientifically supported health benefits. 8. ACKNOWLEDGMENTS This work was financially supported by Science Foundation Ireland through the Alimentary Pharmabiotic Centre (POT, DvS), by the Dept. Agriculture and Food FHRI award to the ELDERMET consortium (POT, DvS), by the Italian Award for Outstanding Young Researcher scheme "Incentivazione alla mobilita' di studiosi stranieri e italiani residenti all'estero" to the Marie Curie Reintegration Grant (MERG-CT-2005-03080) and by Parmalat spa, Italy to MV. 9. RFERENCE 1. L. V. Hooper, J. Xu, P. G. Falk, T. Midtvedt and J. I. Gordon: A molecular sensor that allows a gut commensal to control its nutrient foundation in a competitive ecosystem. Proc Natl Acad Sci 96, 9833-9838 (1999) Key Words: Intestinal microbiota, metagenomics, novel probiotic, probiotic bacteria Send correspondence to: Marco Ventura, Department of Genetics, Biology of Microorganisms, Anthropology and Evolution University of Parma, Italy, Parco Area delle Scienze 11a, 43100 Parma, Italy, Tel: 339-521-905666, Fax: 339-521-905604, E-mail:marco.ventura@unipr.it |