The symbiont-associated (SA) environmental package is a new extension to the minimum information about any (x) sequence (MIxS) standards, established by the Parasite Microbiome Project (PMP) consortium, in collaboration with the Genomics Standard Consortium. The SA was built upon the host-associated MIxS standard, but reflects the nestedness of symbiont-associated microbiota within and across host-symbiont-microbe interactions. This package is designed to facilitate the collection and reporting of a broad range of metadata information that apply to symbionts such as life history traits, association with one or multiple host organisms, or the nature of host-symbiont interactions along the mutualism-parasitism continuum. To better reflect the inherent nestedness of all biological systems, we present a novel feature that allows users to co-localize samples, to nest a package within another package, and to identify replicates. Adoption of the MIxS-SA and of the new terms will facilitate reports of complex sampling design from a myriad of environments.
Interspecific interactions are ubiquitous across the Tree of Life. With the realization that eukaryotic organisms can harbor rich microbial communities, came also the view that these smaller partners may in fact play important roles in mediating host-symbiont associations, thus adding a further layer to this complex set of nested interactions, i.e. host-symbiont-microbe [1,2,3,4,5,6,7,8]. As the number of studies exploring the microorganisms associated with symbiotic organisms increases, likewise does the need for compliant standardized metadata that provides contextual information associated with each study and sample. Standardized metadata allows for the integration of data across organisms, resources, and within data repositories. Here, we present the symbiont-associated (SA) environmental package as a new extension to the minimum information about any (x) sequence (MIxS) standards , which will be included in MIxS version 6. Whilst the MIxS-SA expands upon the MIxS host-associated environmental package , it reflects the need for a new standard that takes into account the distinct life history traits of symbionts, their association with one or multiple host organisms, the complex nature of host-symbiont interactions along the mutualism-parasitism continuum, and the nestedness of symbiont-associated microbiota. We also propose adding the term ‘relationship to other packages’ to all environmental packages across the domains of life, to allow for integrated analysis of symbiont and host microbiota by linking metadata elements across environmental packages. This will allow users to nest a package within another package, and to identify replicates. This added feature is pivotal for the study of the microbiome of symbionts that are themselves nested within a host, reflects the inherent nestedness of all ecosystems and will facilitate reports of complex sampling design from a myriad of environments.
Collecting relevant metadata (data describing data) is now widely recognized as critical to contextualize samples and increase their reusability and reproducibility [9,10,11]. The Genomic Standards Consortium (GSC, https://gensc.org) has developed and maintains a suite of minimal information metadata standards for describing sequence metadata (checklists) for genome (MIGS), metagenome (MIMS), marker gene sequences (MIMARKS), simple amplified genome (MISAG), metagenome-assembled genome (MIMAG), virus genomes (MIUViG) and environmental packages for describing habitat-specific contextual data of the sampling environment [9, 10, 12, 13], collectively referred to as the Minimum Information about any (x) Sequence (MIxS) standard (ref. , https://gensc.org/mixs/).
The MIxS standards are used broadly across the microbiome research communities. These standards have been integrated into large scale microbiome projects (e.g. Human Microbiome Project, https://www.hmpdacc.org/), Earth Microbiome Project (https://earthmicrobiome.org/), Microbiology of the Built Environment (MoBE, https://www.microbe.net), microbiome bioinformatics platforms (e.g., QIIME, Qiita, mothur, JGI GOLD, MG-RAST, EBI, NCBI) and are now required upon manuscript submission. A primary advantage of the MIxS standards is the collation of large aggregates of associated metadata that can be harnessed to uncover, and eventually comprehend, patterns of microbial diversity and ecology.
The MIxS-SA package was initially drafted during the 1st Parasite Microbiome project workshop that involved the contribution of members of the GSC in addition to microbial ecologists, parasitologists, pathologists and marine biologists . Participants rapidly identified the need to incorporate information on the nestedness of symbiont-associated systems, and the absence within the MIxS host-associated package of descriptors of complex life histories of mutualistic and parasitic symbionts. Until now, researchers have either omitted this information or added research-specific symbiont-associated annotations, limiting significantly the potential to compare, combine and/or reuse data from different systems and studies. Whereas the MIxS-SA package was initially designed for the study of parasite-microbes interaction, the scope of the package was expanded to include non-parasitic symbionts. This addition is a necessary expansion due to the context-dependent nature of symbiotic interactions and the ability of a given symbiont to interact differently with different organisms. Notably, the resulting MIxS-SA package reduces the need to develop additional highly similar packages for different types of symbionts.
Symbiotic associations are generally classified as mutualistic (mutually beneficial association), commensal (beneficial association to one of the partners, but not harmful to the other), or parasitic (detrimental association to one of the partners) . In the context of the symbiont-associated package, the term symbiont applies to macro and microorganisms that can establish a physical interaction with at least one other organism at some stage of their life cycle regardless of the nature and dependence of the interaction. As such, this definition also covers symbiotic organisms that establish facultative and accidental associations (e.g., dead-end hosts), not requiring evolutionary processes to explain their association, but excludes free-living organisms that establish a symbiotic relationship with another free-living organism (e.g., flowers and bees). The MIxS-SA package presented herein has gone through an open and iterative review process engaging the GSC community and experts studying symbiotic organisms across various symbiont and host taxa.
Here, we present the selected list of metadata descriptors for symbiont-associated microbiota studies, including a subset of mandatory (M) terms that underpin metadata compliance (Table 1; Supplementary Information SI-1 contains all MIxS-SA items). In order to allow comparative studies of the microbiota of, sometimes closely related, free-living and symbiotic organisms, the MIxS-SA includes terms already found in the MIxS host-associated package. Thus, in MixS-SA, the term “host” (when used alone) refers to the host of the biological sample which is the symbiotic organism. New terms were created to characterise the “host of the symbiotic host”. We provide symbiont-associated package specific “Expected values” and “Examples”. Changes to the package (addition of terms, modification etc.) can be proposed by the community by creating a ticket on the MIxS GitHub page (https://github.com/GenomicsStandardsConsortium/mixs).
Given the diversity of symbiotic interactions and that the nature and dependence of such interactions can be context-dependent rather than a fixed trait, it was crucial to define terms and provide value syntax that were inclusive for diverse types of symbioses and also across the symbiont life histories and transmission processes. For example, the term “host dependence” (a mandatory item) and “type of symbiosis” (a conditional item) are discrete but complementary items. While “host dependence” aims to provide a general characterization of the known type of host dependence for the symbiotic organism (e.g., facultative), “type of symbiosis” was specifically designed to further characterize the type of biological interaction established between the symbiotic organism and its respective host at the moment the biological sample was taken (e.g., mutualistic). As a result, the MIxS-SA package features mandatory and conditionally mandatory, and optional features that enable flexibility according to the knowledge of the study system at the time of sampling. Two examples of MIxS-SA-compliant metadata are provided in Supplementary information (SI-2), and the respective study designs are presented in Fig. 1. The examples refer to 16 S rRNA gene studies of (a) the bacterial communities of the parasite Coitocaecum parvum, a trematode, across four of its life stages: the sporocyst, the metacercaria and the adult, as well as the free-living cercaria , and (b) of the leaves and roots of the parasitic plant Orobanche hederae and its ivy host, Hedera spp. .
While identical terms are often used in several of the 17 environmental packages currently available (https://gensc.org/mixs/), here we introduce three additional new terms: one is shared by several relevant MIxS environmental packages, and the two others will feature within the core MIxS package. The new term “observed host symbionts” provides a more comprehensive descriptor for the subject organism associations with smaller symbionts and it has been added to the host-associated, human-associated, plant-associated, human-vaginal, human-skin, human-oral and human-gut packages. The term “biotic relationship” has been added to the core package as a conditional descriptor of the relationship between the subject organism and other larger host organism(s). Finally, it appears necessary to include in the MIxS core a new term that takes into account the nested feature of most associations found in nature, such as host-symbiont-microorganism, in which multiple packages are necessary to describe the samples of the study (e.g., water, sediment, host-associated, and symbiont-associated). The proposed term “relationship to other samples” indicates the direct relationship between two samples from the same Bioproject, that are described in different environmental package(s). This proposed feature, still under development, will allow for integrated analyses of the microbiota of symbiotic organisms and their direct environment, even in the context of co-infections (e.g., symbiont-associated SA1234 is “within” host-associated sample HA8974, “next to” symbiont-associated sample SA7890). This feature will also benefit other studies by providing ecologically-relevant contextual information (e.g., host-associated HA2567 is “within” environmental water sample W1234, “next to” host-associated sample HA5679, ‘next’ to environmental soil sample S5897). In conclusion, it is our hope that the MIxS-SA, together with the new terms, will enable researchers to better conduct integrated analyses of multi-level biological systems with the ultimate goal of better understanding the role of microbes associated with symbionts.
Dheilly NM, Poulin R, Thomas F. Biological warfare: microorganisms as drivers of host-parasite interactions. Infect Genet Evol. 2015;34:251–9.
Dheilly NM, Ewald PW, Brindley PJ, Fichorova RN, Thomas F. Parasite-microbe-host interactions and cancer risk. PLoS Pathog. 2019;15:e1007912.
Bass D, Stentiford GD, Wang HC, Koskella B, Tyler CR. The pathobiome in animal and plant diseases. Trends Ecol Evol. 2019;34:996–1008.
Brinker P, Fontaine MC, Beukeboom LW, Salles JF. Host, symbionts, and the microbiome: the missing tripartite interaction. Trends Microbiol. 2019;27:480–8.
Barrow P, Dujardin JC, Fasel N, Greenwood AD, Osterrieder K, Lomonossoff G, et al. Viruses of protozoan parasites and viral therapy: Is the time now right? Virol J. 2020;17:142.
Husnik F, Tashyreva D, Boscaro V, George EE, Lukeš J, Keeling PJ. Bacterial and archaeal symbioses with protists. Curr. Biol. 2021;31:R862–R877.
Hahn MA, Dheilly NM. Experimental models to study the role of microbes in host-parasite interactions. Front. Microbiol. 2016;7:1300.
Robinson AJ, House GL, Morales DP, Kelliher JM, Gallegos-Graves LV, LeBrun ES, et al. Widespread bacterial diversity within the bacteriome of fungi. Commun Biol. 2021;4:1168.
Yilmaz P, Kottmann R, Field D, Knight R, Cole JR, Amaral-Zettler L, et al. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications. Nat Biotechnol. 2011;29:415–20.
Bowers R, Kyrpides NC, Stepanauskas R, Harmon-Smith M, Doud D, Reddy TBK, et al. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat Biotechnol. 2017;35:725–31.
Schriml LM, Chuvochina M, Davies N, Eloe-Fadrosh EA, Finn RD, Hugenholtz P, et al. COVID-19 pandemic reveals the peril of ignoring metadata standards. Sci Data. 2020;7:188.
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26:541–7.
Roux S, Adriaenssens EM, Dutilh BE, Koonin EV, Kropinski AM, Krupovic M, et al. Minimum Information about an Uncultivated Virus Genome (MIUViG). Nat Biotechnol. 2019;37:29–37.
Dheilly NM, Martínez Martínez J, Rosario K, Brindley PJ, Fichorova RN, Kaye JZ, et al. Parasite microbiome project: grand challenges. PLoS Pathog. 2019;15:e1008028.
Leung TLF, Poulin R. Parasitism, commensalism, and mutualism: exploring the many shades of symbioses. Vie Milieu. 2008;58:107–15.
Jorge F, Dheilly NM, Poulin R. Persistence of a core microbiome through the ontogeny of a multi-host parasite. Front Microbiol. 2020;11:954.
Fitzpatrick CR, Schneider AC. Unique bacterial assembly, composition, and interactions in a parasitic plant and its host. J Exp Bot. 2020;71:2198–209.
This contribution is part of the Parasite Microbiome Project. The authors sincerely thank the Gordon and Betty Moore Foundation for fully sponsoring the 1st Parasite Microbiome project workshop, and for supporting Open Access publishing. FJ was funded by a grant from the Marsden Fund (Royal Society of New Zealand) to RP (Principal Investigator) and NMD (Associate Investigator). The National Institutes of Health grants RO1CA164719 (PJB), and R01AI144016-01(LJK). VH was supported by the FEDER grant InFoBioS n°EX011185 (région Val de Loire). JMM acknowledges support from award OCE-1933285, National Science Foundation, USA. ML is part of the PARADISE consortium, supported by funding from the European Union’s Horizon 2020 Research and Innovation programme under grant agreement No 773830: One Health European Joint Programme. JL was supported by the ERD funds (16_019/0000759).
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Jorge, F., Brealey, J.C., Brindley, P.J. et al. MIxS-SA: a MIxS extension defining the minimum information standard for sequence data from symbiont-associated micro-organisms. ISME COMMUN. 2, 9 (2022). https://doi.org/10.1038/s43705-022-00092-w