Development of new genetic resources for faba bean (Vicia faba L.) breeding through the discovery of gene-based SNP markers and the construction of a high-density consensus map.

Faba bean (Vicia faba L.) is a pulse crop of high nutritional value and high importance for sustainable agriculture and soil protection. With the objective of identifying gene-based SNPs, transcriptome sequencing was performed in order to reduce faba bean genome complexity. A set of 1,819 gene-based SNP markers polymorphic in three recombinant line populations was selected to enable the construction of a high-density consensus genetic map encompassing 1,728 markers well distributed in six linkage groups and spanning 1,547.71 cM with an average inter-marker distance of 0.89 cM. Orthology-based comparison of the faba bean consensus map with legume genome assemblies highlighted synteny patterns that partly reflected the phylogenetic relationships among species. Solid blocks of macrosynteny were observed between faba bean and the most closely-related sequenced legume species such as pea, barrel medic or chickpea. Numerous blocks could also be identified in more divergent species such as common bean or cowpea. The genetic tools developed in this work can be used in association mapping, genetic diversity, linkage disequilibrium or comparative genomics and provide a backbone for map-based cloning. This will make the identification of candidate genes of interest more efficient and will accelerate marker-assisted selection (MAS) and genomic-assisted breeding (GAB) in faba bean.

rate and abortion of ovules 2 , the need for pollinators for outcrossing and fertilization of ovules 8 and the strong influence of symbiosis for optimal concentration of nitrogen (N) in the grain and for N soil fertility 9 .
Faba bean is a diploid outcrossing species (2n = 12) with a "giant genome" 10 of approximately 13 Gb distributed on six chromosomes. The high content in transposable elements 11 complexifies the faba bean genome assembly and map-based cloning. Most of the linkage maps generated so far have a low to medium saturation and are based on morphological, isozyme, restriction fragment length polymorphism (RFLP), random amplified polymorphic DNA (RAPD), sequence characterized amplified region (SCAR), intron targeted amplified polymorphism (ITAP), simple sequence repeat (SSR) and low-density single-nucleotide polymorphism (SNP) markers [12][13][14][15][16][17][18][19][20][21] . SNP-based genetic maps have been recently developed 21 . To date, the most saturated map is the one reported by Webb et al. 21 consisting of 687 SNPs. This map provided a first glimpse of synteny of faba bean with other legumes such as barrel medic (Medicago truncatula L.; Mt), lupine (Lupinus albus L.), soybean (Glycine max (L.) M.; Gm) or lentil (Lens culinaris M.) 16,21 . In fact, the SNPs used to construct the consensus map of Webb et al. 21 were designed based on orthologous sequences in M. truncatula with the objective of physically anchoring the faba bean consensus map to the Medicago genome. Webb et al. 21 also took advantage of the macrosynteny between these two species to discern the level of conservation of the genetic organization between faba bean and lentil, one of its most closely related crop species. A high conservation of genomic blocks between V. faba and these legume species was reported.
The development of dense and robust genetic maps that involve multiple populations and gene-based markers is a prerequisite for marker-assisted selection (MAS) and paves the way to V. faba genome assembly. Now that pulse genomes are becoming available, it is important to implement more accurate comparative genomic approaches that will reinforce faba bean breeding programs through a faster and more efficient identification of candidate genes. Transcriptome sequencing has been intensively used in the development of SNP markers for genetic mapping and diversity panel structuration of model species and crops with large genomes [21][22][23][24][25][26][27][28] . In addition, Illumina MiSeq. 29 has made possible the identification, location and functional characterization of genes that control traits of interest and has been used to provide a more comprehensive view of diversity and gene function in plants [30][31][32] . The design of SNP markers is strongly recommended for the construction of genetic linkage maps, as they stand out for their uniform distribution throughout the genome, for being numerous and for their tendency to be biallelic and codominant 33,34 . Therefore, we chose to exploit next-generation sequencing (NGS) technologies and transcriptome sequencing to specifically address the expressed gene fraction (exome) for the discovery of gene-based SNPs. The objective of our work was to develop high-resolution genetic linkage maps in three interconnected faba bean recombinant line populations and build a high-density consensus map. Our work also took advantage of macrosynteny among legume relatives to locate faba bean genomic regions conserved in different sequenced species. A comparative alignment and mapping between the faba bean consensus map developed herein and the genomes of barrel medic, birdsfoot trefoil (Lotus japonicus L.; Lj), chickpea (Cicer arietinum L.; Ca), common bean (Phaseolus vulgaris L., Pv), cowpea (Vigna unguiculata (L.) W.; Vu), pea (Pisum sativum L.; Ps) and soybean has been performed. The identification of highly syntenic and collinear areas between faba bean and already sequenced species will facilitate candidate gene discovery. In this way, SNP markers developed in genes resulting from the present work will be very useful in MAS and in map-based isolation of candidate genes.

Materials and Methods
plant material. Three connected bi-parental mapping populations have been built using cultivar (cv.) HIVERNA as a common female parent, and the accessions NOVA GRADISKA, SILIAN, and QUASAR as male parents for Pop1, Pop2 and Pop3, respectively. The winter type cv. HIVERNA of German origin was selected as a common parent between the three populations. The parents NOVA GRADISKA (originated in Croatia) and SILIAN (originated in Northern Sudan) are both minor type landraces sown in late and early spring, respectively; while QUASAR (originated in United Kingdom) is a winter type cultivar adapted to oceanic climate (cool winters with abundant rainfall). NOVA GRADISKA and QUASAR have previously been reported as resistant to the faba bean weevil (Bruchus spp.) attack 35  transcript assembly and Snp Discovery. SNP discovery was carried out as described by Duarte et al. 28 with slight modifications. Total RNAs were extracted from the parental lines and checked for quality and integrity. RNAs were then converted to full-length double-stranded cDNA and normalized with the Mint-2 and Trimmer-2 kits (Evrogen, Moscow, Russia), respectively. Later, q-RT-PCR assays were developed on a set of genes of different abundances to assess the efficiency of normalization. Normalized double-stranded cDNAs were sheared into 450 bp fragment size using the Covaris E220 system (Covaris Inc., Massachusetts, USA). Individual indexed NGS libraries were then produced with the SPRIworks HT reagent kit (Beckman Coulter, Indianapolis, USA). An equimolar pool of the four libraries was sequenced on the Illumina MiSeq platform (V2 chemistry, PE 2 × 250 nt, 12 million of clusters) (Illumina, California, USA). FastQC was used to check raw data quality. Then, sequencing adaptor removal and quality trimming were performed using trim_galore and SMART oligos (normalization primers) that were masked in the raw sequence readings using an in house script. MIRA 4.0 (http://mira-assembler.sourceforge.net) software was used to perform a de novo assembly of all samples at once. Reads of parental genotypes were remapped on the assembly using BWA 36 and only contigs with more than 10x coverage were kept for further analysis. Transdecoder 37 was used to predict open reading frames in contigs. BUSCO v3.0.1 38 was used on viridiplantae odb10 database to assess completeness of the dataset. Functional annotation was performed using eggNOG-mapper 39 on eggNOG database 5.0 40 .
The SNP discovery was then performed with SAMtools mpileup 36 and BCFtools call 41 . Only homozygous SNPs were retained.
Genotyping f 3 populations. DNA samples were extracted from leaf tissues of the parents and the individuals of the three populations using the NucleoSpin Plant II Mini kit (https://www.mn-net.com/, Hoerdt, France) following the manufacturer's protocol. DNAs were normalized before being fragmented with Adaptive Focused Acoustics Technology (Covaris Inc., Massachusetts, USA). A 250 bp target size value was obtained by using a Covaris E220 system, according to the manufacture's instructions. Then DNA fragments underwent a NGS library preparation procedure consisting in end repair and Illumina adaptor ligation using the KAPA HTP kit (Roche, Basel, Switzerland). Individual index sequences were added to each library for identifying reads and sorting them according to their initial origin. Two thousand SNPs were targeted to design capture probes. The Sequence Capture was done by using SeqCap EZ Developer kit from Roche according to the manufacture's instructions. The sequence capture reaction efficiency was evaluated by measuring, with a qPCR assessment, a relative fold enrichment and loss of respectively targeted and non-targeted regions before and after the sequence capture reaction.
The captured samples were sequenced on HiSeq sequencing platform (Illumina, California, USA) with a Paired End sequencing strategy of 2 reads of 100 bases. The objective was to produce around 3 Million sequencing clusters per sample.
Raw reads were trimmed for adaptor sequence using cutadapt 1.8.3 42 and then aligned on the targeted regions with Novoalign (http://www.novocraft.com, Selangor, Malaysia). Genotype at each position of interest was determined using SAMtools mpileup 36 and in house Perl scripts to filter out low quality positions, call the SNPs, and to produce a genotyping matrix for all 2000 selected markers.
Genetic maps construction for each population. Markers were first filtered for deviation from Mendelian segregation (37.5:25:37.5) using the following index with threshold of 0.8. Then, markers were assigned to linkage groups by estimating the pairwise recombination frequencies within each population using the maximum likelihood procedure with the forward-backward algorithm and a LOD score of 5.0 as the threshold for significant linkage in the software JoinMap V5.0 43 . In a second stage, markers were ordered and assigned to their positions in each linkage group by likelihood maximization. Candidate orders' likelihoods were computed as for F 2 populations using Spell-QTL Bayesian inference 44 and heuristic local maximization was performed using custom R scripts implementing the algorithm described earlier in Ganal et al. 45 . This heuristic was based on the serial inference of marker orders, proceeding according to the following steps. First, 10 "seed" markers were randomly chosen in each chromosome, and one statistically robust scaffold map replicate was constructed from each seed marker, by iteratively choosing the most strongly linked neighbour with at least 10 cM between adjacent markers. Then given these scaffold maps, marker density was increased to produce framework maps containing as many markers as possible while keeping a LOD score >3.0 for the robustness of marker orders. Finally, the complete maps were obtained by placement of additional markers using bin-mapping 45 . As a post-processing step, it was necessary to calculate the distance between pairs of markers taking into account that the recombination rates estimated by the algorithm assumed that the populations were F 2 whereas they were in fact F 3 . For that, we first determined the correspondence between rF 2 , the recombination rate estimated assuming an F 2 population, and the true recombination rate, r. That correspondence was obtained in two steps. In the first step we used the explicit formula relating rF 2 to the two-locus genotype frequencies fAaBb, fAaBB = fAABb, fAABB = faabb, and fAAbb=faaBB where the two parental types are denoted A (respectively a) and B (respectively b) and the labels refer to unphased genotypes. Then in the second step we inferred r from those 7 frequencies by maximum likelihood using the formulas for those frequencies in F3 as a function of r. Finally, centiMorgan (cM) distances were calculated using Haldane's mapping function 46 . consensus genetic map construction. SNP sequences anchored to the sequences of the SNPs previously mapped by Webb et al. 21 were used to assign each linkage group (defined by the initial seed used in the map construction) to one chromosome number. For each chromosome, we produce a consensus map that summarizes the recombination information within the three populations. Our method is quite general and in particular it does not assume collinearity of the individual maps. Its key feature is the minimization of an index I D which measures the differences between the (unknown) consensus map M* and all of the individual maps M1, M2. We forced our consensus map to include all the markers present in the individual maps. M* is to be specified by assigning a genetic position for each marker and of course that will also define the marker ordering in M*. Without loss of generality, the genetic position of the first marker of M* can be considered as the origin of genetic coordinates. The task is then to estimate the genetic positions of the rest of the markers. In our framework based on the minimization of the index I D , we first define a "distance" between two maps via the formula: where i labels the markers that are common in the two maps and j(i) labels the markers that are not only in common in the two maps but also meet a criterion for their distance to marker i. Specifically, if marker j(i) is quite far www.nature.com/scientificreports www.nature.com/scientificreports/ from marker i, that corresponding interval does not provide much information, so it is better to exclude it from the sum. In general, there are many of these markers and thus it is also computationally efficient to exclude them. Inversely, if a marker j(i) is very close to i, the corresponding distance is often not very well determined and so again it is better to exclude it. Therefore, our criterion imposes both a minimum and maximum distance between j(i) and i. In addition, to avoid having many j(i) markers for some i and only a few for others, we keep only a subset of the possible markers j(i) for a given i. where n runs through all the individual maps from which the consensus is being built. If the maps have been determined using very different population sizes, we weight each term of this sum by the corresponding population size. In this way, if a map has a much smaller population than the other maps, it will have little influence on the construction of M* which is justified since its marker positions are not very accurate. On the contrary, if a map has a much larger population than the other maps, it will have a strong influence on the construction of M*, which is again justified since its marker positions are rather precisely determined.
Synteny with other legume crops. The flanking sequences of the SNP markers placed in the consensus map were searched against C. arietinum (v1) 47 (Table S1). After filtering on coverage (see Materials and Methods section), a transcriptome resource of 39,423 high-quality contigs with a N50 of 1460 bp (Table S2) and a BUSCO completeness score of 84.8% was built (Table S3). Functional annotation was obtained for 24,507 contigs (Fig. S1). Each contig of the unigene set was assigned, if possible, to the categories of biological processes, molecular functions and cellular components (Fig. S1). Among the biological processes, the metabolism of nucleobase, nucleoside, nucleotide and nucleic acid (10.26%), biosynthesis (9.67%) and cell organization and biogenesis (8.02%) were the main classes represented (Fig. S1A). Catalytic (36.49%) and transferase (13.17%) activities contributed in greater proportion to the category of molecular function (Fig. S1B). The cytoplasmic (31.68%) and nucleus (15.91%) cellular components were the most represented classes within the annotation (Fig. S1C). SNP discovery, selection, genotyping and individual genetic linkage of the three F3 populations. A total of 105,828 homozygous SNPs (Fig. S2) were detected on 19,190 contigs (5.5 SNP/contig) with an average coverage of 47.7×. In total, 64.77% of the SNPs were transitions while 35.23% were transversions. Out of these robust SNPs, 2,000 were selected based on the following parameters: SNP quality score, polymorphism in more than one population (see Materials and Methods section), potential synteny with pea and Medicago truncatula and 1 SNP maximum per contig. Capture probes were then designed to allow large-scale targeted genotyping. Of the 2,000 gene-based SNP markers selected above, 1,911 markers were successfully scored on the progenies of three F 3 inbred populations, i.e., Pop1-3 (Table S4). Of them, 95.2% (1,819 SNPs) were polymorphic in at least one population. Altogether, 1,446 SNPs were polymorphic in Pop1, 1,499 in Pop2 and 1,409 in Pop3 (Table S4).
Linkage mapping of the three F3 populations. Individual genetic maps were constructed for the three F 3 populations after filtering markers for distortion and missing data (see Materials and Methods section). Two hundred thirty-three (Pop1), 189 (Pop2) and 175 (Pop3) SNP markers were placed on the individual scaffold maps (Fig. S3). The framework maps included 350 (Pop1), 209 (Pop2) and 326 (Pop3) markers (Fig. S4). The full maps had 1,438 markers for Pop1, 1,312 markers for Pop2 and 1,406 markers for Pop3 that covered all six faba bean LGs (Table 1 (Table 1). Only a few large gaps (>10 cM) were observed: three gaps on the linkage map for Pop1 (LGII and LGV), nine gaps on the map from Pop2 (LGI, LGIII, LGIV, LGV and LGVI) and eight gaps on the map from Pop3 (LGI, LGII, LGIII and LGV) ( Table 1). Nine hundred twenty-eight markers were common to the www.nature.com/scientificreports www.nature.com/scientificreports/ three populations (Fig. 2). One thousand five hundred and four SNPs were mapped onto at least two of the three linkage maps. Pairwise comparisons of the positions and the orders of common markers were performed among the three populations to assess synteny and collinearity (Fig. 3). The positions of the marker were consistent in the LGs of the three populations. High positive correlations (Spearman test) between map orders were obtained: Pop1-Pop2, r = 0.98; Pop1-Pop3, r = 0.99 and Pop2-Pop3, r = 0.98; P < 0.001).

Segregation distortion of the individual maps.
There was a significant distortion of segregation (χ 2 test, P < 0.05) with respect to the expected Mendelian segregation ratio (37.5:25:37.5) for a minority of markers in the three populations (5.57% in Pop1, 9.14% in Pop2, 8.52% in Pop3). Only a few markers exceeded the index distortion threshold of 0.8 used for mapping (see Materials and Methods section).
Pop2 presented a region with segregation distortion towards SILIAN alleles at the bottom of LGI (Figs. 4 and S6). Although with less intensity, LGI of Pop1 also displayed a region in which segregation favoured the alleles of the male parent NOVA GRADISKA (Figs. 4 and S6). Outside of this region, the segregation distortion slightly favoured NOVA GRADISKA alleles in Pop1 (except in LGIII where HIVERNA alleles were over-represented) and SILIAN alleles in Pop2 (except in LGVI where HIVERNA alleles were favoured) while in Pop3 HIVERNA alleles were favoured (except in LGIII and LGIV where QUASAR alleles were more frequent) (Fig. S6).
integration of the individual maps into a consensus map of faba bean. General collinearity among the individual genetic linkage maps (Fig. 3) helped to construct a dense consensus map of 1,547.71 cM that included 1,728 markers (95% of the polymorphic SNPs) (Table 1, Fig. 5, Table S8). The number of markers per LG varied from a minimum of 153 (LGV) to a maximum of 375 (LGI) SNPs (Table 1). The density of markers in this consensus map was high in all LGs, with an average density of 1.12 markers/cM. The average distance between two markers was 1.17 cM (Table 1). Nonetheless, 9 intervals were found with a distance between two markers LGI LGII LGIII LGIV LGV LGVI www.nature.com/scientificreports www.nature.com/scientificreports/ greater than 10 cM, the largest gap being 21.51 cM ( Table 1). The position and order of markers on the individual and consensus maps was overall conserved (r = 0.99 for Pop1-consensus, Pop2-consensus Pop3-consensus, P < 0.001) (Figs. 6 and S7). A few local inversions of the order of markers were observed (Fig. S7).
Comparison between the faba bean consensus map SNP marker sequences reported here and those presented in the Webb et al. 21 consensus map using BLASTn search highlighted markers located in the same genes. Eighty-eight common markers (e.g. markers corresponding to the same Medicago truncatula gene sequence) were found. The distribution and position of these common markers between both maps showed highly conserved www.nature.com/scientificreports www.nature.com/scientificreports/ collinearity between them in the six LGs, although a few marker inversions were also observed (Fig. S8), which is another evidence of the reliability of the data presented here.
Macrosynteny between the faba bean consensus map and the genomes of related legume species. Syntenic and collinear relationships between the faba bean consensus map presented in this study and the genomes of related legume species are summarized in Fig. 7. As expected, the degree of synteny and collinearity between faba bean and the legume species compared here increased when the phylogenetic distance decreased and vice versa. The genomes of P. sativum, C. arietinum and M. truncatula showed high levels of macrosynteny with our consensus map (Fig. 7). The best blast hits on the M. truncatula and P. sativum genomes for the faba bean markers' flanking sequences are described in Table S4, including their annotations and positions. There was a high conservation of synteny and collinearity between LGs II, IV, V and VI of faba bean and PsChr5, PsChr4, PsChr3 and PsChr7 of P. sativum and LGs II, III, IV and V of faba bean and MtChr3, MtChr1, MtChr4 and MtChr7 of M. truncatula, respectively (Fig. 7).

Discussion
new faba bean genetic resources: transcriptome, gene-based Snps, gene-based Snp markers and a consensus map. Europe suffers a significant deficit of plant proteins that makes it necessary to import up to 70% of the plant-based proteins consumed 54 . Grain legumes including faba bean are good candidates to boost EU plant protein production due to the high protein content of their seeds. Despite this potential and www.nature.com/scientificreports www.nature.com/scientificreports/ the important environmental services related to grain legume production, these crops represent only 3-4% of the arable land. The low investment in breeding programs has limited the development of stable high-yielding varieties, resistant to biotic and abiotic stresses 55   www.nature.com/scientificreports www.nature.com/scientificreports/ (File S1), of which 24,507 contigs were annotated. Although this amount is lower than that of other studies 56,57 , it is a sufficiently high number for the discovery of robust SNP markers well distributed throughout the faba bean genome. These data are also available for use in transcriptome comparisons with other faba bean genotypes and between faba bean and other species. In addition, the faba bean transcriptome was used to identify 105,828 gene-based SNPs (File S2). The present work makes this new source of SNPs available to the faba bean community, who can develop additional SNP markers useful in other genetic backgrounds. Two thousand non-redundant loci (Table S4)   www.nature.com/scientificreports www.nature.com/scientificreports/ recombinant lines from three populations and the four parental lines that originated them. One thousand nine hundred eleven SNP markers (95.5%) were validated after a successful genotyping (Table S4). Our set of 1,819 polymorphic gene-based SNPs is a valuable tool for the faba bean community and particularly for breeders. Since the markers were designed in genes, they are highly informative and allow establishing syntenic relationships with other species 32,58,59 . The high quality of this set of markers was confirmed after the construction of three individual genetic maps derived from three different populations.
High collinearity between individual maps led to the construction of the densest faba bean consensus map known to date. The map includes 1,728 well-distributed markers along six LGs that correspond to the six faba bean chromosomes and covers 1,547.71 cM with a dense marker placement (0.89 cM between adjacent pairs of markers on average) (Fig. 5). In accordance with previous cytogenetic studies LGI, corresponding to chromosome 1, was the largest linkage group 60,61 . The total map size is consistent with the 1,403.8 cM size of the consensus map previously published by Webb et al. 21 . Despite having different genetic backgrounds, both consensus maps showed a good collinearity (Fig. S8), which confirming the good quality of both maps. Consensus gene-based genetic linkage maps are useful in meta-QTL analysis, phylogenetic and comparative genomic studies, map-based cloning and GAB, especially in the absence of a genome sequence. The phenotypic characteristics of the parents of the recombinant lines make these populations and maps useful resources. The next steps in our research will be to perform Quantitative Trait Locus (QTL) analyses to identify potential candidate genes for resistance to faba bean seed weevils in Pop1 and Pop3. In a previous work 35 , we reported that the male parents NOVA GRADISKA and QUASAR present partial resistance to the attack of bruchids. Differences in parental responses to bruchids attacks suggested distinct resistance mechanisms in the two accessions. This would be of great advantage in breeding since different genes could be pyramided and introgressed simultaneously in cultivars, making the resistance to faba bean weevils more durable and contributing to agriculture with less need for pesticides. In the case of Pop2, a distorted region was located on the top of LGI (Figs. 4 and S6). Knowledge about this area is of great importance for MAS because the genes located in this part of LGI of Pop2 seem to segregate together in favour of the genetic background of SILIAN. If favourable alleles of a gene of interest were located in this area of the LGI but at the same time there were nearby genes that carried unfavourable alleles, most likely, they would all segregate together. Thus, the introgression of favourable agronomic traits will be quite difficult in such a situation. Other previously published faba bean maps have also noted the presence of distorted regions throughout the different linkage groups 62-64 . Syntenic regions shared between faba bean and other legumes will facilitate future comparative genomic studies. Exploitation of the syntenic relationships between the faba bean consensus map developed in this work and available legume genomes will make the identification of candidate genes of important traits easier in the future and will enable forthcoming synteny-based gene cloning approaches. As expected, robust macrosyntenic blocks that sometimes nearly cover a complete chromosome were found between faba bean LGs and pea, barrel medic and chickpea chromosomes since their phylogenetic proximity is greater than that of the rest of the compared sequenced legumes (Fig. 7). In accordance with our results, Webb et al. 21 also reported good levels of synteny between their consensus map, the genome of M. truncatula and the genetic map of lentil developed by Sharpe et al. 27 . In addition, we have been able to locate abundant blocks of macrosynteny between faba bean and common bean, cowpea or birdsfoot trefoil despite their greater evolutionary distances (Fig. 7), providing further evidence of the mapping accuracy. By contrast, the number of markers in our consensus map may not be enough to clarify the macrosynteny between faba bean and soybean due to the extensive chromosomal rearrangements and polyploidization of the soybean genome. Hopefully, gene conservation with soybean will be more evident once the faba bean genome is available, as has happened in the case of the pea 48 . Although duplication of the soybean genome and chromosomal rearrangement are a limitation for translational genomics with faba bean, synteny in duplicate regions would be a good resource to exploit. Despite the large size of the faba bean genome, synteny data reflects a globally conserved organization with respect to the genome of the legumes studied here. There is of course a certain amount of reorganization that can be easily observed, for example, in the condensed LGI of faba bean that gathers the genes located on chromosomes 1, 2 and 5 of pea. These results include V. faba as an additional syntenic species in the paleogenomic scheme described in Kreplak et al. 48 .
In conclusion, this work provides to faba bean researchers and breeders a new faba bean exome assembly originated from transcriptome data of four accessions (HIVERNA, NOVA GRADISKA, SILIAN and QUASAR), a set of 105,828 gene-based SNPs and 1,819 mapped SNP markers on three individual linkage maps and one consensus map. The high quality of the assembly resulted in the identification of a large number of SNPs of the most informative type due to their location in genes. The molecular markers designed from this set of SNPs were validated in three recombinant populations, resulting in the densest faba bean consensus map to date. The SNP markers designed here are available for genotyping other inbred populations that could be integrated later into our consensus map. These robust resources will be useful for trait mapping, genetic diversity and linkage disequilibrium studies or map-based cloning, and will enable faba bean MAS and GAB as well as the identification of candidate genes of agronomic interest through synteny-based approaches.