Polychromatic solar energy conversion in pigment-protein chimeras that unite the two kingdoms of (bacterio)chlorophyll-based photosynthesis

Natural photosynthesis can be divided between the chlorophyll-containing plants, algae and cyanobacteria that make up the oxygenic phototrophs and a diversity of bacteriochlorophyll-containing bacteria that make up the anoxygenic phototrophs. Photosynthetic light harvesting and reaction centre proteins from both kingdoms have been exploited for solar energy conversion, solar fuel synthesis and sensing technologies, but the energy harvesting abilities of these devices are limited by each protein’s individual palette of pigments. In this work we demonstrate a range of genetically-encoded, self-assembling photosystems in which recombinant plant light harvesting complexes are covalently locked with reaction centres from a purple photosynthetic bacterium, producing macromolecular chimeras that display mechanisms of polychromatic solar energy harvesting and conversion. Our findings illustrate the power of a synthetic biology approach in which bottom-up construction of photosystems using naturally diverse but mechanistically complementary components can be achieved in a predictable fashion through the encoding of adaptable, plug-and-play covalent interfaces.

O ur everyday experience of photosynthesis is dominated by the blue/red-absorbing pigment chlorophyll, a magnesium tetrapyrrole that acts as both a harvester of solar energy and a carrier of electrons and holes. Variants of this versatile molecule, principally chlorophyll a and chlorophyll b, are found in the plants, algae and cyanobacteria that make up the oxygenic phototrophs. Less well-known are the anoxygenic phototrophs, bacteria that use electron donors other than water and have one or more variants of bacteriochlorophyll as their principal photosynthetic pigment. Although these bacteria are less obvious in our environment, oxygen-tolerant species are widespread in oceanic surface waters where they make a sizeable contribution to global solar energy conversion 1 . A few species, including the bacteriochlorophyll a-containing Rhodobacter (Rba.) sphaeroides, have played major roles in our understanding of excitation energy transfer in light-harvesting "antenna" complexes (LHCs) [2][3][4] and charge separation in photochemical reaction centres (RCs) 5,6 .
Improving the performance of photosynthesis and finding new ways to exploit natural solar energy conversion have become important research topics 7,8 , and there is growing interest in the use of photosynthetic proteins as environmentally benign components in biohybrid devices for solar energy conversion [9][10][11][12][13][14] . Photoexcitation of a RC in such a device triggers intra-protein charge separation, producing a potential difference between opposite "poles" of the protein that drives subsequent electron transfer to create a photocurrent and photovoltage. In addition to solar energy conversion per se, proposed applications of photoprotein devices have included biosensing, light/UV sensing, touch sensing and solar fuel synthesis [9][10][11][12][13][14][15][16] . Photosynthetic proteins are attractive as device components because they are environmentally sustainable and benign, they achieve solar energy conversion with a very high quantum efficiency (charges separated per photon absorbed) and they can be adapted to purpose through protein engineering. However, a limitation is their selective use of available solar energy 7,8 , a consequence of their particular palette of lightharvesting pigments (Fig. 1a). This can be evidenced in devices through the recording of action spectra of external quantum efficiency (EQE-the number of charges transferred per incident photon), which exhibit peaks and troughs that correspond to the absorbance spectra of the particular light-harvesting pigments that are coupled to charge separation in the device 12,[17][18][19][20][21] .
One option for the expansion of a protein's light-harvesting capacity is to attach to it chromophores such as synthetic dyes [22][23][24] or emissive nanoparticles [25][26][27] . Drawbacks of this approach are that synthetic dyes are often expensive and prone to photobleaching 26 , while fluorescent nanoparticles can be toxic and achieving well-controlled assembly of protein-nanoparticle conjugates is challenging 28 . More akin to the present study is a report of a fusion protein between a single Yellow Fluorescent Protein (YFP) and the purple bacterial RC, which has the effect of somewhat enhancing light harvesting in a region where RC absorbance is weak by adding a single chromophore 29 .
A striking observation is the complementary nature of the absorbance spectra of chlorophyll and bacteriochlorophyll photosystems (Fig. 1a). This is enabled by the somewhat different electronic structures of their principal pigments ( Supplementary  Fig. 1a) and facilitates the occupancy of complementary ecological niches by oxygenic and anoxygenic phototrophs. Chlorophyll absorbs most strongly in the blue and red whereas the absorbance of bacteriochlorophyll is shifted to the near-ultraviolet and near-infrared. The absorbance spectra of plant and bacterial carotenoids between 400 and 600 nm are also somewhat complementary (Fig. 1a). Thus, anoxygenic phototrophs harvest parts of the solar spectrum which oxygenic phototrophs do not absorb well, and vice versa.
Following nature's lead, here we present the use of genetic encoding to achieve in vitro self-assembly, from diverse components (Fig. 1b, c), of photoprotein "chimeras" that display polychromatic solar energy harvesting and conversion. The components are the Rba. sphaeroides RC 5,6 and the LHCII [30][31][32][33] and heterodimeric LHCI [34][35][36][37][38] proteins from Arabidopsis (A.) thaliana ( Supplementary Fig. 1b-e). Highly specific and programmable self-assembly is achieved through adaptation of these components with the constituents of a two-component protein interface domain (Supplementary Fig. 1f) that covalently locks together two photosynthetic membrane proteins that have no natural propensity to associate in a specific and/or controllable manner. The resulting macromolecular, adaptable chimeric photosystems have defined compositions, and display solar energy conversion across the near-UV, visible and near-IR.

Results
Solar energy conversion by unadapted photosystem components. We first looked at whether plant LHCIIs can pass harvested energy to purple bacterial RCs in dilute solution in the absence of complementary genetic adaptations to promote specific heterodimerisation (complexes defined as "unadapted"). On receipt of excitation energy, photochemical charge separation in the Rba. sphaeroides RC is a rapid four-step process (Fig. 1d) that produces a metastable oxidised primary electron donor (P870 + ) and reduced acceptor ubiquinone (Q B − ); energy transfer can therefore be detected as a quenching of LHC emission accompanied by an enhancement of P870 oxidation. Although bacterial RCs and plant LHCIIs (see Methods for sources) have overlapping absorbance and emission spectra between 640 and 800 nm (Fig. 1b), no appreciable energy transfer was observed when wild-type (WT) RCs were mixed in solution with an LHCII because they have no capacity for binding to one another. The addition of purified wild-type (WT) RCs did not significantly reduce emission from LHCII (Fig. 2a) and photo-oxidative bleaching of the absorbance band of this RC's P870 primary electron donor BChls in response to 650 nm excitation was not significantly enhanced by the addition of LHCII (Fig. 2b), which absorbs strongly at this wavelength (Fig. 1b).
In comparison to LHCII, the spectral overlap (J) between LHC emission and RC absorbance is~80% larger in the case of LHCI (Fig. 1c, Supplementary Table 1) which contains a pair of "redform" chlorophyll a that possess a charge-transfer state that mixes with the low-energy exciton state 38 . Although the addition of WT RCs did bring about a decrease in LHCI emission (Fig. 2a), there was no associated significant increase in RC P870 photobleaching in the presence of LHCI (Fig. 2c), leading to the conclusion that the observed emission quenching was not due to energy transfer. Protein concentrations used for the fluorescence measurements were too low (max absorbance < 0.07) for this drop in LHCI emission to be attributable to reabsorption by the added RCs, and an equivalent drop was not seen for LHCII and WT RCs at similar concentrations (Fig. 2a). As it is known that the emission quantum yield of LHCI in vitro is much more sensitive to its environment than is the case for LHCII 36 , the observed drop in LHCI emission on adding WT RCs is attributed to a change in its intrinsic quantum yield rather than being a signature of energy transfer.
Although no significant energy transfer was seen between these proteins in dilute solution, to establish the principle that plant LHCs can pass energy to bacterial RCs when brought sufficiently close together, mixtures of LHC and WT RC proteins were adhered to a nanostructured silver cathode and their capacity for generating photocurrents examined (see Methods). In this photoelectrochemical system (Fig. 2d) cytochrome c (cyt c) is used to "wire" charge separation in the RC to the cathode, and ubiquinone-0 (Q 0 ) shuttles electrons to the counter electrode 20,39,40 . Electrodes drop-cast with purified WT RCs produced a photocurrent in response to RC-specific 870 nm light and a weaker current in response to 650 nm excitation where RC absorbance is very low (Supplementary Fig. 2a). An EQE action spectrum showed good correspondence with the RC absorbance spectrum (Fig. 2e, magenta versus black), confirming that the photocurrent was attributable to light capture by the pigments of the RC. As expected, an electrode fabricated with purified LHCII failed to show any photocurrent response during 650 nm excitation of the main low-energy LHCII absorbance band ( Supplementary Fig. 2a).
For electrodes fabricated from mixtures of WT RCs and LHCs, in addition to the expected RC bands the EQE spectra contained a component between 620 and 700 nm that corresponded to the lowenergy absorbance band of LHCII or LHCI (Fig. 2e, green). A contribution from the high-energy Soret absorbance band of LHCII or LHCI was also observed in EQE spectra (Supplementary Figs. 3a,b and 4). This demonstrated that bacteriochlorophyll-based purple bacterial RCs can utilise chlorophyll-based plant LHCs for energy harvesting, producing charge separation and a photocurrent response, provided they are brought within Förster resonance energy transfer (FRET) distance of one another. In this case this was realised by colocalising the two proteins on the surface of a biophotoelectrode.
Design and production of components for chimeric photosystems. In an attempt to activate chlorophyll to bacteriochlorophyll energy transfer in dilute solution, RCs and LHCs were adapted using the SpyTag/SpyCatcher protein fusion system 41 as a programmable interface (see Supplementary Note 1). When mixed in solution, highly specific binding of the short SpyTag peptide to the SpyCatcher protein domain initiates autocatalysis of an isopeptide bond between the two involving aspartate and lysine residues ( Supplementary Fig. 1f), producing a single, covalently locked, water-soluble protein domain 41 .
Self-assembly of RC-LHC chimeras. Following ultracentrifugation, purified RCs and LHCIIs could be visualised on sucrose density gradients as either a red or green band, respectively (Fig. 3d, gradients 1 and 2), and these two proteins also migrated separately in gradients loaded with a mixture with only either the SpyTag or SpyCatcher adaptations (Fig. 3d, gradients 3 and 4). In contrast, mixing any SpyCatcherΔ-adapted RC with any SpyTag (Δ)-adapted LHCII produced a product, dubbed a "chimera", that migrated further than either monomeric protein.
The two examples shown in Fig. 3d (gradients 5 and 6) are chimeras from a RC4C/Td-dLHCII mix (dubbed "RC#LHCII") and from a RCC/ LHCII-T mix (dubbed "LHCII#RC"). The symbol "#" denotes the spontaneously formed SpyCatcher/SpyTag interface domain. Chimera formation could also be detected on a native blue gel ( Supplementary Fig. 6a). Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) combined with western blotting using anti-His antibodies confirmed that chimera self-assembly was due to the formation of a covalent bond between the SpyTag (Δ)-adapted Lhcb1 polypeptide of LHCII and the SpyCatcherΔadapted PufL polypeptide of the RC ( Supplementary Fig. 6b, Supplementary Note 2). The reaction half-time for chimera formation varied between 10 and 90 min depending on the particular combination of adapted RC and LHCII (detailed in Supplementary Note 3).
LHCI-RC chimeras could also be assembled by incubation of LHCI-Td or Td-LHCI-Td with a threefold excess of RCC. This again produced higher molecular weight products that could be separated from unreacted RCs on blue native gels (Fig. 3e). As designed, assembly of RCC with doubly adapted Td-LHCI-Td complexes produced higher molecular weight products than with singly adapted LHCI-Td complexes (Fig. 3e, right). Equivalent results were obtained with LHCI adapted with the full SpyTag and also with RC4C ( Supplementary Fig. 7a). Analysis by SDS-PAGE and western blotting showed that chimera self-assembly was due to formation of a covalent bond between the SpyCatcherΔ-adapted PufL of the RC and Lhca4 of a singly SpyTagΔ-adapted LHCI (to form chimera LHCI#RC) or Lhca4 and Lhca1 of a doubly SpyTagΔ-adapted LHCI (to form chimera RC#LHCI#RC) ( Supplementary Fig. 7b). Sucrose density gradient ultracentrifugation (Fig. 3f) showed that LHCI#RC chimeras (gradient 5) were clearly larger than LHCI alone (gradients 2-4) or unadapted RCs (gradients 1, 3 and 4), and RC#LHCI#RC chimeras (gradient 6) were larger again.
Covalent locking of the structure enabled purification of all LHCI-RC and LHCII-RC chimeras by a combination of nickel affinity and size-exclusion chromatography, absorbance spectroscopy being used to identify fractions containing protein oligomers with the designed molar ratio ( Supplementary Fig. 8).
A change in protein morphology on chimera formation could be observed by transmission electron microscopy (TEM). Images of a mix of unadapted WT RCs and dLHCII showed a large number of monodispersed, regularly sized objects of <10 nm Chlorophyll to bacteriochlorophyll energy transfer. In solution, LHCII emission was quenched within each chimera in comparison to a control sample formed from an equivalent mix of the SpyTag-adapted LHCII and WT RCs ( Fig. 4a; see spectra and other combinations in Supplementary Fig. 9). This was indicative of energy transfer, likely through a FRET mechanism at the distances implied by the chimera models (Fig. 3h, i), that was activated in these proteins in dilute solution by physically linking the RC to the LHCII. These trends, observed with 650 nm excitation, were also seen in data on the same complexes obtained with other three excitation wavelengths, with no variation in emission spectrum line shape ( Supplementary Fig. 9). As well as being diagnostic of correctly refolded LHCII proteins, this lack of dependence of emission spectrum on excitation wavelength showed that the reduction in LHCII emission on chimera formation was not due to parasitic RC absorbance, which would be expected to be wavelength dependent (and also seen when WT RCs were mixed with each LHCII).
To determine the fate of transferred energy, measurements of RC P870 photooxidation in response to 650 nm excitation were carried out on the LHCII#RC and RC#LHCII chimeras and fitted to a simple interconversion reaction (Eq. (1); all parameters are summarised in Supplementary Table 3). Bleaching of 870 nm absorbance was stronger in LHCII#RC chimeras than in controls comprising the RCC protein alone or a mixture of RCC with unadapted dLHCII complexes (Fig. 4b). The same was found for the RC#LHCII chimera ( Supplementary Fig. 10a). Hence, decreased emission by the LHCII energy donor was accompanied by enhanced photooxidation of the RC energy acceptor, confirming energy transfer between the two proteins in solution that was switched on only after linking them by the SpyCatcher/ Tag domain.
Turning to LHCI, a greater reduction of LHCI emission was seen on forming either LHCI#RC or RC#LHCI#RC chimeras than after mixing the same adapted LHCI proteins with WT RCs (Fig. 4c; and other combinations are shown in Supplementary  Fig. 11). This effect was again seen to be independent of excitation wavelength ( Supplementary Fig. 11a) showing it was not due to the absorbance of excitation light by the tethered RC(s). This emission quenching was accompanied by significant enhancement of P870 photooxidation in LHCI chimeras with one or two RCC, compared to that seen with RCC alone (Fig. 4d), confirming energy transfer. Doubly modified RC#LHCI#RC complexes showed less P870 bleaching than LHCI#RC complexes due to two tethered RCs competing for the exciton reservoir rather than one (see below). Purified chimeras were also adhered to nanostructured silver electrodes to test their functionality. All were able to generate photocurrents, showing that dynamic interactions between the RC, cyt c and ubiquinone at the electrode surface, required for the generation of a photocurrent, were not obstructed by attaching the RC to LHCII or LHCI. All EQE action spectra recorded for chimeras exhibited low-energy (Fig. 4e, f)  Energy transfer efficiency in chimeras. Apparent efficiencies of energy transfer from LHCII or LHCI to the RC in solution were estimated either from data on emission of the LHC energy donor (E FL ) or from data on photobleaching of the RC energy acceptor (E P870 ) (see Methods, Eqs. (2)-(4)). Efficiency E FL was based on the additional quenching of LHC emission in a chimera relative to that in a compositionally matched mixture of the relevant LHC variant and WT RCs (Eqs. (2) and (3)) or additional quenching in an LHC/WT RC mixture relative to that in a concentrationmatched LHC-only sample. Efficiency E P870 was based on the enhanced rate of RC P870 photobleaching in a chimera relative to a matched RC-only control (Eq. 4).
Values of E P870 calculated from experimental data are shown in   Fig. 4 Energy transfer in chimeras in solution and on surfaces. a Emission at 682 nm from (left) the Td-dLHCII protein alone (grey), after addition of a twofold excess of WT RCs (green) and in a LHCII#RC chimera (cyan) formed on mixing with a twofold excess of RCC, and (right) equivalent data for LHCII-T. b Data and fits for photobleaching and dark recovery of P870 absorbance in RCC, a 1:1 RCC plus dLHCII mixture and the LHCII#RC chimera. c Emission from (left) LHCI-Td alone (grey), after addition of a threefold excess of WT RCs (green) and in a LHCI#RC chimera (navy) formed on mixing with a threefold excess of RCC, and (right) equivalent data for Td-LHCI-Td. Error bars in a and c represent standard deviations from six repeats (three technical repeats of two biological repeats) and data points are shown as overlaid circles. d Data and fits for photobleaching and dark recovery of P870 absorbance in RCC and the two RC-LHCI chimeras. e Solution absorbance and EQE spectra for WT RCs compared with those for the two LHCII-RC chimeras. f Solution absorbance and EQE spectra for WT RCs compared with those for the two RC-LHCI chimeras. g Schematic of adsorption of independent RC (red) and LHCII (green) complexes on an electrode. Yellow arrows indicate possible energy transfer connections. h Equivalent schematic of adsorption of RC#LHCII chimeras. i Simulated apparent ET efficiencies as a function of packing density for independent RC and LHCII proteins or the LHCII#RC chimera. Error bars represent standard deviations from ten simulation repeats. Individual data points are represented by circles or squares. no propensity to associate (see Supplementary Fig. 12 and Supplementary Table 4 for other control combinations). In marked contrast, E P870 was over 20% in the corresponding RC-LHCII or RC-LHCI chimera ( Table 1). For all chimeras the value of E FL derived from LHC emission data was in excellent agreement with the values of E P870 derived from RC absorbance data (Table 1). This correspondence between independently determined efficiencies from separate data sets reinforced the conclusion that energy transfer was taking place from the plant LHCs to the bacterial RCs within the chimera.
"On electrode" apparent energy transfer efficiencies (E electrode ) were also determined from the EQE action spectra, as described in Methods. In general, values of E electrode were higher than either estimate of energy transfer efficiency in solution (Table 1). This was particularly striking for mixtures of WT RCs and SpyTag(Δ)adapted LHCII or LHCI (shown schematically in Fig. 4g) where energy transfer in solution had a very low apparent efficiency. However, for the RC/LHCII chimeras in particular the value of E electrode was also substantially higher than E P870 or E FL (Table 1, Fig. 4h), suggesting that adhering the chimeras to a surface turned on inter-chimera ET that supplemented the intra-chimera ET observed in solution. This effect was less pronounced for the RC/ LHCI chimeras, particularly for complex RC#LHCI#RC where there were already two RCs per LHCI antenna (Table 1).
To examine whether the benefits of pre-linking RCs and LHCs in a chimera would be seen across a range of surface packing densities, a 2D Monte Carlo simulation was carried out as detailed in Supplementary Notes 4 and 5 (and summarised in Supplementary Fig. 13). In this either LHCII#RC chimeras or a mixture of LHCII-T and WT RC proteins were represented as hard-discs on a 2D surface and centre-to-centre distances calculated as a function of packing density. The outcome of this simulation was an apparent energy transfer efficiency (E sim ) based on how protein packing densities affected overall interprotein distances. In the high packing regime, E sim was in good agreement with the slightly higher E electrode determined for chimeras than for a mixture of unadapted proteins (Fig. 4i,  right). As the packing density dropped to a low value (right to left in Fig. 4i), E sim for the chimeras gradually declined to around the values for E P870 and E FL estimated for the LHCII#RC chimera in solution (22.8%/19.6%). In contrast, E sim for the protein mixture declined steeply to less than 2% at the lowest packing density, again in agreement with estimates of E P870 and E FL for the protein mixture in solution (1.2%/0.8%). This reinforced the conclusion that pre-tethering of the RC and LHCII protein into a chimera brought an added benefit even under conditions where co-localisation of the proteins on a surface switched on energy transfer between the two irrespective of tethering. Presumably pre-tethering can mitigate against situations where, for example, formation of RC-rich or LHCII-rich sub-domains and sub-optimal mixing can lead to some proteins being outside the FRET distance ( Supplementary  Fig. 14, marked with blue triangles).
The EQE spectra were also used to estimate the percentage improvement in the use of visible light by an LHC/RC biophotoelectrode compared to a RC bio-photoelectrode. Consistent with values of E electrode , the presence of an LHC consistently boosted the use of visible light, with the strongest effects seen for electrodes fabricated from chimeras (see Methods and Table 1).

Discussion
The data establish that it is possible to genetically encode in vitro self-assembly of a hybrid chlorophyll/bacteriochlorophyll solar energy conversion system using a highly specific split-interface domain. To our knowledge such combinations of chlorin and bacteriochlorin pigments are not used for light harvesting in nature, although in green sulfur bacteria the multiple BChl a light harvesting and electron transfer cofactors of the RC are supplemented by four molecules of Chl a that are used electron acceptors during charge separation 49 . In a similar vein, in the related heliobacterial RC the multiple BChl g (an isomer of Chl a) cofactors are supplemented by two molecules of 8 1 -hydroxychlorophyll a that also act as electron transfer acceptors 50 . Hence some organisms have evolved to supplement bacteriochlorin cofactors with chlorins to achieve charge separation, but not to expand solar energy harvesting in the way demonstrated here.
The SpyCatcher/Tag system provided a versatile means of constructing self-assembling hybrid photosystems. LHCII could be modified with SpyTag at either its N-or C-terminus, and by also using heterodimeric LHCI proteins that were either singly or doubly SpyTag modified the oligomeric state of the chimeras could be varied between heterodimers (RC#LHCII and LHCII#RC), heterotrimers (LHCI#RC) and heterotetramers (RC#LHCI#RC). The SpyCatcher/Tag linking domain produced predictable and stable products due to its very high partner specificity and the autocatalytic formation of a locking covalent bond. This binding reaction, which under the present conditions was found to have a half-time of between 10 and 90 min, was irreversible, relatively insensitive to reaction conditions and was free from side products (i.e. a failed reaction did not lead to depletion of reactants). The assembly strategy used, using E. coli and Rba. sphaeroides as separate bacterial factories for the synthesis of protein components that could be assembled in vitro, avoided the need to re-engineer a host organism to be able to produce both chlorophyll and bacteriochlorophyll (and different types of carotenoid). This methodology therefore provides a route for the bottom-up redesign of a photosystem in vitro despite the challenges of working with large, multi-component integral membrane complexes. a Mean ± standard deviation (n = 5 for EP870, n = 6 for EFL and n = 14 for Eelectrode). b These low apparent energy transfer efficiencies may have arisen from some reabsorption of LHC fluorescence by RCs or a small degree of aggregation. Data for additional control mixtures can be found in Supplementary Table 4. c The variance between EP870 and EFL in these two cases is attributed to the latter largely reflecting a decrease in LHCI quantum yield on adding WT RCs rather than being due to ET (see text). d RCs conjugated to each of Lhca1 and Lhca4 in the LHCI heterodimer.
The mechanism of solar energy conversion operating in the chimeras, based on the well-understood photophysical properties of the component proteins, is summarised in Fig. 5. Energy captured by the pigment systems of LHCII or LHCI will be passed to the RC in a downhill manner, exciting the primary electron donor bacteriochlorophylls (P870*) and initiating charge separation to form P870 + Q B − . Energy harvested by the chlorophyll b (or carotenoid-not shown) pigments of either LHC is passed to the lower energy chlorophyll a. Inter-protein energy transfer is likely to involve a sub-set of red-shifted chlorophyll a in either LHC, and entry of energy into the RC is likely to occur principally via the bacteriopheophytin cofactors (H A/B ) as their absorbance has the greatest spectral overlap with LHC emission (Fig. 1b, c).
As evident from comparing Fig. 1c with Fig. 1b, LHCI exhibits a red-enhanced fluorescence that produces an~80% stronger spectral overlap with RC absorbance (factor J in Supplementary  Table 1) compared to LHCII. Despite this, the efficiency of ET in the LHCI#RC chimera was not significantly higher than that in either the LHCII#RC or RC#LHCII chimera. This is likely due to the reconstituted LHCI heterodimers being in a partially quenched state 48,51 that reportedly reduces their quantum yield to only 29% of that of LHCII 36 , so counteracting the potential benefits of an enhanced spectral overlap. In agreement with this our estimates of quantum yield were 30% for LHCI-Td and 28% for Td-LHCI-Td (Supplementary Table 1). In future work it might be possible to partially overcome this through SpyTag modification of LHCI in a native organism, as the quantum yield of purified native LHCI has been reported to be~64% that of LHCII, more than double that of recombinant LHCI 36 .
Estimates of ET efficiency in RC#LHCI#RC chimeras in solution were consistently higher than those for the LHCI#RC chimera (parameters E P870 and E FL in Table 1), consistent with the presence of two ET acceptors in the former. Estimates of the ET efficiency to the second RC added to Lhca1 in RC#LHCI#RC, made using Eq. (5), yielded values that were either 50% or 69% of that for transfer to the first RC attached to Lhca4. This is consistent with the presence of a relatively low-energy red-form chlorophyll a dimer in the Lhca4 subunit ( Supplementary Fig. 1d) that is responsible for the red-enhancement of the LHCI emission spectrum 36,38,47,48 , and which may have produced more efficient ET to the RC attached to Lhca4 than that attached to Lhca1.
To conclude, this work shows that genetically adapting two diverse photosynthetic membrane proteins with the components of an extramembrane interface domain enables in vitro selfassembly of a chimeric photosystem in which UV/near-IR solar energy conversion by a bacteriochlorophyll-based RC is augmented by visible light capture by chlorophyll-based LHCs. This approach inspired by a concept of synthetic biology, to adapt naturally incompatible biological modules to interface in a standardised way through genetic encoding, creates covalently stabilised macromolecular photosystems that are predictable and programmable. In addition to providing photosynthetic structures and energy transfer pathways to explore, these polychromatic photosystems constitute interesting materials for biohybrid devices that in recent years have expanded in application beyond photoelectrochemical solar energy conversion to fuel molecule synthesis, energy storage, biosensing, touch sensing and photodetection. Finally, the demonstrated flexibility with which RCs and LHCs could be interfaced opens the possibility of constructing more elaborate, self-assembling chimeric photosystems that employ multiple orthogonal linking modules 52,53 and a wider range of photosynthetic and redox proteins that, despite being separated by billions of years of evolution, can be adapted for future solar energy conversion through genetic programming.

Methods
RC expression and purification. Details of the designs of adapted RCs are described in Supplementary Note 1. All adapted RCs were expressed in a strain of Rba. sphaeroides engineered to lack light-harvesting complexes 54,55 . Bacterial cells grown under dark/semiaerobic conditions and harvested by centrifugation were suspended in 20 mM tris(hydroxymethyl)aminomethane (Tris, pH 8.0), supplemented with protease inhibitor and DNAase, and lysed in a cell disruptor (Constant Systems) at 20,000 psi. Cell debris was removed by centrifugation at 18,000 rpm for 15 min at 4°C. The supernatant was then incubated in the dark at 4°C for 1 h with 1.5% (v/v) lauryldimethylamine N-oxide (LDAO) and 200 mM NaCl. After ultracentrifugation at 38,000 rpm for 30 min at 4°C the supernatant was collected and initial purification carried out using a Ni-NTA (nitrilotriacetic acid) column (GE Healthcare). The protein eluate was concentrated and then further purified using a Superdex 200 gel filtration column (GE Healthcare) in 20 mM Tris (pH 8) containing 0.04% N-dodecyl-β-D-maltopyranoside (Tris/DDM buffer). Eluted fractions with a low 280 to 804 nm absorbance ratio (<1.5) were pooled, concentrated and stored at −80°C.
LHCII expression, refolding and purification. Details of the designs of the adapted LHCII proteins are described in Supplementary Note 1. The starting point for production of the designed LHCII holoproteins was a pET-28a expression vector containing a modified gene encoding the Lhcb1.3 protein from A. thaliana (UniProtKB entry P04778). Modification of this gene was carried out by Gibson assembly using oligonucleotides sourced from Eurofins or using the Q5 ® Site-Directed Mutagenesis Kit from NEB. Designed apoproteins were expressed in E. coli Rosetta™ 2 (Novagen). Protein expression was induced using 1 mM isopropyl β-D-1-thiogalactopyranoside (IPTG) after the cell density reached OD 600 = 0.8 and cells were then incubated at 37°C and 250 r.p.m. agitation for 4-8 h. Harvested cells were resuspended in Lysis Buffer (50 mM Tris (pH 8.0)/2.5% (w/v) sucrose/1 mM ethylenediaminetetraacetic acid (EDTA)) with protease inhibitor and DNase 43 . Resuspended cells were lysed in a cell disruptor (Constant Systems) at 25,000 psi. Separation of inclusion bodies (IBs) from other cell fractions was accomplished by centrifugation of the cell lysate at 12,000g for 15 min at 4°C. IBs from the pellet were washed once with Detergent Buffer (20 mM Tris (pH 7.8)/200 mM NaCl/2 mM EDTA/1% sodium deoxycholate/1% octyl β-D-glucopyranoside (OG)/10 mM dithiothreitol (DTT)), three times with Triton Buffer (20 mM Tris (pH 8.0)/1 mM DTT/0.5% (w/v) Triton X-100) and finally with TE Buffer (20 mM Tris (pH 8.0)/1 mM EDTA). Washed IBs were stored at −80°C as pellets and the protein purity was checked by SDS-PAGE using quick Coomassie staining (Generon). The concentration of total protein was measured by a DC™ protein assay (Bio-Rad).
Total chlorophylls and carotenoids were extracted from fresh spinach leaves 43 . Leaves were homogenised in cold grinding buffer (0.4 M sorbitol/10 mM NaCl/5 mM MgCl 2 /0.5% (w/v) milk powder/0.1 M Tricine (pH 7.8)) using a Waring immersion blender (20 g leaves per 100 mL buffer). Chloroplasts were separated by filtration through a nylon cloth and centrifugation of the filtrate at 1500g for 10 min at 4°C. The chloroplast pellet was washed by twice resuspending in cold wash buffer (50 mM sorbitol/10 mM EDTA/5 mM Tricine (pH 7.8)) and centrifuging at 10,000g for 10 min at 4°C. Having removed all supernatant, total pigment was extracted in the dark by adding 80% acetone buffered with sodium carbonate to the chloroplasts and incubating on ice. Unextracted material was removed by centrifugation at 12,000g for 15 min at 4°C. The supernatant was well mixed with 0.4 volumes of diethyl ether in a separating funnel. After layer separation was obtained by mixing in 0.33 M NaCl, the upper, pigmented ether phase was decanted, dried by the addition of anhydrous sodium sulfate and solvent was evaporated under vacuum.
Carotenoids were extracted from washed chloroplasts 43 . The chloroplast pellet was resuspended in 96% ethanol buffered with sodium carbonate and incubated on ice as for the total pigment extraction except that 80% KOH (w/v) was added and the mixture incubated at 4°C overnight. After saponification the solution was placed in a separating funnel and mixed gently with an equal volume of diethyl ether and then with 0.8 volumes of 0.33 M NaCl. After distinct layers had formed the lower green phase was removed. Three volumes of water were then added with gentle mixing to clean the upper orange layer. After two layers had formed the top phase was isolated, dried with anhydrous sodium sulfate and the solvent was evaporated under vacuum.
Extracted pigments were re-dissolved in 100% acetone and their concentration was determined by absorbance spectroscopy in 80% buffered acetone (20 mM Tris, pH 8.0). Concentrations of Chl a, Chl b and total carotenoid were calculated based on a published equation 56 .
For refolding 43 , apoprotein in IBs was solubilised by resuspending in TE buffer (50 mM Tris (pH 8)/1 mM EDTA) and briefly heating to 95°C after mixing 1:1 in 2× Reconstitution Buffer (4% (w/v) lithium dodecyl sulfate/20 mM DTT/2 mM benzamidine/10 mM aminocaproic acid/5% (w/v) sucrose/200 mM 4-(2hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES, pH 8). Purified chlorophyll a/b and carotenoid pigments were pre-dissolved in ethanol and added to the apoprotein with gentle vortexing. Light was avoided after this step to prevent photodamage. The volume of ethanol was kept at no more than 4% of total reaction volume and a mass ratio of apoprotein/total pigment/carotenoid (w/w/w) equal to 40/25/4 was used. OG was then added to 2% (w/v) and the mixture incubated on ice for 10 min. KCl was then added to 200 mM for 20 min on ice to precipitate LDS. The solution containing reconstituted LHCIIs was recovered after centrifugation for 10 min at 16,000g and 4°C.
Each refolded LHCII was purified by nickel affinity chromatography 43 and then by gel filtration chromatography. Fractions with the lowest A 470 to A 674 ratio and invariant emission profiles in response to 440, 475 and 500 nm excitation were kept and pooled. Purified proteins were stored at −80°C before use as concentrated solutions in Tris/DDM. An extinction coefficient at the chlorophyll a Q y band of 546,000 M −1 cm −1 was used to estimate LHCII concentration 25 . The refolded LHCII complexes had absorbance spectra that were similar to one another ( Supplementary Fig. 5b) and to spectra previously published by others [43][44][45][46] . Their emission spectra were highly similar ( Supplementary Fig. 5c), and the line shapes of these spectra were invariant with excitation wavelength (Supplementary Fig. 5d), a feature diagnostic of a structurally intact LHCII. Pigment compositions were similar to those typically reported for recombinant LHCII (Supplementary Fig. 5e) 46 .
LHCI expression, refolding and purification. Details of the designs of the adapted LHCI proteins are described in Supplementary Note 1. Expression plasmids were pET-28a containing synthetic genes sourced from Eurofins. Following apoprotein expression in E. coli, LHCI heterodimers were assembled by refolding with purified pigments as described above for LHCII 34,36,38,47,48 . A 20% excess (by mass) of either L1 or Td-L1 was mixed with Td-L4 to reduce the level of free Td-L4 monomer after refolding. The apoprotein:total pigment ratio was kept the same as for LHCII refolding. Nickel affinity chromatography was used to separate the Histagged LHCI dimer from residual Lhca1 monomer (which was not His-tagged). Each LHCI was then further purified by gel filtration chromatography and stored at −80°C before use as a concentrated solution in Tris/DDM. An extinction coefficient for the chlorophyll a Q y band equal to 1,092,000 M −1 cm −1 was used to evaluate LHCI concentration since its chlorophyll a content is approximately twice that of a refolded LHCII monomer 34 .
Chimera formation and verification. The standard approach to chimera formation was to mix RCs with a twofold molar excess of LHCII, or a threefold molar excess of LHCI, and then separate the chimera from unreacted components by gel filtration chromatography, using absorbance spectroscopy and each constituent molar extinction coefficient to assess the RC:LHC molar ratio in each column fraction (see Supplementary Fig. 8).
Formation of chimeras was initially verified by sucrose density gradient ultracentrifugation (Fig. 3d, f). Linear sucrose gradients were prepared by freezing and thawing 10 mL of 21% (w/v) sucrose in 20 mM Tris/0.04% DDM (pH 8.0). Each gradient was loaded with 400 μL of sample with each photoprotein at a concentration of 2.5 μM and then capped with 1 mL of 20 mM Tris/0.04% DDM (pH 8.0). Gradients were ultracentrifuged in a Sorvall TH-641 swing-out rotor at 38,000 rpm for 18 h at 4°C.
For native blue PAGE, precast NativePAGE 4-20% gels (Thermo) were run in a Bis-Tris buffer system. Coomassie blue dye at 0.02% (w/v) was used in the cathode buffer but not in the loading buffer. The gel cassette was placed in an ice bath and run at 150 V for 1 h followed by 250 V for 2 h 57 . SDS-PAGE was carried out using precast 4-20% gradient gels (Bio-Rad). A standard loading of 20 pmol RC was used. Loaded gels were run at 200 V for 45 min and stained overnight at room temperature with Quick Coomassie Stain (Generon).
Western blotting was carried out following protein transfer onto a nitrocellulose membrane (GE Healthcare) on a TE 77 PWR Semi-Dry Transfer Unit (45 mA/gel and 30 min with a NOVA Blot kit) in 30 min. The membrane was blocked overnight with 5% milk PBS-Tween (PBS/T) buffer and then incubated with horse radish peroxidase (HRP)-conjugated antibodies in the same buffer for 1 h. The membrane was developed using 1× LumniGLO(R) (CST®) after rinsing the membrane three times with PBS/T buffer. Finally, the result was recorded on an ODYSSEY imaging system (LI-COR Biosciences). Re-probing of the membrane was accomplished by stripping and a repeat process of incubation in 5% milk PBS/ T buffer. Stripping of membrane was achieved by incubating twice in mild stripping buffer (200 mM glycine, 0.1% SDS (w/v), 1% Tween 20 (v/v), pH 2.2) for 5 min and twice in TBST buffer (50 mM Tris, 150 mM NaCl, 0.1% Tween 20 (v/v), pH 7.6), before finally transferring in PBS/T. For polyhistidine tag detection, a 1000-fold dilution of HRP-conjugated anti-His antibody was used (A7058, Sigma) and a 2000-fold dilution of HRP-conjugated c-Myc antibody was used for Myc tag detection (GT0002, Insight Biotech).
Spectroscopy. Absorbance spectra were recorded on a Varian Cary60 spectrophotometer and emission spectra on a Varian Cary Eclipse Fluorimeter in nitrogen-gassed, freshly prepared Tris/DDM. Photooxidation of the RC P870 primary electron donor was measured using an optical fibre attachment for the Cary60 and a four-way cuvette holder (Ocean Optics, Inc.). For excitation, light from an HL-2000 light source (Ocean Optics, Inc.) was passed through an optical fibre and a 25-nm band-pass filter centred at 650 nm (Edmund Optics Ltd). Incident light intensity was approximately 0.1 mW cm −2 , which excited~15% of the RC population. Light-on/off was controlled using the electronic shutter on the light source triggered by a TGP110 pulse generator (Aim-TTi Ltd, UK). After incubation with a 5-fold excess of ubiquinone-0 (UQ 0 ) in the dark for 10 min, samples at an RC concentration of 0.5 μM (0.25 μM with LHCI-Td) were housed in a 3 mm path length, four-sided microcuvette (110-15-QS, Hellma ® Analytics). Each measurement was repeated five times and the traces were fitted to a model assuming a simple interconversion between the ground and photo-oxidised state: Parameters k f and k r from these fits are shown in Supplementary LHC quantum yield estimation. The quantum yield of each LHC was determined by comparison to the dye DCM (4-(dicyanomethylene)-2-methyl-6-(4-dimethylaminostyryl)-4H-pyran; Sigma) dissolved in methanol. To avoid self-shading the absorbance of each LHC and DCM was set around 0.07 across the relevant spectral region (Supplementary Fig. 15a). Emission from DCM and each LHC (average of 10 measurements; Supplementary Fig. 15b) was corrected for spectral response and used to calculate their relative integral photon fluxes 58,59 . The value for Φ D was estimated with reference to Φ DCM = 0.435 (ref. 60 ) and the refractive indices of water (n water = 1.333) and methanol (n methanol = 1.328).
Photochronoamperometry and EQE action spectra. Nanostructured silver electrodes were prepared by subjecting thoroughly polished planar disc polycrystalline Ag electrodes (CHI Instruments) to an electrochemical procedure consisting of four oxidation/reduction cycles (+0.4 V for 30 s/−0.4 V for 30 s) in 100 mM KCl 20 . Pigment-proteins at concentrations between 20 and 100 µM were drop-casted onto prepared electrodes in the dark at 4°C for 1 h and unbound protein was removed by repeated mechanically controlled dipping in 20 mM Tris (pH 8) at 4°C. Coated electrodes were immersed in 20 mM Tris (pH 8)/50 µM KCl/20 µM horse heart cyt c/1.5 mM ubiquinone-0 (Q 0 ) in a room temperature electrochemical cell fitted with an Ag/AgCl/3 M KCl reference electrode and a platinum counter electrode. Photocurrents were measured at a bias potential of −50 mV vs Ag/AgCl, controlled by a PGSTAT128N potentiostat (Metrohm Autolab). Illumination was supplied by 870 or 650 nm LED (Roithner Lasertechnik) with irradiances of 32 or 6.7 mW cm −2 , respectively, at the electrode surface with about 50 nm FWHM (full-width at half-maximum) for both. EQE action spectra were recorded using a tungsten-halogen source passed through a monochromator ( Supplementary Fig. 16a) 20 . All control samples had equimolar LHC and RC except a WT RC/Td-LHCI-Td mix where the molar ratio of RC to LHC was two.
Transmission electron microscopy. Negative stain TEM was carried out on an equimolar mixture of 500 nM WT RCs and dLHCII, 500 nM LHCII#RC heterodimers, 100 nM LHCI#RC or 100 nM RC#LHCI#RC. The protein samples were kept on ice before adsorbing to a glowing discharge-treated carbon-coated copper grid. After incubating with the grid for 1 min, excess liquid was removed with a clean filter paper. Samples were stained with 3% uranyl acid (UA) and dried completely before imaged with a FEI Tecnai 12 120 kV BioTwin Spirit TEM system 27 .
Estimation of energy transfer efficiency. Apparent efficiencies of ET were calculated from LHC emission spectra (E FL ) using where FL chimera was the intensity of LHC emission in a chimera and FL WTRC+LHC was that in a concentration-matched mixture of the appropriate LHCII or LHCI variant and the WT RC. A similar approach was used for estimating the apparent ET efficiency in mixtures of WT RCs and LHCII or LHCI, expressing FL WTRC+LHC as a function of the emission from the same concentration of the LHC (FL LHC ). For LHCII, where the line shape of the emission spectrum did not vary as it is a single quantum system 61 , maximum emission values were used in Eq. (2) as a simple measure of emission intensity. For LHCI, which has multiple distinct emission states 38 , values of emission intensity (FL int ) were produced by integration across the emission spectrum using Eq. (3), and then applied in Eq. (2).
Apparent efficiencies of ET were also calculated from the rate of P870 photobleaching (k f ) from the kinetic analyses summarised in Supplementary Table 2. To enable this the intensity of the 650 nm excitation light used in these experiments was kept low such that no more than~15% of P870 was oxidised within the lifetime of P870 + (~1 s), ensuring that photooxidation directly represented the quantity of energy received by either direct absorption by the RC or ET from the tethered LHC. The apparent efficiency of ET (E P870 ) was estimated from the rate of P870 photobleaching using where k f was the rate of P870 oxidation in a chimera or the equivalent RC-only control (RC) (Supplementary Table 3). Integration of incident photon flux (P) and the 1-transmittance of RCs or LHCs as a function of wavelength provided the number of photons absorbed by either RCs or LHCIIs per unit area per second (Supplementary Table 3). Equation (4) was also used for estimation of E electrode , with parameter k f replaced by the maximum EQE around 650 nm (i.e. the same illumination region as used for measurements of P870 oxidation). The efficiency was determined by comparing the LHC's contribution to the EQE with the sample absorbance at the corresponding wavelength ( Supplementary Fig. 4). In addition to enabling direct comparison of E electrode and E P870 , the data at 650 nm were not affected by parasitic absorption or unwanted emission from cyt c, UQ 0 or nanostructured silver used in photocurrent measurements 20,40 .
For RC#LHCI#RC chimeras there were two acceptors per LHCI, one connected to the Lhca1 subunit and one to the Lhca4 subunit. Efficiencies of energy transfer to the Lhca1-connected RC (E a1 ) were estimated from: where E FL,P870 was the apparent energy transfer efficiency for the RC#LHCI#RC chimera estimated from either LHC fluorescence or P870 photobleaching and E a4 was the corresponding apparent energy transfer efficiency for the LHCI#RC chimera where the single RC is attached to Lhca4. From E FL the value of E a1 was 10.4% (compared to E a4 = 20.7%) and from E P870 the value of E a1 was 13.9% (compared to E a4 = 20.1%). Deduction of Eq. (5) can be found in Supplementary Note 5.
Estimation of solar radiance coverage enhancement. The effect of the LHCs on the performance of a bio-photoelectrode in response to visible light was estimated using where S(λ) was the air mass 1.5 standard solar power reference spectrum as a function of wavelength (photons m −2 nm −1 ), EQE T was the EQE spectrum of each LHC + RC system and EQE RC was that of the RC-only component ( Supplementary  Fig. 4, green versus magenta shade). Integration provided an estimate of the improvement in the use of solar energy (U LHCII ) between 400 and 700 nm where the chlorophyll-based LHCs absorb. Values are compiled in Table 1.
Protein structures and chimera modelling. Protein structures used in modelling were Protein Data Bank entries 3ZUW for the Rba. sphaeroides RC 62 , 2BHW for the LHCII from pea 31 , 4KX8 for the LHCI from pea 37 and 4MLI for SpyCatcher/ Tag 42 . Schematic models of chimeras were produced using Modeller 63 .
Reporting summary. Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability
The data underlying Figs. 1a-c, 2a-c, e, 3d-g and 4a-f, i as well as Supplementary Figs. 2, 4, 6-11, 18-20 are provided as a Source Data file. All data and biological materials reported in this study can be provided by the corresponding author upon reasonable request.

Code availability
Codes for data processing, analysis, plotting and simulation were implemented within MATLAB (release 2018b; MathWorks) and can be provided by the corresponding author on reasonable request.