Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Daily transcriptomes of the copepod Calanus finmarchicus during the summer solstice at high Arctic latitudes


The zooplankter Calanus finmarchicus is a member of the so-called “Calanus Complex”, a group of copepods that constitutes a key element of the Arctic polar marine ecosystem, providing a crucial link between primary production and higher trophic levels. Climate change induces the shift of C. finmarchicus to higher latitudes with currently unknown impacts on its endogenous timing. Here we generated a daily transcriptome of C. finmarchicus at two high Arctic stations, during the more extreme time of Midnight Sun, the summer solstice. While the southern station (74.5 °N) was sea ice-free, the northern one (82.5 °N) was sea ice-covered. The mRNAs of the 42 samples have been sequenced with an average of 126 ± 5 million reads (mean ± SE) per sample, and aligned to the reference transcriptome. We detail the quality assessment of the datasets and the complete annotation procedure, providing the possibility to investigate daily gene expression of this ecologically important species at high Arctic latitudes, and to compare gene expression according to latitude and sea ice-coverage.

Measurement(s) transcriptome • sequence feature annotation
Technology Type(s) RNA sequencing • sequence annotation
Factor Type(s) location • time
Sample Characteristic - Organism Calanus finmarchicus
Sample Characteristic - Environment polar
Sample Characteristic - Location Arctic Ocean

Machine-accessible metadata file describing the reported data:

Background & Summary

The copepod Calanus finmarchicus (Crustacea, Copepoda) is a key zooplankton species in the northern Atlantic food web as it converts sugars from algae into energy rich lipids that sustain higher consumers including marine fish larvae and seabirds1,2,3. Its high abundance and biomass also makes it an important contributor to ocean carbon flux4. The species inhabits a large latitudinal range from ~40° up to 80° N5. However, recent findings show that C. finmarchicus is undergoing temperature driven geographical shifts northwards because of climate change6,7,8, the effects of which are at their most extreme in the Northern Atlantic and Barents Sea. Therefore, the copepods will experience a change between the photoperiods they are adapted to at lower latitudes and the extreme high-latitude photoperiods. Photoperiodic variation is particularly pronounced in the Arctic with rapid change over short latitudinal ranges. The impact of such extreme photoperiods on non-endemic species is unknown, and the northward expansion of organisms at high latitudes may be limited by the adaptive capacity of their endogenous timing systems to extreme photoperiods8,9.

Endogenous timing systems, or biological clocks, are ubiquitous ancient and highly adaptive mechanisms enabling organisms to track and anticipate environmental cycles and prepare biological processes accordingly10,11. Since the identification of circadian clock genes in C. finmarchicus12, studies have shown that this species possesses a functional clock that might be involved in the timing of both diel and seasonal events, such as the ecologically and biogeochemically important diel vertical migration (DVM)13 or diapause14. However, the Arctic environment is characterized by dramatic seasonality resulting in permanent illumination during Midnight Sun and permanent darkness during Polar Night15. As the circadian clock is entrained and synchronized by daily light/dark cycles, the persistence of daily biological processes in Arctic organisms during the absence of those remains uncertain16,17, as well as the consequences for newcomer species due to global warming8,9. Moreover, the Arctic is characterized by strong fluctuations in sea ice-cover, reflecting on biotic and abiotic factors, such as species communities and interactions or light penetration18,19.

Copepods are among the important non-model invertebrates for which genomic resources are still limited, one barrier being that many species, including C. finmarchicus, have large genomes20,21. The de novo transcriptome of C. finmarchicus22 represents a useful resource for assessing the impact of global warming in this species of high ecological interest. In addition to differential gene expression analyzes, RNA sequencing has increased the ability to study the expression of rhythmically expressed mRNAs23,24,25. Indeed, at the molecular level, the endogenous clock machinery drives the rhythmic expression of downstream genes whose rhythmic translation and function ultimately underlie daily oscillations at cellular and organismal levels25. Note that in the field, environmental cycles also directly generate rhythms independently from the clock. Thus, temporal transcriptomic studies allow a major breakthrough in the understanding of daily dynamics of biological processes in the field.

In this study, we performed RNA sequencing on temporally collected in situ samples to generate a daily transcriptome of C. finmarchicus in the high Arctic during summer solstice period when the sun remains high above the horizon with minimal altitude variation. Sampling of C. finmarchicus stage V copepodites was performed at 4 h intervals within a 24 h cycle at two ocean stations along a latitudinal gradient. The northern station (82.5 °N, Nansen Basin) was characterized by sea ice-coverage, while the southern one (74.5 °N, Barents Sea) was sea ice-free. In addition to providing the raw data, we describe its quality assessment and the alignment to the reference transcriptome to verify reliability and determine transcript quantification. Finally a complete annotation is performed and two normalized datasets are provided for further transcriptomic data exploration of this species.


Sampling design

The sampling strategy was specifically designed for the detection of rhythmic transcripts25,26 although it does not exclude classic differential expression analysis27. Sampling design and analysis strategy are presented in Fig. 1, Table 1 and Supplementary Table 1. Sampling covered a complete 24 h cycle at 4 h intervals, resulting in seven time points per station. At each station, samplings were performed at similar time intervals of: 14–15 h, 18–19 h, 22–23 h, 2–3 h, 6–7 h, and 10–11 h (all times noted in local time (UTC + 2)). Sampling at “North” station, JR85, started on 18th June (3 days before the summer solstice) at 10–11 h and ended on 19th June at 10–11 h. Sampling at “South” station, B13, started on 30th June (9 days after the summer solstice) at 14–15 h and ended on 1st July at 14–15 h. At each timepoint the water column was sampled from 200 m to the surface with vertical hauls of a WP2 plankton net (opening : 57 cm, net length: 236 cm, mesh size: 200 µm) with a meshed bucket cod end (mesh size: 200 µm) at a speed of 0.5 m*s−1. Transferring the animals from the net into the stabilization solution was done within less than 12 minutes for all samplings. A 12 h period of incubation at 2–4 °C was allowed to soak the samples thoroughly with the RNAlater stabilization solution (Ambion, UK) before they were transferred to −80 °C for further transport and storage.

Fig. 1

Overview of the experimental workflow used to generate the transcriptomic data output (all times noted in local time (UTC + 2)). Sample details are available in Supplementary Table 1.

Table 1 Summary of sampling and sequencing strategy. Details are available in Supplementary Table 1.

Sites description

Sampling has been conducted during Cruise JR17006 of the RRS James Clark Ross in summer 2018 at two stations along a latitudinal gradient. The station “North” was sea ice-covered and located in the Nansen Basin (JR85; 82.56°N, 30.85°E). The station “South” was sea ice-free and located in the southern Barents Sea (B13; 74.5°N, 30°E). Water depth at “North” was 3700 m and at “South” was 360 m. The sun’s altitude was always above the horizon but still showed diel oscillations of altitude above the horizon from 16 ° at midnight to 30.9 ° at midday at “North”, and from 7.7 ° at midnight to 38.6 ° at midday at “South” at the times of sampling (all times noted in local time (UTC + 2)). Sites were exposed to semidiurnal tide regimes, i.e., 2 tides per day, with a maximum amplitude of ± 0.47 m at JR85 and ± 0.36 m at B13 at times of sampling. Maps with the location of the sea ice edge at the time of sampling at “North” are available from the meereisportal28 ( Modeled data of sun altitude were obtained from the United States Naval Observatory (, USNO, USA). Information on the tidal dynamics have been drawn from the TPX08 model29 by using the OTPS package (Tidal Prediction Software,, via the mbotps program30 (MB-System). Solar altitude, tidal height and sea-ice cover during the sampling campaign at both latitudes are detailed in Supplementary Table 2. Temperature, pressure (depth), conductivity (salinity), oxygen saturation (SBE 43, Sea-Bird Electronics) and Chlorophyll a (Chl a) fluorescence (Aquatracka III fluorometer, Chelsea Technologies Group, UK) were measured from the surface to 200 m depth and are available in Hueppe et al.31.

Copepod sorting

Copepods were sorted at 2 °C under a stereo microscope for species (C. finmarchicus) and stage (CV). To distinguish C. finmarchicus from its closely related congener C. glacialis, morphological indicators were used, in particular the redness of the antenna, which has been shown to be a good indicator in the regions of sampling32; see also the molecular validation of morphological identification, below. For each timepoint and station, 3 replicates of 15 C. finmachicus CV were sorted. The choice to pool 15 individuals was made to (1) get the sufficient amount of RNA required for RNA sequencing and quantitative real-time PCR analyses and (2) increase the number of individuals (315 copepods per station in total) thereby decreasing the effect of individual variability.

RNA extraction

Each replicate was distributed to a 2 ml Precellys® homogenization tube (Bertin Instruments, France), containing a mix of 1.4 mm and 2.8 mm ceramic beads and homogenized in 600 µl of TRIzol® reagent (ThermoFisher Scientific, USA) with a Precellys® 24 Tissue Homogenizer (Bertin Instruments, France), using two times 15 sec. of homogenization at 5000 rpm with a 10 sec. break between. For RNA extraction, a Phenol/Chloroform based single-step extraction in combination with a spin column based solid phase extraction (Direct-zol™ RNA MiniPrep Kit, Zymo Research, USA) was used. Genomic DNA was removed by DNase I digestion on column as part of the RNA extraction kit and total RNA was eluted in ultra-pure water. A portion of the RNA of each of the samples was used to investigate relative expression of 8 candidate genes with SYBRGreen based quantitative real-time PCR (qPCR) on candidate genes, using the 2−∆Ct method32 and the geometric mean of elongation factor 1α and 16 s rRNA as reference, as described by Hueppe et al.31. Another portion of each samples was send to GeT-PlaGe core facility in dried-ice for RNA sequencing.

RNA sequencing

RNA sequencing was performed at the GeT-PlaGe core facility, INRAE Toulouse. The 42 RNA sequencing libraries were prepared according to Illumina’s protocols using the Illumina TruSeq Stranded mRNA sample prep kit to analyse mRNA. Briefly, mRNA were selected using poly-T beads. Then, RNA were fragmented to generate double stranded cDNA and adaptors were ligated to be sequenced. 11 cycles of PCR were applied to amplify libraries. Library quality was assessed using a Fragment Analyser (Advanced Analytical Technologies, Inc., Iowa, USA) and libraries were quantified by qPCR using the Kapa Library Quantification Kit (Roche). RNA sequencing experiments have been performed on a NovaSeq S4 lane (Illumina, California, USA) using a paired-end read length of 2 × 150 pb with the Illumina NovaSeq Reagent Kits.

Reads alignment and quantification

42 RNA sequencing libraries were obtained (Fig. 1, Table 1, and Supplementary Table 1). The number of paired reads per library was between 74 million and 276 million with an average of 126 ± 5 million (mean ± SE) reads. The RNA sequencing libraries reads quality were evaluated using FastQC33. Contamination was checked by aligning reads against E. coli, Yeast and PhiX genomes.

The Calanus finmarchicus de novo transcriptome22, based on different life stages and deposited to Bioproject PRJNA236528, was used as the reference transcriptome. It is composed of 206,012 contigs and presents good results of quality assessment, with a nearly complete BUSCO set22,34. Reads were aligned to the de novo transcriptome with BWA-MEM ( Quantification was performed with SAMtools35 idxStats to generate the quantification matrix. The matrix was filtered with edgeR36 and only contigs with more than 1 CPM (Count Per Million) in at least one sample were kept, providing a matrix of 76,550 contigs. Information on the datasets resulting from this study is available in Table 2.

Table 2 List of available datasets related to the study (NCBI Bioproject PRJNA62888640 and figshare collection 512770441).


We provided different annotations for all further analysis. Contigs were aligned with DIAMOND37 on NR (2019-09-29), Swissprot and Trembl (2018-12) to retrieve corresponding best annotations. An annotation matrix was then generated by selecting the best hit for each database if: i) the percent of the query length covered by the alignment was higher than 60%; ii) the percent of the subject length covered by the alignment was higher than 40%; iii) the percent of identity of the alignment was higher than 40%. Contigs were also processed with InterProScan38 to scan InterProScan signatures. A GO was assigned to each contig with an InterProScan hit containing a GO annotation. Information on the datasets resulting from this study is available in Table 2. Note that a previous annotation of Calanus finmarchicus reference transcriptome22 against Non-redundant (NR) protein database is also available at


Two normalizations are proposed (down-sampling normalization and RLE normalization) but the choice of normalization depends on the analysis required downstream. For a rhythmic analysis, we suggest down-sampling the mapped reads to the lowest number among the 42 samples (down-sampling normalization), i.e. to 70.4 million properly mapped reads per sample for all samples (after filtering), in order to adjust for differences in sequencing depth among samples23,25,39. This was performed with StreamSampler.jar ( EdgeR36 was used to perform RLE normalization, since it is more appropriate for differential expression analysis. Information on the datasets resulting from this study is available in Table 2.

Data Records

Raw reads were gathered in the NCBI BioProject PRJNA62888640 which includes all BioSamples used for the study (Table 2, Supplementary table 1). We also provide the following in figshare collection 512770441 (Table 2): the quantification matrix for the 206,012 contigs; the list of identifiers corresponding to the 76,550 contigs after filtering; the two suggested normalization matrices (down-sampling and RLE) and; the datasets annotations (DIAMOND annotation matrix, InterProScan annotation, GO association).

Technical Validation

Molecular validation of morphological identification

Since C. finmarchicus’ Arctic congener C. glacialis also occurs in the region of sampling and differences between the species can be very subtle42, morphological identification was validated by molecular species identification on a subset of samples from the same stations21,43. DNA was extracted from individual copepods using the HotShot method44, and the species-specific nuclear insertion/deletion (InDel) marker G-150 was amplified using a modified protocol from Smolina et al.45. Identification was done by accessing the size of the resulting amplicon via electrophoresis on a 2% agarose gel. Results have shown that 99% of the individuals identified as C. finmarchicus by the morphological identification method were also clearly identified as C. finmarchicus by the molecular identification method, while 0.1% were not clearly identified and 0.7% were identified as the Arctic congener Calanus glacialis (n = 305 individuals).

Extraction and RNA integrity

RNA extraction procedures were performed with randomization of samples to ensure reliable and unbiased data production. RNA purity was assessed by OD measurements with a NanoDrop 8000 spectrophotometer (ThermoFisher Scientific, USA), and all 260/280 and 260/230 OD ratio was superior to 1.9. RNA integrity was evaluated with a Fragment Analyzer (Advanced Analytical Technologies, Inc., Iowa, USA; RNA Kit (15nt) Standard Sensitivity, Agilent). Due to a non-conventional 28 S/18 S ribosomal ratio in this species, sample quality was evaluated on the electropherogram46. No degradation in the inter region was observed. Total RNA samples were stored at −80 °C.

Raw reads assessment and quantification overview

All samples passed the FastQC33 “base quality control”. No relevant contamination hit was found after the alignment against E. coli, Yeast and PhiX. The mapping rate against the reference transcriptome22 of 206,012 contigs was higher than 72.4% for properly paired reads and higher than 93.6% considering both paired and single mate reads, validating the raw reads quality (Fig. 2, Supplementary Table 3). Furthermore, over the 42 samples, the maximal percentage of multi-mapped alignment is of 3.31% (Fig. 2, Supplementary Table 3).

Fig. 2

Mapping statistics of the 42 samples against the set of 206,012 contigs. Number of million (M) reads: paired, single mate, and unmapped. Details are available in Supplementary Table 3.

For an overview of the quantification matrix, a principal component analysis (PCA) was generated on the raw pseudo-count (log2 (count + 1)) non-normalized matrix (Fig. 3). Results showed a clear separation between samples from “North” and “South” stations, indicating environmental variations that might be due to latitude and/or sea ice-coverage.

Fig. 3

Principal component analysis of the 42 samples based on expression levels of the whole transcriptome (206,012 contigs). “Groups” grouping the 3 replicates per time and per station.


Of the 206,012 transcripts, 37% (76,550) were expressed above the threshold of 1 CPM. This result corroborates previously observed results on the C. finmarchicus transcriptome22. Thus a large proportion of the whole contigs (63%) exhibited an extremely low level of expression, representing only 1.32 ± 0.04% of total aligned reads at “North”, and 1.27 ± 0.05% at “South” (Table 3, Supplementary Table 4).

Table 3 Average number (million, mean ± SE) of alignments per station and percentage of alignments discarded by filtering contigs with very low expression.

Contigs annotation

By selecting the best hit for each database, the annotation matrix generated with Diamond37 has led to 36,274 and 22,527 contigs with an annotation in at least one database out of the 206,012 and 76,550 contigs respectively (Table 4). Moreover, the number of unique hits for each database is always lower than the number of contigs annotated by the respective database, highlighting the contigs’ functional redundancies.

Table 4 Number of contigs annotated with DIAMOND against NR, TREMBL and Swissprot and number of unique hits in the target database.

The InterProScan annotation provided annotations from many protein signature databases. The main results are presented in Table 5 and Supplementary Table 5. A GO was attributed to 65,924 contigs over the whole transcriptome (206,012 contigs), while 33,057 contigs out of the 76,550 contigs with an expression level higher than 1 CPM in at least one sample had a GO annotation (Table 5, Supplementary Table 5).

Table 5 Number of contigs with an InterProScan annotation and details on main features.

Quantitative real-time PCR data for normalization verification

The relative expression of six core circadian clock genes (clock, cycle, period1, timeless, cryptochrome2, vrille) and 2 circadian clock-related genes (cryptochrome1 and doubletime2) was investigated by quantitative real-time PCR and are available in Supplementary Table 6, allowing the verification of RNA sequencing normalization for further investigations. Regarding the two normalizations, the down-sampling normalization was selected for a rhythmic analysis based on concordant temporal expression profiles with qPCR data (using RAIN algorithm47), while the RLE normalization has been validated for differential expression analysis of the mean level of expression between stations, using the 21 samples of each stations as replicates.

Usage Notes

We present here the first in situ daily transcriptomes from the high Arctic, where molecular investigations of biological rhythms are exceptionally limited15,16. The samplings have been realized during drastic Polar photic conditions, i.e. the summer solstice, when daily oscillations of the Sun are minimal, high in the sky and always above the horizon15. The proposed datasets are thus novel and of interest due to the unique geographical location and time of year, the ecological importance of C. finmarchicus, and the rigorous temporal sampling strategy. Another strength of this dataset is the high depth of the RNA sequencing, with an average of 126 ± 5 million of reads (mean ± SE) per sample, which optimizes the detection of rhythmic transcripts25 in a species with a large genome20,21. Finally, the elaborate annotation of the large transcriptome is now publicly available and is thus accessible for further research.

The sampling strategy is optimized for rhythmic analysis, and particularly adapted for RAIN algorithm analysis23,25,47. Moreover, dataset allows powerful differential gene expression analysis using the 21 samples per station as replicates providing time-integrated detection of differentially expressed genes in C. finmarchicus with latitude/sea ice-cover. With climate driven environmental changes, this dataset ultimately constitutes new insights into transcriptomic regulation in the northward migrating copepod C. finmarchicus.

Code availability

Parameters to software tools involved are described in the following paragraph.

FastQC: version 0,11,2, --nogroup --casava.

DIAMOND: version v0.9.22, parameters: -f 6 qseqid qlen qcovhsp pident score evalue length sseqid slen stitle.

InterProScan: version 5.29–68.0, --goterms -t n -dp -f TSV, gff3 parameters.

BWA: version 0.7.17, standard parameters, mem algorithm.

SAMtools programs (view, sort, index and idxStats, flagstat): version 1.8, standard parameters.

EdgeR: version 3.26.5.

StreamSampler.jar: version 1.0.


  1. 1.

    Prokopchuk, I. & Sentyabov, E. Diets of herring, mackerel, and blue whiting in the Norwegian Sea in relation to Calanus finmarchicus distribution and temperature conditions. ICES J. Mar. Sci. 63, 117–127 (2006).

    Article  Google Scholar 

  2. 2.

    Beaugrand, G., Brander, K. M., Alistair Lindley, J., Souissi, S. & Reid, P. C. Plankton effect on cod recruitment in the North Sea. Nature 426, 661–664 (2003).

    ADS  CAS  Article  Google Scholar 

  3. 3.

    Berge, J., Gabrielsen, T. M., Moline, M. & Renaud, P. E. Evolution of the Arctic Calanus complex: an Arctic marine avocado? J. Plankton Res. 34, 191–195 (2012).

    Article  Google Scholar 

  4. 4.

    Archibald, K. M., Siegel, D. A. & Doney, S. C. Modeling the Impact of Zooplankton Diel Vertical Migration on the Carbon Export Flux of the Biological Pump. Glob. Biogeochem. Cycles 33, 181–199 (2019).

    ADS  CAS  Article  Google Scholar 

  5. 5.

    Helaouët, P. & Beaugrand, G. Macroecology of Calanus finmarchicus and C. helgolandicus in the North Atlantic Ocean and adjacent seas. Mar. Ecol. Prog. Ser. 345, 147–165 (2007).

    ADS  Article  Google Scholar 

  6. 6.

    Murphy, E. J. et al. Understanding the structure and functioning of polar pelagic ecosystems to predict the impacts of change. Proc. R. Soc. B Biol. Sci. 283, 20161646 (2016).

    Article  Google Scholar 

  7. 7.

    Reygondeau, G. & Beaugrand, G. Future climate-driven shifts in distribution of Calanus finmarchicus. Glob. Change Biol. 17, 756–766 (2011).

    ADS  Article  Google Scholar 

  8. 8.

    Saikkonen, K. et al. Climate change-driven species’ range shifts filtered by photoperiodism. Nat. Clim. Change 2, 239–242 (2012).

    ADS  Article  Google Scholar 

  9. 9.

    Huffeldt, N. P. Photic Barriers to Poleward Range-shifts. Trends Ecol. Evol. 35, 652–655 (2020).

    Article  Google Scholar 

  10. 10.

    Emerson, K. J., Bradshaw, W. E. & Holzapfel, C. M. Concordance of the Circadian Clock with the Environment Is Necessary to Maximize Fitness in Natural Populations. Evolution 62, 979–983 (2008).

    Article  Google Scholar 

  11. 11.

    Bradshaw, W. E. & Holzapfel, C. M. What Season Is It Anyway? Circadian Tracking vs. Photoperiodic Anticipation in Insects. J. Biol. Rhythms 25, 155–165 (2010).

    Article  Google Scholar 

  12. 12.

    Christie, A. E., Fontanilla, T. M., Nesbit, K. T. & Lenz, P. H. Prediction of the protein components of a putative Calanus finmarchicus (Crustacea, Copepoda) circadian signaling system using a de novo assembled transcriptome. Comp. Biochem. Physiol. Part D Genomics Proteomics 8, 165–193 (2013).

    CAS  Article  Google Scholar 

  13. 13.

    Häfker, N. S. et al. Circadian Clock Involvement in Zooplankton Diel Vertical Migration. Curr. Biol. CB 27, 2194–2201.e3 (2017).

    Article  Google Scholar 

  14. 14.

    Häfker, N. S. et al. Calanus finmarchicus seasonal cycle and diapause in relation to gene expression, physiology, and endogenous clocks. Limnol. Oceanogr. 63, 2815–2838 (2018).

    ADS  Article  Google Scholar 

  15. 15.

    Schmal, C., Herzel, H. & Myung, J. Clocks in the Wild: Entrainment to Natural Light. Front. Physiol. 11 (2020).

  16. 16.

    Abhilash, L., Shindey, R. & Sharma, V. K. To be or not to be rhythmic? A review of studies on organisms inhabiting constant environments. Biol. Rhythm Res. 48, 677–691 (2017).

    Article  Google Scholar 

  17. 17.

    Bertolini, E. et al. Life at high latitudes does not require circadian behavioral rhythmicity under constant darkness. Curr. Biol. 29, 3928–3936.e3 (2019).

    CAS  Article  Google Scholar 

  18. 18.

    David, C., Lange, B., Rabe, B. & Flores, H. Community structure of under-ice fauna in the Eurasian central Arctic Ocean in relation to environmental properties of sea-ice habitats. Mar. Ecol. Prog. Ser. 522, 15–32 (2015).

    ADS  Article  Google Scholar 

  19. 19.

    Falk-Petersen, S. et al. Lipids and fatty acids in ice algae and phytoplankton from the Marginal Ice Zone in the Barents Sea. Polar Biol. 20, 41–47 (1998).

    Article  Google Scholar 

  20. 20.

    Bron, J. E. et al. Observing copepods through a genomic lens. Front. Zool. 8, 22 (2011).

    Article  Google Scholar 

  21. 21.

    Choquet Marvin et al. Towards population genomics in non-model species with large genomes: a case study of the marine zooplankton Calanus finmarchicus. R. Soc. Open Sci. 6, 180608 (2019).

  22. 22.

    Lenz, P. H. et al. De Novo Assembly of a Transcriptome for Calanus finmarchicus (Crustacea, Copepoda) – The Dominant Zooplankter of the North Atlantic Ocean. PLOS ONE 9, e88589 (2014).

    ADS  Article  Google Scholar 

  23. 23.

    Hughes, M. E. et al. Guidelines for Genome-Scale Analysis of Biological Rhythms. J. Biol. Rhythms 32, 380–393 (2017).

    CAS  Article  Google Scholar 

  24. 24.

    Mermet, J., Yeung, J. & Naef, F. Systems Chronobiology: Global Analysis of Gene Regulation in a 24-Hour Periodic World. Cold Spring Harb. Perspect. Biol. 9, a028720 (2017).

    Article  Google Scholar 

  25. 25.

    Li, J., Grant, G. R., Hogenesch, J. B. & Hughes, M. E. Considerations for RNA-seq analysis of circadian rhythms. Methods Enzymol. 551, 349–367 (2015).

    CAS  Article  Google Scholar 

  26. 26.

    Ness-Cohn, E., Iwanaszko, M., Kath, W., Allada, R. & Braun, R. TimeTrial: An Interactive Application for Optimizing the Design and Analysis of Transcriptomic Times-Series Data in Circadian Biology Research. J. Biol. Rhythms 35, 439–451 (2020).

    Article  Google Scholar 

  27. 27.

    Payton, L. et al. Remodeling of the cycling transcriptome of the oyster Crassostrea gigas by the harmful algae Alexandrium minutum. Sci. Rep. 7, 3480 (2017).

    ADS  Article  Google Scholar 

  28. 28.

    Grosfeld, K. et al. Online Sea-Ice Knowledge and Data Platform. (2016).

  29. 29.

    Egbert, G. D. & Erofeeva, S. Y. Efficient Inverse Modeling of Barotropic Ocean Tides. J. Atmospheric Ocean. Technol. 19, 183–204 (2002).

    ADS  Article  Google Scholar 

  30. 30.

    Caress, D. W. & Chayes, D., N. MB-System Version 5.5.2284. Open source software distributed from the MBARI and L-DEO web sites. (2016).

  31. 31.

    Hüppe, L. et al. Evidence for oscillating circadian clock genes in the copepod Calanus finmarchicus during the summer solstice in the high Arctic. Biol. Lett. 16, 20200257 (2020).

    Article  Google Scholar 

  32. 32.

    Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCT method. Methods 25, 402–408 (2001).

    CAS  Article  Google Scholar 

  33. 33.

    Andrews, S. FastQC: a quality control tool for high throughput sequence data. (Babraham Bioinformatics, Babraham Institute, Cambridge, United Kingdom, 2010).

    Google Scholar 

  34. 34.

    Tarrant, A. M., Nilsson, B. & Hansen, B. W. Molecular physiology of copepods - from biomarkers to transcriptomes and back again. Comp. Biochem. Physiol. Part D Genomics Proteomics 30, 230–247 (2019).

    CAS  Article  Google Scholar 

  35. 35.

    Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).

    Article  Google Scholar 

  36. 36.

    Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).

    CAS  Article  Google Scholar 

  37. 37.

    Buchfink, B., Xie, C. & Huson, D. H. Fast and sensitive protein alignment using DIAMOND. Nat. Methods 12, 59–60 (2015).

    CAS  Article  Google Scholar 

  38. 38.

    Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).

    CAS  Article  Google Scholar 

  39. 39.

    Koike, N. et al. Transcriptional Architecture and Chromatin Landscape of the Core Circadian Clock in Mammals. Science 338, 349–354 (2012).

    ADS  CAS  Article  Google Scholar 

  40. 40.

    NCBI Sequence Read Archive (2020)

  41. 41.

    Payton, L. et al. Daily transcriptomes of the copepod Calanus finmarchicus during the summer solstice at high Arctic latitudes. figshare (2020).

  42. 42.

    Nielsen, T. G., Kjellerup, S., Smolina, I., Hoarau, G. & Lindeque, P. Live discrimination of Calanus glacialis and C. finmarchicus females: can we trust phenological differences? Mar. Biol. 161, 1299–1306 (2014).

    Article  Google Scholar 

  43. 43.

    Choquet, M. et al. Genetics redraws pelagic biogeography of Calanus. Biol. Lett. 13, 20170588 (2017).

    Article  Google Scholar 

  44. 44.

    Truett, G. E. et al. Preparation of PCR-quality mouse genomic DNA with hot sodium hydroxide and tris (HotSHOT). BioTechniques 29(52), 54 (2000).

    Google Scholar 

  45. 45.

    Smolina, I. et al. Genome- and transcriptome-assisted development of nuclear insertion/deletion markers for Calanus species (Copepoda: Calanoida) identification. Mol. Ecol. Resour. 14, 1072–1079 (2014).

    CAS  PubMed  Google Scholar 

  46. 46.

    DeLeo, D. M., Pérez-Moreno, J. L., Vázquez-Miranda, H. & Bracken-Grissom, H. D. RNA profile diversity across arthropoda: guidelines, methodological artifacts, and expected outcomes. Biol. Methods Protoc. 3 (2018).

  47. 47.

    Thaben, P. F. & Westermark, P. O. Detecting rhythms in time series with RAIN. J. Biol. Rhythms 29, 391–400 (2014).

    Article  Google Scholar 

Download references


This work was supported by the CHASE project, part of the Changing Arctic Ocean programme, jointly funded by the UKRI Natural Environment Research Council (NERC, project number: NE/R012733/1) and the German Federal Ministry of Education and Research (BMBF, project number: 03F0803A). We thank the NERC PRIZE cruise leader Professor Finlo Cottier (Scottish Association for Marine Science, UK) as well as the Captain and crew of the RRS James Clark Ross for their support during the cruise JR17006. Cruise time was supported by the CAO Arctic PRIZE project (NERC: NE/P006302/1). EE was supported by Arctic SIZE, a project co-funded by UiT The Arctic University of Norway and the Tromsø Research Foundation (project number 01vm/h15), and within framework of the state assignment of IO RAS (theme No. 0149- 2019- 0008). We would like to thank Vittoria Roncalli (Stazione Zoologica Anton Dohrn, Italy) for useful discussions. Open Access funding enabled and organized by Projekt DEAL.

Author information




L.P. designed the study, coordinated RNA extraction, contributed to data analysis and wrote the manuscript; C.N. and C.H. performed reads quality assessment, reads alignment on transcriptome, transcriptome annotation and validation and wrote the manuscript; L.H. designed the study, collected samples, extracted RNA and reviewed the manuscript; K.L. and D.W. designed the study, collected samples and reviewed the manuscript; E.E. identified the copepod species on a genetic level and reviewed the manuscript; S.V performed the samples preparation for RNA sequencing; B.M. designed the study, reviewed the manuscript and supervised the study.

Corresponding author

Correspondence to Laura Payton.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit

The Creative Commons Public Domain Dedication waiver applies to the metadata files associated with this article.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Payton, L., Noirot, C., Hoede, C. et al. Daily transcriptomes of the copepod Calanus finmarchicus during the summer solstice at high Arctic latitudes. Sci Data 7, 415 (2020).

Download citation


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing