Vertebral samples from each individual were first crushed in liquid nitrogen. Total cellular RNA was extracted TSA HDAC using TRIzol Reagent (Life Technologies) according to the manufacturer’s recommendations. Total extracted RNA was subjected to DNAse treated (ArcturusPicoPure RNA isolation kit, Life Technologies) and RNA integrity and purity were assessed using a Bioanalyzer 2100 (Agilent Technologies). RNA was quantified using ND-1000 spectrophotometer (NanoDrop
Technologies Inc.). RNA samples from weeks 0 and 4 were pooled (3 fish per pool) according to sampling time and diet, while fish sampled at week 27 were processed separately (Table 1). Libraries were created using TruSeq Sample Prep Kit v2 (Illumina, USA) following the manufacturer’s instruction. Resulting libraries were quantified using a Bioanalyzer BTK inhibitor solubility dmso 2100 (Agilent Technologies).
Samples were multiplexed (6 samples per lane) and sequenced at McGill genomic platform (Montréal, Canada) with HiSeq2000 sequencer and a 100 paired-end (PE) technology. Reads from HiSeq2000 Illumina were processed with Trimmomatic v0.30 (Lohse et al., 2012) to remove low quality (trailing: 20, lowest quality: 30) and short reads (< 60 bp). Trimming also included removal of Illumina adapters together with the most common contamination vectors from UniVec database (https://www.ncbi.nlm.nih.gov/tools/vecscreen/univec/). The combined high quality reads (pools/samples) were de novo assembled using the Trinity assembler ( Haas et al., 2013). Sequencing
yielded 185,369,129 reads for each end. Trimming decreased the amount of reads to 141,986,373. Assembly for Illumina 100PE reads led to 679,869 transcripts for a mean length of 542 bp (Table 1). This Transcriptome Shotgun Assembly project has been deposited at DDBJ/EMBL/GenBank under the accession GBTD00000000. The version described in this paper is the first version, GBTD01000000. From the 679,869 transcripts, 340,737 found homology (Blastn, threshold evalue < 10–4) with referenced ESTs for rainbow trout. Functional annotation revealed that 141,909 and 117,564 transcripts found sequence homology against Nr and Uniprot protein databases, respectively (Blastx, threshold evalue < 10–8). See supplementary file 1 for more details regarding the methods and the results. More information regarding transcripts and matches on Uniprot Methane monooxygenase database is provided in a spreadsheet in supplementary data (Supplementary file 2). Besides, a top-hit distribution revealed that transcripts matched mainly with teleost species (Fig. 1A). In addition, Gene Ontology association (GO) resulted in 11,202 assignment from which 93.4%, 91.1% and 85.9% were allocated to cellular components, molecular function and biological process, respectively (see details Fig. 1B). Finally, only 5.4% of the non-matching sequences against Uniprot displayed theoretical ORFs superior to 100 amino acids (see supplementary methods for more details).