(a) Comparison of measured relative growth phenotypes from the large-scale screen and predicted activities assigned by the elastic net linear model. Marginal histograms show distributions of relative activities along the corresponding axes. n = 5,241 sgRNAs; r2 = squared Pearson correlation coefficient. (b) Comparison of measured relative activity (relative knockdown) in the GFP experiment and predicted relative sgRNA activity. n = 57 sgRNAs; r2 = squared Pearson correlation coefficient. (c) Comparison of predicted relative activities from the linear model and the neural network, based on the validation set of singly-mismatched sgRNAs. n = 5,241 sgRNAs; r2 = squared Pearson correlation coefficient. (d) Regression coefficients assigned to each feature in the linear model. 228 features (gray, blue) describe the position and type of mismatch; 42 features (gold) carry other information about the sgRNA and genomic context surrounding the protospacer. These features are detailed in subsequent panels. (e) Linear coefficients for features of the sgRNA and targeted locus. TSS; transcription start site. (f) Linear coefficients for features covering positions in the distal, intermediate, and seed regions of the targeting sequence (highlighted blue in panel d).