miR-181a initiates and perpetuates oncogenic transformation through the regulation of innate immune signaling

Genomic instability (GI) predisposes cells to malignant transformation, however the molecular mechanisms that allow for the propagation of cells with a high degree of genomic instability remain unclear. Here we report that miR-181a is able to transform fallopian tube secretory epithelial cells through the inhibition of RB1 and stimulator-of-interferon-genes (STING) to propagate cells with a high degree of GI. MiR-181a targeting of RB1 leads to profound nuclear defects and GI generating aberrant cytoplasmic DNA, however simultaneous miR-181a mediated inhibition of STING allows cells to bypass interferon mediated cell death. We also found that high miR-181a is associated with decreased IFNγ response and lymphocyte infiltration in patient tumors. DNA oncoviruses are the only known inhibitors of STING that allow for cellular transformation, thus, our findings are the first to identify a miRNA that can downregulate STING expression to suppress activation of intrinsic interferon signaling. This study introduces miR-181a as a putative biomarker and identifies the miR-181a-STING axis as a promising target for therapeutic exploitation.

Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist.

Statistics
For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section.
n/a Confirmed The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly The statistical test(s) used AND whether they are one-or two-sided Only common tests should be described solely by name; describe more complex techniques in the Methods section.
A description of all covariates tested A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted Give P values as exact values whenever suitable.

For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings
For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated Our web collection on statistics for biologists contains articles on many of the points above.

Software and code
Policy information about availability of computer code Data collection Prism 8.0 was used for all statistical analysis Data analysis Prism 8.0 was used for all statistical analysis, LAS-X imaging software For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information.

Data
Policy information about availability of data All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: -Accession codes, unique identifiers, or web links for publicly available datasets -A list of figures that have associated raw data -A description of any restrictions on data availability All sequencing data has been uploaded Gene Expression Omnibus Database under the accession number GSE150909.

Field-specific reporting
Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. primary FT cell lines were used. For all cell lines they will undergo periodic assessment (every 10 passages or upon the generation of new derivatives) using short tandem repeats (STR) analysis.
Authentication short tandem repeats (STR) analysis was used for validation.
In addition, cells in culture in the laboratory are/will be periodically tested for mycoplasma using the Universal Mycoplasma Detection Kit (ATCC® 30-1012K™) from the ATCC. We routinely perform mycoplasma testing every month. These measures follow ANSI (American National Standards Institute) voluntary consensus standards.

Specimen deposition
Indicate where the specimens have been deposited to permit free access by other researchers.

Dating methods
If new dates are provided, describe how they were obtained (e.g. collection, storage, sample pretreatment and measurement), where they were obtained (i.e. lab name), the calibration program and the protocol for quality assurance OR state that no new dates are provided.
Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information.

Animals and other organisms
Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research Confirm that both raw and final processed data have been deposited in a public database such as GEO.
Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks.

Data access links
May remain private before publication.
For "Initial submission" or "Revised version" documents, provide reviewer access links. For your "Final submission" document, provide a link to the deposited data.

Files in database submission
Provide a list of all files available in the database submission.
Genome browser session (e.g. UCSC) Provide a link to an anonymized genome browser session for "Initial submission" and "Revised version" documents only, to enable peer review. Write "no longer applicable" for "Final submission" documents.
For cell cycle analysis, cells were plated in 10 cm plates at a density of 500,000 cells per plate in biological triplicate (approx. 40% confluency) and cultured for 48 hr. After 48 hr cells were trypsinized and fixed in 70% EtOH overnight. Fixed cells were washed with PBS and then stained with propidium iodide for 30 min at room temperature. Cell cycle for each sample was then analyzed by fluorescence activated cell sorting using an Attune NXT flow cytometer (Invitrogen) and FCS Express software.

Instrument
Attune NXT flow cytometer (Invitrogen) a Software FloJo Cell population abundance For selection of the FTSECs transduced with RB1 shRNA GFP flow cytometry was used. All cell lines selected with flow cytometry were analyzed every 3-5 passages to make sure that the RFP or GFP+ population was >95% with re-sorting done as needed.
Gating strategy FSC vs SSC gating was used to identify the distinct cell types Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information.

Magnetic resonance imaging
Experimental design Design type Indicate task or resting state; event-related or block design.

Specify the number of blocks, trials or experimental units per session and/or subject, and specify the length of each trial or block (if trials are blocked) and interval between trials.
Behavioral performance measures State number and/or type of variables recorded (e.g. correct button press, response time) and what statistics were used to establish that the subjects were performing the task as expected (e.g. mean, range, and/or standard deviation across subjects).

nature research | reporting summary
October 2018 Acquisition Imaging type(s) Specify: functional, structural, diffusion, perfusion. Field strength

Sequence & imaging parameters
Specify the pulse sequence type (gradient echo, spin echo, etc.), imaging type (EPI, spiral, etc.), field of view, matrix size, slice thickness, orientation and TE/TR/flip angle.

Area of acquisition
State whether a whole brain scan was used OR define the area of acquisition, describing how the region was determined.

Noise and artifact removal
Describe your procedure(s) for artifact and structured noise removal, specifying motion parameters, tissue signals and physiological signals (heart rate, respiration).

Volume censoring
Define your software and/or method and criteria for volume censoring, and state the extent of such censoring.

Statistical modeling & inference Model type and settings
Specify type (mass univariate, multivariate, RSA, predictive, etc.) and describe essential details of the model at the first and second levels (e.g. fixed, random or mixed effects; drift or auto-correlation).

Graph analysis
Report the dependent variable and connectivity measure, specifying weighted graph or binarized graph, subject-or group-level, and the global and/or node summaries used (e.g. clustering coefficient, efficiency, etc.).

Multivariate modeling and predictive analysis
Specify independent variables, features extraction and dimension reduction, model, training and evaluation metrics.