This analysis was performed to identify loci that contain tandem 3' UTRs (loci with multiple PASs located in the same last exon). Custom scripts were used to identify loci that contain at least two PASs that share the same coordinates of the start of the last exon. The number of loci with tandem 3' UTRs was calculated for those in which PAS was significantly coupled to alternative exons and for those that did not show any significant interdepenedncies between alternative exons and the PAS usage. <br>
<br>
---
## **Sequence Motif Analysis Relative to Acceptor and Donor Sites** <br>
For each detected gene, we report the first and last nucleotide of each exon as acceptor and donor splice sites, respectively. Each unique genomic position was converted into a BED format and the strand specific sequences of 2 nucleotides were extracted using UCSC Table Browser (GRCh37/hg19) for both acceptor and donor splice sites.
> **`script:`** The python script for extracting dinucleotide sequences of the splice-sites at R1 and R3 domains (i.e., canonical GT and AG motifs) can be found [**here**](https://git.lumc.nl/s.y.anvar/mRNA-Coupling/ipython_notebook/master/scripts/Rdomain_splice_junction_motif.ipynb). <br>
<br>
---
## **RNA Binding Motif Analysis** <br>
We used MEME suite to identify enriched sequence motifs present in exons significantly coupled with alternative TSSs, PASs or other alternative exons. For each unique exon, three regions were considered: R1 domain (containing up to 35bp upstream of the acceptor splice site), R2 domain (containing 32bp downstream of the acceptor splice site and 32bp upstream of the donor splice site) and R3 domain (containing up to 40bp downstream of the donor splice site). R1, R2 and R3 domains were obtained by extracting strand specific FASTA sequences using UCSC Table Browser (GRCh37/hg19). <br>
We locally ran DREME (version 4.11.4) for each region separately and performed a motif search analysis using a negative background (R1, R2 and R3 domains of exons that were not significantly coupled). We ran DREME without any limitation for the motifs' length (similar to poly(A) site motif analysis). In each case, a maximum of 10 motifs with E-value less than 0.05 was reported. The remaining parameters were kept as default. We then compared each motif found by DREME against the human RNA-binding motifs database CISBP-RNA using TOMTOM Motif Comparison tool. We ran the analysis by setting the Pearson correlation coefficient as comparison function and considered only matches with a minimum false discovery rate (q-value) less than 0.05. <br>
<br>
---
## **Reported Bugs and Fixes** <br>
So far, we have not received any bug reports! In this section, we will report any future changes to the procedure or the accompanied scripts. Feel free to send in your suggestions and comments for improvement or additional features. <br>