Data:
Available
via GEO
Format: These files are in single line FASTA format. The header line contains a 6nt degenerate barcode associated with each sequenced read. The same degenerate barcode is reproduced in the first 6nt positions of the read sequence line.
Example:
Random barcode: >CTAGAA AGO bound read: CTAGAACCGTAGCCCTGGCGGATGCCTGGTAGGTGGAAGCGT
Data:
CD4_miR155_clipbase.zip
Format: This
file is in tab-delimited format. Each line contains the sequence and details for one peak. Multiple attributes/details are present
for each peak.
Example:
[Line: 6296] Transcript ID: NM_001205043.1 Gene Symbol: Jarid2 Peak Number: 2 WT read count: 7.02049 KO read count: 2.44848 Diff. (WT-KO): 4.57201 Diff. P-value: 2.57589e-13 Replicate stat: 12 Peak start pos: 127 Peak end pos: 162 Peak sequence: TAGAGAACTGATTTTGTTTTAGCATTAAACTGTTCAAGTTTTTGTACG
Data:
mouse.utr.fa.zip
Format: This file is in
tab-delimited format. Each line
contains one 3' UTR sequence for a gene. The following attributes are present for each gene: Transcript ID, Gene Symbol, Species
ID and UTR Sequence.
Example:
[Line 10682] Transcript ID: NM_001205043.1 Gene Symbol: Jarid2 Taxonomy ID: 10090 3' UTR seq.: AAGATGCCG...