Construct: ORF TRCN0000471138
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF005914.2_s317c1
- Derived from:
- ccsbBroadEn_06998
- DNA Barcode:
- AGGTCATAATAACTGTTCTGGTAC
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- SRPK1 (6732)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000471138
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 6732 | SRPK1 | SRSF protein kinase 1 | NM_003137.5 | 99.9% | 99.8% | 629T>C |
2 | human | 6732 | SRPK1 | SRSF protein kinase 1 | NR_034069.1 | 44.5% | (many diffs) | |
3 | mouse | 20815 | Srpk1 | serine/arginine-rich protei... | NM_016795.4 | 88.2% | 91.4% | (many diffs) |
4 | mouse | 20815 | Srpk1 | serine/arginine-rich protei... | XM_006523958.3 | 76.7% | 78.2% | (many diffs) |
5 | mouse | 20815 | Srpk1 | serine/arginine-rich protei... | XM_011246337.2 | 74.8% | 75.3% | (many diffs) |
6 | mouse | 20815 | Srpk1 | serine/arginine-rich protei... | XR_385308.3 | 57.1% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2031
- ORF length:
- 1965
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcatgga gcggaaagtg cttgcgctcc aggcccgaaa gaaaaggacc aaggccaaga 121 aggacaaagc ccaaaggaaa tctgaaactc agcaccgagg ctctgctccc cactctgaga 181 gtgatctacc agagcaggaa gaggagattc tgggatctga tgatgatgag caagaagatc 241 ctaatgatta ttgtaaagga ggttatcatc ttgtgaaaat tggagatcta ttcaatggga 301 gataccatgt gatccgaaag ttaggctggg gacacttttc aacagtatgg ttatcatggg 361 atattcaggg gaagaaattt gtggcaatga aagtagttaa aagtgctgaa cattacactg 421 aaacagcact agatgaaatc cggttgctga agtcagttcg caattcagac cctaatgatc 481 caaatagaga aatggttgtt caactactag atgactttaa aatatcagga gttaatggaa 541 cacatatctg catggtattt gaagttttgg ggcatcatct gctcaagtgg atcatcaaat 601 ccaattatca ggggcttcca ctgccttgtg tcaaaaaaat tattcagcaa gtgttacagg 661 gtcttgatta tttacatacc aagtgccgta tcacccacac tgacattaaa ccagagaaca 721 tcttattgtc agtgaatgag cagtacattc ggaggctggc tgcagaagca acagaatggc 781 agcgatctgg agctcctccg ccttccggat ctgcagtcag tactgctccc cagcctaaac 841 cagctgacaa aatgtcaaag aataagaaga agaaattgaa gaagaagcag aagcgccagg 901 cagaattact agagaagcga atgcaggaaa ttgaggaaat ggagaaagag tcgggccctg 961 ggcaaaaaag accaaacaag caagaagaat cagagagtcc tgttgaaaga cccttgaaag 1021 agaacccacc taataaaatg acccaagaaa aacttgaaga gtcaagtacc attggccagg 1081 atcaaacgct tatggaacgt gatacagagg gtggtgcagc agaaattaat tgcaatggag 1141 tgattgaagt cattaattat actcagaaca gtaataatga aacattgaga cataaagagg 1201 atctacataa tgctaatgac tgtgatgtcc aaaatttgaa tcaggaatct agtttcctaa 1261 gctcccaaaa tggagacagc agcacatctc aagaaacaga ctcttgtaca cctataacat 1321 ctgaggtgtc agacaccatg gtgtgccagt cttcctcaac tgtaggtcag tcattcagtg 1381 aacaacacat tagccaactt caagaaagca ttcgggcaga gataccctgt gaagatgaac 1441 aagagcaaga acataacgga ccactggaca acaaaggaaa atccacggct ggaaattttc 1501 ttgttaatcc ccttgagcca aaaaatgcag aaaagctcaa ggtgaagatt gctgaccttg 1561 gaaatgcttg ttgggtgcac aaacatttca ctgaagatat tcaaacaagg caatatcgtt 1621 ccttggaagt tctaaTCGGA TCTGGCTATA ATACCCCTGC TGACATTTGG AGCACGGCAT 1681 GCATGGCCTT TGAACTGGCC ACAGGTGACT ATTTGTTTGA ACCTCATTCA GGGGAAGAGT 1741 ACACTCGAGA TGAAGATCAC ATTGCATTGA TCATAGAACT TCTGGGGAAG GTGCCTCGCA 1801 AGCTCATTGT GGCAGGAAAA TATTCCAAGG AATTTTTCAC CAAAAAAGGT GACCTGAAAC 1861 ATATCACGAA GCTGAAACCT TGGGGCCTTT TTGAGGTTCT AGTGGAGAAG TATGAGTGGT 1921 CGCAGGAAGA GGCAGCTGGC TTCACAGATT TCTTACTGCC CATGTTGGAG CTGATCCCTG 1981 AGAAGAGAGC CACTGCCGCC GAGTGTCTCC GGCACCCTTG GCTTAACTCC TACCCAACTT 2041 ACTTGTACAA AGTGGTTGAT ATCGGTAAGC CTATCCCTAA CCCTCTCCTC GGTCTCGATT 2101 CTACGTAGTA ATGAACTAGT CCGTAACTTG AAAGTATTTC GATTTCTTGG CTTTATATAT 2161 CTTGTGGAAA GGACGAAGGT CATAATAACT GTTCTGGTAC ACGCGTTAAG TCgacaatca 2221 acctctggat tacaaaattt gtgaaagatt