Construct: ORF TRCN0000477910
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF003217.1_s317c1
- Derived from:
- ccsbBroadEn_04333
- DNA Barcode:
- TTTACTCAGAAGTACATTCGAGGT
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- USP42 (84132)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000477910
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 84132 | USP42 | ubiquitin specific peptidas... | NM_032172.3 | 100% | 100% | |
2 | human | 84132 | USP42 | ubiquitin specific peptidas... | NM_001365764.1 | 99.3% | 99.3% | 3946_3952delAAGAGGA;3956_3972del |
3 | human | 84132 | USP42 | ubiquitin specific peptidas... | XM_006715790.2 | 99% | 99% | 3942_3943insGGT;3946_3981del |
4 | human | 84132 | USP42 | ubiquitin specific peptidas... | XM_006715791.2 | 99% | 99% | 3942_3943insGGT;3946_3981del |
5 | human | 84132 | USP42 | ubiquitin specific peptidas... | XM_011515573.1 | 99% | 99% | 3942_3943insGGT;3946_3981del |
6 | human | 84132 | USP42 | ubiquitin specific peptidas... | XM_024446968.1 | 99% | 99% | 3942_3943insGGT;3946_3981del |
7 | human | 84132 | USP42 | ubiquitin specific peptidas... | XM_024446969.1 | 99% | 99% | 3942_3943insGGT;3946_3981del |
8 | human | 84132 | USP42 | ubiquitin specific peptidas... | XM_011515574.1 | 89.8% | 87.2% | (many diffs) |
9 | human | 84132 | USP42 | ubiquitin specific peptidas... | XM_011515577.1 | 87.9% | 87.8% | (many diffs) |
10 | human | 84132 | USP42 | ubiquitin specific peptidas... | XM_011515578.1 | 87.4% | 87.4% | 0_1ins462;3480_3481insGGT;3484_3519del |
11 | human | 84132 | USP42 | ubiquitin specific peptidas... | XM_005249883.5 | 55.8% | 55.7% | (many diffs) |
12 | human | 84132 | USP42 | ubiquitin specific peptidas... | XR_002956494.1 | 52.9% | 1_67del;4013_4019delAAGAGGA;4023_7458del |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 4017
- ORF length:
- 3948
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcaccat gaccatagtt gacaaagctt ctgaatcttc agacccatca gcctatcaga 121 atcagcctgg cagctccgag gcagtctcac ctggagacat ggatgcaggt tctgccagct 181 ggggtgctgt gtcttcattg aatgatgtgt caaatcacac actttcttta ggaccagtac 241 ctggtgctgt agtttattcg agttcatctg tacctgataa atcaaaacca tcaccacaaa 301 aggatcaagc cctaggtgat ggcatcgctc ctccacagaa agttcttttc ccatctgaga 361 agatttgtct taagtggcaa caaactcata gagttggagc tgggctccag aatttgggca 421 atacctgttt tgccaatgca gcactgcagt gtttaaccta cacaccacct cttgccaatt 481 acatgctatc acatgaacac tccaaaacat gtcatgcaga aggcttttgt atgatgtgta 541 caatgcaagc acatattacc caggcactca gtaatcctgg ggacgttatt aaaccaatgt 601 ttgtcatcaa tgagatgcgg cgtatagcta ggcacttccg ttttggaaac caagaagatg 661 cccatgaatt ccttcaatac actgttgatg ctatgcagaa agcatgcttg aatggcagca 721 ataaattaga cagacacacc caggccacca ctcttgtttg tcagatattt ggaggatacc 781 taagatctag agtcaaatgt ttaaattgca agggcgtttc agatactttt gatccatatc 841 ttgatataac attggagata aaggctgctc agagtgtcaa caaggcattg gagcagtttg 901 tgaagccgga acagcttgat ggagaaaact cgtacaagtg cagcaagtgt aaaaagatgg 961 ttccagcttc aaagaggttc actatccata gatcctctaa tgttcttaca ctttctctga 1021 aacgttttgc aaattttacc ggtggaaaaa ttgctaagga tgtgaaatac cctgagtatc 1081 ttgatattcg gccatatatg tctcaaccca acggagagcc aattgtctac gtcttgtatg 1141 cagtgctggt ccacactggt tttaattgcc atgctggcca ttacttctgc tacataaaag 1201 ctagcaatgg cctctggtat caaatgaatg actccattgt atctaccagt gatattagat 1261 cggtactcag ccaacaagcc tatgtgctct tttatatcag gtcccatgat gtgaaaaatg 1321 gaggtgaact tactcatccc acccatagcc ccggccagtc ctctccccgc cccgtcatca 1381 gtcagcgggt tgtcaccaac aaacaggctg cgccaggctt tatcggacca cagcttccct 1441 ctcacatgat aaagaatcca cctcacttaa atgggactgg accattgaaa gacacgccaa 1501 gcagttccat gtcgagtcct aacgggaatt ccagtgtcaa cagggctagt cctgttaatg 1561 cttcagcttc tgtccaaaac tggtcagtta ataggtcctc agtgatccca gaacatccta 1621 agaaacaaaa aattacaatc agtattcaca acaagttgcc tgttcgccag tgtcagtctc 1681 aacctaacct tcatagtaat tctttggaga accctaccaa gcccgttccc tcttctacca 1741 ttaccaattc tgcagtacag tctacctcga acgcatctac gatgtcagtt tctagtaaag 1801 taacaaaacc gatcccccgc agtgaatcct gctcccagcc cgtgatgaat ggcaaatcca 1861 agctgaactc cagcgtgctg gtgccctatg gcgccgagtc ctctgaggac tctgacgagg 1921 agtcaaaggg gctgggcaag gagaatggga ttggtacgat tgtgagctcc cactctcccg 1981 gccaagatgc cgaagatgag gaggccactc cgcacgagct tcaagaaccc atgaccctaa 2041 acggtgctaa tagtgcagac agcgacagtg acccgaaaga aaacggccta gcgcctgatg 2101 gtgccagctg ccaaggccag cctgccctgc actcagaaaa tccctttgct aaggcaaacg 2161 gtcttcctgg aaagttgatg cctgctcctt tgctgtctct cccagaagac aaaatcttag 2221 agaccttcag gcttagcaac aaactgaaag gctcgacgga tgaaatgagt gcacctggag 2281 cagagagggg ccctcccgag gaccgcgacg ccgagcctca gcctggcagc cccgccgccg 2341 aatccctgga ggagccagat gcggccgccg gcctcagcag caccaagaag gctccgccgc 2401 cccgcgatcc cggcaccccc gctaccaaag aaggcgcctg ggaggccatg gccgtcgccc 2461 ccgaggagcc tccgcccagc gccggcgagg acatcgtggg ggacacagca ccccctgacc 2521 tgtgtgatcc cgggagctta acaggcgatg cgagcccgtt gtcccaggac gcaaagggga 2581 tgatcgcgga gggcccgcgg gactcggcgt tggcggaagc cccggaaggg ttgagtccgg 2641 ctccgcctgc gcggtcggag gagccctgcg agcagccact ccttgttcac cccagcgggg 2701 accacgcccg ggacgctcag gacccatccc agagcttggg cgcacccgag gccgcagagc 2761 ggccgccagc tcctgtgctg gacatggccc cggccggtca cccggaaggg gacgctgagc 2821 ctagccccgg cgagagggtc gaggacgccg cggcgccgaa agccccaggc ccttccccag 2881 cgaaggagaa aatcggcagc ctcagaaagg tggaccgagg ccactaccgc agccggagag 2941 agcgctcgtc cagcggggag cccgccagag agagcaggag caagactgag ggccaccgtc 3001 accggcggcg ccgcacctgc ccccgggagc gcgaccgcca ggaccgccac gccccggagc 3061 accaccccgg ccacggcgac aggctcagcc ctggcgagcg ccgctctctg ggcaggtgca 3121 gtcaccacca ctcccgacac cggagcgggg tggagctgga ctgggtcaga caccactaca 3181 ccgagggcga gcgtggctgg ggccgggaga agttctaccc cgacaggccg cgctgggaca 3241 ggtgccggta ctaccatgac aggtacgccc tgtacgctgc ccgggactgg aagcccttcc 3301 acggcggccg cgagcacgag cgggccgggc tgcacgagcg gccgcacaag gaccacaacc 3361 ggggccgtag gggctgcgag ccggcccggg agagggagcg gcaccgcccc agcagccccc 3421 gcgcaggcgc gccccacgcc ctcgccccgc accccgaccg cttctcccac gacagaactg 3481 cacttgtagc cggagacaac tgtaacctct ctgatcGGTT TCACGAACAC GAAAATGGAA 3541 AGTCCCGGAA ACGGAGACAC GACAGTGTGG AGAACAGTGA CAGTCATGTT GAAAAGAAAG 3601 CCCGGAGGAG CGAACAGAAG GATCCTCTAG AAGAGCCTAA AGCAAAGAAG CACAAAAAAT 3661 CAAAGAAGAA AAAGAAATCC AAAGACAAAC ACCGAGACCG CGACTCCAGG CATCAGCAGG 3721 ACTCAGACCT CTCAGCAGCG TGCTCTGACG CTGACCTCCA CAGACACAAA AAAAAGAAGA 3781 AGAAAAAGAA GAGACATTCA AGAAAATCAG AGGACTTTGT TAAAGATTCA GAACTGCACT 3841 TACCCAGGGT CACCAGCTTG GAGACTGTCG CCCAGTTCCG GAGAGCCCAG GGTGGCTTTC 3901 CTCTCTCTGG TGGCCCGCCT CTGGAAGGCG TCGGACCTTT CCGTGAGAAA ACGAAACACT 3961 TACGGATGGA AAGCAGGGAT GACAGGTGTC GTCTCTTTGA GTATGGCCAG GGTGATTTGC 4021 CTACTTTCTT GTACAAAGTG GTTGATATCG GTAAGCCTAT CCCTAACCCT CTCCTCGGTC 4081 TCGATTCTAC GTAGTAATGA ACTAGTCCGT AACTTGAAAG TATTTCGATT TCTTGGCTTT 4141 ATATATCTTG TGGAAAGGAC GATTTACTCA GAAGTACATT CGAGGTACGC GTTAAGTCga 4201 caatcaacct ctggattaca aaatttgtga aagatt