Construct: ORF ccsbBroadEn_09209
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF004918.1_s300c1, BRDN0000397751
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- USP38 (84640)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 84640 | USP38 | ubiquitin specific peptidas... | NM_032557.6 | 99.9% | 100% | 1437T>C;2589G>A |
2 | human | 84640 | USP38 | ubiquitin specific peptidas... | XM_011532360.3 | 96.6% | 96.7% | 946_947ins102;1335T>C;2487G>A |
3 | human | 84640 | USP38 | ubiquitin specific peptidas... | NM_001290325.1 | 95.6% | 94.2% | (many diffs) |
4 | human | 84640 | USP38 | ubiquitin specific peptidas... | NM_001290326.1 | 56.2% | 56.3% | 0_1ins1365;72T>C;1224G>A |
5 | mouse | 74841 | Usp38 | ubiquitin specific peptidas... | NM_027554.2 | 87.5% | 90.1% | (many diffs) |
6 | mouse | 74841 | Usp38 | ubiquitin specific peptidas... | XM_006531450.3 | 58.5% | 57.8% | (many diffs) |
7 | mouse | 74841 | Usp38 | ubiquitin specific peptidas... | XM_017312992.1 | 45.8% | 45.4% | (many diffs) |
8 | mouse | 74841 | Usp38 | ubiquitin specific peptidas... | XR_387782.2 | 38.1% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 3192
- ORF length:
- 3126
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGA CAAGATCCTG GAGGGCCTTG TGAGTTCCTC GCATCCCCTG CCCCTCAAGC 121 GGGTGATTGT GCGGAAGGTG GTGGAATCGG CGGAGCACTG GCTAGACGAG GCGCAGTGCG 181 AGGCCATGTT TGACCTGACG ACCCGGCTCA TCCTGGAGGG CCAGGACCCT TTCCAGCGGC 241 AGGTGGGGCA CCAGGTGCTG GAGGCCTACG CACGATACCA CCGGCCAGAG TTCGAGTCCT 301 TCTTCAACAA GACCTTCGTG TTGGGCCTCC TTCATCAGGG CTACCACTCT CTGGACAGGA 361 AGGATGTAGC CATCCTGGAC TACATTCACA ACGGCCTGAA GCTGATTATG AGCTGTCCGT 421 CGGTGCTGGA TCTCTTTAGC CTCCTGCAGG TAGAGGTGTT ACGGATGGTG TGTGAGAGGC 481 CGGAGCCGCA GCTCTGTGCC CGACTGAGCG ACCTTCTGAC CGACTTTGTG CAATGCATCC 541 CCAAGGGGAA ATTGTCCATC ACGTTCTGTC AACAGCTGGT TCGAACGATA GGCCATTTCC 601 AGTGCGTGTC CACCCAGGAA AGAGAGCTGC GGGAATATGT CTCCCAGGTG ACAAAAGTGA 661 GTAACTTGCT GCAGAACATC TGGAAGGCCG AGCCTGCCAC ACTACTGCCT TCCCTGCAAG 721 AAGTTTTTGC AAGCATCTCT TCCACAGATG CATCATTTGA ACCTTCTGTA GCATTGGCAA 781 GCCTTGTGCA GCATATTCCT CTTCAGATGA TTACAGTTCT CATCAGGAGC CTTACTACGG 841 ATCCAAATGT AAAAGATGCA AGTATGACCC AAGCCCTTTG CAGAATGATT GACTGGCTAT 901 CCTGGCCATT GGCTCAGCAT GTGGATACAT GGGTAATTGC ACTCCTGAAA GGACTGGCAG 961 CTGTCCAGAA GTTTACTATT TTGATAGATG TTACTTTGCT GAAAATAGAA CTGGTTTTTA 1021 ATCGACTTTG GTTTCCTCTT GTGAGACCTG GTGCTCTTGC AGTTCTTTCT CACATGCTGC 1081 TTAGCTTTCA GCATTCTCCA GAGGCGTTCC ATTTGATTGT TCCTCATGTG GTTAATTTGG 1141 TTCATTCTTT CAAAAATGAT GGTCTGCCTT CAAGTACAGC CTTCTTAGTA CAATTAACAG 1201 AATTGATACA CTGTATGATG TATCATTATT CTGGATTTCC AGATCTCTAT GAACCTATTC 1261 TGGAGGCAAT AAAGGATTTT CCTAAGCCCA GTGAAGAGAA GATTAAGTTA ATTCTCAATC 1321 AAAGTGCCTG GACTTCTCAA TCCAATTCTT TGGCGTCTTG CTTGTCTAGA CTTTCTGGAA 1381 AATCTGAAAC TGGGAAAACT GGTCTTATTA ACCTAGGAAA TACATGTTAT ATGAACAGTG 1441 TTATACAAGC CTTGTTTATG GCCACAGATT TCAGGAGACA AGTATTATCT TTAAATCTAA 1501 ACGGGTGCAA TTCATTAATG AAAAAATTAC AGCATCTTTT TGCCTTTCTG GCCCATACAC 1561 AGAGGGAAGC ATACGCACCT CGGATATTCT TTGAGGCTTC CAGACCTCCA TGGTTTACTC 1621 CCAGATCACA GCAAGACTGT TCTGAATACC TCAGATTTCT CCTTGACAGG CTCCATGAAG 1681 AAGAAAAGAT CTTGAAAGTT CAGGCCTCAC ACAAGCCTTC TGAAATTCTG GAATGCAGTG 1741 AAACTTCTTT ACAGGAAGTA GCTAGTAAAG CAGCAGTACT AACAGAGACC CCTCGTACAA 1801 GTGACGGTGA GAAGACTTTA ATAGAAAAAA TGTTTGGAGG AAAACTACGA ACTCACATAC 1861 GTTGTTTGAA CTGCAGGAGT ACCTCACAAA AAGTGGAAGC CTTTACAGAT CTTTCGCTTG 1921 CCTTTTGTCC TTCCTCTTCT TTGGAAAACA TGTCTGTCCA AGATCCAGCA TCATCACCCA 1981 GTATACAAGA TGGTGGTCTA ATGCAAGCCT CTGTACCCGG TCCTTCAGAA GAACCAGTAG 2041 TTTATAATCC AACAACAGCT GCCTTCATCT GTGACTCACT TGTGAATGAA AAAACCATAG 2101 GCAGTCCTCC TAATGAGTTT TACTGTTCTG AAAACACTTC TGTCCCTAAC GAATCTAACA 2161 AGATTCTTGT TAATAAAGAT GTACCTCAGA AACCAGGAGG TGAAACCACA CCTTCAGTAA 2221 CTGACTTACT AAATTATTTT TTGGCTCCAG AGATTCTTAC TGGTGATAAC CAATATTATT 2281 GTGAAAACTG TGCCTCTCTG CAAAATGCTG AGAAAACTAT GCAAATCACG GAGGAACCTG 2341 AATACCTTAT TCTTACTCTC CTGAGATTTT CATATGATCA GAAGTATCAT GTGAGAAGGA 2401 AAATTTTAGA CAATGTATCA CTGCCACTGG TTTTGGAGTT GCCAGTTAAA AGAATTACTT 2461 CTTTCTCTTC ATTGTCAGAA AGTTGGTCTG TAGATGTTGA CTTCACTGAT CTTAGTGAGA 2521 ACCTTGCTAA AAAATTAAAG CCTTCAGGGA CTGATGAAGC TTCCTGCACA AAATTGGTGC 2581 CCTATCTATT AAGTTCCGTT GTGGTTCACT CTGGTATATC CTCTGAAAGT GGGCATTACT 2641 ATTCTTATGC CAGAAATATC ACAAGTACAG ACTCTTCATA TCAGATGTAC CACCAGTCTG 2701 AGGCTCTGGC ATTAGCATCC TCCCAGAGTC ATTTACTAGG GAGAGATAGT CCCAGTGCAG 2761 TTTTTGAACA GGATTTGGAA AATAAGGAAA TGTCAAAAGA ATGGTTTTTA TTTAATGACA 2821 GTAGAGTGAC ATTTACTTCA TTTCAGTCAG TCCAGAAAAT TACGAGCAGG TTTCCAAAGG 2881 ACACAGCTTA TGTGCTTTTG TATAAAAAAC AGCATAGTAC TAATGGTTTA AGTGGTAATA 2941 ACCCAACCAG TGGACTCTGG ATAAATGGAG ACCCACCTCT ACAGAAAGAA CTTATGGATG 3001 CTATAACAAA AGACAATAAA CTATATTTAC AGGAACAAGA GTTGAATGCT CGAGCCCGGG 3061 CCCTCCAAGC TGCATCTGCT TCATGTTCAT TTCGGCCCAA TGGATTTGAT GACAACGACC 3121 CACCAGGAAG CTGTGGACCA ACTGGTGGAG GGGGTGGAGG AGGATTTAAT ACAGTTGGCA 3181 GACTCGTATT TTGCCCAACT TTCTTGTACA AAGTtggcat tataagaaag cattgcttat 3241 caatttgttg caacgaac