Construct: ORF ccsbBroadEn_11719
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF010156.1_s300c1, BRDN0000391413
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- USP22 (23326)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 23326 | USP22 | ubiquitin specific peptidas... | NM_015276.2 | 93.1% | 89.4% | (many diffs) |
2 | human | 23326 | USP22 | ubiquitin specific peptidas... | XM_005256575.2 | 81.8% | 81.8% | 0_1ins279;6C>T |
3 | mouse | 216825 | Usp22 | ubiquitin specific peptidas... | NM_001004143.4 | 83% | 89.5% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 1608
- ORF length:
- 1539
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaacttTG TACAAAAAAG 61 TTGGCACCAT GGCGCCGGGT TGGCCCTCAC TATCAGCGGG CTCCCGACAG GAGGCGCCCC 121 AGCTTGCGGC CGGGGGCAGC GCCTACCAGG CAGTTGGCAG GCAGTTCCAG CCCCGGGCCA 181 CGGCACTGCA GGGCCCGAGC CAGGCCAAGT CCTGTATCTG CCATGTCTGT GGCGTCCACC 241 TCAACAGGCT GCATTCCTGC CTCTACTGTG TCTTCTTCGG CTGTTTAACA AAGAAGCATA 301 TTCACGAGCA TGCGAAGGCG AAGCGGCACA ACCTGGCCAT TGATCTGATG TATGGAGGCA 361 TCTACTGTTT TCTGTGCCAG GACTACATCT ATGACAAAGA CATGGAAATA ATCGCCAAGG 421 AGGAGCAGCG AAAAGCTTGG AAAATGCAAG GCGTTGGAGA GAAGTTTTCA ACTTGGGAAC 481 CAACCAAACG GGAGCTTGAA CTGCTGAAGC ACAACCCGAA AAGGAGAAAG ATCACCTCGA 541 ACTGCACCAT AGGTCTGCGT GGGCTGATCA ACCTTGGGAA CACATGCTTC ATGAACTGCA 601 TCGTGCAGGC CCTGACCCAC ACGCCACTTC TGCGGGACTT CTTCCTGTCT GACAGGCACC 661 GCTGTGAGAT GCAGAGCCCC AGCTCCTGTC TGGTCTGTGA GATGTCCTCA CTGTTTCAGG 721 AGTTTTACTC TGGACACCGG TCCCCTCACA TCCCGTATAA GTTGCTGCAC CTGGTGTGGA 781 CCCACGCGAG GCACCTAGCA GGCTACGAGC AGCAGGACGC CCACGAGTTC CTCATCGCGG 841 CCCTGGACGT GCTCCACCGA CACTGCAAAG GTGATGACAA TGGGAAGAAG GCCAACAACC 901 CCAACCACTG CAACTGCATC ATAGACCAGA TCTTCACAGG CGGGTTGCAG TCAGACGTCA 961 CCTGCCAAGT CTGCCATGGA GTCTCCACCA CCATCGACCC CTTCTGGGAC ATCAGCTTGG 1021 ATCTCCCCGG CTCTTCCACC CCATTCTGGC CCCTGAGCCC AGGGAGCGAG GGCAACGTGG 1081 TAAACGGGGA AAGCCACGTG TCGGGAACCA CCACGCTCAC GGACTGCCTG CGACGATTCA 1141 CCAGACCAGA GCACTTGGGC AGCAGCGCCA AGATCAAGTG CAGCGGTTGC CATAGCTACC 1201 AGGAGTCCAC AAAGCAGCTC ACTATGAAGA AACTGCCCAT CGTAGCCTGT TTTCATCTCA 1261 AACGATTTGA ACACTCAGCC AAGCTGCGGC GGAAGATCAC CACGTATGTG TCCTTCCCCC 1321 TGGAGCTGGA CATGACCCCT TTCATGGCCT CCAGCAAAGA GAGCAGGATG AATGGACAGT 1381 ACCAGCAGCC CACGGACAGT CTCAACAATG ACAACAAGTA TTCCCTGTTT GCTGTTGTTA 1441 ACCATCAAGG GACCTTGGAG AGTGGCCACT ACACCAGCTT TATCCGGCAG CACAAAGACC 1501 AGTGGTTCAA GTGTGACGAT GCCATCATCA CCAAGGCCAG CATCAAGGAC GTCCTGGACA 1561 GCGAAGGGTA CTTGCTGTTC TATCACAAAC AGTTCCTGGA ATACGAGTTG CCAACTTTCT 1621 TGTACAAAGT tggcattata agaaagcatt gcttatcaat ttgttgcaac gaac