Construct: ORF ccsbBroadEn_09182
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF005155.1_s300c1, BRDN0000396477
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- THOC3 (84321)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 84321 | THOC3 | THO complex 3 | NM_032361.3 | 99.9% | 100% | 357C>T |
2 | human | 84321 | THOC3 | THO complex 3 | XM_017009985.1 | 74.7% | 74.6% | 1_1delAins265;93C>T |
3 | human | 728554 | LOC728554 | THO complex 3 pseudogene | NR_003615.2 | 57.3% | (many diffs) | |
4 | mouse | 73666 | Thoc3 | THO complex 3 | NM_028597.3 | 89.1% | 98.8% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1119
- ORF length:
- 1053
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGC GGTCCCCGCT GCAGCCATGG GGCCCTCGGC GTTGGGCCAG AGCGGCCCCG 121 GCTCGATGGC CCCGTGGTGC TCAGTGAGCA GCGGCCCGTC GCGCTACGTG CTTGGGATGC 181 AGGAGCTGTT CCGGGGCCAC AGCAAGACGC GCGAGTTCCT GGCGCACAGC GCCAAGGTGC 241 ACTCGGTGGC CTGGAGTTGC GACGGGCGTC GCCTAGCCTC GGGGTCCTTC GACAAGACGG 301 CCAGCGTCTT CTTGCTGGAG AAGGACCGGT TGGTCAAAGA AAACAATTAT CGGGGACATG 361 GGGATAGTGT GGACCAGCTT TGTTGGCATC CAAGTAATCC TGACCTATTT GTTACGGCGT 421 CTGGAGATAA AACCATTCGC ATCTGGGATG TGAGGACTAC AAAATGCATT GCCACTGTGA 481 ACACTAAAGG GGAGAACATT AATATCTGCT GGAGTCCTGA TGGGCAGACC ATTGCTGTAG 541 GCAACAAGGA TGATGTGGTG ACCTTTATTG ATGCCAAGAC ACACCGTTCC AAAGCAGAAG 601 AGCAGTTCAA GTTCGAGGTC AACGAAATCT CCTGGAACAA TGACAATAAT ATGTTCTTCC 661 TGACAAATGG CAATGGTTGT ATCAACATCC TCAGCTACCC AGAACTGAAG CCTGTGCAGT 721 CCATCAACGC CCATCCTTCC AACTGCATCT GTATCAAGTT TGACCCCATG GGGAAGTACT 781 TTGCCACAGG AAGTGCAGAT GCTTTGGTCA GCCTCTGGGA TGTGGATGAG TTAGTGTGTG 841 TTCGGTGCTT TTCCAGGCTG GATTGGCCTG TAAGAACCCT CAGTTTCAGC CATGATGGGA 901 AAATGCTGGC GTCAGCATCG GAAGATCATT TTATTGACAT TGCTGAAGTG GAGACAGGGG 961 ACAAACTATG GGAGGTACAG TGTGAGTCTC CGACCTTCAC AGTGGCGTGG CACCCCAAAA 1021 GGCCTCTGCT GGCATTTGCC TGTGATGACA AAGACGGCAA ATATGACAGC AGCCGGGAAG 1081 CCGGAACTGT GAAGCTGTTT GGGCTTCCTA ATGATTCTTG CCCAACTTTC TTGTACAAAG 1141 Ttggcattat aagaaagcat tgcttatcaa tttgttgcaa cgaac