Construct: ORF TRCN0000470869
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF008814.1_s317c1
- Derived from:
- ccsbBroadEn_07525
- DNA Barcode:
- TTTATGCTAAAACAACGGGCGGCC
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- THOC1 (9984)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000470869
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 9984 | THOC1 | THO complex 1 | NM_005131.3 | 99.9% | 100% | 657C>T |
2 | human | 9984 | THOC1 | THO complex 1 | XM_011525772.3 | 98.8% | 98.9% | 657C>T;1207_1227del |
3 | human | 9984 | THOC1 | THO complex 1 | XM_011525773.1 | 78.1% | 78.1% | 0_1ins414;243C>T;793_813del |
4 | human | 9984 | THOC1 | THO complex 1 | XM_024451292.1 | 53.2% | 53.2% | 0_1ins921 |
5 | human | 9984 | THOC1 | THO complex 1 | XM_011525774.2 | 52.7% | 52.7% | 0_1ins921;286_306del |
6 | human | 9984 | THOC1 | THO complex 1 | XM_017026104.2 | 46.6% | 46.5% | 657C>T;918_919insCTGATGGATTTA;921_922ins1038 |
7 | mouse | 225160 | Thoc1 | THO complex 1 | NM_153552.3 | 90.7% | 96.1% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2037
- ORF length:
- 1971
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcatgtc tccgacgccg ccgctcttca gtttgcccga agcgcggacg cggtttacga 121 agtctaccag agaggccttg aacaacaaaa acatcaagcc attgttaagt accttcagcc 181 aggtacctgg cagtgaaaat gaaaaaaaat gtacccttga ccaagctttc agaggtattc 241 tagaagaaga aattataaat cattcatcat gtgaaaacgt tttagctatt atttctcttg 301 ctattggggg agtaactgaa ggtatttgta ccgcatctac accttttgta ttgttgggag 361 atgttttgga ttgtcttcct ttggatcagt gtgacacaat attcactttt gtggaaaaaa 421 atgttgctac ttggaaatca aatacattct attctgctgg gaaaaattac ttactacgta 481 tgtgcaatga tctcctaaga agattgtcta aatcccagaa tacagtcttc tgtggacgga 541 ttcagctctt tttggccagg cttttccctc tgtctgagaa atcaggtctt aacttgcaga 601 gtcagtttaa tctggaaaat gtcactgttt tcaatacaaa tgagcaggaa agcaccctgg 661 gtcagaagca cactgaagat agagaagaag gaatggatgt agaagaaggc gaaatgggag 721 atgaggaagc tccaacaacg tgctctattc caattgatta caacctgtat cgaaaattct 781 ggtcacttca ggattacttc aggaaccctg tgcaatgcta tgagaagatt tcatggaaaa 841 cttttctcaa gtattctgaa gaagttttag ctgtttttaa gagttataaa ttagatgata 901 ctcaggcctc aagaaaaaag atggaagaat tgaaaacagg aggagaacat gtatattttg 961 caaaattttt aacaagtgaa aagctgatgg atttacaact gagtgacagt aactttcgtc 1021 gacacatcct gttgcagtat ctcattttat tccaatatct caaggggcag gtcaaattca 1081 aaagttcaaa ctatgtttta actgatgagc aatcactttg gattgaagat actacaaaat 1141 cagtttatca actactatct gaaaaccccc ccgatggaga aagattttca aagatggtag 1201 agcatatatt aaacactgaa gaaaactgga actcgtggaa aaatgaaggt tgcccaagtt 1261 ttgtgaaaga aagaacatca gataccaaac ctacgagaat aattcggaag agaacagcac 1321 ccgaggactt cctagggaaa ggacccacca aaaaaattct gatgggaaat gaggagttaa 1381 caaggctttg gaatctttgc cctgataata tggaagcctg taaatcagag acaagggaac 1441 acatgcccac tttggaggaa ttctttgaag aagccattga acaggcagac cctgaaaata 1501 tggtggaaaa tgaatataag gctgtgaaca attcaaatta tggttggaga gccctgagac 1561 tattagcacg gagaagccct cacttcttcc agccaaccaa ccagcagttt aaaagtttac 1621 cagaatatct tgaaaatatg gtaataaagc tagccaagga attaccgcct ccttctgaag 1681 aaataaaaac aggtgaggat gaagatgagg aagataatga tgctctactg aaggaaaatg 1741 aaagtcctga tgttcggcga gacaaacctg taacaggaga acaaatagag gtatttgcca 1801 acaagctggg tgaacaatgg aagattctgg ctcccTACTT GGAAATGAAA GACTCAGAAA 1861 TTAGGCAGAT TGAGTGTGAC AGTGAAGACA TGAAGATGAG AGCTAAGCAG CTCCTGGTTG 1921 CCTGGCAAGA TCAAGAGGGA GTTCATGCAA CACCTGAGAA TCTGATTAAT GCACTGAATA 1981 AGTCTGGATT AAGTGACCTT GCAGAAAGTC TAACTAATGA CAATGAGACA AATAGTTACC 2041 CAACTTTCTT GTACAAAGTG GTTGATATCG GTAAGCCTAT CCCTAACCCT CTCCTCGGTC 2101 TCGATTCTAC GTAGTAATGA ACTAGTCCGT AACTTGAAAG TATTTCGATT TCTTGGCTTT 2161 ATATATCTTG TGGAAAGGAC GATTTATGCT AAAACAACGG GCGGCCACGC GTTAAGTCga 2221 caatcaacct ctggattaca aaatttgtga aagatt