Construct: ORF TRCN0000470869

Construct Description:

Construct Type:
ORF
Other Identifiers:
ORF008814.1_s317c1
Derived from:
ccsbBroadEn_07525
DNA Barcode:
TTTATGCTAAAACAACGGGCGGCC
Epitope Tag:
V5
Notes:
No stop codon in insert

Originally Annotated References:

Gene:
THOC1 (9984)

Vector Information:

Vector Backbone:
pLX_317
Pol II Cassette 1:
SV40-PuroR
Pol II Cassette 2:
EF1a-TRCN0000470869
Selection Marker:
PuroR
Visible Reporter:
n/a
Epitope Tag:
V5

Current transcripts matched by this ORF:

Taxon Gene Symbol Description Transcript Nuc. Match %[?] Prot. Match %[?] Match Diffs[?]
1 human 9984 THOC1 THO complex 1 NM_005131.3 99.9% 100% 657C>T
2 human 9984 THOC1 THO complex 1 XM_011525772.3 98.8% 98.9% 657C>T;1207_1227del
3 human 9984 THOC1 THO complex 1 XM_011525773.1 78.1% 78.1% 0_1ins414;243C>T;793_813del
4 human 9984 THOC1 THO complex 1 XM_024451292.1 53.2% 53.2% 0_1ins921
5 human 9984 THOC1 THO complex 1 XM_011525774.2 52.7% 52.7% 0_1ins921;286_306del
6 human 9984 THOC1 THO complex 1 XM_017026104.2 46.6% 46.5% 657C>T;918_919insCTGATGGATTTA;921_922ins1038
7 mouse 225160 Thoc1 THO complex 1 NM_153552.3 90.7% 96.1% (many diffs)
Download CSV

Sequence Information

Note: uppercase bases indicate empirically verified sequence.

ORF start:
66
ORF end:
2037
ORF length:
1971
Sequence:
1tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag
61ttggcatgtc tccgacgccg ccgctcttca gtttgcccga agcgcggacg cggtttacga
121agtctaccag agaggccttg aacaacaaaa acatcaagcc attgttaagt accttcagcc
181aggtacctgg cagtgaaaat gaaaaaaaat gtacccttga ccaagctttc agaggtattc
241tagaagaaga aattataaat cattcatcat gtgaaaacgt tttagctatt atttctcttg
301ctattggggg agtaactgaa ggtatttgta ccgcatctac accttttgta ttgttgggag
361atgttttgga ttgtcttcct ttggatcagt gtgacacaat attcactttt gtggaaaaaa
421atgttgctac ttggaaatca aatacattct attctgctgg gaaaaattac ttactacgta
481tgtgcaatga tctcctaaga agattgtcta aatcccagaa tacagtcttc tgtggacgga
541ttcagctctt tttggccagg cttttccctc tgtctgagaa atcaggtctt aacttgcaga
601gtcagtttaa tctggaaaat gtcactgttt tcaatacaaa tgagcaggaa agcaccctgg
661gtcagaagca cactgaagat agagaagaag gaatggatgt agaagaaggc gaaatgggag
721atgaggaagc tccaacaacg tgctctattc caattgatta caacctgtat cgaaaattct
781ggtcacttca ggattacttc aggaaccctg tgcaatgcta tgagaagatt tcatggaaaa
841cttttctcaa gtattctgaa gaagttttag ctgtttttaa gagttataaa ttagatgata
901ctcaggcctc aagaaaaaag atggaagaat tgaaaacagg aggagaacat gtatattttg
961caaaattttt aacaagtgaa aagctgatgg atttacaact gagtgacagt aactttcgtc
1021gacacatcct gttgcagtat ctcattttat tccaatatct caaggggcag gtcaaattca
1081aaagttcaaa ctatgtttta actgatgagc aatcactttg gattgaagat actacaaaat
1141cagtttatca actactatct gaaaaccccc ccgatggaga aagattttca aagatggtag
1201agcatatatt aaacactgaa gaaaactgga actcgtggaa aaatgaaggt tgcccaagtt
1261ttgtgaaaga aagaacatca gataccaaac ctacgagaat aattcggaag agaacagcac
1321ccgaggactt cctagggaaa ggacccacca aaaaaattct gatgggaaat gaggagttaa
1381caaggctttg gaatctttgc cctgataata tggaagcctg taaatcagag acaagggaac
1441acatgcccac tttggaggaa ttctttgaag aagccattga acaggcagac cctgaaaata
1501tggtggaaaa tgaatataag gctgtgaaca attcaaatta tggttggaga gccctgagac
1561tattagcacg gagaagccct cacttcttcc agccaaccaa ccagcagttt aaaagtttac
1621cagaatatct tgaaaatatg gtaataaagc tagccaagga attaccgcct ccttctgaag
1681aaataaaaac aggtgaggat gaagatgagg aagataatga tgctctactg aaggaaaatg
1741aaagtcctga tgttcggcga gacaaacctg taacaggaga acaaatagag gtatttgcca
1801acaagctggg tgaacaatgg aagattctgg ctcccTACTT GGAAATGAAA GACTCAGAAA
1861TTAGGCAGAT TGAGTGTGAC AGTGAAGACA TGAAGATGAG AGCTAAGCAG CTCCTGGTTG
1921CCTGGCAAGA TCAAGAGGGA GTTCATGCAA CACCTGAGAA TCTGATTAAT GCACTGAATA
1981AGTCTGGATT AAGTGACCTT GCAGAAAGTC TAACTAATGA CAATGAGACA AATAGTTACC
2041CAACTTTCTT GTACAAAGTG GTTGATATCG GTAAGCCTAT CCCTAACCCT CTCCTCGGTC
2101TCGATTCTAC GTAGTAATGA ACTAGTCCGT AACTTGAAAG TATTTCGATT TCTTGGCTTT
2161ATATATCTTG TGGAAAGGAC GATTTATGCT AAAACAACGG GCGGCCACGC GTTAAGTCga
2221caatcaacct ctggattaca aaatttgtga aagatt

Download FASTA (ORF) (Full)