Construct: ORF ccsbBroadEn_03273

Construct Description:

Construct Type:
ORF
Other Identifiers:
ORF006544.1_s300c1, BRDN0000386149
DNA Barcode:
None
Epitope Tag:
None
Notes:
No stop codon in insert

Originally Annotated References:

Gene:
THEG (51298)

Vector Information:

Vector Backbone:
pDONR223
Pol II Cassette 1:
n/a
Pol II Cassette 2:
n/a
Selection Marker:
n/a
Visible Reporter:
n/a
Epitope Tag:
n/a

Current transcripts matched by this ORF:

Taxon Gene Symbol Description Transcript Nuc. Match %[?] Prot. Match %[?] Match Diffs[?]
1 human 51298 THEG theg spermatid protein NM_016585.5 100% 100%
2 human 51298 THEG theg spermatid protein NM_199202.3 93.6% 93.6% 430_431ins72
3 human 51298 THEG theg spermatid protein XM_011528049.2 84.7% 84.7% 1_108del;1022_1117del
4 human 51298 THEG theg spermatid protein XM_011528050.2 79.4% 79.4% 1_108del;538_539ins72;950_1045del
5 human 51298 THEG theg spermatid protein XM_024451532.1 74.9% 74.9% 1_108del;535_536ins204
6 human 51298 THEG theg spermatid protein XM_011528051.2 69.5% 69.5% 1_108del;535_536ins204;818_913del
7 human 51298 THEG theg spermatid protein XM_024451533.1 60.7% 37.6% 1_108del;537_538ins322;864_865ins59
8 human 51298 THEG theg spermatid protein XM_011528052.2 51.2% 50.8% (many diffs)
Download CSV

Sequence Information

Note: uppercase bases indicate empirically verified sequence.

ORF start:
66
ORF end:
1203
ORF length:
1137
Sequence:
1gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG
61TTGGCATGGG GGACAGCAGG CGAAGGTCAC TCGGGAACCA GCCCAGCTCT GAGGCTGCGG
121GCAGGTCGGA AAGGGAGCAG GACGGCGACC CCCGTGGCCT CCAGAGCTCT GTGTACGAGA
181GCCGGCGGGT CACAGACCCC GAACGCCAGG ACCTGGACAA TGCAGAGCTG GGACCAGAAG
241ACCCAGAAGA GGAGCTTCCC CCCGAGGAGG TGGCCGGGGA GGAGTTCCCG GAGACCCTGG
301ATCCCAAAGA GGCACTTTCT GAGTTGGAGA GAGTCCTGGA CAAGGACTTG GAAGAGGACA
361TTCCTGAAAT CAGCCGGCTG TCCATCAGCC AGAAGCTCCC CAGCACCACC ATGACCAAAG
421CAAGGAAGAG GAGGAGGCGG AGGAGGCTCA TGGAGCTGGC AGAGCCCAAG ATAAACTGGC
481AAGTCCTGAA AGACAGGAAG GGACGCTGTG GTAAGGGGTA TGCCTGGATC TCCCCATGTA
541AGATGAGCTT GCACTTCTGT CTCTGCTGGC CCTCTGTGTA CTGGACCGAG CGGTTCCTTG
601AGGACACCAC CCTCACCATC ACAGTGCCCG CGGTGTCCCG CCGCGTGGAG GAACTGTCTC
661GGCCCAAGAG ATTCTACCTG GAATATTACA ACAACAACAG GACGACTCCT GTCTGGCCCA
721TTCCTCGGTC CTCCCTGGAA TACAGAGCGT CGAGTCGCCT GAAGGAACTG GCCGCCCCGA
781AGATTCGTGA TAACTTCTGG AGCATGCCCA TGTCTGAGGT GTCCCAGGTA TCCAGGGCAG
841CCCAAATGGC AGTCCCCAGC TCGCGGATCC TCCAGTTGTC AAAGCCGAAG GCCCCAGCCA
901CCCTCTTGGA AGAGTGGGAC CCCGTGCCAA AACCCAAGCC ACATGTGTCA GACCATAACC
961GCCTCCTTCA CTTGGCCAGG CCCAAAGCTC AGTCGGACAA GTGCGTTCCT GACCGAGATC
1021CTCGCTGGGA GGTGCTGGAT GTCACCAAGA AGGTGGTGGC CAGCCCCCGG ATCATCTCCC
1081TGGCCAAGCC CAAAGTGCGC AAGGGCCTCA ACGAGGGATA CGACAGGCGT CCCCTCGCCT
1141CTATGAGCTT GCCACCCCCA AAAGCATCAC CAGAAAAGTG TGATCAACCC AGGCCTGGCC
1201TCTACCCAAC TTTCTTGTAC AAAGTtggca ttataagaaa gcattgctta tcaatttgtt
1261gcaacgaac

Download FASTA (ORF) (Full)