Construct: ORF TRCN0000473508

Construct Description:

Construct Type:
ORF
Other Identifiers:
ORF014008.1_s317c1
Derived from:
ccsbBroadEn_01579
DNA Barcode:
GCTGTGATTAGGCTTCACAGGATT
Epitope Tag:
V5
Notes:
No stop codon in insert

Originally Annotated References:

Gene:
SOX2 (6657)

Vector Information:

Vector Backbone:
pLX_317
Pol II Cassette 1:
SV40-PuroR
Pol II Cassette 2:
EF1a-TRCN0000473508
Selection Marker:
PuroR
Visible Reporter:
n/a
Epitope Tag:
n/a

Current transcripts matched by this ORF:

Taxon Gene Symbol Description Transcript Nuc. Match %[?] Prot. Match %[?] Match Diffs[?]
1 human 6657 SOX2 SRY-box transcription factor 2 NM_003106.4 100% 100%
2 mouse 20674 Sox2 SRY (sex determining region... NM_011443.4 94% 97.8% (many diffs)
Download CSV

Sequence Information

Note: uppercase bases indicate empirically verified sequence.

ORF start:
66
ORF end:
1017
ORF length:
951
Sequence:
1tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag
61ttggcatgta caacatgatg gagacggagc tgaagccgcc gggcccgcag caaacttcgg
121ggggcggcgg cggcaactcc accgcggcgg cggccggcgg caaccagaaa aacagcccgg
181accgcgtcaa gcggcccatg aatgccttca tggtgtggtc ccgcgggcag cggcgcaaga
241tggcccagga gaaccccaag atgcacaact cggagatcag caagcgcctg ggcgccgagt
301ggaaactttt gtcggagacg gagaagcggc cgttcatcga cgaggctaag cggctgcgag
361cgctgcacat gaaggagcac ccggattata aataccggcc ccggcggaaa accaagacgc
421tcatgaagaa ggataagtac acgctgcccg gcgggctgct ggcccccggc ggCAATAGCA
481TGGCGAGCGG GGTCGGGGTG GGCGCCGGCC TGGGCGCGGG CGTGAACCAG CGCATGGACA
541GTTACGCGCA CATGAACGGC TGGAGCAACG GCAGCTACAG CATGATGCAG GACCAGCTGG
601GCTACCCGCA GCACCCGGGC CTCAATGCGC ACGGCGCAGC GCAGATGCAG CCCATGCACC
661GCTACGACGT GAGCGCCCTG CAGTACAACT CCATGACCAG CTCGCAGACC TACATGAACG
721GCTCGCCCAC CTACAGCATG TCCTACTCGC AGCAGGGCAC CCCTGGCATG GCTCTTGGCT
781CCATGGGTTC GGTGGTCAAG TCCGAGGCCA GCTCCAGCCC CCCTGTGGTT ACCTCTTCCT
841CCCACTCCAG GGCGCCCTGC CAGGCCGGGG ACCTCCGGGA CATGATCAGC ATGTATCTCC
901CCGGCGCCGA GGTGCCGGAA CCCGCCGCCC CCAGCAGACT TCACATGTCC CAGCACTACC
961AGAGCGGCCC GGTGCCCGGC ACGGCCATTA ACGGCACACT GCCCCTCTCA CACATGTGCC
1021CAACTTTCTT GTACAAAGTG GTTGATATCG GTAAGCCTAT CCCTAACCCT CTCCTCGGTC
1081TCGATTCTAC GTAGTAATGA ACTAGTCCGT AACTTGAAAG TATTTCGATT TCTTGGCTTT
1141ATATATCTTG TGGAAAGGAC GAGCTGTGAT TAGGCTTCAC AGGATTACGC GTTAAGTCga
1201caatcaacct ctggattaca aaatttgtga aagatt

Download FASTA (ORF) (Full)