Construct: ORF ccsbBroad304_01579
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF014008.1_s304c1, BRDN0000401187
- Derived from:
- ccsbBroadEn_01579
- DNA Barcode:
- None
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- SOX2 (6657)
Vector Information:
- Vector Backbone:
- pLX_304
- Pol II Cassette 1:
- PGK-BlastR
- Pol II Cassette 2:
- CMV-ccsbBroad304_01579
- Selection Marker:
- BlastR
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 6657 | SOX2 | SRY-box transcription factor 2 | NM_003106.4 | 100% | 100% | |
2 | mouse | 20674 | Sox2 | SRY (sex determining region... | NM_011443.4 | 94% | 97.8% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1017
- ORF length:
- 951
- Sequence:
-
1 ggtctatata agcagagctc tctggctaac tgtcgggatc aacaagtttg tacaaaaaag 61 ttggcatgta caacatgatg gagacggagc tgaagccgcc gggcccgcag caaacttcgg 121 ggggcggcgg cggcaactcc accgcggcgg cggccggcgg caaccagaaa aacagcccgg 181 accgcgtcaa gcggcccatg aatgccttca tggtgtggtc ccgcgggcag cggcgcaaga 241 tggcccagga gaaccccaag atgcacaact cggagatcag caagcgcctg ggcgccgagt 301 ggaaactttt gtcggagacg gagaagcggc cgttcatcga cgaggctaag cggctgcgag 361 cgctgcacat gaaggagcac ccggattata aataccggcc ccggcggaaa accaagacgc 421 TCATGAAGAA GGATAAGTAC ACGCTGCCCG GCGGGCTGCT GGCCCCCGGC GGCAATAGCA 481 TGGCGAGCGG GGTCGGGGTG GGCGCCGGCC TGGGCGCGGG CGTGAACCAG CGCATGGACA 541 GTTACGCGCA CATGAACGGC TGGAGCAACG GCAGCTACAG CATGATGCAG GACCAGCTGG 601 GCTACCCGCA GCACCCGGGC CTCAATGCGC ACGGCGCAGC GCAGATGCAG CCCATGCACC 661 GCTACGACGT GAGCGCCCTG CAGTACAACT CCATGACCAG CTCGCAGACC TACATGAACG 721 GCTCGCCCAC CTACAGCATG TCCTACTCGC AGCAGGGCAC CCCTGGCATG GCTCTTGGCT 781 CCATGGGTTC GGTGGTCAAG TCCGAGGCCA GCTCCAGCCC CCCTGTGGTT ACCTCTTCCT 841 CCCACTCCAG GGCGCCCTGC CAGGCCGGGG ACCTCCGGGA CATGATCAGC ATGTATCTCC 901 CCGGCGCCGA GGTGCCGGAA CCCGCCGCCC CCAGCAGACT TCACATGTCC CAGCACTACC 961 AGAGCGGCCC GGTGCCCGGC ACGGCCATTA ACGGCACACT GCCCCTCTCA CACATGTGCC 1021 CAACTTTCTT GTACaaagtg gttggtaagc ctatccctaa ccctctcctc ggtctcgatt 1081 ctacgtagta atgagctagc gctaaccggt ggcgcgttaa gtcgacaatc aacctctgga 1141 tta