Construct: ORF ccsbBroadEn_01578
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF011324.1_s300c1, BRDN0000395101
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- SOS2 (6655)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 6655 | SOS2 | SOS Ras/Rho guanine nucleot... | NM_006939.4 | 100% | 100% | |
2 | mouse | 20663 | Sos2 | son of sevenless homolog 2 ... | XM_006515635.1 | 88.5% | 94.9% | (many diffs) |
3 | mouse | 20663 | Sos2 | son of sevenless homolog 2 ... | NM_001135559.1 | 88.5% | 94.8% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 4065
- ORF length:
- 3996
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaacttTG TACAAAAAAG 61 TTGGCACCAT GCAGCAGGCG CCGCAGCCTT ACGAGTTCTT CAGCGAGGAG AACAGTCCGA 121 AATGGCGGGG ACTGTTGGTC TCGGCCCTGC GGAAGGTTCA GGAACAAGTG CATCCCACTC 181 TCTCAGCTAA TGAAGAGTCT CTCTATTATA TTGAAGAGCT GATTTTTCAG CTGCTTAATA 241 AATTATGCAT GGCCCAGCCA AGGACTGTTC AAGATGTAGA GGAGCGAGTT CAGAAGACCT 301 TTCCTCACCC AATTGATAAA TGGGCCATTG CTGATGCACA ATCTGCTATA GAAAAACGAA 361 AACGAAGAAA TCCTCTTTTA CTGCCTGTGG ACAAAATCCA TCCTTCGTTG AAGGAAGTAT 421 TAGGGTACAA AGTGGACTAC CATGTATCCC TATATATTGT GGCTGTACTA GAGTATATCT 481 CAGCTGATAT TTTAAAATTG GCTGGTAATT ATGTTTTTAA TATCCGGCAT TATGAAATAT 541 CTCAGCAGGA CATTAAAGTG TCAATGTGTG CGGATAAGGT TTTGATGGAC ATGTTTGATC 601 AGGATGACAT AGGTTTGGTT TCTCTCTGTG AAGATGAACC TAGTTCTTCT GGTGAATTAA 661 ACTACTATGA TCTTGTCAGA ACTGAAATCG CAGAAGAAAG ACAGTATCTA CGGGAATTAA 721 ATATGATCAT AAAAGTGTTT CGAGAAGCCT TTCTTTCTGA TAGAAAGCTG TTTAAACCTT 781 CTGATATCGA AAAGATTTTT AGTAACATTT CAGATATACA TGAATTGACT GTGAAACTTT 841 TAGGTTTGAT TGAAGACACA GTTGAAATGA CTGATGAAAG CAGTCCTCAT CCCTTAGCTG 901 GCAGCTGTTT TGAAGATTTG GCAGAAGAGC AAGCATTTGA TCCTTATGAA ACATTATCAC 961 AGGACATTCT TTCACCAGAG TTTCATGAAC ATTTCAATAA ATTGATGGCC AGACCTGCAG 1021 TTGCTCTACA CTTTCAGTCC ATTGCTGATG GTTTTAAAGA GGCAGTTCGT TATGTCCTTC 1081 CACGTCTTAT GCTGGTGCCA GTGTATCACT GTTGGCACTA CTTTGAGTTA CTAAAGCAAT 1141 TGAAAGCATG TAGTGAAGAA CAAGAAGACA GAGAATGTTT GAACCAAGCT ATTACTGCTC 1201 TCATGAATCT CCAAGGTAGC ATGGACCGAA TTTACAAGCA GTATTCACCT AGACGTCGAC 1261 CTGGAGATCC TGTTTGCCCT TTTTATAGTC ACCAATTAAG AAGCAAACAC CTGGCTATCA 1321 AAAAAATGAA TGAAATTCAG AAAAATATCG ATGGATGGGA AGGCAAAGAT ATTGGACAGT 1381 GTTGTAATGA ATTCATTATG GAGGGACCAT TGACAAGAAT CGGTGCCAAA CATGAACGGC 1441 ATATTTTTCT GTTTGATGGC TTAATGATCA GTTGTAAACC TAATCATGGC CAGACTCGGC 1501 TTCCAGGTTA CAGTAGTGCA GAATACAGGT TAAAAGAAAA ATTTGTCATG AGGAAAATAC 1561 AAATTTGTGA TAAAGAAGAT ACTTGTGAGC ACAAGCATGC ATTTGAATTA GTATCCAAAG 1621 ATGAGAACAG CATAATATTT GCTGCTAAGT CTGCTGAAGA AAAAAACAAC TGGATGGCAG 1681 CCCTTATTTC TCTTCATTAT CGTAGTACTC TAGATCGAAT GTTAGATTCA GTATTATTGA 1741 AAGAAGAAAA TGAGCAACCA CTGAGATTAC CAAGTCCTGA AGTATATCGT TTTGTAGTAA 1801 AAGACTCTGA GGAAAACATT GTTTTTGAAG ACAACTTGCA AAGTAGAAGT GGCATCCCCA 1861 TTATTAAAGG AGGAACTGTA GTGAAATTAA TTGAAAGGTT AACATATCAT ATGTATGCAG 1921 ATCCCAATTT TGTTCGTACT TTTCTTACCA CATATCGTTC ATTTTGTAAA CCACAGGAAT 1981 TGCTGAGCTT ACTGATTGAA CGGTTTGAAA TTCCAGAGCC AGAACCTACT GACGCAGACA 2041 AATTGGCAAT AGAGAAAGGC GAGCAGCCAA TCAGTGCAGA CCTTAAAAGA TTTCGCAAGG 2101 AATATGTCCA ACCAGTACAA CTTAGGATCT TAAATGTATT TCGGCATTGG GTTGAACATC 2161 ATTTTTATGA CTTTGAAAGA GACTTGGAAT TGCTTGAAAG ACTAGAATCC TTCATTTCAA 2221 GTGTAAGAGG GAAAGCTATG AAAAAATGGG TAGAGTCAAT TGCTAAGATC ATCAGGAGGA 2281 AGAAGCAAGC TCAGGCAAAC GGAGTAAGCC ATAATATTAC CTTTGAAAGT CCACCTCCAC 2341 CAATTGAATG GCATATCAGC AAACCAGGAC AGTTTGAAAC ATTTGATCTC ATGACACTTC 2401 ATCCAATAGA AATTGCACGT CAGCTGACAC TTTTGGAGTC TGATCTTTAC AGGAAAGTTC 2461 AACCGTCTGA ACTTGTAGGG AGTGTGTGGA CCAAAGAAGA TAAAGAAATA AATTCTCCAA 2521 ATTTATTAAA AATGATTCGC CATACCACAA ATCTCACCCT CTGGTTTGAA AAATGCATTG 2581 TGGAAGCAGA AAATTTTGAA GAACGGGTGG CAGTACTAAG TAGAATTATA GAAATTCTGC 2641 AAGTTTTTCA AGATTTGAAT AATTTCAATG GCGTATTGGA GATAGTCAGT GCAGTAAATT 2701 CAGTGTCAGT ATACAGACTA GACCATACCT TTGAGGCACT GCAGGAAAGG AAAAGGAAAA 2761 TTTTGGACGA AGCTGTGGAA TTAAGTCAAG ATCACTTTAA AAAATACCTA GTAAAACTTA 2821 AGTCAATCAA TCCACCTTGT GTGCCTTTTT TTGGAATATA TTTAACAAAT ATTCTGAAGA 2881 CCGAAGAAGG GAATAATGAT TTTTTAAAAA AGAAAGGGAA AGATTTAATC AATTTCAGTA 2941 AGAGGAGGAA AGTAGCTGAA ATTACTGGAG AAATTCAGCA GTATCAGAAT CAGCCTTACT 3001 GTTTACGGAT AGAACCAGAT ATGAGGAGAT TCTTTGAAAA CCTTAACCCC ATGGGAAGTG 3061 CATCTGAAAA AGAGTTTACA GATTATTTGT TCAACAAGTC ACTAGAAATT GAACCTCGAA 3121 ACTGCAAACA GCCACCTCGA TTTCCTAGGA AATCAACTTT TTCCTTAAAA TCTCCTGGAA 3181 TAAGGCCTAA CACAGGCCGA CATGGCTCTA CCTCAGGTAC TTTACGAGGT CACCCAACAC 3241 CATTAGAAAG AGAACCATGT AAAATAAGCT TTAGTCGGAT TGCTGAAACT GAGCTGGAAT 3301 CAACAGTGTC AGCACCAACC TCTCCAAATA CACCATCTAC TCCACCAGTA TCTGCTTCTT 3361 CAGACCTTAG TGTATTTTTA GATGTGGATC TCAACAGCTC CTGTGGCAGC AATAGCATCT 3421 TTGCTCCAGT GCTTTTGCCA CATTCAAAGT CTTTCTTTAG TTCATGTGGT AGTTTACATA 3481 AACTAAGTGA AGAGCCCCTG ATTCCTCCTC CTCTTCCTCC TCGAAAAAAG TTTGATCATG 3541 ATGCTTCAAA TTCCAAGGGA AATATGAAAT CTGATGATGA TCCTCCTGCT ATTCCACCGA 3601 GACAGCCTCC TCCTCCAAAG GTAAAACCCA GAGTTCCTGT TCCTACTGGT GCATTTGATG 3661 GGCCTCTGCA TAGTCCACCT CCGCCACCAC CAAGAGATCC TCTTCCTGAT ACCCCTCCAC 3721 CAGTTCCCCT TCGGCCTCCA GAACACTTTA TAAACTGTCC ATTTAATCTT CAGCCACCTC 3781 CACTGGGGCA TCTTCACAGA GATTCAGACT GGCTCAGAGA CATTAGTACG TGTCCAAATT 3841 CGCCAAGCAC TCCTCCTAGC ACACCCTCTC CAAGGGTACC GCGTCGATGC TATGTGCTCA 3901 GTTCTAGTCA GAATAATCTT GCTCATCCTC CAGCTCCCCC TGTTCCACCA AGGCAGAATT 3961 CAAGCCCTCA TCTGCCAAAA CTGCCACCAA AGACTTACAA ACGGGAGCTT TCGCACCCCC 4021 CATTGTACAG ACTGCCTTTG CTAGAAAATG CAGAAACTCC CCAATTGCCA ACTTTCTTGT 4081 ACAAAGTtgg cattataaga aagcattgct tatcaatttg ttgcaacgaa c