Construct: ORF ccsbBroadEn_10116
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF007098.1_s300c1, BRDN0000396025
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- SHC4 (399694)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 399694 | SHC4 | SHC adaptor protein 4 | NM_203349.4 | 99.8% | 99.5% | 154A>G;730A>G;1340A>G |
2 | human | 399694 | SHC4 | SHC adaptor protein 4 | XM_005254375.3 | 69.1% | 68% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1956
- ORF length:
- 1890
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGCG AGAACGCGGC CAGGACAGCC TGGCAGGACT CGTGCTGTAT GTAGGACTCT 121 TCGGGCACCC CGGGATGCTG CACAGGGCCA AGTACAGCCG CTTTCGGAAC GAGTCGATCA 181 CGTCCTTGGA CGAAGGTAGC TCCGGAGGCT CGGTCGGGGA CAAGGGCTCG CCGCAGCCTC 241 CCCACCCCGC CCTGGCACCT CACCTGCCGA CTGAAGATGC CACCTTGCCG TCGCAGGAGA 301 GCCCCACCCC ACTGTGCACC TTGATCCCCC GCATGGCAAG CATGAAGCTG GCCAACCCGG 361 CCACTTTGCT GAGTCTGAAA AACTTTTGCC TGGGTACCAA AGAGGTGCCT CGGCTGAAGC 421 TCCAGGAAAG CCGGGACCCA GGTTCCAGCG GCCCCTCTTC CCCAGAAACC AGTTTAAGTA 481 GGTCCGGGAC TGCACCTCCA CCGCAGCAGG ACCTGGTGGG ACACAGGGCA ACCGCCCTAA 541 CCCCTGATTC GTGCCCGCTT CCTGGCCCTG GGGAGCCAAC ACTTAGGAGC AGGCAGGACA 601 GGCACTTTCT ACAGCACCTG TTGGGGATGG GCATGAACTA CTGTGTGAGG TACATGGGCT 661 GTGTTGAAGT GCTGCAATCA ATGAGATCAC TGGATTTTGG AATGAGAACC CAAGTTACAA 721 GGGAAGCAAT AAGTCGCCTG TGTGAAGCTG TCCCCGGGGC AAATGGAGCC ATTAAAAAGC 781 GAAAGCCTCC AGTTGAGTTC CTATCAACAG TCCTTGGCAA AAGTAATCTT CAGTTTTCAG 841 GAATGAATAT AAAACTGACC ATCTCAACAT GCAGTCTCAC ATTGATGAAT CTTGACAACC 901 AACAGATTAT TGCAAATCAT CATATGCAGT CTATTTCATT TGCCTCTGGA GGGGATCCTG 961 ATACTACAGA CTATGTTGCC TACGTAGCTA AAGATCCAGT TAATCAACGA GCCTGTCACA 1021 TATTGGAATG CCACAATGGA ATGGCCCAAG ACGTCATAAG TACCATAGGG CAGGCTTTTG 1081 AACTCCGGTT TAAACAGTAC TTGAAAAATC CTTCTTTGAA TACTTCTTGT GAAAGTGAGG 1141 AGGTGCATAT TGATAGCCAT GCCGAGGAGA GAGAAGATCA TGAATATTAC AATGAAATTC 1201 CAGGGAAGCA GCCACCAGTA GGTGGTGTTT CAGATATGCG GATCAAAGTT CAAGCCACGG 1261 AACAAATGGC TTACTGCCCC ATACAGTGTG AAAAGTTGTG CTATTTGCCT GGAAACTCCA 1321 AGTGCAGCAG TGTATATGAG AACTGTTTAG AACAAAGCAG GGCAATAGGT AATGTCCATC 1381 CAAGAGGGGT GCAGTCCCAG CGAGGTACCT CATTATTGAA GCACACGTGC CGAGTGGATC 1441 TCTTTGATGA CCCCTGCTAC ATTAATACAC AGGCTCTTCA AAGTACACCT GGCTCTGCTG 1501 GAAATCAAAG GTCAGCCCAA CCACTGGGGA GCCCATGGCA CTGCGGAAAG GCACCAGAAA 1561 CTGTTCAGCC GGGTGCCACA GCCCAGCCTG CCAGCTCACA TTCTTTGCCA CACATTAAGC 1621 AGCAGCTGTG GAGCGAAGAA TGCTATCATG GCAAGCTGAG CAGGAAGGCG GCAGAGAGCC 1681 TCTTGGTAAA GGATGGGGAC TTTTTGGTTC GAGAGAGTGC AACATCCCCT GGCCAATATG 1741 TGCTGAGTGG ACTACAGGGA GGCCAAGCAA AACATCTTCT CCTGGTGGAT CCTGAAGGCA 1801 AGGTGAGGAC CAAGGATCAT GTATTTGATA ATGTCGGCCA CCTTATCAGA TACCATATGG 1861 ATAACAGTTT GCCAATCATC TCCTCTGGAA GCGAAGTAAG CCTTAAACAA CCAGTGAGAA 1921 AAGATAATAA TCCAGCACTT TTGCATTCCA ACAAATGCCC AACTTTCTTG TACAAAGTtg 1981 gcattataag aaagcattgc ttatcaattt gttgcaacga ac