Construct: ORF ccsbBroadEn_00487
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF013002.1_s300c1, BRDN0000397910
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- EGR1 (1958)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 1958 | EGR1 | early growth response 1 | NM_001964.3 | 100% | 100% | |
2 | mouse | 13653 | Egr1 | early growth response 1 | NM_007913.5 | 85.9% | 91.3% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1695
- ORF length:
- 1629
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGC CGCGGCCAAG GCCGAGATGC AGCTGATGTC CCCGCTGCAG ATCTCTGACC 121 CGTTCGGATC CTTTCCTCAC TCGCCCACCA TGGACAACTA CCCTAAGCTG GAGGAGATGA 181 TGCTGCTGAG CAACGGGGCT CCCCAGTTCC TCGGCGCCGC CGGGGCCCCA GAGGGCAGCG 241 GCAGCAACAG CAGCAGCAGC AGCAGCGGGG GCGGTGGAGG CGGCGGGGGC GGCAGCAACA 301 GCAGCAGCAG CAGCAGCACC TTCAACCCTC AGGCGGACAC GGGCGAGCAG CCCTACGAGC 361 ACCTGACCGC AGAGTCTTTT CCTGACATCT CTCTGAACAA CGAGAAGGTG CTGGTGGAGA 421 CCAGTTACCC CAGCCAAACC ACTCGACTGC CCCCCATCAC CTATACTGGC CGCTTTTCCC 481 TGGAGCCTGC ACCCAACAGT GGCAACACCT TGTGGCCCGA GCCCCTCTTC AGCTTGGTCA 541 GTGGCCTAGT GAGCATGACC AACCCACCGG CCTCCTCGTC CTCAGCACCA TCTCCAGCGG 601 CCTCCTCCGC CTCCGCCTCC CAGAGCCCAC CCCTGAGCTG CGCAGTGCCA TCCAACGACA 661 GCAGTCCCAT TTACTCAGCG GCACCCACCT TCCCCACGCC GAACACTGAC ATTTTCCCTG 721 AGCCACAAAG CCAGGCCTTC CCGGGCTCGG CAGGGACAGC GCTCCAGTAC CCGCCTCCTG 781 CCTACCCTGC CGCCAAGGGT GGCTTCCAGG TTCCCATGAT CCCCGACTAC CTGTTTCCAC 841 AGCAGCAGGG GGATCTGGGC CTGGGCACCC CAGACCAGAA GCCCTTCCAG GGCCTGGAGA 901 GCCGCACCCA GCAGCCTTCG CTAACCCCTC TGTCTACTAT TAAGGCCTTT GCCACTCAGT 961 CGGGCTCCCA GGACCTGAAG GCCCTCAATA CCAGCTACCA GTCCCAGCTC ATCAAACCCA 1021 GCCGCATGCG CAAGTACCCC AACCGGCCCA GCAAGACGCC CCCCCACGAA CGCCCTTACG 1081 CTTGCCCAGT GGAGTCCTGT GATCGCCGCT TCTCCCGCTC CGACGAGCTC ACCCGCCACA 1141 TCCGCATCCA CACAGGCCAG AAGCCCTTCC AGTGCCGCAT CTGCATGCGC AACTTCAGCC 1201 GCAGCGACCA CCTCACCACC CACATCCGCA CCCACACAGG CGAAAAGCCC TTCGCCTGCG 1261 ACATCTGTGG AAGAAAGTTT GCCAGGAGCG ATGAACGCAA GAGGCATACC AAGATCCACT 1321 TGCGGCAGAA GGACAAGAAA GCAGACAAAA GTGTTGTGGC CTCTTCGGCC ACCTCCTCTC 1381 TCTCTTCCTA CCCGTCCCCG GTTGCTACCT CTTACCCGTC CCCGGTTACT ACCTCTTATC 1441 CATCCCCGGC CACCACCTCA TACCCATCCC CTGTGCCCAC CTCCTTCTCC TCTCCCGGCT 1501 CCTCGACCTA CCCATCCCCT GTGCACAGTG GCTTCCCCTC CCCGTCGGTG GCCACCACGT 1561 ACTCCTCTGT TCCCCCTGCT TTCCCGGCCC AGGTCAGCAG CTTCCCTTCC TCAGCTGTCA 1621 CCAACTCCTT CAGCGCCTCC ACAGGGCTTT CGGACATGAC AGCAACCTTT TCTCCCAGGA 1681 CAATTGAAAT TTGCTACCCA ACTTTCTTGT ACAAAGTtgg cattataaga aagcattgct 1741 tatcaatttg ttgcaacgaa c