Construct: ORF ccsbBroadEn_11813
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF017186.1_s300c1, BRDN0000457157
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- EPC2 (26122)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 26122 | EPC2 | enhancer of polycomb homolog 2 | NM_015630.4 | 97% | 97% | 1_72del |
2 | human | 26122 | EPC2 | enhancer of polycomb homolog 2 | XM_011510941.2 | 96.7% | 96.7% | 1_72del;1717_1718insCAGTAA |
3 | human | 26122 | EPC2 | enhancer of polycomb homolog 2 | XM_011510943.3 | 90.9% | 90.2% | (many diffs) |
4 | mouse | 227867 | Epc2 | enhancer of polycomb homolo... | NM_172663.4 | 91.4% | 93.9% | (many diffs) |
5 | mouse | 227867 | Epc2 | enhancer of polycomb homolo... | XM_006498011.2 | 83.9% | 86.5% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 2418
- ORF length:
- 2349
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaacttTG TACAAAAAAG 61 TTGGCACCAT GCCTGATCTC AACGACTGCG TCTCCATCAA CCGGGCCGTG CCCCAGATGC 121 CCACCGGGAT GGAGAAGGAG GAGGAATCGG AACATCATTT ACAGCGAGCA ATTTCAGCAC 181 AGCAAGTGTT TAGAGAAAAA AAAGAGAGTA TGGTCATTCC TGTTCCTGAG GCAGAGAGCA 241 ACGTCAACTA TTACAATCGC TTGTACAAAG GAGAGTTTAA ACAGCCAAAA CAGTTCATTC 301 ATATTCAGCC TTTTAATCTA GACAACGAGC AACCAGATTA TGATATGGAT TCAGAAGATG 361 AGACTTTATT AAATAGACTT AACAGAAAGA TGGAAATTAA GCCTTTGCAA TTTGAAATTA 421 TGATTGACAG ACTTGAAAAA GCCAGTTCTA ATCAGCTTGT AACACTTCAA GAAGCAAAAC 481 TGCTGCTAAA CGAAGATGAT TACCTTATTA AAGCTGTATA TGACTACTGG GTGAGAAAAC 541 GTAAAAACTG CAGGGGGCCA TCCCTCATTC CTCAGATAAA ACAAGAGAAA AGAGATGGCT 601 CTACCAACAA TGACCCTTAT GTTGCCTTTC GGAGAAGAAC AGAGAAAATG CAAACTCGAA 661 AGAATCGTAA GAATGATGAA GCCTCTTATG AAAAGATGTT GAAACTGAGA CGAGAATTTA 721 GTAGAGCCAT AACAATTTTG GAAATGATTA AGAGAAGAGA GAAAACAAAA CGAGAATTAT 781 TGCACTTAAC CTTAGAAGTT GTGGAGAAAA GATACCATTT GGGAGACTAT GGTGGTGAAA 841 TCCTTAATGA AGTAAAAATC AGTAGATCAG AAAAAGAGTT ATATGCCACT CCAGCAACTC 901 TTCATAATGG AAATCATCAC AAAGTTCAAG AATGTAAAAC TAAGCACCCT CATCATTTGT 961 CTTTGAAAGA AGAGGCTTCT GATGTGGTTC GTCAAAAGAA GAAGTACCCA AAGAAGCCTA 1021 AAGCAGAGGC TTTGATAACA TCTCAGCAAC CCACTCCTGA GACATTGCCT GTGATCAATA 1081 AGAGTGACAT TAAGCAATAT GATTTTCACA GCTCAGATGA AGATGAATTT CCACAGGTAT 1141 TGTCCCCAGT ATCAGAACCG GAAGAAGAAA ATGATCCTGA TGGTCCCTGT GCTTTCAGAA 1201 GGCGGGCAGG ATGCCAGTAT TATGCTCCTC GTTTGGACCA AGCTAACCAT TCATGTGAAA 1261 ATTCAGAATT GGCAGATTTG GATAAGTTGA GGTATAGGCA TTGCCTTACA ACACTTACAG 1321 TCCCAAGAAG ATGTATAGGA TTTGCAAGGA GGCGAATTGG CAGAGGTGGA AGGGTCATAA 1381 TGGACCGAAT ATCCACAGAA CATGACCCAG TCCTGAAACA GATAGACCCT GAAATGCTGA 1441 ATAGTTTTTC AAGCTCTTCC CAAACTATAG ACTTTTCTTC TAATTTCTCT CGGACCAATG 1501 CTTCCAGTAA ACATTGTGAA AATAGACTGT CTCTTTCTGA AATATTAAGC AATATCAGAT 1561 CATGTCGACT ACAGTGTTTC CAGCCAAGGC TACTAAATTT ACAGGACAGT GATAGTGAAG 1621 AATGTACCTC AAGAAAACCA GGGCAGACTG TGAACAATAA AAGAGTTTCT GCAGCATCTG 1681 TAGCTTTATT GAACACCAGC AAGAATGGCA TATCAGTAAC AGGGGGTATC ACAGAAGAGC 1741 AGTTTCAGAC ACATCAGCAG CAGTTAGTTC AGATGCAAAG GCAGCAACTT GCCCAGCTTC 1801 AGCAGAAACA GCAATCTCAG CATTCCTCGC AACAGACACA TCCAAAAGCA CAGGGCTCAA 1861 GCACCTCTGA CTGTATGTCT AAAACACTTG ACTCAGCCAG CGCCCACTTT GCTGCATCTG 1921 CAGTGGTCAG TGCACCTGTT CCAAGTCGCA GTGAGGTAGC CAAGGAACAG AACACTGGCC 1981 ACAACAACAT AAACGGTGTT GTCCAGCCTT CAGGAACCTC TAAAACATTA TACTCCACCA 2041 ATATGGCTTT ATCATCCAGC CCAGGGATTT CAGCTGTACA GCTTGTAAGG ACAGTTGGCC 2101 ACACCACTAC AAACCACTTA ATCCCAGCAT TGTGCACAAG CAGTCCTCAG ACACTTCCCA 2161 TGAACAATTC CTGCCTGACA AATGCAGTGC ACCTCAATAA TGTCAGTGTT GTTTCTCCAG 2221 TCAATGTGCA TATCAATACA CGGACTTCAG CACCATCGCC AACAGCCTTA AAACTTGCCA 2281 CAGTTGCTGC CAGTATGGAC AGAGTGCCAA AGGTTACTCC CAGCAGTGCC ATCAGCAGCA 2341 TAGCAAGAGA GAACCACGAA CCAGAAAGAT TGGGCTTAAA TGGAATAGCA GAGACAACAG 2401 TAGCTATGGA AGTGACATTG CCAACTTTCT TGTACAAAGT TGGcattata agaaagcatt 2461 gcttatcaat ttgttgcaac gaac