Construct: ORF ccsbBroadEn_02499
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF004542.1_s300c1, BRDN0000396860
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- CTCF (10664)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 10664 | CTCF | CCCTC-binding factor | NM_006565.4 | 100% | 100% | |
2 | human | 10664 | CTCF | CCCTC-binding factor | XM_017022868.1 | 100% | 100% | |
3 | human | 10664 | CTCF | CCCTC-binding factor | NM_001363916.1 | 99.7% | 99.7% | 1999_2000insCAACAG |
4 | human | 10664 | CTCF | CCCTC-binding factor | XM_005255775.4 | 99.7% | 99.7% | 1999_2000insCAACAG |
5 | human | 10664 | CTCF | CCCTC-binding factor | NM_001191022.2 | 54.8% | 54.8% | 0_1ins984 |
6 | mouse | 13018 | Ctcf | CCCTC-binding factor | XM_017312557.1 | 91.6% | 98% | (many diffs) |
7 | mouse | 13018 | Ctcf | CCCTC-binding factor | NM_181322.3 | 91.5% | 97.9% | (many diffs) |
8 | mouse | 13018 | Ctcf | CCCTC-binding factor | XM_006530648.2 | 91.5% | 97.9% | (many diffs) |
9 | mouse | 13018 | Ctcf | CCCTC-binding factor | XM_006530645.3 | 90.5% | 96.9% | (many diffs) |
10 | mouse | 13018 | Ctcf | CCCTC-binding factor | XM_006530644.2 | 90.4% | 96.7% | (many diffs) |
11 | mouse | 13018 | Ctcf | CCCTC-binding factor | XM_006530647.2 | 90.2% | 96.6% | (many diffs) |
12 | mouse | 13018 | Ctcf | CCCTC-binding factor | XM_006530646.3 | 90.1% | 96.5% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2247
- ORF length:
- 2181
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGA AGGTGATGCA GTCGAAGCCA TTGTGGAGGA GTCCGAAACT TTTATTAAAG 121 GAAAGGAGAG AAAGACTTAC CAGAGACGCC GGGAAGGGGG CCAGGAAGAA GATGCCTGCC 181 ACTTACCCCA GAACCAGACG GATGGGGGTG AGGTGGTCCA GGATGTCAAC AGCAGTGTAC 241 AGATGGTGAT GATGGAACAG CTGGACCCCA CCCTTCTTCA GATGAAGACT GAAGTAATGG 301 AGGGCACAGT GGCTCCAGAA GCAGAGGCTG CTGTGGACGA TACCCAGATT ATAACTTTAC 361 AGGTTGTAAA TATGGAGGAA CAGCCCATAA ACATAGGAGA ACTTCAGCTT GTTCAAGTAC 421 CTGTTCCTGT GACTGTACCT GTTGCTACCA CTTCAGTAGA AGAACTTCAG GGGGCTTATG 481 AAAATGAAGT GTCTAAAGAG GGCCTTGCGG AAAGTGAACC CATGATATGC CACACCCTAC 541 CTTTGCCTGA AGGGTTTCAG GTGGTTAAAG TGGGGGCCAA TGGAGAGGTG GAGACACTAG 601 AACAAGGGGA ACTTCCACCC CAGGAAGATC CTAGTTGGCA AAAAGACCCA GACTATCAGC 661 CACCAGCCAA AAAAACAAAG AAAACCAAAA AGAGCAAACT GCGTTATACA GAGGAGGGCA 721 AAGATGTAGA TGTGTCTGTC TACGATTTTG AGGAAGAACA GCAGGAGGGT CTGCTATCAG 781 AGGTTAATGC AGAGAAAGTG GTTGGTAATA TGAAGCCTCC AAAGCCAACA AAAATTAAAA 841 AGAAAGGTGT AAAGAAGACA TTCCAGTGTG AGCTTTGCAG TTACACGTGT CCACGGCGTT 901 CAAATTTGGA TCGTCACATG AAAAGCCACA CTGATGAGAG ACCACACAAG TGCCATCTCT 961 GTGGCAGGGC ATTCAGAACA GTCACCCTCC TGAGGAATCA CCTTAACACA CACACAGGTA 1021 CTCGTCCTCA CAAGTGCCCA GACTGCGACA TGGCCTTTGT GACCAGTGGA GAATTGGTTC 1081 GGCATCGTCG TTACAAACAC ACCCACGAGA AGCCATTCAA GTGTTCCATG TGCGATTACG 1141 CCAGTGTAGA AGTCAGCAAA TTAAAACGTC ACATTCGCTC TCATACTGGA GAGCGTCCGT 1201 TTCAGTGCAG TTTGTGCAGT TATGCCAGCA GGGACACATA CAAGCTGAAA AGGCACATGA 1261 GAACCCATTC AGGGGAAAAG CCTTATGAAT GTTATATTTG TCATGCTCGG TTTACCCAAA 1321 GTGGTACCAT GAAGATGCAC ATTTTACAGA AGCACACAGA AAATGTGGCC AAATTTCACT 1381 GTCCCCACTG TGACACAGTC ATAGCCCGAA AAAGTGATTT GGGTGTCCAC TTGCGAAAGC 1441 AGCATTCCTA TATTGAGCAA GGCAAGAAAT GCCGTTACTG TGATGCTGTG TTTCATGAGC 1501 GCTATGCCCT CATCCAGCAT CAGAAGTCAC ACAAGAATGA GAAGCGCTTT AAGTGTGACC 1561 AGTGTGATTA CGCTTGTAGA CAGGAGAGGC ACATGATCAT GCACAAGCGC ACCCACACCG 1621 GGGAGAAGCC TTACGCCTGC AGCCACTGCG ATAAGACCTT CCGCCAGAAG CAGCTTCTCG 1681 ACATGCACTT CAAGCGCTAT CACGACCCCA ACTTCGTCCC TGCGGCTTTT GTCTGTTCTA 1741 AGTGTGGGAA AACATTTACA CGTCGGAATA CCATGGCAAG ACATGCTGAT AATTGTGCTG 1801 GCCCAGATGG CGTAGAGGGG GAAAATGGAG GAGAAACGAA GAAGAGTAAA CGTGGAAGAA 1861 AAAGAAAGAT GCGCTCTAAG AAAGAAGATT CCTCTGACAG TGAAAATGCT GAACCAGATC 1921 TGGACGACAA TGAGGATGAG GAGGAGCCTG CCGTAGAAAT TGAACCTGAG CCAGAGCCTC 1981 AGCCTGTGAC CCCAGCCCCA CCACCCGCCA AGAAGCGGAG AGGACGACCC CCTGGCAGAA 2041 CCAACCAGCC CAAACAGAAC CAGCCAACAG CTATCATTCA GGTTGAAGAC CAGAATACAG 2101 GTGCAATTGA GAACATTATA GTTGAAGTAA AAAAAGAGCC AGATGCTGAG CCCGCAGAGG 2161 GAGAGGAAGA GGAGGCCCAG CCAGCTGCCA CAGATGCCCC CAACGGAGAC CTCACGCCCG 2221 AGATGATCCT CAGCATGATG GACCGGTGCC CAACTTTCTT GTACAAAGTt ggcattataa 2281 gaaagcattg cttatcaatt tgttgcaacg aac