Construct: ORF ccsbBroadEn_04555
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF001539.1_s300c1, BRDN0000397578
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- CENPL (91687)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 91687 | CENPL | centromere protein L | NM_001127181.2 | 100% | 100% | |
2 | human | 91687 | CENPL | centromere protein L | NM_001171182.1 | 88.2% | 88.2% | 419_420ins138 |
3 | human | 91687 | CENPL | centromere protein L | NM_033319.3 | 88.2% | 88.2% | 419_420ins138 |
4 | mouse | 70454 | Cenpl | centromere protein L | NM_001159930.2 | 76.1% | 74.9% | (many diffs) |
5 | mouse | 70454 | Cenpl | centromere protein L | NR_131028.1 | 38.7% | (many diffs) | |
6 | mouse | 70454 | Cenpl | centromere protein L | NR_131029.1 | 35.2% | (many diffs) | |
7 | mouse | 70454 | Cenpl | centromere protein L | NR_131030.1 | 32.9% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1236
- ORF length:
- 1170
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGA TTCTTACAGT GCACCAGAGT CAACTCCTAG TGCATCCTCA AGACCTGAAG 121 ATTACTTTAT AGGTGCCACT CCTCTGCAGA AACGATTAGA ATCGGTCAGG AAGCAGAGTT 181 CATTTATCCT GACTCCACCT CGAAGGAAAA TTCCCCAGTG TTCGCAGTTG CAGGAAGATG 241 TTGACCCTCA AAAGGTTGCA TTCCTTCTGC ATAAACAGTG GACTTTATAT AGTTTAACTC 301 CCTTATATAA ATTCTCCTAT AGTAATCTCA AAGAGTATTC TAGACTTCTC AATGCTTTTA 361 TTGTTGCTGA AAAGCAAAAA GGACTTGCTG TGGAAGTGGG AGAAGACTTC AACATCAAAG 421 TGATTTTTTC TACTCTCCTA GGAATGAAAG GAACACAAAG GGACCCGGAA GCATTTCTTG 481 TCCAGGGTCT CATTTTGTCA CCCAGGCTGG AGTACAGTGG CACGATCTTG GTTGACTGCA 541 ACCTCTGTCT CCTGGGCTCA AGTGATCCTT CCACCTTAGC CTTCCAAGTA GCTGGGACTG 601 CAGGTGCATG CCACCACACT CGGATTGTGT CAAAATCTCA ATTGCCATCT GAGAATAGAG 661 AAGGTAAAGT GCTGTGGACT GGCTGGTTCT GCTGTGTATT TGGAGACAGT CTTCTGGAGA 721 CTGTTTCAGA AGATTTCACC TGTCTGCCCT TATTCCTTGC AAATGGAGCA GAGTCTAACA 781 CAGCAATAAT TGGAACTTGG TTTCAGAAAA CCTTTGACTG TTATTTCAGT CCTTTAGCAA 841 TCAATGCATT TAATCTTTCC TGGATGGCTG CCATGTGGAC TGCATGCAAA ATGGACCATT 901 ATGTGGCTAC TACTGAATTT CTTTGGTCTG TACCCTGTAG CCCTCAAAGT CTGGACATTT 961 CTTTCGCAAT ACATCCAGAG GATGCAAAAG CTCTATGGGA CAGTGTCCAC AAAACACCTG 1021 GGGAGGTTAC CCAGGAAGAA GTTGACCTAT TCATGGATTG CCTTTATTCA CATTTCCATA 1081 GACATTTCAA AATTCATTTA TCAGCCACAA GATTAGTTCG TGTTTCAACA TCTGTAGCTT 1141 CAGCACATAC TGATGGAAAA ATAAAGATTC TGTGTCATAA ATACCTTATT GGAGTGTTAG 1201 CATATTTGAC AGAACTGGCA ATTTTTCAAA TTGAGTGCCC AACTTTCTTG TACAAAGTtg 1261 gcattataag aaagcattgc ttatcaattt gttgcaacga ac