Construct: ORF ccsbBroadEn_04107
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF007999.1_s300c1, BRDN0000398479
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- CENPU (79682)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 79682 | CENPU | centromere protein U | NM_024629.4 | 100% | 100% | |
2 | human | 79682 | CENPU | centromere protein U | XM_005263218.4 | 92.7% | 91% | (many diffs) |
3 | human | 79682 | CENPU | centromere protein U | NR_104593.2 | 47.7% | 1_34del;957_958ins62;1227_2432del |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 1323
- ORF length:
- 1254
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaacttTG TACAAAAAAG 61 TTGGCACCAT GGCCCCGCGG GGGCGGCGGC GGCCGCGGCC TCACAGGTCT GAGGGCGCAA 121 GACGTTCAAA GAACACTTTA GAAAGAACAC ATTCCATGAA AGATAAAGCT GGTCAAAAGT 181 GCAAGCCTAT TGACGTGTTC GACTTTCCTG ATAATTCTGA TGTCTCAAGC ATTGGCAGGC 241 TGGGTGAAAA TGAGAAAGAT GAAGAAACTT ATGAGACCTT TGATCCTCCT TTACATAGCA 301 CAGCTATATA TGCTGATGAA GAAGAATTCT CCAAACATTG TGGACTGTCT CTCTCTTCAA 361 CTCCTCCAGG AAAAGAAGCA AAAAGAAGTT CAGACACTTC TGGAAATGAA GCAAGTGAAA 421 TCGAATCTGT AAAAATTAGT GCAAAAAAGC CAGGAAGAAA GCTCAGGCCC ATTAGTGATG 481 ACTCTGAAAG CATTGAAGAA AGTGATACAA GGAGAAAAGT TAAATCAGCA GAGAAAATAA 541 GTACACAACG TCATGAGGTT ATTCGAACCA CAGCGTCTTC AGAACTTTCA GAGAAACCAG 601 CTGAGTCTGT CACTTCTAAA AAGACAGGAC CCCTTAGTGC CCAGCCCTCT GTTGAAAAAG 661 AGAACTTGGC AATAGAAAGT CAATCGAAAA CTCAGAAAAA AGGGAAGATA TCTCATGACA 721 AAAGGAAGAA ATCAAGAAGT AAAGCCATAG GCTCAGATAC TTCTGACATT GTGCACATTT 781 GGTGTCCAGA AGGAATGAAA ACCAGTGACA TCAAGGAGTT GAATATTGTT TTGCCTGAAT 841 TTGAGAAAAC CCACCTAGAG CATCAACAAA GAATAGAATC TAAAGTTTGT AAGGCAGCCA 901 TCGCCACATT TTATGTTAAT GTTAAAGAAC AATTCATCAA AATGCTTAAA GAAAGCCAGA 961 TGTTGACAAA TCTGAAAAGG AAGAATGCTA AGATGATTTC AGATATCGAA AAGAAAAGGC 1021 AGCGTATGAT TGAAGTCCAG GATGAACTGC TTCGGTTAGA GCCACAGCTG AAACAACTAC 1081 AAACAAAATA TGATGAACTT AAAGAGAGAA AGTCTTCCCT TAGGAATGCA GCATATTTCT 1141 TATCTAATTT AAAACAGCTT TATCAAGATT ATTCAGATGT TCAAGCTCAA GAACCAAACG 1201 TAAAGGAAAC GTATGATTCA TCCAGCCTTC CAGCTCTGTT ATTTAAAGCA AGAACACTTC 1261 TGGGAGCCGA AAGCCATCTG CGAAATATCA ACCATCAGTT AGAGAAGCTC CTTGACCAGG 1321 GATTGCCAAC TTTCTTGTAC AAAGTtggca ttataagaaa gcattgctta tcaatttgtt 1381 gcaacgaac