Construct: ORF ccsbBroadEn_10731
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF003231.1_s300c1, BRDN0000390597
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- CENPC (1060)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 1060 | CENPC | centromere protein C | XM_011531542.3 | 70% | 69.6% | 1614_1615insTG;1618_1639del;1646_2316delinsC |
2 | human | 1060 | CENPC | centromere protein C | NM_001362481.2 | 57.8% | 57.5% | 1614_1615insTG;1618_1639del;1646_2805delinsC |
3 | human | 1060 | CENPC | centromere protein C | NM_001812.4 | 57.3% | 57% | 1614_1615insTG;1618_1639del;1646_2829delinsC |
4 | human | 1060 | CENPC | centromere protein C | NR_155754.2 | 23.3% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1692
- ORF length:
- 1626
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGC TGCGTCCGGT CTGGATCATC TCAAAAATGG CTACAGAAGA AGATTTTGTC 121 GACCTTCCAG GGCACGTGAC ATTAACACAG AGCAAGGCCA GAATGTTCTG GAAATCTTAC 181 AAGACTGTTT TGAAGAAAAA AGTCTTGCCA ATGATTTTAG TACAAATTCT ACAAAATCAG 241 TGCCTAATTC AACACGCAAA ATAAAAGACA CTTGTATTCA GTCACCAAGC AAAGAGTGCC 301 AGAAATCACA TCCAAAGTCA GTTCCAGTTT CTTCAAAGAA GAAAGAAGCC TCTCTACAGT 361 TTGTTGTAGA ACCAAGTGAA GCCACAAACA GATCAGTTCA GGCCCATGAA GTTCATCAGA 421 AAATTCTGGC AACTGATGTT AGTTCCAAAA ATACACCTGA CTCGAAAAAA ATATCAAGTA 481 GAAACATAAA TGATCATCAC AGTGAAGCTG ATGAAGAATT TTACTTATCC GTTGGCTCAC 541 CTTCTGTTCT TTTGGATGCA AAAACATCTG TATCACAAAA TGTTATTCCA TCTAGTGCCC 601 AAAAGAGAGA GACTTACACT TTTGAAAATT CAGTAAATAT GCTGCCTTCA AGTACAGAGG 661 TTTCAGTTAA AACCAAAAAA AGGTTAAACT TTGATGATAA AGTTATGTTA AAGAAAATAG 721 AAATAGATAA TAAAGTATCA GATGAAGAGG ATAAAACATC GGAAGGACAA GAAAGAAAAC 781 CATCAGGATC ATCTCAGAAT AGAATACGAG ATTCAGAATA TGAAATTCAA CGACAAGCTA 841 AAAAAAGTTT TTCAACATTG TTTTTAGAAA CAGTAAAACG AAAAAGTGAA TCCAGTCCCA 901 TTGTTAGGCA TGCGGCAACT GCTCCACCTC ATTCGTGTCC TCCCGATGAT ACGAAGTTGA 961 TAGAGGATGA ATTTATAATT GATGAGTCGG ATCAAAGTTT TGCCAGTAGA TCTTGGATTA 1021 CAATACCAAG AAAGGCAGGG TCTCTGAAAC AACGCACAAT ATCCCCGGCT GAGAGCACTG 1081 CACTCCTTCA AGGTAGAAAG TCAAGAGAAA AGCATCATAA TATATTACCT AAGACTTTGG 1141 CAAATGACAA ACATTCCCAT AAACCTCACC CAGTAGAGAC ATCTCAGCCC TCTGATAAAA 1201 CAGTACTGGA TACAAGTTAT GCTTTGATAG GTGAAACAGT AAATAATTAT AGATCTACAA 1261 AATATGAAAT GTATTCCAAG AATGCAGAAA AACCATCTAG AAGCAAAAGG ACTATAAAAC 1321 AAAAACAGAG AAGAAAATTC ATGGCTAAAC CAGCTGAAGA ACAGCTTGAT GTGGGACAGT 1381 CTAAAGATGA AAACATACAT ACATCACATA TTACCCAAGA CGAATTTCAA AGAAATTCAG 1441 ACAGAAATAT GGAAGAGCAT GAAGAGATGG GAAATGATTG TGTTTCCAAA AAACAGATGC 1501 CACCTGTGGG AAGCAAGAAA AGTAGCACTA GAAAAGATAA GGAAGAATCT AAAAAGAAGC 1561 GCTTTTCCAG TGAGTCCAAG AACAAACTTG TACCTGAAGA AGTGACTTCA ACTGTCACGA 1621 AAAGTCGAAG AATTTCCAGG CGTCCATCTG ATTGGTGGGT GGTAAAATCA GAGGAGAGTT 1681 GCCTGAAATG CTACCCAACT TTCTTGTACA AAGTtggcat tataagaaag cattgcttat 1741 caatttgttg caacgaac