Construct: ORF TRCN0000470764

Construct Description:

Construct Type:
ORF
Other Identifiers:
ORF003231.1_s317c1
Derived from:
ccsbBroadEn_10731
DNA Barcode:
GACTGTAATTTTACTCACTGTCGA
Epitope Tag:
V5
Notes:
No stop codon in insert

Originally Annotated References:

Gene:
CENPC (1060)

Vector Information:

Vector Backbone:
pLX_317
Pol II Cassette 1:
SV40-PuroR
Pol II Cassette 2:
EF1a-TRCN0000470764
Selection Marker:
PuroR
Visible Reporter:
n/a
Epitope Tag:
V5

Current transcripts matched by this ORF:

Taxon Gene Symbol Description Transcript Nuc. Match %[?] Prot. Match %[?] Match Diffs[?]
1 human 1060 CENPC centromere protein C XM_011531542.3 70% 69.6% 1614_1615insTG;1618_1639del;1646_2316delinsC
2 human 1060 CENPC centromere protein C NM_001362481.2 57.8% 57.5% 1614_1615insTG;1618_1639del;1646_2805delinsC
3 human 1060 CENPC centromere protein C NM_001812.4 57.3% 57% 1614_1615insTG;1618_1639del;1646_2829delinsC
4 human 1060 CENPC centromere protein C NR_155754.2 23.3% (many diffs)
Download CSV

Sequence Information

Note: uppercase bases indicate empirically verified sequence.

ORF start:
66
ORF end:
1692
ORF length:
1626
Sequence:
1tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag
61ttggcatggc tgcgtccggt ctggatcatc tcaaaaatgg ctacagaaga agattttgtc
121gaccttccag ggcacgtgac attaacacag agcaaggcca gaatgttctg gaaatcttac
181aagactgttt tgaagaaaaa agtcttgcca atgattttag tacaaattct acaaaatcag
241tgcctaattc aacacgcaaa ataaaagaca cttgtattca gtcaccaagc aaagagtgcc
301agaaatcaca tccaaagtca gttccagttt cttcaaagaa gaaagaagcc tctctacagt
361ttgttgtaga accaagtgaa gccacaaaca gatcagttca ggcccatgaa gttcatcaga
421aaattctggc aactgatgtt agttccaaaa atacacctga ctcgaaaaaa atatcaagta
481gaaacataaa tgatcatcac agtgaagctg atgaagaatt ttacttatcc gttggctcac
541cttctgttct tttggatgca aaaacatctg tatcacaaaa tgttattcca tctagtgccc
601aaaagagaga gacttacact tttgaaaatt cagtaaatat gctgccttca agtacagagg
661tttcagttaa aaccaaaaaa aggttaaact ttgatgataa agttatgtta aagaaaatag
721aaatagataa taaagtatca gatgaagagg ataaaacatc ggaaggacaa gaaagaaaac
781catcaggatc atctcagaat agaatacgag attcagaata tgaaattcaa cgacaagcta
841aaaaaagttt ttcaacattg tttttagaaa cagtaaaacg aaaaagtgaa tccagtccca
901ttgttaggca tgcggcaact gctccacctc attcgtgtcc tcccgatgat acgaagttga
961tagaggatga atttataatt gatgagtcgg atcaaagttt tgccagtaga tcttggatta
1021caataccaag aaaggcaggg tctctgaaac aacgcacaat atccccggct gagagcactg
1081cactccttca aggtagaaag tcaagagaaa agcatcataa tatattacct aagactttgg
1141caaatgacaa acattcccat aaacctcacc cagtagagac atctcagccc tctgataaaa
1201cagtactgga tacaagttat gctttgatag gtgaaacagt aaataattat agatctacaa
1261aatatgaaat gtattccaag aatgcagaaa aaccatctag aagcaaaagg actataaaac
1321aaaaacagag aagaaaattc atggctaaaC CAGCTGAAGA ACAGCTTGAT GTGGGACAGT
1381CTAAAGATGA AAACATACAT ACATCACATA TTACCCAAGA CGAATTTCAA AGAAATTCAG
1441ACAGAAATAT GGAAGAGCAT GAAGAGATGG GAAATGATTG TGTTTCCAAA AAACAGATGC
1501CACCTGTGGG AAGCAAGAAA AGTAGCACTA GAAAAGATAA GGAAGAATCT AAAAAGAAGC
1561GCTTTTCCAG TGAGTCCAAG AACAAACTTG TACCTGAAGA AGTGACTTCA ACTGTCACGA
1621AAAGTCGAAG AATTTCCAGG CGTCCATCTG ATTGGTGGGT GGTAAAATCA GAGGAGAGTT
1681GCCTGAAATG CTACCCAACT TTCTTGTACA AAGTGGTTGA TATCGGTAAG CCTATCCCTA
1741ACCCTCTCCT CGGTCTCGAT TCTACGTAGT AATGAACTAG TCCGCAACTT GAAAGTATTT
1801CGATTTCTTG GCTTTATATA TCTTGTGGAA AGGACGAGAC TGTAATTTTA CTCACTGTCG
1861AACGCGTTAA GTCgacaatc aacctctgga ttacaaaatt tgtgaaagat t

Download FASTA (ORF) (Full)