Construct: ORF TRCN0000475393
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF009569.1_s317c1
- Derived from:
- ccsbBroadEn_07337
- DNA Barcode:
- CCCTTGACTGGCCGTATATAGAGC
- Epitope Tag:
- V5 (not translated due to prior stop codon)
- Notes:
- Early stop codon detected
Originally Annotated References:
- Gene:
- TAF1B (9014)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000475393
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 9014 | TAF1B | TATA-box binding protein as... | NM_005680.3 | 99.6% | 76.3% | (many diffs) |
2 | human | 9014 | TAF1B | TATA-box binding protein as... | XM_011510410.2 | 99.1% | 75.6% | (many diffs) |
3 | human | 9014 | TAF1B | TATA-box binding protein as... | XM_011510411.1 | 64.3% | 41.1% | (many diffs) |
4 | human | 9014 | TAF1B | TATA-box binding protein as... | NM_001318976.1 | 56.4% | 33.3% | (many diffs) |
5 | human | 9014 | TAF1B | TATA-box binding protein as... | NM_001318977.1 | 56.4% | 33.3% | (many diffs) |
6 | human | 9014 | TAF1B | TATA-box binding protein as... | XM_024453209.1 | 56.3% | 33.1% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1437
- ORF length:
- 1371
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcatgga cctcgaggag tcggaagagt ttaaagaacg ctgtactcag tgtgctgctg 121 tctcatgggg tcttactgat gaaggcaaat attattgcac ttcttgccac aatgttacag 181 agagatatca ggaagttaca aacactgatc ttattcctaa tacccaaata aaagccctca 241 accgggggct taaaaaaaaa aacaatactg aaaaaggctg ggattggtat gtgtgtgaag 301 gtttccagta tattctttat caacaagcag aagccttaaa gaaccttgga gtaggcccag 361 agttaaagaa cgatgtttta cataattttt ggaagcgcta ccttcagaag agcaagcagg 421 catattgtaa gaacccagtt tataccactg gaaggaaacc tacggtatta gaagataatc 481 taagtcattc agactgggct agtgagcctg agctgctaag tgatgtcagc tgtcctcctt 541 ttcttgaaag tggagcggag tctcagtctg acatccacac tcgaaaacct ttccccgtca 601 gcaaagcatc acaatcagaa acgtctgtct gctctggatc tctggatgga gttgaatact 661 cacaacgaaa ggagaaggga atcgtgaaga tgaccatgcc acagacactt gccttctgtt 721 atctgtcctt actttggcag agagaagcaa taacactttc agatcttttg aggtttgttg 781 aagaggacca tattccttac ataaatgctt ttcagcattt tccagaacag atgaaattat 841 atggacgtga cagaggaatc tttggtatag agtcttggcc tgactacgag gacatctata 901 aaaaaacaat agaagttgga acatttttag atttgcctcg ttttccagac ataactgaag 961 actgctatct tcatcccaac atactgtgta tgaaatactt gatggaagtc aacctccctg 1021 atgaaatgca tagcttaact tgccacgtgg taaaaatgac tggaatggga gaagtggatt 1081 ttctgacatt tgatcctata gccaaaatgg caaaagctgt taagtacgat gtacaagctg 1141 tagctatcat tgtggtggta ttgaaactgc tctttctatt ggatgacagt ttcgagtggt 1201 ctttgtctaa tcttgctgaa aagcataatg aaaagaacaa aaaagataag ccatggtttg 1261 atttcagaaa gtggtaccaa attatgaaga aagcttttga tgagaaaaaa caaaaatggg 1321 aagaagcaag ggccAAGTAC CTGTGGAAAA GTGAAAAGCC ACTCTACTAC TCATTTGTCG 1381 ACAAACCAGT AGCATATAAA AAAAGAGAAA ATGGTGGTGA ATCTACAGAA ACAATTTAGC 1441 ACACTGGTCG ATTCAACAGC AACTGCTGGA AAAAAAAGCC CTTCAAGTTT TCAGTTCAAC 1501 TGGACTGAAG AGGACACTGA TAGAACGTGT TTCCATGGAC ACAGCCTTCA GGGAGTCCTG 1561 AAAGAGAAAG GCCAATCACT GCTGACTAAG AATTCATTAT ATTGGCTTAG TACACAGAAA 1621 TTCTGCAGAT GCTATTGTAC ACATGTGACA ACCTATGAAG AATCAAATTA TTCTCTGAGT 1681 TATCAGTTTA TACTAAATCT CTTCTCCTTC CTGCTCAGAA TAAAGACTTC CCTTCTCCAT 1741 GAAGAAGTGA GCTTAGTTGA GAAGAAACTT TTTGAGAAAA AATACAGTGT AAAAAGAAAG 1801 AAATCAAGAT CCAAGAAAGT GAGACGACAT TGCCCAACTT TCTTGTACAA AGTGGTTGAT 1861 ATCGGTAAGC CTATCCCTAA CCCTCTCCTC GGTCTCGATT CTACGTAGTA ATGAACTAGT 1921 CCGTAACTTG AAAGTATTTC GATTTCTTGG CTTTATATAT CTTGTGGAAA GGACGACCCT 1981 TGACTGGCCG TATATAGAGC ACGCGTTAAG TCgacaatca acctctggat tacaaaattt 2041 gtgaaagatt