Construct: ORF TRCN0000472012
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF017064.1_s317c1
- Derived from:
- ccsbBroadEn_04278
- DNA Barcode:
- GCCTACTCCGCGGCAACCGCGGTG
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- THAP2 (83591)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000472012
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 83591 | THAP2 | THAP domain containing 2 | NM_031435.4 | 100% | 100% | |
2 | mouse | 66816 | Thap2 | THAP domain containing, apo... | NM_025780.4 | 86.1% | 84.2% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 750
- ORF length:
- 684
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcatgcc gaccaattgc gctgcggcgg gctgtgccac tacctacaac aagcacatta 121 acatcagctt ccacaggttt cctttggatc ctaaaagaag aaaagaatgg gttcgcctgg 181 ttaggcgcaa aaattttgtg ccaggaaaac acacttttct ttgttcaaag cactttgaag 241 ccTCCTGTTT TGACCTAACA GGACAAACTC GACGACTTAA AATGGATGCT GTTCCAACCA 301 TTTTTGATTT TTGTACCCAT ATAAAGTCTA TGAAACTCAA GTCAAGGAAT CTTTTGAAGA 361 AAAACAACAG TTGTTCTCCA GCTGGACCAT CTAATTTAAA ATCAAACATT AGTAGTCAGC 421 AAGTACTACT TGAACACAGC TATGCCTTTA GGAATCCTAT GGAGGCAAAA AAGAGGATCA 481 TTAAACTGGA AAAAGAAATA GCAAGCTTAA GAAGAAAAAT GAAAACTTGC CTACAAAAGG 541 AACGCAGAGC AACTCGAAGA TGGATCAAAG CCACGTGTTT GGTAAAGAAT TTAGAAGCAA 601 ATAGTGTATT ACCTAAAGGT ACATCAGAAC ACATGTTACC AACTGCCTTA AGCAGTCTTC 661 CTTTGGAAGA TTTTAAGATC CTTGAACAAG ATCAACAAGA TAAAACACTG CTAAGTCTAA 721 ATCTAAAACA GACCAAGAGT ACCTTCATTT ACCCAACTTT CTTGTACAAA GTGGTTGATA 781 TCGGTAAGCC TATCCCTAAC CCTCTCCTCG GTCTCGATTC TACGTAGTAA TGAACTAGTC 841 CGTAACTTGA AAGTATTTCG ATTTCTTGGC TTTATATATC TTGTGGAAAG GACGAGCCTA 901 CTCCGCGGCA ACCGCGGTGA CGCGTTAAGT Cgacaatcaa cctctggatt acaaaatttg 961 tgaaagatt