Construct: ORF ccsbBroadEn_05179
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF018298.1_s300c1, BRDN0000397803
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- THAP8 (199745)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 199745 | THAP8 | THAP domain containing 8 | NM_152658.3 | 100% | 100% | |
2 | human | 199745 | THAP8 | THAP domain containing 8 | NM_001331102.1 | 85% | 85% | 548_549ins123 |
3 | human | 199745 | THAP8 | THAP domain containing 8 | NM_001331103.1 | 84.3% | 84.3% | 0_1ins129 |
4 | human | 199745 | THAP8 | THAP domain containing 8 | NM_001331104.1 | 84.3% | 84.3% | 0_1ins129 |
5 | human | 199745 | THAP8 | THAP domain containing 8 | NR_138539.1 | 34.4% | 1_545del;628_629ins193;1175_1632del |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 888
- ORF length:
- 822
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGCC CAAGTACTGC AGGGCGCCGA ACTGCTCCAA CACTGCGGGC CGCCTGGGTG 121 CAGACAACCG CCCTGTGAGC TTCTACAAGT TCCCACTGAA GGATGGTCCC CGGCTGCAGG 181 CCTGGCTGCA GCACATGGGC TGTGAGCACT GGGTGCCCAG CTGCCACCAG CACTTGTGCA 241 GCGAGCACTT CACACCCTCC TGCTTCCAGT GGCGCTGGGG TGTGCGCTAC CTGCGGCCTG 301 ATGCAGTGCC CTCCATCTTC TCCCGGGGAC CACCTGCCAA GAGTCAGCGG AGGACCCGAA 361 GCACCCAGAA GCCAGTCTCG CCGCCGCCTC CCCTACAGAA GAATACACCC CTGCCCCAGA 421 GCCCTGCCAT CCCAGTCTCT GGCCCAGTGC GCCTAGTGGT GCTGGGCCCC ACATCGGGGA 481 GCCCCAAGAC TGTGGCCACC ATGCTCCTGA CCCCCCTGGC CCCTGCGCCA ACTCCTGAGC 541 GGTCACAACC TGAAGTCCCT GCCCAACAGG CCCAGACCGG GCTGGGCCCA GTGCTGGGAG 601 CACTGCAACG CCGGGTGCGG AGGCTGCAAC GGTGCCAGGA GCGGCACCAG GCGCAGCTGC 661 AGGCCCTGGA ACGGCTGGCA CAGCAGCTAC ACGGGGAGAG CCTGCTGGCA CGGGCACGCC 721 GGGGTCTGCA GCGCCTGACA ACAGCCCAGA CCCTTGGACC TGAGGAATCC CAAACCTTCA 781 CCATCATCTG TGGAGGGCCT GACATAGCCA TGGTCCTTGC CCAGGACCCT GCACCTGCCA 841 CAGTGGATGC CAAGCCGGAG CTCCTGGACA CTCGGATCCC CAGTGCATAC CCAACTTTCT 901 TGTACAAAGT tggcattata agaaagcatt gcttatcaat ttgttgcaac gaac