Construct: ORF ccsbBroadEn_07501
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF001368.1_s300c1, BRDN0000386812
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- TOX4 (9878)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 9878 | TOX4 | TOX high mobility group box... | NM_014828.4 | 99.8% | 100% | 1041T>C;1194G>A |
2 | human | 9878 | TOX4 | TOX high mobility group box... | NM_001303523.2 | 96.1% | 96.1% | (many diffs) |
3 | mouse | 268741 | Tox4 | TOX high mobility group box... | NM_023434.3 | 89.5% | 95% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 1929
- ORF length:
- 1863
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGGA GTTTCCCGGA GGAAATGACA ATTACCTGAC GATCACAGGG CCTTCGCACC 121 CCTTCCTGTC AGGGGCCGAG ACATTCCATA CACCAAGCTT GGGTGATGAG GAATTTGAAA 181 TCCCACCTAT CTCCTTGGAT TCTGATCCCT CATTGGCTGT CTCAGATGTG GTTGGCCACT 241 TTGATGACCT GGCAGACCCT TCCTCTTCAC AGGATGGCAG TTTTTCAGCC CAGTATGGGG 301 TCCAGACATT GGACATGCCT GTGGGCATGA CCCATGGCTT GATGGAGCAG GGCGGGGGGC 361 TCCTGAGTGG GGGCTTGACC ATGGACTTGG ACCACTCTAT AGGAACTCAG TATAGTGCCA 421 ACCCACCTGT TACAATTGAT GTACCAATGA CAGACATGAC ATCTGGCTTG ATGGGGCATA 481 GCCAGTTGAC CACCATTGAT CAGTCAGAAC TGAGTTCCCA GCTGGGTTTG AGCCTAGGGG 541 GTGGCACCAT CCTGCCACCT GCCCAGTCAC CTGAAGATCG TCTTTCAACC ACCCCTTCAC 601 CTACTAGTTC ACTTCACGAG GATGGTGTTG AGGATTTCCG GAGGCAACTT CCCAGCCAGA 661 AGACAGTCGT GGTGGAAGCA GGGAAAAAGC AGAAGGCCCC AAAGAAGAGA AAAAAGAAAG 721 ATCCTAATGA ACCTCAGAAA CCAGTTTCAG CATATGCTTT ATTCTTTCGT GATACACAGG 781 CTGCCATCAA GGGACAGAAT CCTAATGCCA CTTTTGGTGA GGTTTCAAAA ATTGTGGCCT 841 CCATGTGGGA TAGTCTTGGA GAGGAGCAAA AACAGGTATA TAAGAGGAAA ACTGAGGCTG 901 CCAAGAAAGA GTATCTGAAG GCACTGGCTG CTTACAAAGA CAACCAGGAG TGTCAGGCCA 961 CTGTGGAAAC AGTGGAATTG GATCCAGCAC CACCATCACA AACTCCTTCT CCACCTCCTA 1021 TGGCTACTGT TGACCCAGCA TCTCCAGCAC CAGCTTCAAT AGAGCCCCCT GCCCTGTCCC 1081 CATCCATTGT TGTTAACTCC ACCCTCTCAT CCTATGTGGC AAACCAGGCA TCTTCTGGAG 1141 CTGGGGGTCA GCCCAATATC ACCAAGTTGA TTATTACCAA ACAAATGTTG CCCTCTTCTA 1201 TTACTATGTC TCAAGGAGGG ATGGTTACTG TTATCCCAGC CACAGTGGTG ACCTCCCGAG 1261 GGCTCCAACT AGGCCAAACC AGTACAGCTA CTATCCAGCC CAGTCAACAA GCCCAGATTG 1321 TCACTCGGTC AGTGTTGCAG GCAGCAGCAG CTGCTGCTGC TGCTGCTTCT ATGCAACTGC 1381 CTCCACCCCG ACTACAGCCC CCTCCATTAC AACAGATGCC ACAGCCCCCG ACTCAGCAGC 1441 AAGTTACCAT TCTGCAGCAG CCTCCTCCAC TCCAGGCCAT GCAACAGCCT CCACCTCAGA 1501 AAGTTCGAAT CAATTTACAG CAACAGCCTC CTCCTCTGCA GATCAAGAGT GTGCCTCTAC 1561 CCACTTTGAA AATGCAGACT ACCTTAGTCC CACCAACTGT GGAAAGTAGT CCTGAGCGGC 1621 CTATGAACAA CAGCCCTGAG GCCCATACAG TGGAGGCACC TTCTCCTGAG ACTATCTGTG 1681 AGATGATCAC AGATGTAGTT CCTGAGGTTG AGTCTCCTTC TCAGATGGAT GTTGAATTGG 1741 TGAGTGGGTC TCCTGTGGCA CTCTCACCCC AGCCTCGATG TGTGAGGTCT GGTTGTGAGA 1801 ACCCTCCCAT TGTGAGTAAG GACTGGGACA ATGAATACTG CAGCAATGAG TGTGTGGTGA 1861 AGCACTGCAG GGATGTATTC TTGGCCTGGG TAGCCTCTAG AAATTCAAAC ACAGTGGTGT 1921 TTGTGAAATA CCCAACTTTC TTGTACAAAG Ttggcattat aagaaagcat tgcttatcaa 1981 tttgttgcaa cgaac