Construct: ORF ccsbBroadEn_12116
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF017690.1_s300c1, BRDN0000396243
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- SOHLH2 (54937)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 54937 | SOHLH2 | spermatogenesis and oogenes... | NM_001282147.1 | 100% | 100% | |
2 | human | 54937 | SOHLH2 | spermatogenesis and oogenes... | NM_017826.3 | 51.6% | 50.2% | (many diffs) |
3 | human | 100526761 | CCDC169-SOHLH2 | CCDC169-SOHLH2 readthrough | NM_001198910.2 | 43.1% | 39.5% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 741
- ORF length:
- 675
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACGAAAAAG 61 TTGGCATGGC TTCCTCAATT ATCTGCCAGG AGCACTGCCA GATCTCGGGC CAGGCAAAAA 121 TAGACATCTT ATTAGTTGGA GATGTCACTG TGGGCTACCT GGCTGATACT GTACAGAAAC 181 TATTTGCAAA CATAGCAGAA GTCACCATCA CCATCAGTGA CACGAAGGAG GCAGCAGCGC 241 TTTTGGATGA TTGCATATTC AACATGGTTC TCTTGAAGGT GCCTTCTTCA CTAAGTGCCG 301 AGGAGCTGGA AGCCATCAAG TTAATTAGAT TTGGCAAAAA GAAAAATACA CATTCACTGT 361 TTGTTTTTAT AATCCCTGAA AATTTTAAAG GTTGTATTTC AGGGCATGGA ATGGATATTG 421 CTTTAACTGA ACCACTGACA ATGGAAAAAA TGAGTAATGT GGTAAAATAC TGGACAACAT 481 GTCCCTCAAA CACTGTTAAG ACTGAAAACG CAACTGGGCC TGAAGAACTT GGATTGCCCC 541 TGCAGAGGTC CTACAGCGAA CACCTGGGAT ATTTTCCTAC TGATCTATTT GCCTGCTCTG 601 AATCTTTAAG GAATGGCAAT GGGCTTGAAT TAAATGCTTC GTTGTCAGAG TTCGAGAAAA 661 ACAAAAAGAT CTCTCTTCTT CATTCAAGCA AGGAAAAACT AAGAAGGCTG TACAGGAAGC 721 ATAGCAGCTT CTGTTTCTGG TGCCCAACTT TCTTGTACAA AGTtggcatt ataagaaagc 781 attgcttatc aatttgttgc aacgaac