Construct: ORF ccsbBroadEn_04787
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF003663.1_s300c1, BRDN0000387901
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- USH1G (124590)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 124590 | USH1G | USH1 protein network compon... | NM_173477.5 | 100% | 100% | |
2 | human | 124590 | USH1G | USH1 protein network compon... | NM_001282489.3 | 77.6% | 77.6% | 0_1ins309 |
3 | human | 124590 | USH1G | USH1 protein network compon... | XM_011524296.2 | 77.6% | 77.6% | 0_1ins309 |
4 | mouse | 16470 | Ush1g | USH1 protein network compon... | NM_176847.3 | 90.8% | 96.3% | (many diffs) |
5 | mouse | 16470 | Ush1g | USH1 protein network compon... | XM_017314300.1 | 70.1% | 73.9% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 1452
- ORF length:
- 1383
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaacttTG TACAAAAAAG 61 TTGGCACCAT GAACGACCAG TACCACCGGG CAGCCCGGGA TGGCTACCTG GAGCTCCTCA 121 AGGAGGCCAC CCGAAAGGAG CTGAATGCCC CCGACGAGGA TGGCATGACC CCCACTCTCT 181 GGGCTGCCTA CCATGGCAAC CTCGAGTCGC TGCGTCTCAT TGTGAGCCGC GGGGGTGACC 241 CGGACAAGTG TGACATCTGG GGCAACACAC CCCTGCATCT GGCAGCTTCC AATGGCCACT 301 TGCACTGCCT GTCCTTCCTG GTGTCCTTCG GAGCCAACAT CTGGTGCCTA GACAACGACT 361 ACCACACGCC GCTGGACATG GCTGCCATGA AGGGCCACAT GGAATGCGTG CGCTACCTGG 421 ACTCCATCGC GGCCAAGCAG AGCAGCCTCA ACCCCAAGCT GGTGGGTAAG CTGAAGGACA 481 AGGCCTTCCG CGAGGCGGAG CGGCGCATCC GCGAGTGCGC CAAGCTGCAG CGGAGGCACC 541 ACGAACGCAT GGAGCGGCGA TACCGGCGCG AGCTGGCCGA GCGTTCCGAC ACCCTCAGCT 601 TCTCCAGCCT CACGTCCAGC ACCCTGAGCC GCCGGCTGCA GCATCTGGCG CTGGGCAGCC 661 ACCTGCCGTA CTCTCAGGCC ACGCTGCACG GCACGGCCAG GGGCAAGACC AAGATGCAGA 721 AGAAGCTGGA GCGGCGCAAG CAGGGCGGCG AAGGCACCTT CAAGGTCTCC GAGGATGGGC 781 GCAAGAGCGC CCGCTCGCTC TCGGGCCTGC AGCTGGGCAG CGACGTGATG TTCGTGCGCC 841 AGGGCACCTA CGCCAATCCC AAGGAGTGGG GCCGAGCCCC GCTCCGGGAC ATGTTCCTCT 901 CGGACGAGGA CAGCGTCTCC CGTGCCACGC TGGCGGCCGA GCCTGCCCAC TCGGAGGTCA 961 GCACCGACTC AGGCCACGAC TCCCTGTTTA CCCGCCCCGG CCTGGGCACC ATGGTGTTCC 1021 GCAGAAATTA CTTGAGCAGT GGGCTGCACG GACTGGGCCG CGAGGATGGG GGTCTGGATG 1081 GGGTGGGAGC GCCGCGGGGT CGGCTGCAGA GCTCCCCCAG CCTGGACGAT GACAGCCTGG 1141 GCAGTGCCAA CAGCCTGCAG GACCGCAGCT GTGGGGAGGA GCTGCCCTGG GATGAGCTCG 1201 ATTTAGGCTT GGACGAGGAC CTGGAGCCCG AGACTAGCCC GCTGGAGACC TTCCTGGCCT 1261 CTCTGCACAT GGAGGACTTT GCCGCCCTCC TGCGGCAGGA GAAGATCGAC CTCGAGGCTT 1321 TGATGCTGTG CTCTGACCTC GACCTCCGCA GCATCAGCGT CCCACTGGGG CCCCGAAAGA 1381 AGATCTTGGG GGCCGTGAGG AGGCGGCGGC AGGCGATGGA GCGCCCGCCG GCCCTGGAGG 1441 ACACAGAGCT ATTGCCAACT TTCTTGTACA AAGTtggcat tataagaaag cattgcttat 1501 caatttgttg caacgaac