Construct: ORF ccsbBroadEn_06595
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF015637.1_s300c1, BRDN0000385472
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- MSH4 (4438)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 4438 | MSH4 | mutS homolog 4 | NM_002440.4 | 99.8% | 99.8% | (many diffs) |
2 | mouse | 55993 | Msh4 | mutS homolog 4 | NM_031870.3 | 81% | 81.7% | (many diffs) |
3 | mouse | 55993 | Msh4 | mutS homolog 4 | NM_001282054.1 | 69.9% | 75.3% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2874
- ORF length:
- 2808
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGCT GAGGCCTGAG ATCTCATCAA CCTCGCCTTC TGCCCCGGCG GTTTCCCCGT 121 CGTCGGGAGA AACCCGCTCA CCTCAGGGTC CCCGCTACAA TTTCGGACTC CAGGAGACTC 181 CACAGAGCCG CCCTTCGGTC CAGGTGGTCT CTGCATCCAC CTGTCCTGGC ACGTCAGGAG 241 CTGCGGGCGA CCGGAGCAGC AGCAGCAGCA GCCTTCCCTG CCCCGCGCCA AACTCCCGGC 301 CAGCTCAAGG TTCATACTTT GGAAACAAAA GAGCTTATGC AGAGAACACA GTTGCATCAA 361 ATTTTACTTT TGGTGCAAGC TCATCTTCTG CACGAGATAC TAATTATCCT CAAACACTTA 421 AAACTCCATT GTCTACTGGA AATCCTCAGA GATCAGGTTA TAAGAGCTGG ACACCACAAG 481 TGGGATATTC AGCTTCATCC TCATCTGCGA TTTCTGCACA CTCCCCATCA GTTATTGTAG 541 CTGTTGTAGA AGGGAGAGGA CTTGCCAGAG GTGAAATAGG AATGGCAAGT ATTGATTTAA 601 AAAACCCCCA AATTATACTA TCCCAGTTTG CAGACAACAC AACATATGCA AAGGTGATCA 661 CTAAACTTAA AATTTTATCA CCTTTGGAAA TAATAATGTC AAATACTGCT TGTGCTGTGG 721 GGAATTCCAC CAAGTTGTTC ACTCTGATCA CAGAAAATTT CAAGAATGTT AATTTCACTA 781 CTATCCAAAG GAAATACTTC AATGAAACAA AAGGATTAGA GTACATTGAA CAGTTATGCA 841 TAGCAGAATT CAGCACTGTC CTAATGGAGG TTCAGTCCAA GTATTACTGC CTTGCAGCTG 901 TTGCAGCTTT GTTAAAATAT GTTGAATTTA TTCAAAATTC AGTTTATGCA CCAAAATCAC 961 TGAAGATTTG TTTCCAGGGT AGTGAACAGA CAGCCATGAT AGATTCATCA TCAGCCCAAA 1021 ACCTTGAATT GTTAATTAAT AATCAAGACT ATAGGAATAA TCACACTCTC TTTGGTGTTC 1081 TAAATTATAC TAAGACTCCT GGAGGGAGTA GACGACTTCG TTCTAATATA TTAGAGCCTC 1141 TAGTTGATAT TGAAACCGTT AACATGAGAT TAGATTGTGT TCAAGAACTA CTTCAAGATG 1201 AGGAACTATT TTTTGGACTT CAATCAGTTA TATCAAGATT TCTTGATACA GAGCAGCTTC 1261 TTTCTGTTTT AGTCCAAATT CCAAAGCAAG ACACGGTCAA TGCTGCTGAA TCAAAGATAA 1321 CAAATTTAAT ATACTTAAAA CATACCTTGG AACTTGTGGA TCCTTTAAAG ATTGCTATGA 1381 AGAACTGTAA CACACCTTTA TTAAGAGCTT ACTATGGTTC CTTGGAAGAC AAGAGGTTTG 1441 GAATCATACT TGAAAAGATT AAAACAGTAA TTAATGATGA TGCAAGATAC ATGAAAGGAT 1501 GCCTAAACAT GAGGACTCAG AAGTGCTATG CAGTGAGGTC TAACATAAAT GAATTTCTTG 1561 ACATAGCAAG AAGAACATAC ACAGAGATTG TAGATGACAT AGCAGGAATG ATATCACAAC 1621 TTGGAGAAAA ATACAGTCTT CCTTTAAGGA CAAGTTTTAG CTCTGCTCGA GGATTTTTCA 1681 TCCAGATGAC TACAGATTGT ATAGCCCTAC CTAGTGATCA ACTTCCTTCA GAATTTATTA 1741 AGATTTCTAA AGTGAAAAAT TCTTACAGCT TTACATCAGC AGATTTAATT AAAATGAATG 1801 AAAGATGCCA AGAATCTTTG AGAGAAATCT ATCACATGAC TTATATGATA GTGTGCAAAC 1861 TGCTTAGTGA GATTTATGAA CATATTCATT GCTTATATAA ACTATCTGAC ACTGTGTCAA 1921 TGCTGGATAT GCTACTGTCA TTTGCTCATG CCTGCACTCT TTCTGACTAT GTTCGACCAG 1981 AATTTACTGA TACTTTAGCA ATCAAACAGG GATGGCATCC TATTCTTGAA AAAATATCTG 2041 CGGAAAAACC TATTGCCAAC AATACCTATG TTACAGAAGG GAGTAATTTT TTGATCATAA 2101 CTGGACCAAA CATGAGTGGA AAATCCACAT ATTTAAAACA GATTGCTCTT TGTCAGATTA 2161 TGGCCCAGAT TGGATCATAT GTTCCAGCAG AATATTCTTC CTTTAGAATT GCTAAACAGA 2221 TTTTTACAAG AATTAGTACT GATGATGATA TCGAAACAAA TTCATCAACA TTTATGAAAG 2281 AAATGAAAGA GATAGCATAT ATTCTACATA ATGCTAATGA CAAATCGCTC ATATTAATTG 2341 ATGAACTTGG CAGAGGTACT AATACGGAAG AAGGTATTGG CATTTGTTAT GCTGTTTGTG 2401 AATATCTACT GAGCTTAAAG GCATTTACAC TGTTTGCTAC ACATTTCCTG GAACTATGCC 2461 ATATTGATGC CCTGTATCCT AATGTAGAAA ACATGCATTT TGAAGTTCAA CATGTAAAGA 2521 ATACCTCAAG AAATAAAGAA GCAATTTTGT ATACCTACAA ACTTTCTAAG GGACTCACAG 2581 AAGAGAAAAA TTATGGATTA AAAGCTGCAG AGGTGTCATC ACTTCCACCA TCAATTGTCT 2641 TGGATGCCAA GGAAATCACA ACTCAAATTA CGAGACAAAT TTTGCAAAAC CAAAGGAGTA 2701 CCCCTGAGAT GGAAAGACAG AGAGCTGTGT ACCATCTAGC CACTAGGCTT GTTCAAACTG 2761 CTCGAAACTC TCAATTGGAT CCAGACAGTT TACGAATATA TTTAAGTAAC CTCAAGAAGA 2821 AGTACAAAGA AGATTTTCCC AGGACTGAAC AAGTTCCAGA AAAGACTGAA GAATACCCAA 2881 CTTTCTTGTA CAAAGTtggc attataagaa agcattgctt atcaatttgt tgcaacgaac