Construct: ORF TRCN0000471846
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF004067.1_s317c1
- Derived from:
- ccsbBroadEn_06597
- DNA Barcode:
- ACTTCCCCTTAACCTATTTACGTA
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- MSX2 (4488)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000471846
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 4488 | MSX2 | msh homeobox 2 | NM_002449.5 | 99.8% | 99.6% | 386T>C |
2 | human | 4488 | MSX2 | msh homeobox 2 | NM_001363626.2 | 49.3% | 47.5% | (many diffs) |
3 | human | 55545 | MSX2P1 | msh homeobox 2 pseudogene 1 | NR_002307.1 | 34% | (many diffs) | |
4 | mouse | 17702 | Msx2 | msh homeobox 2 | NM_013601.2 | 87.8% | 92.1% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 867
- ORF length:
- 801
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcatggc ttctccgtcc aaaggcaatg acttgttttc gcccgacgag gagggcccag 121 cagtggtggc cggaccaggc ccggggcctg ggggcgccga gggggccgcg gaggagcgcc 181 gcgtcaaggt ctccagcctg cccttcagcg tggaggcgct catgtccgac aagaagccgc 241 ccaaggaggc gtccccgctg ccggccgaaa gcgcctcggc cggggccacc ctgcggccac 301 tgctgctgtc ggggcacggc gctcgggaag cgcacagccc cgggccgctg gtgaagccct 361 tcgagaccgc cTCGGTCAAG TCGGAAAATT CAGAAGATGG AGCGGCGTGG ATGCAGGAAC 421 CCGGCCGATA TTCGCCGCCG CCAAGACATA CGAGCCCTAC CACCTGCACC CTGAGGAAAC 481 ACAAGACCAA TCGGAAGCCG CGCACGCCCT TTACCACATC CCAGCTCCTC GCCCTGGAGC 541 GCAAGTTCCG TCAGAAACAG TACCTCTCCA TTGCAGAGCG TGCAGAGTTC TCCAGCTCTC 601 TGAACCTCAC AGAGACCCAG GTCAAAATCT GGTTCCAGAA CCGAAGGGCC AAGGCGAAAA 661 GACTGCAGGA GGCAGAACTG GAAAAGCTGA AAATGGCTGC AAAACCTATG CTGCCCTCCA 721 GCTTCAGTCT CCCTTTCCCC ATCAGCTCGC CCCTGCAGGC AGCGTCCATA TATGGAGCAT 781 CCTACCCGTT CCATAGACCT GTGCTTCCCA TCCCGCCTGT GGGACTCTAT GCCACGCCAG 841 TGGGATATGG CATGTACCAC CTGTCCTACC CAACTTTCTT GTACAAAGTG GTTGATATCG 901 GTAAGCCTAT CCCTAACCCT CTCCTCGGTC TCGATTCTAC GTAGTAATGA ACTAGTCCGT 961 AACTTGAAAG TATTTCGATT TCTTGGCTTT ATATATCTTG TGGAAAGGAC GAACTTCCCC 1021 TTAACCTATT TACGTAACGC GTTAAGTCga caatcaacct ctggattaca aaatttgtga 1081 aagatt