Construct: ORF TRCN0000475097
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF018392.1_s317c1
- Derived from:
- ccsbBroadEn_07733
- DNA Barcode:
- TTAGTCGGTCTGTCTCAGTTTTGC
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- UPK1A (11045)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000475097
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
| Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
|---|---|---|---|---|---|---|---|---|
| 1 | human | 11045 | UPK1A | uroplakin 1A | NM_007000.3 | 99.8% | 99.6% | 770T>C |
| 2 | human | 11045 | UPK1A | uroplakin 1A | NM_001281443.1 | 82.6% | 75.9% | 647_744del;819_820ins53 |
| 3 | mouse | 109637 | Upk1a | uroplakin 1A | NM_026815.2 | 87.7% | 94.1% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 843
- ORF length:
- 774
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcaccat ggcgtctgcg gcagcagcgg aggccgagaa gggatctcca gttgtggtgg 121 gcctgctagt tgtgggcaat atcattattc tgctgtcagg cctgtccctg tttgctgaga 181 ccatatgggt gacagccgac cagtaccgtg tatacccact gatgggagtc tcaggcaagg 241 atgacgtctt cgctggtgcc tggattgcca tcttctgcgg cttctccttc ttcatggtag 301 ccagttttgg tgtgggtgcc gcactctgcc gccgccggtc catggtcctc acgtacctgg 361 tgctcatgct catcgtctac atcttcgagt gcgcctcctg catcacgtcc tacacccacc 421 GTGACTACAT GGTGTCCAAC CCATCCCTGA TCACCAAGCA GATGCTGACC TTCTACAGCG 481 CGGACACCGA CCAGGGCCAG GAGCTGACCC GCCTCTGGGA CCGCGTCATG ATTGAGCAAG 541 AATGCTGTGG CACATCTGGT CCCATGGACT GGGTGAACTT CACGTCAGCC TTCCGGGCGG 601 CCACTCCGGA GGTGGTGTTC CCCTGGCCCC CACTGTGCTG TCGCCGGACG GGAAACTTCA 661 TCCCCCTCAA CGAGGAGGGC TGCCGCCTGG GGCACATGGA CTACCTGTTC ACCAAGGGCT 721 GCTTCGAACA CATCGGCCAC GCCATCGACA GCTACACGTG GGGTATCTCG TGGTTTGGGT 781 TTGCCATCCT GATGTGGACG CTCCCGGTCA TGCTGATAGC CATGTATTTC TACACCACGC 841 TCTTGCCAAC TTTCTTGTAC AAAGTGGTTG ATATCGGTAA GCCTATCCCT AACCCTCTCC 901 TCGGTCTCGA TTCTACGTAG TAATGAACTA GTCCGTAACT TGAAAGTATT TCGATTTCTT 961 GGCTTTATAT ATCTTGTGGA AAGGACGATT AGTCGGTCTG TCTCAGTTTT GCACGCGTTA 1021 AGTCgacaat caacctctgg attacaaaat ttgtgaaaga tt