Construct: ORF TRCN0000475856
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF013554.1_s317c1
- Derived from:
- ccsbBroadEn_10600
- DNA Barcode:
- AAAGGATTTTAGCATTTTCCACAA
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- KRT18P55 (284085)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000475856
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 3875 | KRT18 | keratin 18 | NM_000224.3 | 42.6% | 35.3% | (many diffs) |
2 | human | 3875 | KRT18 | keratin 18 | NM_199187.1 | 42.6% | 35.3% | (many diffs) |
3 | human | 284085 | KRT18P55 | keratin 18 pseudogene 55 | NR_028334.1 | 39.8% | 1_487del;1265_1950del |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 846
- ORF length:
- 777
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcaccat ggatcattgc ttaatttcgg gtcttagcca gttagattta ccttcagctt 121 taacaaaaaa ttggccgtca aaacctgagt cctgtcctct tgctctcctc cccggacagc 181 atgaacttca ccacttgctc caccctctcc accaactacc agtccctggg cactgtccag 241 gcacccagct atgtgcccag ctggtcagca gtgtggccag tgtctatgca ggcatcaggg 301 gctctggttc ccggatctcc atgtcctgct tcaccagctt ccagggcagc atggggtcca 361 ggggcctgcc cgcagtgatg gccgggggtc tggcaggaat gggagtcatc cagaatgaga 421 aggaaaccat gcaaagcctc aacgaccacc tggcctccta cctggacaga gttaggagcc 481 tggataccaa gaactggaag ctggagagcc aggagcacct ggagaagaag ggaccccagg 541 tcagagactg gagccatgac ttcaagacca tcgagaaccT GAGGGCTCAG ATCTTTGCAA 601 ATACTGTGGA CAGTGCCCAC ATTGTTCTGC AGATCGACAA TGCCTGTCTT GCTGGTGATG 661 ACTTTAGGGT CAAGTATGAG ACAGAGCTGG CCATGTGCCA GTCTGTGGAG AGTGACATCC 721 ATGGGCTCCA CAAGGTCATT GATGACACCA ATGTCACTTG GCTGCAGCTG GAAGCAGAGA 781 TCAAGGCTCT CAAGGAGAAG CTGCTCTTCA TGAAGAAGAA CCATGAAGAG GAAGTAAAGG 841 GCCTATTGCC AACTTTCTTG TACAAAGTGG TTGATATCGG TAAGCCTATC CCTAACCCTC 901 TCCTCGGTCT CGATTCTACG TAGTAATGAA CTAGTCCGTA ACTTGAAAGT ATTTCGATTT 961 CTTGGCTTTA TATATCTTGT GGAAAGGACG AAAAGGATTT TAGCATTTTC CACAAACGCG 1021 TTAAGTCgac aatcaacctc tggattacaa aatttgtgaa agatt