Construct: ORF ccsbBroadEn_06116
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF004894.1_s300c1, BRDN0000457101
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- DPP4 (1803)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 1803 | DPP4 | dipeptidyl peptidase 4 | NM_001935.4 | 99.9% | 99.8% | 278G>A |
2 | human | 1803 | DPP4 | dipeptidyl peptidase 4 | XM_005246371.3 | 99.8% | 99.6% | 94_95insCAG;275G>A |
3 | mouse | 13482 | Dpp4 | dipeptidylpeptidase 4 | NM_010074.3 | 85.4% | 84.3% | (many diffs) |
4 | mouse | 13482 | Dpp4 | dipeptidylpeptidase 4 | NM_001159543.1 | 82.2% | 80.6% | (many diffs) |
5 | mouse | 13482 | Dpp4 | dipeptidylpeptidase 4 | XM_006498692.4 | 76.1% | 74.4% | (many diffs) |
6 | mouse | 13482 | Dpp4 | dipeptidylpeptidase 4 | XM_011239274.3 | 75.8% | 71% | (many diffs) |
7 | mouse | 13482 | Dpp4 | dipeptidylpeptidase 4 | XM_006498691.4 | 75.6% | 74.5% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2364
- ORF length:
- 2298
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGAA GACACCGTGG AAGGTTCTTC TGGGACTGCT GGGTGCTGCT GCGCTTGTCA 121 CCATCATCAC CGTGCCCGTG GTTCTGCTGA ACAAAGGCAC AGATGATGCT ACAGCTGACA 181 GTCGCAAAAC TTACACTCTA ACTGATTACT TAAAAAATAC TTATAGACTG AAGTTATACT 241 CCTTAAGATG GATTTCAGAT CATGAATATC TCTACAAACA AGAAAATAAT ATCTTGGTAT 301 TCAATGCTGA ATATGGAAAC AGCTCAGTTT TCTTGGAGAA CAATACATTT GATGAGTTTG 361 GACATTCTAT CAATGATTAT TCAATATCTC CTGATGGGCA GTTTATTCTC TTAGAATACA 421 ACTACGTGAA GCAATGGAGG CATTCCTACA CAGCTTCATA TGACATTTAT GATTTAAATA 481 AAAGGCAGCT GATTACAGAA GAGAGGATTC CAAACAACAC ACAGTGGGTC ACATGGTCAC 541 CAGTGGGTCA TAAATTGGCA TATGTTTGGA ACAATGACAT TTATGTTAAA ATTGAACCAA 601 ATTTACCAAG TTACAGAATC ACATGGACGG GGAAAGAAGA TATAATATAT AATGGAATAA 661 CTGACTGGGT TTATGAAGAG GAAGTCTTCA GTGCCTACTC TGCTCTGTGG TGGTCTCCAA 721 ACGGCACTTT TTTAGCATAT GCCCAATTTA ACGACACAGA AGTCCCACTT ATTGAATACT 781 CCTTCTACTC TGATGAGTCA CTGCAGTACC CAAAGACTGT ACGGGTTCCA TATCCAAAGG 841 CAGGAGCTGT GAATCCAACT GTAAAGTTCT TTGTTGTAAA TACAGACTCT CTCAGCTCAG 901 TCACCAATGC AACTTCCATA CAAATCACTG CTCCTGCTTC TATGTTGATA GGGGATCACT 961 ACTTGTGTGA TGTGACATGG GCAACACAAG AAAGAATTTC TTTGCAGTGG CTCAGGAGGA 1021 TTCAGAACTA TTCGGTCATG GATATTTGTG ACTATGATGA ATCCAGTGGA AGATGGAACT 1081 GCTTAGTGGC ACGGCAACAC ATTGAAATGA GTACTACTGG CTGGGTTGGA AGATTTAGGC 1141 CTTCAGAACC TCATTTTACC CTTGATGGTA ATAGCTTCTA CAAGATCATC AGCAATGAAG 1201 AAGGTTACAG ACACATTTGC TATTTCCAAA TAGATAAAAA AGACTGCACA TTTATTACAA 1261 AAGGCACCTG GGAAGTCATC GGGATAGAAG CTCTAACCAG TGATTATCTA TACTACATTA 1321 GTAATGAATA TAAAGGAATG CCAGGAGGAA GGAATCTTTA TAAAATCCAA CTTAGTGACT 1381 ATACAAAAGT GACATGCCTC AGTTGTGAGC TGAATCCGGA AAGGTGTCAG TACTATTCTG 1441 TGTCATTCAG TAAAGAGGCG AAGTATTATC AGCTGAGATG TTCCGGTCCT GGTCTGCCCC 1501 TCTATACTCT ACACAGCAGC GTGAATGATA AAGGGCTGAG AGTCCTGGAA GACAATTCAG 1561 CTTTGGATAA AATGCTGCAG AATGTCCAGA TGCCCTCCAA AAAACTGGAC TTCATTATTT 1621 TGAATGAAAC AAAATTTTGG TATCAGATGA TCTTGCCTCC TCATTTTGAT AAATCCAAGA 1681 AATATCCTCT ACTATTAGAT GTGTATGCAG GCCCATGTAG TCAAAAAGCA GACACTGTCT 1741 TCAGACTGAA CTGGGCCACT TACCTTGCAA GCACAGAAAA CATTATAGTA GCTAGCTTTG 1801 ATGGCAGAGG AAGTGGTTAC CAAGGAGATA AGATCATGCA TGCAATCAAC AGAAGACTGG 1861 GAACATTTGA AGTTGAAGAT CAAATTGAAG CAGCCAGACA ATTTTCAAAA ATGGGATTTG 1921 TGGACAACAA ACGAATTGCA ATTTGGGGCT GGTCATATGG AGGGTACGTA ACCTCAATGG 1981 TCCTGGGATC GGGAAGTGGC GTGTTCAAGT GTGGAATAGC CGTGGCGCCT GTATCCCGGT 2041 GGGAGTACTA TGACTCAGTG TACACAGAAC GTTACATGGG TCTCCCAACT CCAGAAGACA 2101 ACCTTGACCA TTACAGAAAT TCAACAGTCA TGAGCAGAGC TGAAAATTTT AAACAAGTTG 2161 AGTACCTCCT TATTCATGGA ACAGCAGATG ATAACGTTCA CTTTCAGCAG TCAGCTCAGA 2221 TCTCCAAAGC CCTGGTCGAT GTTGGAGTGG ATTTCCAGGC AATGTGGTAT ACTGATGAAG 2281 ACCATGGAAT AGCTAGCAGC ACAGCACACC AACATATATA TACCCACATG AGCCACTTCA 2341 TAAAACAATG TTTCTCTTTA CCTTACCCAA CTTTCTTGTA CAAAGTTGGc attataagaa 2401 agcattgctt atcaatttgt tgcaacgaac