Construct: ORF ccsbBroadEn_02765
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF013270.1_s300c1, BRDN0000395049
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- AP4E1 (23431)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 23431 | AP4E1 | adaptor related protein com... | NM_007347.5 | 100% | 100% | |
2 | human | 23431 | AP4E1 | adaptor related protein com... | NM_001252127.2 | 93.4% | 93.4% | 0_1ins225 |
3 | human | 23431 | AP4E1 | adaptor related protein com... | XM_005254264.4 | 93.4% | 93.4% | 0_1ins225 |
4 | human | 23431 | AP4E1 | adaptor related protein com... | XM_006720447.4 | 93.4% | 93.4% | 0_1ins225 |
5 | human | 23431 | AP4E1 | adaptor related protein com... | XR_001751184.1 | 76.5% | 1_107del;2073_2074ins124;3395_4168del | |
6 | human | 23431 | AP4E1 | adaptor related protein com... | XM_017022042.2 | 74.1% | 74.1% | 0_1ins882 |
7 | human | 23431 | AP4E1 | adaptor related protein com... | XR_001751185.1 | 52.5% | 1_107del;1424_1425delAC;1960_1961ins1560 | |
8 | human | 23431 | AP4E1 | adaptor related protein com... | XR_001751183.1 | 47.6% | 1_107del;3009_3010ins191;3328_6568del | |
9 | mouse | 108011 | Ap4e1 | adaptor-related protein com... | XM_006498553.2 | 47.9% | 49.1% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 69
- ORF end:
- 3480
- ORF length:
- 3411
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaacttTG TACAAAAAAG 61 TTGGCACCAT GAGCGACATA GTGGAGAAGA CGCTGACGGC GCTGCCGGGA CTCTTTCTGC 121 AGAACCAGCC CGGTGGTGGG CCCGCGGCCG CCAAGGCGTC CTTCTCCTCG AGGCTGGGCA 181 GCCTTGTCCG CGGCATCACA GCCCTCACCT CCAAGCACGA AGAAGAAAAA TTAATCCAGC 241 AGGAACTGAG TAGTCTGAAA GCGACTGTTT CTGCTCCTAC TACAACACTG AAAATGATGA 301 AGGAATGTAT GGTGAGACTT ATATATTGTG AAATGCTTGG ATATGATGCT TCCTTTGGCT 361 ATATTCATGC AATCAAGTTA GCCCAACAAG GAAACCTCTT AGAAAAAAGA GTAGGTTATT 421 TGGCTGTTTC CTTATTTCTA CATGAAAGTC ATGAATTATT GCTTCTCCTT GTGAATACAG 481 TTGTAAAGGA TCTGCAGAGC ACTAACCTAG TAGAAGTGTG TATGGCACTG ACTGTTGTTA 541 GCCAGATTTT CCCCTGCGAA ATGATTCCAG CTGTTCTTCC ATTAATAGAA GATAAACTTC 601 AACATTCTAA GGAGATTGTA CGAAGAAAAG CTGTTCTGGC ATTATACAAA TTCCATCTCA 661 TTGCTCCTAA TCAAGTACAA CATATTCATA TTAAGTTTCG GAAAGCACTT TGTGACAGAG 721 ATGTTGGGGT CATGGCTGCC TCCTTGCATA TATATCTTAG AATGATTAAG GAGAATTCAT 781 CTGGATATAA AGACTTGACT GGGAGTTTTG TAACCATTTT GAAGCAAGTA GTTGGAGGAA 841 AGCTCCCAGT AGAATTCAAT TACCACAGTG TGCCAGCACC ATGGTTACAA ATTCAGCTCT 901 TGAGAATACT GGGACTTCTA GGAAAAGATG ATCAAAGGAC AAGTGAATTA ATGTATGATG 961 TTCTTGATGA ATCCTTACGA AGAGCTGAGT TAAATCACAA TGTCACATAT GCTATTTTGT 1021 TTGAATGTGT GCATACAGTC TATTCTATTT ATCCTAAATC GGAATTACTT GAGAAGGCTG 1081 CCAAGTGCAT TGGAAAATTT GTTCTGTCAC CTAAAATAAA TCTAAAATAT TTAGGACTGA 1141 AGGCTCTTAC CTATGTTATC CAACAGGATC CTACTCTGGC TCTTCAACAC CAGATGACAA 1201 TAATTGAATG TTTAGATCAT CCTGATCCCA TTATTAAAAG AGAGACTCTG GAACTTCTTT 1261 ACAGAATTAC TAATGCACAG AATATAACAG TTATTGTCCA GAAAATGCTT GAATATTTAC 1321 ATCAGAGCAA AGAAGAGTAT GTCATCGTCA ATTTGGTCGG CAAAATAGCA GAGCTGGCTG 1381 AGAAATATGC TCCTGATAAT GCATGGTTTA TTCAGACAAT GAATGCTGTG TTTTCAGTAG 1441 GAGGAGATGT AATGCATCCT GATATTCCCA ATAACTTTCT GAGACTACTA GCGGAAGGTT 1501 TTGATGATGA AACAGAAGAT CAGCAATTAA GACTCTATGC AGTTCAGTCT TATCTCACTT 1561 TACTGGATAT GGAAAATGTG TTCTATCCAC AGAGATTTCT TCAAGTTATG AGTTGGGTAT 1621 TAGGGGAATA TTCCTACCTC TTAGATAAGG AAACGCCAGA GGAAGTTATA GCTAAGCTCT 1681 ACAAGTTACT TATGAATGAC TCTGTGTCTT CAGAAACAAA AGCCTGGTTA ATTGCTGCTG 1741 TGACCAAATT GACATCTCAG GCGCACTCTT CTAATACAGT TGAGAGATTA ATCCATGAAT 1801 TTACCATATC TTTGGATACT TGTATGAGAC AACATGCATT TGAATTAAAA CATTTGCATG 1861 AGAATGTGGA ACTTATGAAG AGCTTGCTTC CAGTTGACAG GAGTTGTGAA GACTTGGTGG 1921 TAGATGCTTC TTTATCTTTT CTGGATGGTT TTGTGGCTGA AGGACTCAGT CAGGGTGCAG 1981 CGCCTTACAA ACCTCCCCAT CAACGCCAGG AGGAAAAGCT TTCTCAGGAA AAAGTTCTCA 2041 ATTTTGAACC ATATGGACTC TCCTTTTCTT CATCTGGCTT CACTGGACGA CAGTCTCCTG 2101 CTGGCATTTC TCTTGGTTCA GATGTATCTG GGAATAGTGC TGAGACAGGA CTGAAAGAGA 2161 CAAATAGCTT GAAGCTGGAA GGTATAAAGA AATTGTGGGG GAAAGAAGGC TATCTTCCCA 2221 AGAAGGAAAG CAAAACTGGT GATGAAAGTG GAGCTCTGCC TGTTCCTCAA GAGAGTATAA 2281 TGGAGAATGT AGATCAAGCT ATAACTAAAA AGGATCAATC TCAAGTTCTT ACCCAATCTA 2341 AAGAGGAGAA AGAAAAGCAG CTGCTGGCAT CATCATTATT TGTTGGTCTA GGATCAGAAA 2401 GTACAATCAA CCTGCTGGGA AAAGCAGATA CTGTCTCTCA CAAGTTCAGA AGGAAATCAA 2461 AAGTCAAAGA AGCTAAAAGT GGCGAAACAA CCAGTACTCA TAATATGACC TGTTCTTCCT 2521 TTAGTTCTTT GTCAAATGTG GCATATGAAG ATGATTATTA TTCGAATACT TTGCACGATA 2581 CAGGAGACAA GGAATTAAAG AAATTTTCTC TCACTTCAGA ACTTTTGGAT TCTGAGTCAC 2641 TCACAGAACT GCCCTTGGTT GAGAAATTCT CATATTGTAG TCTGTCTACA CCTTCATTGT 2701 TTGCTAATAA CAACATGGAA ATTTTTCACC CTCCTCAATC TACTGCAGCC TCAGTTGCCA 2761 AGGAAAGCTC TTTAGCTTCA TCTTTTTTGG AAGAAACTAC TGAATACATA CACTCAAATG 2821 CTATGGAAGT CTGTAATAAT GAAACTATAT CAGTGTCTTC TTATAAAATT TGGAAAGATG 2881 ATTGTTTATT GATGGTCTGG TCAGTCACTA ATAAGAGTGG TTTGGAATTG AAAAGTGCTG 2941 ACTTAGAAAT TTTTCCTGCA GAAAATTTCA AGGTGACTGA GCAACCTGGA TGCTGTTTGC 3001 CTGTAATGGA AGCAGAAAGC ACCAAAAGCT TTCAATATAG TGTGCAGATA GAAAAACCTT 3061 TTACAGAAGG AAATCTTACT GGTTTTATTA GTTATCATAT GATGGATACT CATTCTGCTC 3121 AGCTGGAATT TTCTGTAAAC TTATCACTAT TAGATTTCAT TAGACCATTA AAAATCTCAA 3181 GTGACGACTT TGGGAAACTC TGGTTATCCT TCGCAAATGA TGTGAAACAA AATGTAAAAA 3241 TGTCAGAATC TCAAGCTGCA CTTCCTTCTG CACTAAAGAC TCTGCAACAG AAACTAAGAC 3301 TCCATATTAT TGAGATTATA GGCAATGAAG GGCTATTGGC CTGTCAGCTG CTCCCATCCA 3361 TCCCCTGCTT ACTGCATTGC CGAGTTCATG CAGATGTATT AGCCCTGTGG TTCAGATCCT 3421 CCTGTTCTAC TCTTCCTGAC TATTTACTGT ATCAGTGTCA AAAGGTGATG GAGGGATCCT 3481 TGCCAACTTT CTTGTACAAA GTtggcatta taagaaagca ttgcttatca atttgttgca 3541 acgaac