Construct: ORF TRCN0000470020
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF016357.1_s317c1
- Derived from:
- ccsbBroadEn_08497
- DNA Barcode:
- ACATCCTCTCAAAATGCTGCGGTC
- Epitope Tag:
- V5
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- YY1AP1 (55249)
Vector Information:
- Vector Backbone:
- pLX_317
- Pol II Cassette 1:
- SV40-PuroR
- Pol II Cassette 2:
- EF1a-TRCN0000470020
- Selection Marker:
- PuroR
- Visible Reporter:
- n/a
- Epitope Tag:
- V5
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198899.1 | 99.9% | 99.8% | 1487C>T |
2 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198900.1 | 99.9% | 99.8% | 1487C>T |
3 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_018253.3 | 99.9% | 99.8% | 1487C>T |
4 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198901.1 | 98.4% | 98.4% | 1_33del;1520C>T |
5 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198902.1 | 98.4% | 98.4% | 1_33del;1520C>T |
6 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_139119.2 | 98.4% | 98.4% | 1_33del;1520C>T |
7 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198905.1 | 95.8% | 95.7% | 1_33del;728_729ins60;1460C>T |
8 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_139121.2 | 92.5% | 92.4% | 0_1ins165;1322C>T |
9 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_139118.2 | 88% | 87.9% | 1_231del;926_927ins60;1658C>T |
10 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198903.1 | 83.1% | 83.1% | 1_447del;1934C>T |
11 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198904.1 | 80.9% | 80.8% | 1_447del;1142_1143ins60;1874C>T |
12 | human | 54856 | GON4L | gon-4 like | NM_001282861.2 | 46.5% | 44.9% | (many diffs) |
13 | human | 54856 | GON4L | gon-4 like | NM_032292.6 | 46.5% | 44.9% | (many diffs) |
14 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198906.2 | 41.3% | 39% | 1_231del;1193_1212del;1272_1273ins1196 |
15 | human | 54856 | GON4L | gon-4 like | XM_005245284.3 | 34.4% | 33.2% | (many diffs) |
16 | human | 54856 | GON4L | gon-4 like | XM_011509659.2 | 34.4% | 33.2% | (many diffs) |
17 | human | 54856 | GON4L | gon-4 like | NM_001282858.2 | 31.7% | 30.6% | (many diffs) |
18 | human | 54856 | GON4L | gon-4 like | XM_006711393.3 | 31.7% | 30.6% | (many diffs) |
19 | human | 54856 | GON4L | gon-4 like | XM_006711394.4 | 31.7% | 30.6% | (many diffs) |
20 | human | 54856 | GON4L | gon-4 like | NM_001282856.1 | 31.7% | 30.6% | (many diffs) |
21 | human | 54856 | GON4L | gon-4 like | NM_001282860.1 | 31.7% | 30.6% | (many diffs) |
22 | human | 54856 | GON4L | gon-4 like | XM_005245286.3 | 31.5% | 30.4% | (many diffs) |
23 | human | 54856 | GON4L | gon-4 like | XM_011509658.2 | 30.5% | 29.5% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2283
- ORF length:
- 2217
- Sequence:
-
1 tcttccattt caggtgtcgt gaggctagca tcgattgatc aacaagtttg tacaaaaaag 61 ttggcatggg attctccaac atggaagatg atggcccaga agaggaggag cgtgtggctg 121 agcctcaagc taactttaac acccctcaag ctctacggtt tgaggaacta ctggccaacc 181 tactaaatga acaacatcag atagcgaagg aactatttga acagctgaag atgaagaaac 241 cttcagccaa acagcagaag gaggtagaga aggttaaacc ccagtgtaag gaagttcatc 301 agaccctgat tctggaccca gcacaaagga agagactcca gcagcagatg cagcagcatg 361 ttcagctctt gacacaaatc caccttcttg ccacctgcaa ccccaatctc aatccggagg 421 ccagtagcac caggatatgt cttaaagagc tgggaacctt tgctcaaagc tccatcgccc 481 ttcaccatca gtacaacccc aagtttcaga ccctgttcca accctgtaac ttgatgggag 541 ctatgcagct gattgaagac ttcagcacac atgtcagcat tgactgcagc cctcataaaa 601 ctgtcaagaa gactgccaat gaatttccct gtttgccaaa gcaagtggct tggatcctgg 661 ccacaagcaa ggttttcatg tatccagagt tacttccagt gtgttccctg aaggcaaaga 721 atccccagga taagatcctc ttcaccaagg ctgaggacaa tttgttagct ttaggactga 781 agcattttga agggactgag tttcttaacc ctctaatcag caagtacctt ctaacctgca 841 agactgcccg ccaactgaca gtgagaatca agaacctcaa catgaacaga gctcctgaca 901 acatcattaa attttataag aagaccaaac agctgccagt cctaggaaaa tgctgtgaag 961 agatccagcc acatcagtgg aagccaccta tagagagaga agaacaccgg ctcccattct 1021 ggttaaaggc cagtctgcca tccatccagg aagaactgcg gcacatggct gatggtgcta 1081 gagaggtagg aaatatgact ggaaccactg agatcaactc agatcaaggc ctagaaaaag 1141 acaactcaga gttggggagt gaaactcggt acccactgct attgcctaag ggtgtagtcc 1201 tgaaactgaa gccagttgcc gaccgtttcc ccaagaaggc ttggagacag aagcgttcat 1261 cagtcctgaa acccctcctt atccaaccca gcccctctct ccagcccagc ttcaaccctg 1321 ggaaaacacc agcccaatca actcattcag aagcccctcc gagcaaaatg gtgctccgga 1381 ttcctcaccc aatacagcca gccactgttt tacagacagt tccaggtgtc cctccactgg 1441 gggtcagtgg aggtgagagt tttgagtctc ctgcagcact gcctgctatg ccccctgagg 1501 ccaggacaag cttccctctg tctgagtccc agactttgct ctcttctgcc cttgtgccca 1561 aggtaatgat gccctcccct gcctcttcca tgtttcgaaa gccatatgtg agacggagac 1621 cctcaaaaag aaggggagcc agggcctttc gctgtatcaa acctgcccct gttatccacc 1681 ctgcatctgt tatcttcact gttcctgcta ccactgtgaa gattgtgagc cttggcggtg 1741 gctgtaacat gatccagcct gtcaatgcgg ctgtggccca gagtccccag actattccca 1801 tcgccaccct cttggttaac cctacttcct tcccctgtcc attgaaccag ccccttgtgg 1861 cctcctctgt ctcaccctta attgtttctg gcaattctgt gaaTCTTCCT ATACCATCCA 1921 CCCCTGAAGA TAAGGCCCAC ATGAATGTGG ACATTGCTTG TGCTGTGGCT GATGGGGAAA 1981 ATGCCTTTCA GGGCCTAGAA CCCAAATTAG AGCCCCAGGA ACTATCTCCT CTCTCTGCTA 2041 CTGTTTTCCC CAAAGTGGAA CATAGCCCAG GGCCTCCACC AGTCGATAAA CAGTGCCAAG 2101 AAGGATTGTC AGAGAACAGT GCCTATCGCT GGACCGTTGT GAAAACAGAG GAGGGAAGGC 2161 AAGCTCTGGA GCCGCTCCCT CAGGGCATCC AGGAGTCTCT AAACAACTCT TCCCCTGGGG 2221 ATTTAGAGGA AGTTGTCAAG ATGGAACCTG AAGATGCTAC AGAGGAAATC AGTGGATTTC 2281 TTTGCCCAAC TTTCTTGTAC AAAGTGGTTG ATATCGGTAA GCCTATCCCT AACCCTCTCC 2341 TCGGTCTCGA TTCTACGTAG TAATGAACTA GTCCGTAACT TGAAAGTATT TCGATTTCTT 2401 GGCTTTATAT ATCTTGTGGA AAGGACGAAC ATCCTCTCAA AATGCTGCGG TCACGCGTTA 2461 AGTCgacaat caacctctgg attacaaaat ttgtgaaaga tt