Construct: ORF ccsbBroadEn_12191
Construct Description:
- Construct Type:
- ORF
- Other Identifiers:
- ORF013968.1_s300c1, BRDN0000388518
- DNA Barcode:
- None
- Epitope Tag:
- None
- Notes:
- No stop codon in insert
Originally Annotated References:
- Gene:
- YY1AP1 (55249)
Vector Information:
- Vector Backbone:
- pDONR223
- Pol II Cassette 1:
- n/a
- Pol II Cassette 2:
- n/a
- Selection Marker:
- n/a
- Visible Reporter:
- n/a
- Epitope Tag:
- n/a
Current transcripts matched by this ORF:
Taxon | Gene | Symbol | Description | Transcript | Nuc. Match %[?]A simple nucleotide-based global alignment percentage, calculated as follows: total nt. matches ---------------------------------- aligned length (incl. gaps) |
Prot. Match %[?]A simple amino acid-based global alignment percentage, calculated as follows: total aa. matches ---------------------------------- aligned length (incl. gaps) |
Match Diffs[?]This field may contain sequence annotations in HGVS format. For more information about HGVS annotations, please refer to the HGVS Quick Reference Guide. | |
---|---|---|---|---|---|---|---|---|
1 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_139121.2 | 94.2% | 94.2% | 1_117del |
2 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198899.1 | 87.2% | 87.2% | 1_282del |
3 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198900.1 | 87.2% | 87.2% | 1_282del |
4 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_018253.3 | 87.2% | 87.2% | 1_282del |
5 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198901.1 | 86% | 86% | 1_315del |
6 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198902.1 | 86% | 86% | 1_315del |
7 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_139119.2 | 86% | 86% | 1_315del |
8 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198905.1 | 83.3% | 83.3% | 1_315del;728_729ins60 |
9 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_139118.2 | 76.5% | 76.5% | 1_513del;926_927ins60 |
10 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198903.1 | 72.6% | 72.6% | 1_729del |
11 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198904.1 | 70.3% | 70.3% | 1_729del;1142_1143ins60 |
12 | human | 54856 | GON4L | gon-4 like | NM_001282861.2 | 40.8% | 39.6% | (many diffs) |
13 | human | 54856 | GON4L | gon-4 like | NM_032292.6 | 40.8% | 39.6% | (many diffs) |
14 | human | 54856 | GON4L | gon-4 like | XM_005245286.3 | 33.4% | 32.3% | (many diffs) |
15 | human | 54856 | GON4L | gon-4 like | XM_005245284.3 | 30.2% | 29.3% | (many diffs) |
16 | human | 54856 | GON4L | gon-4 like | XM_011509659.2 | 30.2% | 29.3% | (many diffs) |
17 | human | 55249 | YY1AP1 | YY1 associated protein 1 | NM_001198906.2 | 29.9% | 27.7% | 1_513del;1193_1212del;1272_1273ins1196 |
18 | human | 54856 | GON4L | gon-4 like | NM_001282858.2 | 27.9% | 27% | (many diffs) |
19 | human | 54856 | GON4L | gon-4 like | XM_006711393.3 | 27.9% | 27% | (many diffs) |
20 | human | 54856 | GON4L | gon-4 like | XM_006711394.4 | 27.9% | 27% | (many diffs) |
21 | human | 54856 | GON4L | gon-4 like | NM_001282856.1 | 27.8% | 27% | (many diffs) |
22 | human | 54856 | GON4L | gon-4 like | NM_001282860.1 | 27.8% | 27% | (many diffs) |
23 | human | 54856 | GON4L | gon-4 like | XM_011509658.2 | 27.6% | 25.9% | (many diffs) |
Sequence Information
Note: uppercase bases indicate empirically verified sequence.
- ORF start:
- 66
- ORF end:
- 2001
- ORF length:
- 1935
- Sequence:
-
1 gttcgttgca acaaattgat gagcaatgct tttttataat gccaaCTTTG TACAAAAAAG 61 TTGGCATGCA GCAGCATGTT CAGCTCTTGA CACAAATCCA CCTTCTTGCC ACCTGCAACC 121 CCAATCTCAA TCCGGAGGCC AGTAGCACCA GGATATGTCT TAAAGAGCTG GGAACCTTTG 181 CTCAAAGCTC CATCGCCCTT CACCATCAGT ACAACCCCAA GTTTCAGACC CTGTTCCAAC 241 CCTGTAACTT GATGGGAGCT ATGCAGCTGA TTGAAGACTT CAGCACACAT GTCAGCATTG 301 ACTGCAGCCC TCATAAAACT GTCAAGAAGA CTGCCAATGA ATTTCCCTGT TTGCCAAAGC 361 AAGTGGCTTG GATCCTGGCC ACAAGCAAGG TTTTCATGTA TCCAGAGTTA CTTCCAGTGT 421 GTTCCCTGAA GGCAAAGAAT CCCCAGGATA AGATCCTCTT CACCAAGGCT GAGGACAATT 481 TGTTAGCTTT AGGACTGAAG CATTTTGAAG GGACTGAGTT TCTTAACCCT CTAATCAGCA 541 AGTACCTTCT AACCTGCAAG ACTGCCCGCC AACTGACAGT GAGAATCAAG AACCTCAACA 601 TGAACAGAGC TCCTGACAAC ATCATTAAAT TTTATAAGAA GACCAAACAG CTGCCAGTCC 661 TAGGAAAATG CTGTGAAGAG ATCCAGCCAC ATCAGTGGAA GCCACCTATA GAGAGAGAAG 721 AACACCGGCT CCCATTCTGG TTAAAGGCCA GTCTGCCATC CATCCAGGAA GAACTGCGGC 781 ACATGGCTGA TGGTGCTAGA GAGGTAGGAA ATATGACTGG AACCACTGAG ATCAACTCAG 841 ATCAAGGCCT AGAAAAAGAC AACTCAGAGT TGGGGAGTGA AACTCGGTAC CCACTGCTAT 901 TGCCTAAGGG TGTAGTCCTG AAACTGAAGC CAGTTGCCGA CCGTTTCCCC AAGAAGGCTT 961 GGAGACAGAA GCGTTCATCA GTCCTGAAAC CCCTCCTTAT CCAACCCAGC CCCTCTCTCC 1021 AGCCCAGCTT CAACCCTGGG AAAACACCAG CCCAATCAAC TCATTCAGAA GCCCCTCCGA 1081 GCAAAATGGT GCTCCGGATT CCTCACCCAA TACAGCCAGC CACTGTTTTA CAGACAGTTC 1141 CAGGTGTCCC TCCACTGGGG GTCAGTGGAG GTGAGAGTTT TGAGTCTCCT GCAGCACTGC 1201 CTGCTATGCC CCCTGAGGCC AGGACAAGCT TCCCTCTGTC TGAGTCCCAG ACTTTGCTCT 1261 CTTCTGCCCC TGTGCCCAAG GTAATGATGC CCTCCCCTGC CTCTTCCATG TTTCGAAAGC 1321 CATATGTGAG ACGGAGACCC TCAAAAAGAA GGGGAGCCAG GGCCTTTCGC TGTATCAAAC 1381 CTGCCCCTGT TATCCACCCT GCATCTGTTA TCTTCACTGT TCCTGCTACC ACTGTGAAGA 1441 TTGTGAGCCT TGGCGGTGGC TGTAACATGA TCCAGCCTGT CAATGCGGCT GTGGCCCAGA 1501 GTCCCCAGAC TATTCCCATC GCCACCCTCT TGGTTAACCC TACTTCCTTC CCCTGTCCAT 1561 TGAACCAGCC CCTTGTGGCC TCCTCTGTCT CACCCTTAAT TGTTTCTGGC AATTCTGTGA 1621 ATCTTCCTAT ACCATCCACC CCTGAAGATA AGGCCCACAT GAATGTGGAC ATTGCTTGTG 1681 CTGTGGCTGA TGGGGAAAAT GCCTTTCAGG GCCTAGAACC CAAATTAGAG CCCCAGGAAC 1741 TATCTCCTCT CTCTGCTACT GTTTTCCCCA AAGTGGAACA TAGCCCAGGG CCTCCACCAG 1801 TCGATAAACA GTGCCAAGAA GGATTGTCAG AGAACAGTGC CTATCGCTGG ACCGTTGTGA 1861 AAACAGAGGA GGGAAGGCAA GCTCTGGAGC CGCTCCCTCA GGGCATCCAG GAGTCTCTAA 1921 ACAACTCTTC CCCTGGGGAT TTAGAGGAAG TTGTCAAGAT GGAACCTGAA GATGCTACAG 1981 AGGAAATCAG TGGATTTCTT TGCCCAACTT TCTTGTACAA AGTtggcatt ataagaaagc 2041 attgcttatc aatttgttgc aacgaac