ANOTAÇÃO DE GENES DE INTERESSE EM GENOMAS ESPECIFICOS
Por: Luzia Gabrielle Zeferino • 29/11/2018 • Relatório de pesquisa • 2.117 Palavras (9 Páginas) • 232 Visualizações
ANOTAÇÃO DE GENES DE INTERESSE EM GENOMAS ESPECIFICOS:
- Seleção do gene/proteína: Busca de uma isca/seq de referencia pra fazer buscas (BLAST) nos bancos genomicos de interesse:
- Determinar num genoma a quantidade de genes que codificam uma proteína de interesse.
- Coletar cada um dos genes presentes no genoma
- Em cada um dos genes determinar a região transcrita [exons / introns(se existentes)]; a região do promotor.
- Deduzir o cDNA da região transcrita (remoção de introns se existentes)
- Deduzir a sequencia da proteína a partir do cDNA
- VALIDAÇÃO DA ANOTAÇÃO: Consiste em comparar Proteina/cDNA obtidos com sequencias depositadas em bancos de dados.
- ISCA: AB052799.1 Homo sapiens gene for CD19, complete cds
Com essa isca, posso fazer buscas em genomas de humanos bem como outros animais filogeneticamente próximos. Sempre busca uma isca próxima da espécie que se quer anotar os genes.
CENTRALIZAR / JUSTIFICAR e substituir ^p por nada (WORD)
"BAB60954.1"/translation="
MPPPRLLFFLLFLTPMEVRPEEPLVVKVEEGDNAVLQCLKGTSDGPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLASWLFIFNVSQQMGGFYLCQPGPPSEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKNRSSEGPSSPSGKLMSPKLYVWAKDRPEIWEGEPPCVPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVHPKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITARPVLWHWLLRTGGWKVSAVTLAYLIFCLCSLVGILHLQRALVLRRKRKRMTDPTRRFFKVTPPPGSGPQNQYGNVLSLPTPTSGLGRAQRWAAGLGGTAPSYGNPSSDVQADGALGSRSPPGVGPEEEEGEGYEEPDSEEDSEFYENDSNLGQDQLSQDGSGYENPEDEPLGPEDEDSFSNAESYENEDEELTQPVARTMDFLSPHGSAWDPSREATSLGSQSYEDMRGILYAAPQLRSIRGQPGPNHEEDADSYENMDNPDGPDPAWGGGGRMGTWSTR"
MPPPRLLFFLLFLTPMEVRPEEPLVVKVEGEGDNAVLQCLKGTSDGPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLASWLFIFNVSQQMGGFYLCQPGPPSEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKNRSSEGPSSPSGKLMSPKLYVWAKDRPEIWEGEPPCVPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVHPKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITARPVLWHWLLRTGGWKVSAVTLAYLIFCLCSLVGILHLQRALVLRRKRKRMTDPTRRFFKVTPPPGSGPQNQYGNVLSLPTPTSGLGRAQRWAAGLGGTAPSYGNPSSDVQADGALGSRSPPGVGPEEEEGEGYEEPDSEEDSEFYENDSNLGQDQLSQDGSGYENPEDEPLGPEDEDSFSNAESYENEDEELTQPVARTMDFLSPHGSAWDPSREATSLGSQSYEDMRGILYAAPQLRSIRGQPGPNHEEDADSYENMDNPDGPDPAWGGGGRMGTWSTR
>AB052799.1:1407-1494,1752-2018,2317-2520,2640-2915,4849-4959,5558-5606,5917-6005,6428-6541,6676-6780,6860-6928,7030-7086,7175-7231,8083-8175,8279-8370 Homo sapiens gene for CD19, complete cds
ATGCCACCTCCTCGCCTCCTCTTCTTCCTCCTCTTCCTCACCCCCATGGAAGTCAGGCCCGAGGAACCTCTAGTGGTGAAGGTGGAAGAGGGAGATAACGCTGTGCTGCAGTGCCTCAAGGGGACCTCAGATGGCCCCACTCAGCAGCTGACCTGGTCTCGGGAGTCCCCGCTTAAACCCTTCTTAAAACTCAGCCTGGGGCTGCCAGGCCTGGGAATCCACATGAGGCCCCTGGCATCCTGGCTTTTCATCTTCAACGTCTCTCAACAGATGGGGGGCTTCTACCTGTGCCAGCCGGGGCCCCCCTCTGAGAAGGCCTGGCAGCCTGGCTGGACAGTCAATGTGGAGGGCAGCGGGGAGCTGTTCCGGTGGAATGTTTCGGACCTAGGTGGCCTGGGCTGTGGCCTGAAGAACAGGTCCTCAGAGGGCCCCAGCTCCCCTTCCGGGAAGCTCATGAGCCCCAAGCTGTATGTGTGGGCCAAAGACCGCCCTGAGATCTGGGAGGGAGAGCCTCCGTGTGTCCCACCGAGGGACAGCCTGAACCAGAGCCTCAGCCAGGACCTCACCATGGCCCCTGGCTCCACACTCTGGCTGTCCTGTGGGGTACCCCCTGACTCTGTGTCCAGGGGCCCCCTCTCCTGGACCCATGTGCACCCCAAGGGGCCTAAGTCATTGCTGAGCCTAGAGCTGAAGGACGATCGCCCGGCCAGAGATATGTGGGTAATGGAGACGGGTCTGTTGTTGCCCCGGGCCACAGCTCAAGACGCTGGAAAGTATTATTGTCACCGTGGCAACCTGACCATGTCATTCCACCTGGAGATCACTGCTCGGCCAGTACTATGGCACTGGCTGCTGAGGACTGGTGGCTGGAAGGTCTCAGCTGTGACTTTGGCTTATCTGATCTTCTGCCTGTGTTCCCTTGTGGGCATTCTTCATCTTCAAAGAGCCCTGGTCCTGAGGAGGAAAAGAAAGCGAATGACTGACCCCACCAGGAGATTCTTCAAAGTGACGCCTCCCCCAGGAAGCGGGCCCCAGAACCAGTACGGGAACGTGCTGTCTCTCCCCACACCCACCTCAGGCCTCGGACGCGCCCAGCGTTGGGCCGCAGGCCTGGGGGGCACTGCCCCGTCTTATGGAAACCCGAGCAGCGACGTCCAGGCGGATGGAGCCTTGGGGTCCCGGAGCCCGCCGGGAGTGGGCCCAGAAGAAGAGGAAGGGGAGGGCTATGAGGAACCTGACAGTGAGGAGGACTCCGAGTTCTATGAGAACGACTCCAACCTTGGGCAGGACCAGCTCTCCCAGGATGGCAGCGGCTACGAGAACCCTGAGGATGAGCCCCTGGGTCCTGAGGATGAAGACTCCTTCTCCAACGCTGAGTCTTATGAGAACGAGGATGAAGAGCTGACCCAGCCGGTCGCCAGGACAATGGACTTCCTGAGCCCTCATGGGTCAGCCTGGGACCCCAGCCGGGAAGCAACCTCCCTGGGGTCCCAGTCCTATGAGGATATGAGAGGAATCCTGTATGCAGCCCCCCAGCTCCGCTCCATTCGGGGCCAGCCTGGACCCAATCATGAGGAAGATGCAGACTCTTATGAGAACATGGATAATCCCGATGGGCCAGACCCAGCCTGGGGAGGAGGGGGCCGCATGGGCACCTGGAGCACCAGGTGA
>AH005421.2:321-421,665-931,1230-1433,1554-1829,2057-2167,2769-2817,3128-3216,3591-3704,3896-4000,4174-4242,4343-4399,4488-4544,4813-4905,5231-5337,5496-5701 Homo sapiens CD19 (CD19) gene, complete cds
TCTGACCACCATGCCACCTCCTCGCCTCCTCTTCTTCCTCCTCTTCCTCACCCCCATGGAAGTCAGGCCCGAGGAACCTCTAGTGGTGAAGGTGGAAGGTGAGGGAGATAACGCTGTGCTGCAGTGCCTCAAGGGGACCTCAGATGGCCCCACTCAGCAGCTGACCTGGTCTCGGGAGTCCCCGCTTAAACCCTTCTTAAAACTCAGCCTGGGGCTGCCAGGCCTGGGAATCCACATGAGGCCCCTGGCATCCTGGCTTTTCATCTTCAACGTCTCTCAACAGATGGGGGGCTTCTACCTGTGCCAGCCGGGGCCCCCCTCTGAGAAGGCCTGGCAGCCTGGCTGGACAGTCAATGTGGAGGGCAGCGGGGAGCTGTTCCGGTGGAATGTTTCGGACCTAGGTGGCCTGGGCTGTGGCCTGAAGAACAGGTCCTCAGAGGGCCCCAGCTCCCCTTCCGGGAAGCTCATGAGCCCCAAGCTGTATGTGTGGGCCAAAGACCGCCCTGAGATCTGGGAGGGAGAGCCTCCGTGTGTCCCACCGAGGGACAGCCTGAACCAGAGCCTCAGCCAGGACCTCACCATGGCCCCTGGCTCCACACTCTGGCTGTCCTGTGGGGTACCCCCTGACTCTGTGTCCAGGGGCCCCCTCTCCTGGACCCATGTGCACCCCAAGGGGCCTAAGTCATTGCTGAGCCTAGAGCTGAAGGACGATCGCCCGGCCAGAGATATGTGGGTAATGGAGACGGGTCTGTTGTTGCCCCGGGCCACAGCTCAAGACGCTGGAAAGTATTATTGTCACCGTGGCAACCTGACCATGTCATTCCACCTGGAGATCACTGCTCGGCCAGTACTATGGCACTGGCTGCTGAGGACTGGTGGCTGGAAGGTCTCAGCTGTGACTTTGGCTTATCTGATCTTCTGCCTGTGTTCCCTTGTGGGCATTCTTCATCTTCAAAGAGCCCTGGTCCTGAGGAGGAAAAGAAAGCGAATGACTGACCCCACCAGGAGATTCTTCAAAGTGACGCCTCCCCCAGGAAGCGGGCCCCAGAACCAGTACGGGAACGTGCTGTCTCTCCCCACACCCACCTCAGGCCTCGGACGCGCCCAGCGTTGGGCCGCAGGCCTGGGGGGCACTGCCCCGTCTTATGGAAACCCGAGCAGCGACGTCCAGGCGGATGGAGCCTTGGGGTCCCGGAGCCCGCCGGGAGTGGGCCCAGAAGAAGAGGAAGGGGAGGGCTATGAGGAACCTGACAGTGAGGAGGACTCCGAGTTCTATGAGAACGACTCCAACCTTGGGCAGGACCAGCTCTCCCAGGATGGCAGCGGCTACGAGAACCCTGAGGATGAGCCCCTGGGTCCTGAGGATGAAGACTCCTTCTCCAACGCTGAGTCTTATGAGAACGAGGATGAAGAGCTGACCCAGCCGGTCGCCAGGACAATGGACTTCCTGAGCCCTCATGGGTCAGCCTGGGACCCCAGCCGGGAAGCAACCTCCCTGGGGTCCCAGTCCTATGAGGATATGAGAGGAATCCTGTATGCAGCCCCCCAGCTCCGCTCCATTCGGGGCCAGCCTGGACCCAATCATGAGGAAGATGCAGACTCTTATGAGAACATGGATAATCCCGATGGGCCAGACCCAGCCTGGGGAGGAGGGGGCCGCATGGGCACCTGGAGCACCAGGTGATCCTCAGGTGGCCAGCCTGGATCTCCTCAAGTCCCCAAGATTCACACCTGACTCTGAAATCTGAAGACCTCGAGCAGATGATGCCAACCTCTGGAGCAATGTTGCTTAGGATGTGTGCATGTGTGTAAGTGTGTGTGTGTGTGTGTGTGTGTGTATACATGCCAGTGACACTTCCAGTCCCCTTTGTATTCCTTAAATAAACTCAATGAGCTCTTCCAATC
QUAL NOSSO GENOMA DE INTERESSE PARA FAZER AS BUSCAS?
ESCOLHEMOS O GENOMA Felis catus. BUSCAR NUMERO DE GENES; DEPOIS ANOTAR...
Banco de dados:
[pic 1]
Tem apenas 1 genoma de alta qualidade
Blastn; felis catus; More dissimilar sequences (discontiguous megablast) > 50% identidade
- HÁ APENAS 1 GENE PARA CD19 NO GENOMA do gato:
EVIDENCIAS: blastn = 1 sequencias (sequencia 1 83% cobertura; 76% identidade; evalue
1e-47 | |
- Na sequencia tem mais de gene???
[pic 2]
NÃO TEM MAIS DE UM GENE POIS A CONTAGEM OLHANDO PARA O QUERY É CONTíNUA E NÃO REPETITIVA ENTRE OS RANGES.
COLETAR O GENE CD19
Query 87 AGAGGGAGATAACGCTGTGCTGCAGTGCCTCAAGGGGACCTCAGATGGCCCCACTCAGCA 146
...