TrabalhosGratuitos.com - Trabalhos, Monografias, Artigos, Exames, Resumos de livros, Dissertações
Pesquisar

ANOTAÇÃO DE GENES DE INTERESSE EM GENOMAS ESPECIFICOS

Por:   •  29/11/2018  •  Relatório de pesquisa  •  2.117 Palavras (9 Páginas)  •  225 Visualizações

Página 1 de 9

ANOTAÇÃO DE GENES DE INTERESSE EM GENOMAS ESPECIFICOS:

  1. Seleção do gene/proteína: Busca de uma isca/seq de referencia pra fazer buscas (BLAST) nos bancos genomicos de interesse:
  1. Determinar num genoma a quantidade de genes que codificam uma proteína de interesse.
  2. Coletar cada um dos genes presentes no genoma
  3. Em cada um dos genes determinar a região transcrita [exons / introns(se existentes)]; a região do promotor.

  1. Deduzir o cDNA da região transcrita (remoção de introns se existentes)

  1. Deduzir a sequencia da proteína a partir do cDNA
  1. VALIDAÇÃO DA ANOTAÇÃO: Consiste em comparar Proteina/cDNA obtidos com sequencias depositadas em bancos de dados.
  1. ISCA: AB052799.1 Homo sapiens gene for CD19, complete cds

Com essa isca, posso fazer buscas em genomas de humanos bem como outros animais filogeneticamente próximos. Sempre busca uma isca próxima da espécie que se quer anotar os genes.

CENTRALIZAR / JUSTIFICAR e substituir ^p por nada (WORD)

"BAB60954.1"/translation="

MPPPRLLFFLLFLTPMEVRPEEPLVVKVEEGDNAVLQCLKGTSDGPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLASWLFIFNVSQQMGGFYLCQPGPPSEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKNRSSEGPSSPSGKLMSPKLYVWAKDRPEIWEGEPPCVPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVHPKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITARPVLWHWLLRTGGWKVSAVTLAYLIFCLCSLVGILHLQRALVLRRKRKRMTDPTRRFFKVTPPPGSGPQNQYGNVLSLPTPTSGLGRAQRWAAGLGGTAPSYGNPSSDVQADGALGSRSPPGVGPEEEEGEGYEEPDSEEDSEFYENDSNLGQDQLSQDGSGYENPEDEPLGPEDEDSFSNAESYENEDEELTQPVARTMDFLSPHGSAWDPSREATSLGSQSYEDMRGILYAAPQLRSIRGQPGPNHEEDADSYENMDNPDGPDPAWGGGGRMGTWSTR"

MPPPRLLFFLLFLTPMEVRPEEPLVVKVEGEGDNAVLQCLKGTSDGPTQQLTWSRESPLKPFLKLSLGLPGLGIHMRPLASWLFIFNVSQQMGGFYLCQPGPPSEKAWQPGWTVNVEGSGELFRWNVSDLGGLGCGLKNRSSEGPSSPSGKLMSPKLYVWAKDRPEIWEGEPPCVPPRDSLNQSLSQDLTMAPGSTLWLSCGVPPDSVSRGPLSWTHVHPKGPKSLLSLELKDDRPARDMWVMETGLLLPRATAQDAGKYYCHRGNLTMSFHLEITARPVLWHWLLRTGGWKVSAVTLAYLIFCLCSLVGILHLQRALVLRRKRKRMTDPTRRFFKVTPPPGSGPQNQYGNVLSLPTPTSGLGRAQRWAAGLGGTAPSYGNPSSDVQADGALGSRSPPGVGPEEEEGEGYEEPDSEEDSEFYENDSNLGQDQLSQDGSGYENPEDEPLGPEDEDSFSNAESYENEDEELTQPVARTMDFLSPHGSAWDPSREATSLGSQSYEDMRGILYAAPQLRSIRGQPGPNHEEDADSYENMDNPDGPDPAWGGGGRMGTWSTR

>AB052799.1:1407-1494,1752-2018,2317-2520,2640-2915,4849-4959,5558-5606,5917-6005,6428-6541,6676-6780,6860-6928,7030-7086,7175-7231,8083-8175,8279-8370 Homo sapiens gene for CD19, complete cds

ATGCCACCTCCTCGCCTCCTCTTCTTCCTCCTCTTCCTCACCCCCATGGAAGTCAGGCCCGAGGAACCTCTAGTGGTGAAGGTGGAAGAGGGAGATAACGCTGTGCTGCAGTGCCTCAAGGGGACCTCAGATGGCCCCACTCAGCAGCTGACCTGGTCTCGGGAGTCCCCGCTTAAACCCTTCTTAAAACTCAGCCTGGGGCTGCCAGGCCTGGGAATCCACATGAGGCCCCTGGCATCCTGGCTTTTCATCTTCAACGTCTCTCAACAGATGGGGGGCTTCTACCTGTGCCAGCCGGGGCCCCCCTCTGAGAAGGCCTGGCAGCCTGGCTGGACAGTCAATGTGGAGGGCAGCGGGGAGCTGTTCCGGTGGAATGTTTCGGACCTAGGTGGCCTGGGCTGTGGCCTGAAGAACAGGTCCTCAGAGGGCCCCAGCTCCCCTTCCGGGAAGCTCATGAGCCCCAAGCTGTATGTGTGGGCCAAAGACCGCCCTGAGATCTGGGAGGGAGAGCCTCCGTGTGTCCCACCGAGGGACAGCCTGAACCAGAGCCTCAGCCAGGACCTCACCATGGCCCCTGGCTCCACACTCTGGCTGTCCTGTGGGGTACCCCCTGACTCTGTGTCCAGGGGCCCCCTCTCCTGGACCCATGTGCACCCCAAGGGGCCTAAGTCATTGCTGAGCCTAGAGCTGAAGGACGATCGCCCGGCCAGAGATATGTGGGTAATGGAGACGGGTCTGTTGTTGCCCCGGGCCACAGCTCAAGACGCTGGAAAGTATTATTGTCACCGTGGCAACCTGACCATGTCATTCCACCTGGAGATCACTGCTCGGCCAGTACTATGGCACTGGCTGCTGAGGACTGGTGGCTGGAAGGTCTCAGCTGTGACTTTGGCTTATCTGATCTTCTGCCTGTGTTCCCTTGTGGGCATTCTTCATCTTCAAAGAGCCCTGGTCCTGAGGAGGAAAAGAAAGCGAATGACTGACCCCACCAGGAGATTCTTCAAAGTGACGCCTCCCCCAGGAAGCGGGCCCCAGAACCAGTACGGGAACGTGCTGTCTCTCCCCACACCCACCTCAGGCCTCGGACGCGCCCAGCGTTGGGCCGCAGGCCTGGGGGGCACTGCCCCGTCTTATGGAAACCCGAGCAGCGACGTCCAGGCGGATGGAGCCTTGGGGTCCCGGAGCCCGCCGGGAGTGGGCCCAGAAGAAGAGGAAGGGGAGGGCTATGAGGAACCTGACAGTGAGGAGGACTCCGAGTTCTATGAGAACGACTCCAACCTTGGGCAGGACCAGCTCTCCCAGGATGGCAGCGGCTACGAGAACCCTGAGGATGAGCCCCTGGGTCCTGAGGATGAAGACTCCTTCTCCAACGCTGAGTCTTATGAGAACGAGGATGAAGAGCTGACCCAGCCGGTCGCCAGGACAATGGACTTCCTGAGCCCTCATGGGTCAGCCTGGGACCCCAGCCGGGAAGCAACCTCCCTGGGGTCCCAGTCCTATGAGGATATGAGAGGAATCCTGTATGCAGCCCCCCAGCTCCGCTCCATTCGGGGCCAGCCTGGACCCAATCATGAGGAAGATGCAGACTCTTATGAGAACATGGATAATCCCGATGGGCCAGACCCAGCCTGGGGAGGAGGGGGCCGCATGGGCACCTGGAGCACCAGGTGA

>AH005421.2:321-421,665-931,1230-1433,1554-1829,2057-2167,2769-2817,3128-3216,3591-3704,3896-4000,4174-4242,4343-4399,4488-4544,4813-4905,5231-5337,5496-5701 Homo sapiens CD19 (CD19) gene, complete cds

TCTGACCACCATGCCACCTCCTCGCCTCCTCTTCTTCCTCCTCTTCCTCACCCCCATGGAAGTCAGGCCCGAGGAACCTCTAGTGGTGAAGGTGGAAGGTGAGGGAGATAACGCTGTGCTGCAGTGCCTCAAGGGGACCTCAGATGGCCCCACTCAGCAGCTGACCTGGTCTCGGGAGTCCCCGCTTAAACCCTTCTTAAAACTCAGCCTGGGGCTGCCAGGCCTGGGAATCCACATGAGGCCCCTGGCATCCTGGCTTTTCATCTTCAACGTCTCTCAACAGATGGGGGGCTTCTACCTGTGCCAGCCGGGGCCCCCCTCTGAGAAGGCCTGGCAGCCTGGCTGGACAGTCAATGTGGAGGGCAGCGGGGAGCTGTTCCGGTGGAATGTTTCGGACCTAGGTGGCCTGGGCTGTGGCCTGAAGAACAGGTCCTCAGAGGGCCCCAGCTCCCCTTCCGGGAAGCTCATGAGCCCCAAGCTGTATGTGTGGGCCAAAGACCGCCCTGAGATCTGGGAGGGAGAGCCTCCGTGTGTCCCACCGAGGGACAGCCTGAACCAGAGCCTCAGCCAGGACCTCACCATGGCCCCTGGCTCCACACTCTGGCTGTCCTGTGGGGTACCCCCTGACTCTGTGTCCAGGGGCCCCCTCTCCTGGACCCATGTGCACCCCAAGGGGCCTAAGTCATTGCTGAGCCTAGAGCTGAAGGACGATCGCCCGGCCAGAGATATGTGGGTAATGGAGACGGGTCTGTTGTTGCCCCGGGCCACAGCTCAAGACGCTGGAAAGTATTATTGTCACCGTGGCAACCTGACCATGTCATTCCACCTGGAGATCACTGCTCGGCCAGTACTATGGCACTGGCTGCTGAGGACTGGTGGCTGGAAGGTCTCAGCTGTGACTTTGGCTTATCTGATCTTCTGCCTGTGTTCCCTTGTGGGCATTCTTCATCTTCAAAGAGCCCTGGTCCTGAGGAGGAAAAGAAAGCGAATGACTGACCCCACCAGGAGATTCTTCAAAGTGACGCCTCCCCCAGGAAGCGGGCCCCAGAACCAGTACGGGAACGTGCTGTCTCTCCCCACACCCACCTCAGGCCTCGGACGCGCCCAGCGTTGGGCCGCAGGCCTGGGGGGCACTGCCCCGTCTTATGGAAACCCGAGCAGCGACGTCCAGGCGGATGGAGCCTTGGGGTCCCGGAGCCCGCCGGGAGTGGGCCCAGAAGAAGAGGAAGGGGAGGGCTATGAGGAACCTGACAGTGAGGAGGACTCCGAGTTCTATGAGAACGACTCCAACCTTGGGCAGGACCAGCTCTCCCAGGATGGCAGCGGCTACGAGAACCCTGAGGATGAGCCCCTGGGTCCTGAGGATGAAGACTCCTTCTCCAACGCTGAGTCTTATGAGAACGAGGATGAAGAGCTGACCCAGCCGGTCGCCAGGACAATGGACTTCCTGAGCCCTCATGGGTCAGCCTGGGACCCCAGCCGGGAAGCAACCTCCCTGGGGTCCCAGTCCTATGAGGATATGAGAGGAATCCTGTATGCAGCCCCCCAGCTCCGCTCCATTCGGGGCCAGCCTGGACCCAATCATGAGGAAGATGCAGACTCTTATGAGAACATGGATAATCCCGATGGGCCAGACCCAGCCTGGGGAGGAGGGGGCCGCATGGGCACCTGGAGCACCAGGTGATCCTCAGGTGGCCAGCCTGGATCTCCTCAAGTCCCCAAGATTCACACCTGACTCTGAAATCTGAAGACCTCGAGCAGATGATGCCAACCTCTGGAGCAATGTTGCTTAGGATGTGTGCATGTGTGTAAGTGTGTGTGTGTGTGTGTGTGTGTGTATACATGCCAGTGACACTTCCAGTCCCCTTTGTATTCCTTAAATAAACTCAATGAGCTCTTCCAATC

QUAL NOSSO GENOMA DE INTERESSE PARA FAZER AS BUSCAS?

ESCOLHEMOS O GENOMA Felis catus. BUSCAR NUMERO DE GENES; DEPOIS ANOTAR...

Banco de dados:

[pic 1]

Tem apenas 1 genoma de alta qualidade

Blastn; felis catus; More dissimilar sequences (discontiguous megablast) > 50% identidade

  1. HÁ APENAS 1 GENE PARA CD19 NO GENOMA do gato:

EVIDENCIAS: blastn = 1 sequencias (sequencia 1 83% cobertura; 76% identidade; evalue

1e-47

  1. Na sequencia tem mais de gene???

[pic 2]

NÃO TEM MAIS DE UM GENE POIS A CONTAGEM OLHANDO PARA O QUERY É CONTíNUA E NÃO REPETITIVA ENTRE OS RANGES.

COLETAR O GENE CD19

Query  87        AGAGGGAGATAACGCTGTGCTGCAGTGCCTCAAGGGGACCTCAGATGGCCCCACTCAGCA  146

...

Baixar como (para membros premium)  txt (40.6 Kb)   pdf (433.1 Kb)   docx (102.9 Kb)  
Continuar por mais 8 páginas »
Disponível apenas no TrabalhosGratuitos.com