{"id":574,"date":"2025-01-28T07:03:48","date_gmt":"2025-01-28T07:03:48","guid":{"rendered":"https:\/\/janusai.pro\/?p=574"},"modified":"2025-01-28T08:08:08","modified_gmt":"2025-01-28T08:08:08","slug":"released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut","status":"publish","type":"post","link":"https:\/\/janusai.pro\/pt\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/","title":{"rendered":"Lan\u00e7ado tarde da noite! O DeepSeek redefine a gera\u00e7\u00e3o e a compreens\u00e3o de imagens de IA com a estreia do inovador modelo abrangente Janus-Pro!"},"content":{"rendered":"<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"915\" height=\"564\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2.png\" alt=\"\" class=\"wp-image-580\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2.png 915w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-300x185.png 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-768x473.png 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-18x12.png 18w\" sizes=\"auto, (max-width: 915px) 100vw, 915px\" \/><\/figure>\n\n\n\n<p><strong>Principais destaques<\/strong><br>\ud83d\udd39&nbsp;<strong>Arquitetura de transformador unificada<\/strong>: Um \u00fanico modelo lida com a compreens\u00e3o da imagem&nbsp;<em>e<\/em>&nbsp;eliminando a necessidade de sistemas separados.<br>\ud83d\udd39&nbsp;<strong>Escal\u00e1vel e de c\u00f3digo aberto<\/strong>: Dispon\u00edvel em&nbsp;<strong>1B<\/strong>&nbsp;e&nbsp;<strong>7B<\/strong>&nbsp;vers\u00f5es de par\u00e2metros (licenciadas pelo MIT), otimizadas para diversos aplicativos e uso comercial.<br>\ud83d\udd39&nbsp;<strong>Desempenho de \u00faltima gera\u00e7\u00e3o<\/strong>: Supera o desempenho do DALL-E 3 e do Stable Diffusion da OpenAI em benchmarks como GenEval e DPG-Bench.<br>\ud83d\udd39&nbsp;<strong>Implementa\u00e7\u00e3o simplificada<\/strong>: A arquitetura simplificada reduz os custos de treinamento\/infer\u00eancia, mantendo a flexibilidade.<\/p>\n\n\n\n<p><strong>Links de modelos<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Janus-Pro-7B<\/strong>:&nbsp;<a href=\"https:\/\/huggingface.co\/deepseek-ai\/Janus-Pro-7B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">HuggingFace<\/a><\/li>\n\n\n\n<li><strong>Janus-Pro-1B<\/strong>:&nbsp;<a href=\"https:\/\/huggingface.co\/deepseek-ai\/Janus-Pro-1B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">HuggingFace<\/a><\/li>\n\n\n\n<li><strong>GitHub<\/strong>:&nbsp;<a href=\"https:\/\/github.com\/deepseek-ai\/Janus\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">C\u00f3digo e documentos<\/a><\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_82_2 counter-hierarchy ez-toc-counter ez-toc-grey ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">\u00cdndice<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"Alternar tabela de conte\u00fado\"><span class=\"ez-toc-js-icon-con\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Alternar<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewbox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewbox=\"0 0 24 24\" version=\"1.2\" baseprofile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/span><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1' ><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/janusai.pro\/pt\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Why_Janus-Pro_Stands_Out\" >Por que o Janus-Pro se destaca<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/janusai.pro\/pt\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Benchmark_Dominance\" >Domin\u00e2ncia de benchmark<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/janusai.pro\/pt\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Technical_Breakdown\" >Detalhamento t\u00e9cnico<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/janusai.pro\/pt\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Community_Buzz\" >Buzz da comunidade<\/a><\/li><\/ul><\/nav><\/div>\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Why_Janus-Pro_Stands_Out\"><\/span><strong>Por que o Janus-Pro se destaca<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>1. Superpoderes duplos em um \u00fanico modelo<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Modo de compreens\u00e3o<\/strong>: Usos&nbsp;<strong>SigLIP-L<\/strong>&nbsp;(os \"super \u00f3culos\") para analisar imagens (at\u00e9 384\u00d7384) e texto.<\/li>\n\n\n\n<li><strong>Modo de gera\u00e7\u00e3o<\/strong>: Alavancas&nbsp;<strong>Fluxo retificado<\/strong>&nbsp;+&nbsp;<strong>SDXL-VAE<\/strong>&nbsp;(o \"pincel m\u00e1gico\") para criar imagens de alta qualidade.<\/li>\n<\/ul>\n\n\n\n<p><strong>2. Capacidade cerebral e treinamento<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>N\u00facleo do LLM<\/strong>: Criado com base no poderoso modelo de linguagem do DeepSeek (1,5B\/7B par\u00e2metros), excelente em racioc\u00ednio contextual.<\/li>\n\n\n\n<li><strong>Pipeline de treinamento<\/strong>: Pr\u00e9-treinamento em conjuntos de dados massivos \u2192 Ajuste fino supervisionado \u2192 Otimiza\u00e7\u00e3o de EMA para desempenho m\u00e1ximo.<\/li>\n<\/ul>\n\n\n\n<p><strong>3. Por que a superdifus\u00e3o do transformador?<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Versatilidade de tarefas<\/strong>: Prioriza a compreens\u00e3o unificada + gera\u00e7\u00e3o, enquanto os modelos de difus\u00e3o se concentram puramente na qualidade da imagem.<\/li>\n\n\n\n<li><strong>Efici\u00eancia<\/strong>: Gera\u00e7\u00e3o autorregressiva (etapa \u00fanica) vs. redu\u00e7\u00e3o de ru\u00eddo iterativa da difus\u00e3o (por exemplo, 20 etapas para difus\u00e3o est\u00e1vel).<\/li>\n\n\n\n<li><strong>Custo-efetividade<\/strong>: Um \u00fanico backbone do Transformer simplifica o treinamento e a implementa\u00e7\u00e3o.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"955\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-1024x955.jpeg\" alt=\"\" class=\"wp-image-578\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-1024x955.jpeg 1024w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-300x280.jpeg 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-768x716.jpeg 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-13x12.jpeg 13w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4.jpeg 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Benchmark_Dominance\"><\/span><strong>Domin\u00e2ncia de benchmark<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>Compreens\u00e3o multimodal<\/strong><br>O Janus-Pro-7B supera os modelos especializados (por exemplo, LLaVA) em quatro benchmarks importantes, escalonando suavemente com o tamanho do par\u00e2metro.<\/p>\n\n\n\n<p><strong>Gera\u00e7\u00e3o de texto para imagem<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>GenEval<\/strong>: Corresponde a SDXL e DALL-E 3.<\/li>\n\n\n\n<li><strong>DPG-Bench<\/strong>:&nbsp;<strong>84.2% precis\u00e3o<\/strong>&nbsp;(Janus-Pro-7B), superando todos os concorrentes.<\/li>\n<\/ul>\n\n\n\n<p><strong>Testes no mundo real<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Velocidade<\/strong>: ~15 segundos\/imagem (GPU L4, 22GB VRAM).<\/li>\n\n\n\n<li><strong>Qualidade<\/strong>: Forte ader\u00eancia imediata, embora pequenos detalhes precisem ser refinados.<\/li>\n\n\n\n<li><strong>Demonstra\u00e7\u00e3o do Colab<\/strong>:&nbsp;<a href=\"https:\/\/colab.research.google.com\/drive\/1V3bH2oxhikj_B_EYy5yRG_9yqSqxxqgS?usp=sharing\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Experimente o Janus-Pro-7B<\/a>&nbsp;(\u00c9 necess\u00e1rio o n\u00edvel Pro).<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Technical_Breakdown\"><\/span><strong>Detalhamento t\u00e9cnico<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>Arquitetura<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"376\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-1024x376.png\" alt=\"\" class=\"wp-image-579\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-1024x376.png 1024w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-300x110.png 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-768x282.png 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-18x7.png 18w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640.png 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Entendendo o caminho<\/strong>: Imagem limpa \u2192 codificador SigLIP-L \u2192 LLM \u2192 resposta de texto.<\/li>\n\n\n\n<li><strong>Caminho de gera\u00e7\u00e3o<\/strong>: Imagem com ru\u00eddo \u2192 Decodificador de fluxo retificado + LLM \u2192 Redu\u00e7\u00e3o iterativa de ru\u00eddo.<\/li>\n<\/ul>\n\n\n\n<p><strong>Principais inova\u00e7\u00f5es<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Codifica\u00e7\u00e3o visual desacoplada<\/strong>: Caminhos separados para compreens\u00e3o\/gera\u00e7\u00e3o evitam o \"conflito de fun\u00e7\u00f5es\" nos m\u00f3dulos de vis\u00e3o.<\/li>\n\n\n\n<li><strong>N\u00facleo de transformador compartilhado<\/strong>: Permite a transfer\u00eancia de conhecimento entre tarefas (por exemplo, aprender conceitos de \"gato\" ajuda tanto no reconhecimento quanto no desenho).<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Community_Buzz\"><\/span><strong>Buzz da comunidade<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><strong>AK (pesquisador de IA)<\/strong>:&nbsp;<em>\"A simplicidade e a flexibilidade do Janus-Pro fazem dele o principal candidato para sistemas multimodais de \u00faltima gera\u00e7\u00e3o. Ao desacoplar as vias de vis\u00e3o e manter um Transformer unificado, ele equilibra a especializa\u00e7\u00e3o com a generaliza\u00e7\u00e3o - um feito raro.\"<\/em><\/p>\n<\/blockquote>\n\n\n\n<p><strong>Por que a licen\u00e7a MIT \u00e9 importante<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Liberdade<\/strong>: Use, modifique e distribua comercialmente com restri\u00e7\u00f5es m\u00ednimas.<\/li>\n\n\n\n<li><strong>Transpar\u00eancia<\/strong>: O acesso total ao c\u00f3digo acelera os aprimoramentos conduzidos pela comunidade.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>Conclus\u00e3o final<\/strong><br>O Janus-Pro da DeepSeek n\u00e3o \u00e9 apenas mais um modelo de IA - \u00e9 uma mudan\u00e7a de paradigma. Ao unificar a compreens\u00e3o e a gera\u00e7\u00e3o em um s\u00f3 lugar, ele abre portas para ferramentas criativas mais inteligentes, aplicativos em tempo real e implementa\u00e7\u00f5es econ\u00f4micas. Com acesso de c\u00f3digo aberto e licenciamento do MIT, esse pode ser o catalisador para a pr\u00f3xima onda de inova\u00e7\u00e3o multimodal. \ud83d\ude80<\/p>\n\n\n\n<p><em>Para desenvolvedores: Confira o&nbsp;<a href=\"https:\/\/github.com\/deepseek-ai\/Janus\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">N\u00f3s da ComfyUI<\/a>&nbsp;e participe da onda de experimenta\u00e7\u00e3o!<\/em><\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>esta postagem \u00e9 patrocinada por:<\/p>\n\n\n\n<a href=\"https:\/\/dang.ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/cdn.prod.website-files.com\/63d8afd87da01fb58ea3fbcb\/6487e2868c6c8f93b4828827_dang-badge.png\" alt=\"Dang.ai\" style=\"width: 150px; height: 54px;\" width=\"150\" height=\"54\"\/><\/a>\n\n\n\n<p><a href=\"https:\/\/dang.ai\/\" target=\"_blank\" rel=\"noreferrer noopener\"><\/a><\/p>\n\n\n\n<p><\/p>","protected":false},"excerpt":{"rendered":"<p>Principais destaques\ud83d\udd39 Arquitetura de transformador unificada: Um \u00fanico modelo lida com a compreens\u00e3o e a gera\u00e7\u00e3o de imagens, eliminando a necessidade de sistemas separados.\ud83d\udd39 Escal\u00e1vel e de c\u00f3digo aberto: Dispon\u00edvel nas vers\u00f5es de par\u00e2metros 1B e 7B (licenciado pelo MIT), otimizado para diversos aplicativos e uso comercial.\ud83d\udd39 Desempenho de \u00faltima gera\u00e7\u00e3o: Supera o desempenho do DALL-E 3 e do Stable Diffusion da OpenAI em benchmarks como GenEval e DPG-Bench.\ud83d\udd39 Implanta\u00e7\u00e3o simplificada: A arquitetura simplificada reduz os custos de treinamento\/infer\u00eancia, mantendo a flexibilidade. Links de modelos...<\/p>","protected":false},"author":1,"featured_media":580,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_kadence_starter_templates_imported_post":false,"_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false,"footnotes":""},"categories":[1],"tags":[],"class_list":["post-574","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/posts\/574","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/comments?post=574"}],"version-history":[{"count":3,"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/posts\/574\/revisions"}],"predecessor-version":[{"id":609,"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/posts\/574\/revisions\/609"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/media\/580"}],"wp:attachment":[{"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/media?parent=574"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/categories?post=574"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/janusai.pro\/pt\/wp-json\/wp\/v2\/tags?post=574"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}