{"id":574,"date":"2025-01-28T07:03:48","date_gmt":"2025-01-28T07:03:48","guid":{"rendered":"https:\/\/janusai.pro\/?p=574"},"modified":"2025-01-28T08:08:08","modified_gmt":"2025-01-28T08:08:08","slug":"released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut","status":"publish","type":"post","link":"https:\/\/janusai.pro\/fr\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/","title":{"rendered":"Sorti tard dans la nuit ! DeepSeek red\u00e9finit la g\u00e9n\u00e9ration et la compr\u00e9hension d'images par l'IA avec le lancement du mod\u00e8le complet Janus-Pro !"},"content":{"rendered":"<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"915\" height=\"564\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2.png\" alt=\"\" class=\"wp-image-580\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2.png 915w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-300x185.png 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-768x473.png 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-18x12.png 18w\" sizes=\"auto, (max-width: 915px) 100vw, 915px\" \/><\/figure>\n\n\n\n<p><strong>Faits marquants<\/strong><br>\ud83d\udd39&nbsp;<strong>Architecture du transformateur unifi\u00e9<\/strong>: Un seul mod\u00e8le g\u00e8re \u00e0 la fois la compr\u00e9hension de l'image&nbsp;<em>et<\/em>&nbsp;ce qui \u00e9limine le besoin de syst\u00e8mes distincts.<br>\ud83d\udd39&nbsp;<strong>\u00c9volutif et open-source<\/strong>: Disponible en&nbsp;<strong>1B<\/strong>&nbsp;et&nbsp;<strong>7B<\/strong>&nbsp;des versions param\u00e9tr\u00e9es (sous licence MIT), optimis\u00e9es pour diverses applications et un usage commercial.<br>\ud83d\udd39&nbsp;<strong>Des performances de pointe<\/strong>: Surpasse DALL-E 3 et Stable Diffusion d'OpenAI dans des tests de r\u00e9f\u00e9rence tels que GenEval et DPG-Bench.<br>\ud83d\udd39&nbsp;<strong>D\u00e9ploiement simplifi\u00e9<\/strong>: L'architecture rationalis\u00e9e r\u00e9duit les co\u00fbts de formation et d'intervention tout en maintenant la flexibilit\u00e9.<\/p>\n\n\n\n<p><strong>Liens vers les mod\u00e8les<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Janus-Pro-7B<\/strong>:&nbsp;<a href=\"https:\/\/huggingface.co\/deepseek-ai\/Janus-Pro-7B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Visage \u00e9treint<\/a><\/li>\n\n\n\n<li><strong>Janus-Pro-1B<\/strong>:&nbsp;<a href=\"https:\/\/huggingface.co\/deepseek-ai\/Janus-Pro-1B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Visage \u00e9treint<\/a><\/li>\n\n\n\n<li><strong>GitHub<\/strong>:&nbsp;<a href=\"https:\/\/github.com\/deepseek-ai\/Janus\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Code et documents<\/a><\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_82_2 counter-hierarchy ez-toc-counter ez-toc-grey ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Table des mati\u00e8res<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"Toggle Table des mati\u00e8res\"><span class=\"ez-toc-js-icon-con\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewbox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewbox=\"0 0 24 24\" version=\"1.2\" baseprofile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/span><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1' ><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/janusai.pro\/fr\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Why_Janus-Pro_Stands_Out\" >Pourquoi Janus-Pro se d\u00e9marque<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/janusai.pro\/fr\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Benchmark_Dominance\" >Domination de l'indice de r\u00e9f\u00e9rence<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/janusai.pro\/fr\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Technical_Breakdown\" >Ventilation technique<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/janusai.pro\/fr\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Community_Buzz\" >L'actualit\u00e9 communautaire<\/a><\/li><\/ul><\/nav><\/div>\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Why_Janus-Pro_Stands_Out\"><\/span><strong>Pourquoi Janus-Pro se d\u00e9marque<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>1. Deux superpouvoirs dans un m\u00eame mod\u00e8le<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Comprendre le mode<\/strong>: Utilisations&nbsp;<strong>SigLIP-L<\/strong>&nbsp;(les \"super lunettes\") pour analyser des images (jusqu'\u00e0 384\u00d7384) et du texte.<\/li>\n\n\n\n<li><strong>Mode de g\u00e9n\u00e9ration<\/strong>: Leviers&nbsp;<strong>D\u00e9bit rectifi\u00e9<\/strong>&nbsp;+&nbsp;<strong>SDXL-VAE<\/strong>&nbsp;(le \"pinceau magique\") pour cr\u00e9er des images de haute qualit\u00e9.<\/li>\n<\/ul>\n\n\n\n<p><strong>2. Cerveau et formation<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Core LLM<\/strong>: Construit sur le puissant mod\u00e8le de langage de DeepSeek (1,5B\/7B param\u00e8tres), excellant dans le raisonnement contextuel.<\/li>\n\n\n\n<li><strong>Pipeline de formation<\/strong>: Pr\u00e9-entra\u00eenement sur des ensembles de donn\u00e9es massifs \u2192 R\u00e9glage fin supervis\u00e9 \u2192 Optimisation de l'EMA pour des performances maximales.<\/li>\n<\/ul>\n\n\n\n<p><strong>3. Pourquoi la surdiffusion des transformateurs ?<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Polyvalence des t\u00e2ches<\/strong>: Priorit\u00e9 \u00e0 la compr\u00e9hension et \u00e0 la g\u00e9n\u00e9ration unifi\u00e9es, alors que les mod\u00e8les de diffusion se concentrent uniquement sur la qualit\u00e9 de l'image.<\/li>\n\n\n\n<li><strong>Efficacit\u00e9<\/strong>: G\u00e9n\u00e9ration autor\u00e9gressive (en une seule \u00e9tape) ou d\u00e9bruitage it\u00e9ratif par diffusion (par exemple, 20 \u00e9tapes pour la diffusion stable).<\/li>\n\n\n\n<li><strong>Rapport co\u00fbt-efficacit\u00e9<\/strong>: Une seule dorsale Transformer simplifie la formation et le d\u00e9ploiement.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"955\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-1024x955.jpeg\" alt=\"\" class=\"wp-image-578\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-1024x955.jpeg 1024w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-300x280.jpeg 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-768x716.jpeg 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-13x12.jpeg 13w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4.jpeg 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Benchmark_Dominance\"><\/span><strong>Domination de l'indice de r\u00e9f\u00e9rence<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>\ud83d\udcca Compr\u00e9hension multimodale<\/strong><br>Janus-Pro-7B surpasse les mod\u00e8les sp\u00e9cialis\u00e9s (par exemple, LLaVA) sur quatre points de r\u00e9f\u00e9rence cl\u00e9s, en s'adaptant de mani\u00e8re r\u00e9guli\u00e8re \u00e0 la taille des param\u00e8tres.<\/p>\n\n\n\n<p><strong>\ud83c\udfa8 G\u00e9n\u00e9ration de texte \u00e0 partir d'images<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>GenEval<\/strong>: Correspond \u00e0 SDXL et DALL-E 3.<\/li>\n\n\n\n<li><strong>Banc DPG<\/strong>:&nbsp;<strong>84.2% pr\u00e9cision<\/strong>&nbsp;(Janus-Pro-7B), d\u00e9passant ainsi tous les concurrents.<\/li>\n<\/ul>\n\n\n\n<p><strong>Tests en situation r\u00e9elle<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Vitesse<\/strong>: ~15 secondes\/image (GPU L4, 22GB VRAM).<\/li>\n\n\n\n<li><strong>Qualit\u00e9<\/strong>: Adh\u00e9sion rapide et solide, m\u00eame si des d\u00e9tails mineurs doivent \u00eatre affin\u00e9s.<\/li>\n\n\n\n<li><strong>D\u00e9monstration Colab<\/strong>:&nbsp;<a href=\"https:\/\/colab.research.google.com\/drive\/1V3bH2oxhikj_B_EYy5yRG_9yqSqxxqgS?usp=sharing\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Essayer Janus-Pro-7B<\/a>&nbsp;(niveau Pro requis).<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Technical_Breakdown\"><\/span><strong>Ventilation technique<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>Architecture<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"376\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-1024x376.png\" alt=\"\" class=\"wp-image-579\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-1024x376.png 1024w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-300x110.png 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-768x282.png 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-18x7.png 18w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640.png 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Comprendre le chemin<\/strong>: Image propre \u2192 Encodeur SigLIP-L \u2192 LLM \u2192 R\u00e9ponse textuelle.<\/li>\n\n\n\n<li><strong>Parcours des g\u00e9n\u00e9rations<\/strong>: Image bruit\u00e9e \u2192 D\u00e9codeur \u00e0 flux rectifi\u00e9 + LLM \u2192 D\u00e9bruitage it\u00e9ratif.<\/li>\n<\/ul>\n\n\n\n<p><strong>Principales innovations<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Encodage visuel d\u00e9coupl\u00e9<\/strong>: Des voies distinctes pour la compr\u00e9hension\/g\u00e9n\u00e9ration permettent d'\u00e9viter les \"conflits de r\u00f4les\" dans les modules de vision.<\/li>\n\n\n\n<li><strong>Noyau de transformateur partag\u00e9<\/strong>: Permet le transfert de connaissances d'une t\u00e2che \u00e0 l'autre (par exemple, l'apprentissage des concepts de \"chat\" facilite \u00e0 la fois la reconnaissance et le dessin).<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Community_Buzz\"><\/span><strong>L'actualit\u00e9 communautaire<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><strong>AK (Chercheur en IA)<\/strong>:&nbsp;<em>\"La simplicit\u00e9 et la flexibilit\u00e9 du Janus-Pro en font un candidat de choix pour les syst\u00e8mes multimodaux de la prochaine g\u00e9n\u00e9ration. En d\u00e9couplant les voies de la vision tout en conservant un transformateur unifi\u00e9, il \u00e9quilibre la sp\u00e9cialisation et la g\u00e9n\u00e9ralisation, ce qui est un exploit rare.<\/em><\/p>\n<\/blockquote>\n\n\n\n<p><strong>L'importance de la licence MIT<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Libert\u00e9<\/strong>: Utiliser, modifier et distribuer commercialement avec un minimum de restrictions.<\/li>\n\n\n\n<li><strong>Transparence<\/strong>: L'acc\u00e8s au code complet acc\u00e9l\u00e8re les am\u00e9liorations apport\u00e9es par la communaut\u00e9.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>Derni\u00e8re prise de position<\/strong><br>Le Janus-Pro de DeepSeek n'est pas un mod\u00e8le d'IA comme les autres, c'est un changement de paradigme. En unifiant la compr\u00e9hension et la g\u00e9n\u00e9ration sous un m\u00eame toit, il ouvre la voie \u00e0 des outils cr\u00e9atifs plus intelligents, \u00e0 des applications en temps r\u00e9el et \u00e0 des d\u00e9ploiements rentables. Avec un acc\u00e8s open-source et une licence MIT, cela pourrait \u00eatre le catalyseur de la prochaine vague d'innovation multimodale. \ud83d\ude80<\/p>\n\n\n\n<p><em>Pour les d\u00e9veloppeurs : Consultez le site&nbsp;<a href=\"https:\/\/github.com\/deepseek-ai\/Janus\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">N\u0153uds ComfyUI<\/a>&nbsp;et rejoignez la vague de l'exp\u00e9rimentation !<\/em><\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>Cet article est sponsoris\u00e9 par :<\/p>\n\n\n\n<a href=\"https:\/\/dang.ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/cdn.prod.website-files.com\/63d8afd87da01fb58ea3fbcb\/6487e2868c6c8f93b4828827_dang-badge.png\" alt=\"Dang.ai\" style=\"width: 150px; height: 54px;\" width=\"150\" height=\"54\"\/><\/a>\n\n\n\n<p><a href=\"https:\/\/dang.ai\/\" target=\"_blank\" rel=\"noreferrer noopener\"><\/a><\/p>\n\n\n\n<p><\/p>","protected":false},"excerpt":{"rendered":"<p>Points forts\ud83d\udd39 Architecture de transformateur unifi\u00e9e : Un mod\u00e8le unique g\u00e8re \u00e0 la fois la compr\u00e9hension et la g\u00e9n\u00e9ration d'images, \u00e9liminant ainsi le besoin de syst\u00e8mes s\u00e9par\u00e9s.\ud83d\udd39 Scalable &amp; Open-Source : Disponible en versions de param\u00e8tres 1B et 7B (sous licence MIT), optimis\u00e9 pour diverses applications et une utilisation commerciale.\ud83d\udd39 Performances de pointe : Surpasse DALL-E 3 et Stable Diffusion d'OpenAI dans des benchmarks tels que GenEval et DPG-Bench.\ud83d\udd39 D\u00e9ploiement simplifi\u00e9 : L'architecture rationalis\u00e9e r\u00e9duit les co\u00fbts de formation\/d'inf\u00e9rence tout en maintenant la flexibilit\u00e9. Liens entre les mod\u00e8les...<\/p>","protected":false},"author":1,"featured_media":580,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_kadence_starter_templates_imported_post":false,"_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false,"footnotes":""},"categories":[1],"tags":[],"class_list":["post-574","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/posts\/574","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/comments?post=574"}],"version-history":[{"count":3,"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/posts\/574\/revisions"}],"predecessor-version":[{"id":609,"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/posts\/574\/revisions\/609"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/media\/580"}],"wp:attachment":[{"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/media?parent=574"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/categories?post=574"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/janusai.pro\/fr\/wp-json\/wp\/v2\/tags?post=574"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}