{"id":574,"date":"2025-01-28T07:03:48","date_gmt":"2025-01-28T07:03:48","guid":{"rendered":"https:\/\/janusai.pro\/?p=574"},"modified":"2025-01-28T08:08:08","modified_gmt":"2025-01-28T08:08:08","slug":"released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut","status":"publish","type":"post","link":"https:\/\/janusai.pro\/sv\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/","title":{"rendered":"Sl\u00e4ppt sent p\u00e5 kv\u00e4llen! DeepSeek omdefinierar AI-bildgenerering och -f\u00f6rst\u00e5else n\u00e4r den banbrytande Janus-Pro Comprehensive Model g\u00f6r sin debut!"},"content":{"rendered":"<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"915\" height=\"564\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2.png\" alt=\"\" class=\"wp-image-580\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2.png 915w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-300x185.png 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-768x473.png 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-18x12.png 18w\" sizes=\"auto, (max-width: 915px) 100vw, 915px\" \/><\/figure>\n\n\n\n<p><strong>Viktiga h\u00f6jdpunkter<\/strong><br>\ud83d\udd39&nbsp;<strong>Enhetlig transformatorarkitektur<\/strong>: En enda modell hanterar b\u00e5de bildf\u00f6rst\u00e5else&nbsp;<em>och<\/em>&nbsp;generation, vilket eliminerar behovet av separata system.<br>\ud83d\udd39&nbsp;<strong>Skalbar och \u00f6ppen k\u00e4llkod<\/strong>: Tillg\u00e4nglig i&nbsp;<strong>1B<\/strong>&nbsp;och&nbsp;<strong>7B<\/strong>&nbsp;parameterversioner (MIT-licensierade), optimerade f\u00f6r olika applikationer och kommersiell anv\u00e4ndning.<br>\ud83d\udd39&nbsp;<strong>Toppmodern prestanda<\/strong>: \u00f6vertr\u00e4ffar OpenAI:s DALL-E 3 och Stable Diffusion i benchmarks som GenEval och DPG-Bench.<br>\ud83d\udd39&nbsp;<strong>F\u00f6renklad drifts\u00e4ttning<\/strong>: Str\u00f6mlinjeformad arkitektur minskar kostnaderna f\u00f6r utbildning\/inferens samtidigt som flexibiliteten bibeh\u00e5lls.<\/p>\n\n\n\n<p><strong>L\u00e4nkar till modeller<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Janus-Pro-7B<\/strong>:&nbsp;<a href=\"https:\/\/huggingface.co\/deepseek-ai\/Janus-Pro-7B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Kramande ansikte<\/a><\/li>\n\n\n\n<li><strong>Janus-Pro-1B<\/strong>:&nbsp;<a href=\"https:\/\/huggingface.co\/deepseek-ai\/Janus-Pro-1B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Kramande ansikte<\/a><\/li>\n\n\n\n<li><strong>GitHub<\/strong>:&nbsp;<a href=\"https:\/\/github.com\/deepseek-ai\/Janus\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Kod och dokument<\/a><\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_82_2 counter-hierarchy ez-toc-counter ez-toc-grey ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Inneh\u00e5llsf\u00f6rteckning<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"V\u00e4xla inneh\u00e5llsf\u00f6rteckning\"><span class=\"ez-toc-js-icon-con\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewbox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewbox=\"0 0 24 24\" version=\"1.2\" baseprofile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/span><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1' ><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/janusai.pro\/sv\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Why_Janus-Pro_Stands_Out\" >Varf\u00f6r Janus-Pro sticker ut<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/janusai.pro\/sv\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Benchmark_Dominance\" >Benchmark-dominans<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/janusai.pro\/sv\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Technical_Breakdown\" >Teknisk uppdelning<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/janusai.pro\/sv\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Community_Buzz\" >Samh\u00e4llsinformation<\/a><\/li><\/ul><\/nav><\/div>\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Why_Janus-Pro_Stands_Out\"><\/span><strong>Varf\u00f6r Janus-Pro sticker ut<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>1. Dubbla superkrafter i en modell<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>F\u00f6rst\u00e5else av l\u00e4get<\/strong>: Anv\u00e4ndningar&nbsp;<strong>SigLIP-L<\/strong>&nbsp;(\"superglas\u00f6gonen\") f\u00f6r att analysera bilder (upp till 384\u00d7384) och text.<\/li>\n\n\n\n<li><strong>Generationsl\u00e4ge<\/strong>: H\u00e4vst\u00e5ngseffekt&nbsp;<strong>Rektifierat fl\u00f6de<\/strong>&nbsp;+&nbsp;<strong>SDXL-VAE<\/strong>&nbsp;(den \"magiska penseln\") f\u00f6r att skapa h\u00f6gkvalitativa bilder.<\/li>\n<\/ul>\n\n\n\n<p><strong>2. Hj\u00e4rnkraft &amp; tr\u00e4ning<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Grundl\u00e4ggande LLM<\/strong>: Bygger p\u00e5 DeepSeeks kraftfulla spr\u00e5kmodell (1,5 miljarder\/7 miljarder parametrar), som \u00e4r utm\u00e4rkt f\u00f6r kontextuella resonemang.<\/li>\n\n\n\n<li><strong>Utbildning Pipeline<\/strong>: F\u00f6rtr\u00e4ning p\u00e5 massiva datam\u00e4ngder \u2192 \u00d6vervakad finjustering \u2192 EMA-optimering f\u00f6r b\u00e4sta prestanda.<\/li>\n<\/ul>\n\n\n\n<p><strong>3. Varf\u00f6r transformator i st\u00e4llet f\u00f6r diffusion?<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>M\u00e5ngsidighet i arbetsuppgifterna<\/strong>: Prioriterar enhetlig f\u00f6rst\u00e5else + generering, medan diffusionsmodeller enbart fokuserar p\u00e5 bildkvalitet.<\/li>\n\n\n\n<li><strong>Effektivitet<\/strong>: Autoregressiv generering (ett steg) j\u00e4mf\u00f6rt med diffusionens iterativa denoising (t.ex. 20 steg f\u00f6r Stable Diffusion).<\/li>\n\n\n\n<li><strong>Kostnadseffektivitet<\/strong>: Ett enda Transformer-backbone f\u00f6renklar utbildning och drifts\u00e4ttning.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"955\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-1024x955.jpeg\" alt=\"\" class=\"wp-image-578\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-1024x955.jpeg 1024w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-300x280.jpeg 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-768x716.jpeg 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-13x12.jpeg 13w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4.jpeg 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Benchmark_Dominance\"><\/span><strong>Benchmark-dominans<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>\ud83d\udcca Multimodal f\u00f6rst\u00e5else<\/strong><br>Janus-Pro-7B \u00f6vertr\u00e4ffar specialiserade modeller (t.ex. LLaVA) p\u00e5 fyra viktiga riktm\u00e4rken och skalar j\u00e4mnt med parameterstorleken.<\/p>\n\n\n\n<p><strong>\ud83c\udfa8 Generering av text-till-bild<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>GenEval<\/strong>: Motsvarar SDXL och DALL-E 3.<\/li>\n\n\n\n<li><strong>DPG-b\u00e4nk<\/strong>:&nbsp;<strong>84,2% noggrannhet<\/strong>&nbsp;(Janus-Pro-7B), vilket \u00f6vertr\u00e4ffar alla konkurrenter.<\/li>\n<\/ul>\n\n\n\n<p><strong>Testning i den verkliga v\u00e4rlden<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Hastighet<\/strong>: ~15 sekunder\/bild (L4 GPU, 22 GB VRAM).<\/li>\n\n\n\n<li><strong>Kvalitet<\/strong>: Mycket snabb efterlevnad, men mindre detaljer beh\u00f6ver finjusteras.<\/li>\n\n\n\n<li><strong>Colab Demo<\/strong>:&nbsp;<a href=\"https:\/\/colab.research.google.com\/drive\/1V3bH2oxhikj_B_EYy5yRG_9yqSqxxqgS?usp=sharing\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">F\u00f6rs\u00f6k Janus-Pro-7B<\/a>&nbsp;(Pro-niv\u00e5 kr\u00e4vs).<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Technical_Breakdown\"><\/span><strong>Teknisk uppdelning<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>Arkitektur<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"376\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-1024x376.png\" alt=\"\" class=\"wp-image-579\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-1024x376.png 1024w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-300x110.png 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-768x282.png 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-18x7.png 18w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640.png 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>F\u00f6rst\u00e5else f\u00f6r Path<\/strong>: Ren bild \u2192 SigLIP-L-kodare \u2192 LLM \u2192 Textsvar.<\/li>\n\n\n\n<li><strong>Generationsv\u00e4g<\/strong>: Brusig bild \u2192 Rektifierad fl\u00f6desavkodare + LLM \u2192 Iterativ denoising.<\/li>\n<\/ul>\n\n\n\n<p><strong>Viktiga innovationer<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Frikopplad visuell kodning<\/strong>: Separata v\u00e4gar f\u00f6r f\u00f6rst\u00e5else\/generering f\u00f6rhindrar \"rollkonflikt\" i visionsmoduler.<\/li>\n\n\n\n<li><strong>Delad transformatork\u00e4rna<\/strong>: M\u00f6jligg\u00f6r kunskaps\u00f6verf\u00f6ring mellan olika uppgifter (t.ex. att l\u00e4ra sig \"katt\"-begrepp underl\u00e4ttar b\u00e5de igenk\u00e4nning och ritning).<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Community_Buzz\"><\/span><strong>Samh\u00e4llsinformation<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><strong>AK (AI-forskare)<\/strong>:&nbsp;<em>\"Janus-Pro:s enkelhet och flexibilitet g\u00f6r den till en utm\u00e4rkt kandidat f\u00f6r n\u00e4sta generations multimodala system. Genom att frikoppla synbanorna och samtidigt beh\u00e5lla en enhetlig transformator balanserar den specialisering med generalisering - en s\u00e4llsynt bedrift.\"<\/em><\/p>\n<\/blockquote>\n\n\n\n<p><strong>Varf\u00f6r MIT-licensen \u00e4r viktig<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Frihet<\/strong>: Anv\u00e4nd, modifiera och distribuera kommersiellt med minimala begr\u00e4nsningar.<\/li>\n\n\n\n<li><strong>\u00d6ppenhet<\/strong>: Full kod\u00e5tkomst p\u00e5skyndar f\u00f6rb\u00e4ttringar som drivs av samh\u00e4llet.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>Sista ordet<\/strong><br>DeepSeeks Janus-Pro \u00e4r inte bara ytterligare en AI-modell - det \u00e4r ett paradigmskifte. Genom att f\u00f6rena f\u00f6rst\u00e5else och generering under ett tak \u00f6ppnar den d\u00f6rrar f\u00f6r smartare kreativa verktyg, realtidsapplikationer och kostnadseffektiva implementeringar. Med tillg\u00e5ng till \u00f6ppen k\u00e4llkod och MIT-licensering kan detta vara katalysatorn f\u00f6r n\u00e4sta v\u00e5g av multimodal innovation. \ud83d\ude80<\/p>\n\n\n\n<p><em>F\u00f6r utvecklare: Ta en titt p\u00e5&nbsp;<a href=\"https:\/\/github.com\/deepseek-ai\/Janus\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">ComfyUI noder<\/a>&nbsp;och h\u00e4ng med p\u00e5 experimentv\u00e5gen!<\/em><\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>detta inl\u00e4gg \u00e4r sponsrat av:<\/p>\n\n\n\n<a href=\"https:\/\/dang.ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/cdn.prod.website-files.com\/63d8afd87da01fb58ea3fbcb\/6487e2868c6c8f93b4828827_dang-badge.png\" alt=\"Dang.ai\" style=\"width: 150px; height: 54px;\" width=\"150\" height=\"54\"\/><\/a>\n\n\n\n<p><a href=\"https:\/\/dang.ai\/\" target=\"_blank\" rel=\"noreferrer noopener\"><\/a><\/p>\n\n\n\n<p><\/p>","protected":false},"excerpt":{"rendered":"<p>Viktiga h\u00f6jdpunkter\ud83d\udd39 Unified Transformer Architecture: En enda modell hanterar b\u00e5de bildf\u00f6rst\u00e5else och bildgenerering, vilket eliminerar behovet av separata system.\ud83d\udd39 Skalbar och \u00f6ppen k\u00e4llkod: Finns i 1B- och 7B-parameterversioner (MIT-licensierade), optimerade f\u00f6r olika applikationer och kommersiell anv\u00e4ndning.\ud83d\udd39 Toppmodern prestanda: B\u00e4ttre \u00e4n OpenAI:s DALL-E 3 och Stable Diffusion i benchmarks som GenEval och DPG-Bench.\ud83d\udd39 F\u00f6renklad implementering: Str\u00f6mlinjeformad arkitektur minskar kostnaderna f\u00f6r utbildning\/inferens samtidigt som flexibiliteten bibeh\u00e5lls. Modelll\u00e4nkar ...<\/p>","protected":false},"author":1,"featured_media":580,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_kadence_starter_templates_imported_post":false,"_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false,"footnotes":""},"categories":[1],"tags":[],"class_list":["post-574","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/posts\/574","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/comments?post=574"}],"version-history":[{"count":3,"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/posts\/574\/revisions"}],"predecessor-version":[{"id":609,"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/posts\/574\/revisions\/609"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/media\/580"}],"wp:attachment":[{"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/media?parent=574"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/categories?post=574"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/janusai.pro\/sv\/wp-json\/wp\/v2\/tags?post=574"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}