{"id":574,"date":"2025-01-28T07:03:48","date_gmt":"2025-01-28T07:03:48","guid":{"rendered":"https:\/\/janusai.pro\/?p=574"},"modified":"2025-01-28T08:08:08","modified_gmt":"2025-01-28T08:08:08","slug":"released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut","status":"publish","type":"post","link":"https:\/\/janusai.pro\/da\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/","title":{"rendered":"Udgivet sent om aftenen! DeepSeek omdefinerer AI-billedgenerering og -forst\u00e5else, n\u00e5r den banebrydende Janus-Pro-omfattende model f\u00e5r sin debut!"},"content":{"rendered":"<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"915\" height=\"564\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2.png\" alt=\"\" class=\"wp-image-580\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2.png 915w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-300x185.png 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-768x473.png 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-18x12.png 18w\" sizes=\"auto, (max-width: 915px) 100vw, 915px\" \/><\/figure>\n\n\n\n<p><strong>Vigtige h\u00f8jdepunkter<\/strong><br>\ud83d\udd39&nbsp;<strong>Samlet transformer-arkitektur<\/strong>: En enkelt model h\u00e5ndterer b\u00e5de billedforst\u00e5else&nbsp;<em>og<\/em>&nbsp;generation, hvilket eliminerer behovet for separate systemer.<br>\ud83d\udd39&nbsp;<strong>Skalerbar og open source<\/strong>: Tilg\u00e6ngelig i&nbsp;<strong>1B<\/strong>&nbsp;og&nbsp;<strong>7B<\/strong>&nbsp;parameterversioner (MIT-licens), optimeret til forskellige anvendelser og kommerciel brug.<br>\ud83d\udd39&nbsp;<strong>Topmoderne ydeevne<\/strong>: Overg\u00e5r OpenAI's DALL-E 3 og Stable Diffusion i benchmarks som GenEval og DPG-Bench.<br>\ud83d\udd39&nbsp;<strong>Forenklet udrulning<\/strong>: Str\u00f8mlinet arkitektur reducerer omkostningerne til tr\u00e6ning\/instruktion, samtidig med at fleksibiliteten bevares.<\/p>\n\n\n\n<p><strong>Model-links<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Janus-Pro-7B<\/strong>:&nbsp;<a href=\"https:\/\/huggingface.co\/deepseek-ai\/Janus-Pro-7B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">HuggingFace<\/a><\/li>\n\n\n\n<li><strong>Janus-Pro-1B<\/strong>:&nbsp;<a href=\"https:\/\/huggingface.co\/deepseek-ai\/Janus-Pro-1B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">HuggingFace<\/a><\/li>\n\n\n\n<li><strong>GitHub<\/strong>:&nbsp;<a href=\"https:\/\/github.com\/deepseek-ai\/Janus\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Kode og dokumenter<\/a><\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_72 counter-hierarchy ez-toc-counter ez-toc-grey ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Indholdsfortegnelse<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"Skift til indholdsfortegnelse\"><span class=\"ez-toc-js-icon-con\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewbox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewbox=\"0 0 24 24\" version=\"1.2\" baseprofile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/span><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1' ><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/janusai.pro\/da\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Why_Janus-Pro_Stands_Out\" title=\"Hvorfor Janus-Pro skiller sig ud\">Hvorfor Janus-Pro skiller sig ud<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/janusai.pro\/da\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Benchmark_Dominance\" title=\"Benchmark-dominans\">Benchmark-dominans<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/janusai.pro\/da\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Technical_Breakdown\" title=\"Teknisk opdeling\">Teknisk opdeling<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/janusai.pro\/da\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Community_Buzz\" title=\"F\u00e6llesskabsbuzz\">F\u00e6llesskabsbuzz<\/a><\/li><\/ul><\/nav><\/div>\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Why_Janus-Pro_Stands_Out\"><\/span><strong>Hvorfor Janus-Pro skiller sig ud<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>1. To superkr\u00e6fter i \u00e9n model<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Forst\u00e5else af tilstand<\/strong>: Anvendelser&nbsp;<strong>SigLIP-L<\/strong>&nbsp;(\"superbrillerne\") til at analysere billeder (op til 384\u00d7384) og tekst.<\/li>\n\n\n\n<li><strong>Generationstilstand<\/strong>: L\u00f8ftest\u00e6nger&nbsp;<strong>Rektificeret flow<\/strong>&nbsp;+&nbsp;<strong>SDXL-VAE<\/strong>&nbsp;(den \"magiske pensel\") til at skabe billeder i h\u00f8j kvalitet.<\/li>\n<\/ul>\n\n\n\n<p><strong>2. Hjernekraft og tr\u00e6ning<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Grundl\u00e6ggende LLM<\/strong>: Bygget p\u00e5 DeepSeeks kraftfulde sprogmodel (1,5B\/7B parametre), der udm\u00e6rker sig ved kontekstuel r\u00e6sonnering.<\/li>\n\n\n\n<li><strong>Uddannelse i pipeline<\/strong>: Forudg\u00e5ende tr\u00e6ning p\u00e5 massive datas\u00e6t \u2192 Overv\u00e5get finjustering \u2192 EMA-optimering for maksimal ydeevne.<\/li>\n<\/ul>\n\n\n\n<p><strong>3. Hvorfor transformator-overdiffusion?<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Alsidighed i opgaverne<\/strong>: Prioriterer samlet forst\u00e5else + generering, mens diffusionsmodeller udelukkende fokuserer p\u00e5 billedkvalitet.<\/li>\n\n\n\n<li><strong>Effektivitet<\/strong>: Autoregressiv generering (enkelt trin) vs. diffusionens iterative denoising (f.eks. 20 trin for stabil diffusion).<\/li>\n\n\n\n<li><strong>Omkostningseffektivitet<\/strong>: Et enkelt Transformer-backbone forenkler tr\u00e6ning og implementering.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"955\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-1024x955.jpeg\" alt=\"\" class=\"wp-image-578\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-1024x955.jpeg 1024w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-300x280.jpeg 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-768x716.jpeg 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-13x12.jpeg 13w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4.jpeg 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Benchmark_Dominance\"><\/span><strong>Benchmark-dominans<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>\ud83d\udcca Multimodal forst\u00e5else<\/strong><br>Janus-Pro-7B overg\u00e5r specialiserede modeller (f.eks. LLaVA) p\u00e5 fire vigtige benchmarks og skalerer j\u00e6vnt med parameterst\u00f8rrelsen.<\/p>\n\n\n\n<p><strong>\ud83c\udfa8 Tekst-til-billede-generering<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>GenEval<\/strong>: Matcher SDXL og DALL-E 3.<\/li>\n\n\n\n<li><strong>DPG-Bench<\/strong>:&nbsp;<strong>84.2% n\u00f8jagtighed<\/strong>&nbsp;(Janus-Pro-7B), hvilket overg\u00e5r alle konkurrenter.<\/li>\n<\/ul>\n\n\n\n<p><strong>Test i den virkelige verden<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Hastighed<\/strong>: ~15 sekunder\/billede (L4 GPU, 22 GB VRAM).<\/li>\n\n\n\n<li><strong>Kvalitet<\/strong>: St\u00e6rk hurtig overholdelse, selvom mindre detaljer skal finpudses.<\/li>\n\n\n\n<li><strong>Colab Demo<\/strong>:&nbsp;<a href=\"https:\/\/colab.research.google.com\/drive\/1V3bH2oxhikj_B_EYy5yRG_9yqSqxxqgS?usp=sharing\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Pr\u00f8v Janus-Pro-7B<\/a>&nbsp;(Pro-niveau p\u00e5kr\u00e6vet).<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Technical_Breakdown\"><\/span><strong>Teknisk opdeling<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>Arkitektur<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"376\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-1024x376.png\" alt=\"\" class=\"wp-image-579\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-1024x376.png 1024w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-300x110.png 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-768x282.png 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-18x7.png 18w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640.png 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Forst\u00e5else af stien<\/strong>: Rent billede \u2192 SigLIP-L-koder \u2192 LLM \u2192 Tekstsvar.<\/li>\n\n\n\n<li><strong>Generationsvej<\/strong>: St\u00f8jende billede \u2192 Rectified Flow-dekoder + LLM \u2192 Iterativ denoising.<\/li>\n<\/ul>\n\n\n\n<p><strong>Vigtige innovationer<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Afkoblet visuel kodning<\/strong>: Separate veje til forst\u00e5else\/generering forhindrer \"rollekonflikt\" i synsmoduler.<\/li>\n\n\n\n<li><strong>Delt transformatorkerne<\/strong>: Muligg\u00f8r overf\u00f8rsel af viden p\u00e5 tv\u00e6rs af opgaver (f.eks. hj\u00e6lper indl\u00e6ring af \"katte\"-begreber b\u00e5de med genkendelse og tegning).<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Community_Buzz\"><\/span><strong>F\u00e6llesskabsbuzz<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><strong>AK (AI-forsker)<\/strong>:&nbsp;<em>\"Janus-Pro's enkelhed og fleksibilitet g\u00f8r den til en f\u00f8rsteklasses kandidat til n\u00e6ste generations multimodale systemer. Ved at afkoble synsbaner og samtidig beholde en samlet Transformer, afbalancerer den specialisering med generalisering - en sj\u00e6lden bedrift.\"<\/em><\/p>\n<\/blockquote>\n\n\n\n<p><strong>Hvorfor MIT-licensen er vigtig<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Frihed<\/strong>: Brug, modificer og distribuer kommercielt med minimale begr\u00e6nsninger.<\/li>\n\n\n\n<li><strong>Gennemsigtighed<\/strong>: Fuld adgang til koden fremskynder samfundsdrevne forbedringer.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>Sidste udspil<\/strong><br>DeepSeeks Janus-Pro er ikke bare endnu en AI-model - det er et paradigmeskift. Ved at forene forst\u00e5else og generering under \u00e9t tag \u00e5bner den d\u00f8re for smartere kreative v\u00e6rkt\u00f8jer, realtidsapplikationer og omkostningseffektive implementeringer. Med open source-adgang og MIT-licens kan dette v\u00e6re katalysatoren for den n\u00e6ste b\u00f8lge af multimodal innovation. \ud83d\ude80<\/p>\n\n\n\n<p><em>Til udviklere: Tjek den nye&nbsp;<a href=\"https:\/\/github.com\/deepseek-ai\/Janus\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">ComfyUI-noder<\/a>&nbsp;og kom med p\u00e5 eksperimenteringsb\u00f8lgen!<\/em><\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>Dette indl\u00e6g er sponsoreret af:<\/p>\n\n\n\n<a href=\"https:\/\/dang.ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/cdn.prod.website-files.com\/63d8afd87da01fb58ea3fbcb\/6487e2868c6c8f93b4828827_dang-badge.png\" alt=\"Dang.ai\" style=\"width: 150px; height: 54px;\" width=\"150\" height=\"54\"\/><\/a>\n\n\n\n<p><a href=\"https:\/\/dang.ai\/\" target=\"_blank\" rel=\"noreferrer noopener\"><\/a><\/p>\n\n\n\n<p><\/p>","protected":false},"excerpt":{"rendered":"<p>Vigtige h\u00f8jdepunkter\ud83d\udd39 Samlet transformatorarkitektur: En enkelt model h\u00e5ndterer b\u00e5de billedforst\u00e5else og -generering, hvilket eliminerer behovet for separate systemer.\ud83d\udd39 Skalerbar og open source: F\u00e5s i 1B- og 7B-parameterversioner (MIT-licens), optimeret til forskellige applikationer og kommerciel brug.\ud83d\udd39 State-of-the-Art Performance: Overg\u00e5r OpenAI's DALL-E 3 og Stable Diffusion i benchmarks som GenEval og DPG-Bench.\ud83d\udd39 Forenklet implementering: Str\u00f8mlinet arkitektur reducerer omkostningerne til tr\u00e6ning\/inferens, samtidig med at fleksibiliteten bevares. Model-links ...<\/p>","protected":false},"author":1,"featured_media":580,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_kadence_starter_templates_imported_post":false,"_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false,"footnotes":""},"categories":[1],"tags":[],"class_list":["post-574","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/posts\/574","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/comments?post=574"}],"version-history":[{"count":3,"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/posts\/574\/revisions"}],"predecessor-version":[{"id":609,"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/posts\/574\/revisions\/609"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/media\/580"}],"wp:attachment":[{"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/media?parent=574"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/categories?post=574"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/janusai.pro\/da\/wp-json\/wp\/v2\/tags?post=574"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}