{"id":574,"date":"2025-01-28T07:03:48","date_gmt":"2025-01-28T07:03:48","guid":{"rendered":"https:\/\/janusai.pro\/?p=574"},"modified":"2025-01-28T08:08:08","modified_gmt":"2025-01-28T08:08:08","slug":"released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut","status":"publish","type":"post","link":"https:\/\/janusai.pro\/id\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/","title":{"rendered":"Dirilis Larut Malam! DeepSeek Mendefinisikan Ulang Pembuatan dan Pemahaman Gambar AI saat Model Komprehensif Janus-Pro yang inovatif memulai debutnya!"},"content":{"rendered":"<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"915\" height=\"564\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2.png\" alt=\"\" class=\"wp-image-580\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2.png 915w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-300x185.png 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-768x473.png 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-2-18x12.png 18w\" sizes=\"auto, (max-width: 915px) 100vw, 915px\" \/><\/figure>\n\n\n\n<p><strong>Sorotan Utama<\/strong><br>\ud83d\udd39&nbsp;<strong>Arsitektur Trafo Terpadu<\/strong>: Model tunggal menangani kedua pemahaman gambar&nbsp;<em>dan<\/em>&nbsp;generasi, sehingga tidak memerlukan sistem yang terpisah.<br>\ud83d\udd39&nbsp;<strong>Dapat diskalakan &amp; Sumber Terbuka<\/strong>: Tersedia dalam&nbsp;<strong>1B<\/strong>&nbsp;dan&nbsp;<strong>7B<\/strong>&nbsp;versi parameter (berlisensi MIT), dioptimalkan untuk beragam aplikasi dan penggunaan komersial.<br>\ud83d\udd39&nbsp;<strong>Pertunjukan Seni Mutakhir<\/strong>: Mengungguli DALL-E 3 dan Stable Diffusion dari OpenAI dalam benchmark seperti GenEval dan DPG-Bench.<br>\ud83d\udd39&nbsp;<strong>Penerapan yang Disederhanakan<\/strong>: Arsitektur yang ramping mengurangi biaya pelatihan\/penyimpulan sekaligus mempertahankan fleksibilitas.<\/p>\n\n\n\n<p><strong>Tautan Model<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Janus-Pro-7B<\/strong>:&nbsp;<a href=\"https:\/\/huggingface.co\/deepseek-ai\/Janus-Pro-7B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">HuggingFace<\/a><\/li>\n\n\n\n<li><strong>Janus-Pro-1B<\/strong>:&nbsp;<a href=\"https:\/\/huggingface.co\/deepseek-ai\/Janus-Pro-1B\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">HuggingFace<\/a><\/li>\n\n\n\n<li><strong>GitHub<\/strong>:&nbsp;<a href=\"https:\/\/github.com\/deepseek-ai\/Janus\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Kode &amp; Dokumen<\/a><\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_82_2 counter-hierarchy ez-toc-counter ez-toc-grey ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">Daftar Isi<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"Beralih Daftar Isi\"><span class=\"ez-toc-js-icon-con\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Beralih<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewbox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewbox=\"0 0 24 24\" version=\"1.2\" baseprofile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/span><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1' ><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/janusai.pro\/id\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Why_Janus-Pro_Stands_Out\" >Mengapa Janus-Pro Menonjol<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/janusai.pro\/id\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Benchmark_Dominance\" >Dominasi Tolok Ukur<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/janusai.pro\/id\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Technical_Breakdown\" >Perincian Teknis<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-3'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/janusai.pro\/id\/released-late-at-night-deepseek-redefines-ai-image-generation-and-understanding-as-the-groundbreaking-janus-pro-comprehensive-model-makes-its-debut\/#Community_Buzz\" >Gebrakan Komunitas<\/a><\/li><\/ul><\/nav><\/div>\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Why_Janus-Pro_Stands_Out\"><\/span><strong>Mengapa Janus-Pro Menonjol<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>1. Kekuatan Super Ganda dalam Satu Model<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Memahami Mode<\/strong>: Penggunaan&nbsp;<strong>SigLIP-L<\/strong>&nbsp;(\"kacamata super\") untuk menganalisis gambar (hingga 384\u00d7384) dan teks.<\/li>\n\n\n\n<li><strong>Mode Generasi<\/strong>: Leverage&nbsp;<strong>Aliran yang Diperbaiki<\/strong>&nbsp;+&nbsp;<strong>SDXL-VAE<\/strong>&nbsp;(\"kuas ajaib\") untuk menciptakan gambar berkualitas tinggi.<\/li>\n<\/ul>\n\n\n\n<p><strong>2. Kekuatan Otak &amp; Pelatihan<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Inti LLM<\/strong>: Dibangun di atas model bahasa DeepSeek yang kuat (parameter 1,5B\/7B), unggul dalam penalaran kontekstual.<\/li>\n\n\n\n<li><strong>Jalur Pelatihan<\/strong>: Pra-pelatihan pada dataset yang sangat besar \u2192 Penyempurnaan yang diawasi \u2192 Optimalisasi EMA untuk kinerja puncak.<\/li>\n<\/ul>\n\n\n\n<p><strong>3. Mengapa Transformasi Dibanding Difusi?<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Keserbagunaan Tugas<\/strong>: Memprioritaskan pemahaman + generasi terpadu, sedangkan model difusi hanya berfokus pada kualitas gambar.<\/li>\n\n\n\n<li><strong>Efisiensi<\/strong>: Pembangkitan autoregresif (satu langkah) vs. denoising iteratif difusi (misalnya, 20 langkah untuk Difusi Stabil).<\/li>\n\n\n\n<li><strong>Efektivitas Biaya<\/strong>: Satu tulang punggung Transformer menyederhanakan pelatihan dan penerapan.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"955\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-1024x955.jpeg\" alt=\"\" class=\"wp-image-578\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-1024x955.jpeg 1024w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-300x280.jpeg 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-768x716.jpeg 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4-13x12.jpeg 13w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/b84eb858a5b578c05460fcee5e528fd4.jpeg 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Benchmark_Dominance\"><\/span><strong>Dominasi Tolok Ukur<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>\ud83d\udcca Pemahaman Multimodal<\/strong><br>Janus-Pro-7B mengungguli model khusus (mis., LLaVA) pada empat tolok ukur utama, menskalakan dengan lancar dengan ukuran parameter.<\/p>\n\n\n\n<p><strong>\ud83c\udfa8 Pembuatan Teks-ke-Gambar<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>GenEval<\/strong>: Cocok dengan SDXL dan DALL-E 3.<\/li>\n\n\n\n<li><strong>DPG-Bench<\/strong>:&nbsp;<strong>Akurasi 84,2%<\/strong>&nbsp;(Janus-Pro-7B), mengungguli semua pesaing.<\/li>\n<\/ul>\n\n\n\n<p><strong>Pengujian Dunia Nyata<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Kecepatan<\/strong>: ~15 detik\/gambar (GPU L4, VRAM 22GB).<\/li>\n\n\n\n<li><strong>Kualitas<\/strong>: Kepatuhan yang kuat, meskipun detail kecil perlu disempurnakan.<\/li>\n\n\n\n<li><strong>Demo Colab<\/strong>:&nbsp;<a href=\"https:\/\/colab.research.google.com\/drive\/1V3bH2oxhikj_B_EYy5yRG_9yqSqxxqgS?usp=sharing\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Coba Janus-Pro-7B<\/a>&nbsp;(Diperlukan tingkat Pro).<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Technical_Breakdown\"><\/span><strong>Perincian Teknis<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<p><strong>Arsitektur<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"376\" src=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-1024x376.png\" alt=\"\" class=\"wp-image-579\" srcset=\"https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-1024x376.png 1024w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-300x110.png 300w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-768x282.png 768w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640-18x7.png 18w, https:\/\/janusai.pro\/wp-content\/uploads\/2025\/01\/640.png 1080w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Memahami Path<\/strong>: Gambar bersih \u2192 Penyandi SigLIP-L \u2192 LLM \u2192 Tanggapan teks.<\/li>\n\n\n\n<li><strong>Jalur Generasi<\/strong>: Gambar berisik \u2192 Dekoder Aliran yang diperbaiki + LLM \u2192 Denoisasi berulang.<\/li>\n<\/ul>\n\n\n\n<p><strong>Inovasi Utama<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Pengkodean Visual Terpisah<\/strong>: Jalur terpisah untuk pemahaman\/pembangkitan mencegah \"konflik peran\" dalam modul visi.<\/li>\n\n\n\n<li><strong>Inti Transformator Bersama<\/strong>: Memungkinkan transfer pengetahuan lintas tugas (misalnya, mempelajari konsep \"kucing\" membantu pengenalan dan menggambar).<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<h3 class=\"wp-block-heading\"><span class=\"ez-toc-section\" id=\"Community_Buzz\"><\/span><strong>Gebrakan Komunitas<\/strong><span class=\"ez-toc-section-end\"><\/span><\/h3>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><strong>AK (Peneliti AI)<\/strong>:&nbsp;<em>\"Kesederhanaan dan fleksibilitas Janus-Pro menjadikannya kandidat utama untuk sistem multimodal generasi berikutnya. Dengan memisahkan jalur penglihatan sekaligus mempertahankan Transformer yang terpadu, Transformer ini menyeimbangkan spesialisasi dengan generalisasi - suatu hal yang jarang terjadi.\"<\/em><\/p>\n<\/blockquote>\n\n\n\n<p><strong>Mengapa Lisensi MIT Penting<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Kebebasan<\/strong>: Menggunakan, memodifikasi, dan mendistribusikan secara komersial dengan batasan minimal.<\/li>\n\n\n\n<li><strong>Transparansi<\/strong>: Akses kode penuh mempercepat peningkatan yang digerakkan oleh komunitas.<\/li>\n<\/ul>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p><strong>Final Take<\/strong><br>Janus-Pro DeepSeek bukan sekadar model AI lainnya-ini adalah perubahan paradigma. Dengan menyatukan pemahaman dan generasi di bawah satu atap, ini membuka pintu untuk alat kreatif yang lebih cerdas, aplikasi real-time, dan penerapan yang hemat biaya. Dengan akses sumber terbuka dan lisensi MIT, ini bisa menjadi katalisator untuk gelombang inovasi multimodal berikutnya. \ud83d\ude80<\/p>\n\n\n\n<p><em>Untuk para pengembang: Lihat bagian&nbsp;<a href=\"https:\/\/github.com\/deepseek-ai\/Janus\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Node ComfyUI<\/a>&nbsp;dan bergabunglah dengan gelombang eksperimen!<\/em><\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>postingan ini disponsori oleh:<\/p>\n\n\n\n<a href=\"https:\/\/dang.ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/cdn.prod.website-files.com\/63d8afd87da01fb58ea3fbcb\/6487e2868c6c8f93b4828827_dang-badge.png\" alt=\"Dang.ai\" style=\"width: 150px; height: 54px;\" width=\"150\" height=\"54\"\/><\/a>\n\n\n\n<p><a href=\"https:\/\/dang.ai\/\" target=\"_blank\" rel=\"noreferrer noopener\"><\/a><\/p>\n\n\n\n<p><\/p>","protected":false},"excerpt":{"rendered":"<p>Sorotan Utama\ud83d\udd39 Arsitektur Transformer Terpadu: Satu model menangani pemahaman dan pembuatan gambar, sehingga tidak memerlukan sistem yang terpisah.\ud83d\udd39 Dapat diskalakan &amp; Sumber Terbuka: Tersedia dalam versi parameter 1B dan 7B (berlisensi MIT), dioptimalkan untuk beragam aplikasi dan penggunaan komersial \ud83d\udd39 Performa Canggih: Mengungguli DALL-E 3 dan Stable Diffusion dari OpenAI dalam tolok ukur seperti GenEval dan DPG-Bench.\ud83d\udd39 Penerapan yang Disederhanakan: Arsitektur yang disederhanakan mengurangi biaya pelatihan \/ kesimpulan dengan tetap mempertahankan fleksibilitas. Tautan Model...<\/p>","protected":false},"author":1,"featured_media":580,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_kadence_starter_templates_imported_post":false,"_kad_post_transparent":"","_kad_post_title":"","_kad_post_layout":"","_kad_post_sidebar_id":"","_kad_post_content_style":"","_kad_post_vertical_padding":"","_kad_post_feature":"","_kad_post_feature_position":"","_kad_post_header":false,"_kad_post_footer":false,"footnotes":""},"categories":[1],"tags":[],"class_list":["post-574","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/posts\/574","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/comments?post=574"}],"version-history":[{"count":3,"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/posts\/574\/revisions"}],"predecessor-version":[{"id":609,"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/posts\/574\/revisions\/609"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/media\/580"}],"wp:attachment":[{"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/media?parent=574"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/categories?post=574"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/janusai.pro\/id\/wp-json\/wp\/v2\/tags?post=574"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}