nerdculture.de is one of the many independent Mastodon servers you can use to participate in the fediverse.
Be excellent to each other, live humanism, no nazis, no hate speech. Not only for nerds, but the domain is somewhat cool. ;) No bots in general. Languages: DE, EN, FR, NL, ES, IT

Administered by:

Server stats:

1.2K
active users

#vulkan

7 posts7 participants0 posts today

Hm, not happy with SPIR-V to ISA on my RDNA3 that is reordering loads from shared memory in some ways that it is actually slightly less efficient...

Is there a way to instruct the ISA compiler via GLSL to not reorder loads from shared mem? (Not workgroup/subgroup barriers but simple no-reordering) I haven't yet tried if volatile could do the trick 🤔

Anyone has some thoughts?

Updated node connections in the video mixer / synthesizer.

Before, on changing the output size of a node, the connection broke and had to be recreated. Changing the resolution of many nodes was an annoyance. Especially with mixer connections, which have some own values.

It is now streamlined much more and the resolution of any node can be changed at any time without the need to reconnect.

gitlab.freedesktop.org/AdeptVe

GitLabAdept Veritatis / pw-videomix · GitLabfreedesktop.org GitLab login

#Hawaii ist wieder im Fokus! 🌋 Der Vulkan Kilauea spuckt erneut Lava und sorgt für Aufsehen. Experten beobachten die Situation genau, während Anwohner und Touristen die beeindruckenden Naturschauspiele aus sicherer Entfernung bestaunen. 🌊✨ Weitere Informationen gibt es hier: de.euronews.com/2025/03/27/haw

#Vulkan #Kilauea #Naturereignis

Bleiben Sie sicher und informiert! 📲

Hawaii: Vulkan Kilauea spuckt wieder Lava
euronews · Hawaii: Vulkan Kilauea spuckt wieder LavaDer Vulkan Kilauea auf Hawaii hat wieder Lava gespuckt. Der Ausbruch ist der 15. in einer Eruptionsserie seit Dezember. Eine Gefahr für die umliegenden Wohngebiete besteht nicht, bestätigte der Volcanoes National Park.

📣 NEWS FOR #HAIKU: #NVIDIA GPU support coming soon! 🚀

Developer @X512 has successfully ported Nvidia kernel drivers to Haiku. The driver will support Turing+ GPUs and already includes Vulkan integration via Mesa's NVK.

Initial tests are working and show potential for future uses, including AI acceleration with llama.cpp.

A major step forward for the Haiku ecosystem and hardware compatibility!

#OpenSource #GPU #Vulkan #Driver #AlternativeOS

🔗 desktoponfire.com/haikuos/soft

Der Ätna Südostkrater ist wieder aktiv! 🔥 Vulkane.net berichtet, dass er aus mehreren Schloten feuert. Strombolianische Eruptionen & ein Lavastrom wurden beobachtet. INGV gab eine Warnung heraus. Keine Gefahr für den Flugverkehr. Mehr dazu hier: vulkane.net/blogmobil/aetna-su #Ätna #Vulkan #Eruption #Italien 🌋🇮🇹 #newz

Vulkane Net Newsblog · Ätna: Südostkrater feuerte aus mehreren Schloten - Vulkane Net NewsblogErneute Phase strombolianischer Eruptionen am Ätna – INGV brachte Warnung heraus

Starting to think that API likes Vulkan are on purpose limited 😒

Testing a RDNA3 RX 7600 XT, it's supposed to have 128KB of LDS per WGP. They say in the ISA, "A single work-group may allocate up to 64kB of LDS space". Also later, if the dispatch is done in WGP Mode (but I can't tell how to achieve this in Vulkan), it can access the full 128KB.

Now, Vulkan reports only 32KB accessible! Why so little? I don't think ROCm is limited for instance

Anybody from AMD around?

I tried Nsight Graphics today for GPU Compute profiling (Vulkan not CUDA) and it's amazing what we get these days for GPU optimizations, great improvements! (I used it a lot many years ago) 🚀

My only main complain so far that makes it less useful is that you don't have access to ISA disassembly, unless you are a company and have signed an NDA (!) 😒

So, credits to the red, at least with the AMD Radeon GPU Analyzer, I can get ISA disassembly with the register pressure 😎

So, I just received an AMD RX 7600 XT to test VK_KHR_COOPERATIVE_MATRIX on it and for now, it seems a bit disappointing: only 25-30 TFLOPS with fp16 - while the specs says that it can go up to ~45 TFLOPS. My current Nvidia RTX 4700 can do 100 FTLOPS

Tried the Radeon GPU Profiler, but it seems not working in my setup with multiple non-AMD GPU, what a shame (and frankly, this profiler is just so weird to use). Will try to see If I can disable my main display GPU, super annoying 😒