NVIDIA is excited to announce the release of the stdexec library on GitHub and in the 22.11 release of the NVIDIA HPC Software Development Kit. The stdexec library is a proof-of-concept implementation of the Sender model (or Senders) for asynchronous computing—expected to become part of the C++ Standard in C++26. Senders help you separate the logic of your algorithm – the work you actually care about – from the details of where and how it gets executed. Unlike the C++17 parallel algorithms, with Senders you can chain multiple asynchronous computations (like GPU kernels) without unnecessary synchronization.
Read MoreLast month I discussed why standards-based parallel programming should be in your HPC toolbox. Now, I am highlighting the successes of some of the developers who have already made standards-based parallelism an integral part of their strategy. As you will see, success with standards-based programming isn’t limited to just mini-apps.
Read MoreHPC application developers have long relied on programming abstractions that were developed and used almost exclusively within the realm of traditional HPC. In this post I discuss why you should be parallel programming natively with ISO C++ and ISO Fortran.
Read MoreIn this post we discuss the benefits of using Fortran DO CONCURRENT loops for GPU programming.
Read More