Super Computing Packages
-
Pigeons.jl34Distributed and parallel sampling from intractable distributions
-
FluxMPI.jl48Distributed Data Parallel Training of Deep Neural Networks
-
ThreadPinning.jl56Readily pin Julia threads to CPU processors
-
PreallocationTools.jl78Tools for building non-allocating pre-cached functions in Julia, allowing for GC-free usage of automatic differentiation in complex codes
-
AzureClusterlessHPC.jl33A Julia package for clusterless distributed computing on Azure
-
ConcurrentCollections.jl44Concurrent data structures for Julia
-
BenchmarkHistograms.jl33-
-
Polyester.jl169The cheapest threads you can find!
-
CheapThreads.jl169The cheapest threads you can find!
-
ParallelStencil.jl238Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
-
PartitionedArrays.jl76Vectors and sparse matrices partitioned into pieces for parallel distributed-memory computations.
-
LIKWID.jl41Julia wrapper for the performance monitoring and benchmarking suite LIKWID.
-
FoldsCUDA.jl50Data-parallelism on CUDA using Transducers.jl and for loops (FLoops.jl)
-
Schedulers.jl8Elastic and fault tolerant parallel map and parallel map reduce methods. Part of the COFII framework.
-
PencilArrays.jl48Distributed Julia arrays using the MPI protocol
-
AMDGPU.jl228AMD GPU (ROCm) programming in Julia
-
CuCountMap.jl4Fast `StatsBase.countmap` for small types on the GPU via CUDA.jl
-
Metal.jl266Metal programming in Julia
-
LinuxPerf.jl37-
-
SystemBenchmark.jl41Julia package for benchmarking a system
-
GridapDistributed.jl70Parallel distributed-memory version of Gridap
-
FLoops.jl284Fast sequential, threaded, and distributed for-loops for Julia—fold for humans™
-
GPUCompiler.jl115Reusable compiler infrastructure for Julia GPU backends.
-
DaggerGPU.jl37GPU integrations for Dagger.jl
-
Gaius.jl113Divide and Conquer Linear Algebra
-
OwnTime.jl33A Julia profiling package that provides an "own time" and "total time" view of profiling data
-
KernelAbstractions.jl250Heterogeneous programming in Julia
-
BenchmarkCI.jl46-
-
ExaPF.jl47A Power Flow Solver for GPUs in Julia
-
ImplicitGlobalGrid.jl115Almost trivial distributed parallelization of stencil-based GPU and CPU applications on a regular staggered grid
-
Decentralized-Internet.jl488A SDK/library for decentralized web and distributing computing projects
-
StaticCompiler.jl395Compiles Julia code to a standalone library (experimental)
-
AnyMOD.jl55Julia framework for energy system models with a focus on multi-period capacity expansion
-
CUDA.jl974CUDA programming in Julia.
-
MPIClusterManagers.jl40Julia parallel constructs over MPI
-
DiffEqGPU.jl202GPU-acceleration routines for DifferentialEquations.jl and the broader SciML scientific machine learning ecosystem
-
CalibrateEmulateSample.jl50Stochastic Optimization, Learning, Uncertainty and Sampling
-
Heptapus.jl8-
-
GPUifyLoops.jl61-
-
DispatcherCache.jl1Adaptive persistency-based mechanism for Dispatcher task graphs
Loading more...