Appearance
AI
- AI Prompts
- The bfloat16 numerical format
- Independent analysis of AI language models and API providers
- What is 1 petaFLOPS equal to?
Google Gemini
Apple Intelligence
SemiAnalysis
AI Playground
karpathy
CUDA
The NVIDIA® CUDA® Toolkit provides a development environment for creating high-performance, GPU-accelerated applications.
The toolkit includes GPU-accelerated libraries, debugging and optimization tools, a C/C++ compiler, and a runtime library.
cuDNN
The NVIDIA CUDA® Deep Neural Network library (cuDNN) is a GPU-accelerated library of primitives for deep neural networks.
cuDNN provides highly tuned implementations for standard routines such as forward and backward convolution, attention, matmul, pooling, and normalization.
OpenMPI
The Open MPI Project is an open source Message Passing Interface implementation that is developed and maintained by a consortium of academic, research, and industry partners.
Metal
Metal powers hardware-accelerated graphics on Apple platforms by providing a low-overhead API, rich shading language, tight integration between graphics and compute, and an unparalleled suite of GPU profiling and debugging tools.
GPU Clouds
Google Cloud GPUs
Lambda
On-demand & reserved cloud NVIDIA GPUs for AI training & inference