Skip to main content

NVIDIA CUTLASS Python DSL

Project description

CUTLASS 4.x provides a Python native interfaces for writing high-performance CUDA kernels based on core CUTLASS and CuTe concepts without any performance compromises. This allows for a much smoother learning curve, orders of magnitude faster compile times, native integration with DL frameworks without writing glue code, and much more intuitive metaprogramming that does not require deep C++ expertise.

Overall we envision CUTLASS DSLs as a family of domain-specific languages (DSLs). With the release of 4.0, we are releasing the first of these in CuTe DSL. This is a low level programming model that is fully consistent with CuTe C++ abstractions — exposing core concepts such as layouts, tensors, hardware atoms, and full control over the hardware thread and data hierarchy.

CuTe DSL demonstrates optimal matrix multiply and other linear algebra operations targeting the programmable, high-throughput Tensor Cores implemented by NVIDIA's Ampere, Hopper, and Blackwell architectures.

We believe it will become an indispensable tool for students, researchers, and performance engineers alike — flattening the learning curve of GPU programming, rapidly prototyping kernel designs, and bringing optimized solutions into production.

CuTe DSL is currently in public beta and will graduate out of beta by end of summer 2025.

For more details please visit CUTLASS Documentation or CUTLASS Github.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

nvidia_cutlass_dsl_libs_base-4.5.0-cp314-cp314-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.0-cp314-cp314-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.0-cp313-cp313-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.0-cp313-cp313-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.0-cp312-cp312-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.0-cp312-cp312-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.0-cp311-cp311-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.0-cp311-cp311-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

nvidia_cutlass_dsl_libs_base-4.5.0-cp310-cp310-manylinux_2_28_x86_64.whl (74.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

nvidia_cutlass_dsl_libs_base-4.5.0-cp310-cp310-manylinux_2_28_aarch64.whl (75.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0a60dfce3349984315306ef719ed1edf0e225527158f26019a5cf266e06cc45d
MD5 57f8afa1f7bd802f7fee73e14d540479
BLAKE2b-256 04c65aaa2dff6dfc615a83687df4462a91dad2ac1af85d6a9c91d9a6b9760a02

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0-cp314-cp314-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0-cp314-cp314-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 2e121b20f0a48122c9b48227d00a7d681189e1de2fd4d211f9661a4e1658f066
MD5 eb12f51b2d09247932ca0b44a328c3df
BLAKE2b-256 59852799e4de2fe7070cc4126ac501443d1cd7796b07ed880118e31956ae266a

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 8e58b016da5bb09bd1d809d0c025433edb36b279adfbcd107e96361b214bd8bc
MD5 f3bf13b7ae302ca58e5b1b3a6eeb1080
BLAKE2b-256 1c790dca3b465711ffb4c44b4252940cc5f51d2d4905e405707e5c6c2a83d3d6

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 90a4d802a03963fa36eb287fbc9b40a1374590fc7e8cc1b9673dee8872f75713
MD5 38f7021985813b76b708ee2bbb21ecab
BLAKE2b-256 ff2c21d5fc62e030a43c0f1a3dab6749fb632026a27d6a60f59975cd29a5d165

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 bd18322d9247f8c033a10ed4e519c4985ca6b4fb578ade382e5a264422ebd915
MD5 a1c580700b37b9fd2298d05601dd8ff6
BLAKE2b-256 c38b2c187400d85f7d2acb328f20499b7b05745dca8485cf6ad247d5f2b434cf

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 3f7c133d31fa82ae7db697fd6943a5f9a2c97c8a40ee1056c67ef29fe00974d8
MD5 2797b7b6299453de61e06ebeb57d5e48
BLAKE2b-256 f8d0924048cfa43e1cb546735cb332b05a4fb92c63c1a1ac566f06445f9eca58

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 7bb6de91b00a2b392cd834fec174a1461bf0f10a9b6d28086c8f4885aed27218
MD5 d87a70f8d95a19b6761d715f2e9e3638
BLAKE2b-256 a4362c2b3fc81a45a1bbbdcfd10c6d9793fd28848e6fefa6d4ed7c7c477f7d2a

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 f8635ad1e0a670323cc729f167067fa880cb56577ec2e79afb80a35ab371912e
MD5 f76d61765a71fd6bb27e0a703390a8ec
BLAKE2b-256 b439155dcbcf942b2c170aa0d1115ef5f2d358d9916ddc7200ab6e70541b97a0

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 5cfdf52bea8feede5e512a094484956693cb87adaafa310991d2876653b1a88e
MD5 b3b79120bbf4a56d0e949abbdc46e27e
BLAKE2b-256 fe811229637e8a14e1129475b8260a6ce66058148fa85faf10c94f9f95de5ef6

See more details on using hashes here.

File details

Details for the file nvidia_cutlass_dsl_libs_base-4.5.0-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for nvidia_cutlass_dsl_libs_base-4.5.0-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 c78b18f2b44ca10a91bc76380ebd65bb7b86aa97a9330bae9b73eb0a1bc51d55
MD5 0b953dc49842549d44e0996569537eaf
BLAKE2b-256 044d1d0dc5f36f929885417acfff02af94f61d49e6d34acb480c080d4d887555

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page