Home
[HN Gopher] Triton Fork for Windows Support ___________________________________________________________________ Triton Fork for Windows Support Author : lnyan Score : 17 points Date : 2024-10-15 15:40 UTC (1 days ago) HTML web link (github.com) TEXT w3m dump (github.com) | yjftsjthsd-h wrote: | "Triton" here is apparently a programming language, which | upstream describes as | | > This is the development repository of Triton, a language and | compiler for writing highly efficient custom Deep-Learning | primitives. The aim of Triton is to provide an open-source | environment to write fast code at higher productivity than CUDA, | but also with higher flexibility than other existing DSLs. | | So if you clicked in expecting the illumos-based virtualization | platform, this isn't that. Though | | > This is the basis for torchao, which crucially changes some | large models from "can't run" to "can run" on consumer GPUs. | That's easier than supporting them in other quantization | frameworks, or letting the consumers use Linux or WSL | | Does sound neat on its own merits. | Scene_Cast2 wrote: | This is pretty great. PyTorch uses triton as the backend for | torch.compile (the big feature of PyTorch 2.0, and the necessary | part for making Flex Attention in the about to be released 2.5 | usably fast). | | Triton's team doesn't support Windows, and, worse yet, does not | accept community PRs to enable any sort of support. | | Here's the github issue: https://github.com/triton- | lang/triton/issues/1640 | | And here's the performance comparison of Flex Attention with and | without torch.compile (tldr it's 3x slower than a standard MHA | when not compiled): https://github.com/rasbt/LLMs-from- | scratch/blob/76e9a9ec02a1... | | EDIT: after taking a look at the repo, the only thing changed in | the "46 commits ahead of [official triton]" is the README. | Somewhat sketchy. | zorgmonkey wrote: | It is mentioned at the top of the readme, but the actual code | changes appear to be on the branch v3.1.x-windows (also a few | other branches with -windows in the name). Also the triton seem | willing to collaborate as long as the patches sent are | reasonably minimal and high quality https://github.com/triton- | lang/triton/pull/4045#issuecomment... ___________________________________________________________________ (page generated 2024-10-17 06:01 UTC)